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Préfacé 


Since the last édition of this boolc appeared, more than five million scientific papers 
hâve been published. There has been a parallel increase in the quantity of digital 
information: new data on genome sequences, protein interactions, molecular struc¬ 
tures, and gene expression—ail stored in vast databases. The challenge, for both sci- 
entists and textbook writers, is to convert this overwhelming amount of information 
into an accessible and up-to-date understanding of how cells work. 

Help cornes from a large increase in the number of review articles that attempt 
to make raw material easier to digest, although the vast majority of these reviews 
are still quite narrowly focused. Meanwhile, a rapidly growing collection of online 
resources tries to convince us that understanding is only a few mouse-clicks away. 
In some areas this change in the way we access knowledge has been highly suc- 
cessful—in discovering the latest information about our own medical problems, for 
example. But to understand something of the beauty and complexity of how living 
cells work, one needs more than just a wiki- this or wiki- that; it is enormously hard 
to identify the valuable and enduring gems from so much confusing landfill. Much 
more effective is a carefully wrought narrative that leads logically and progressively 
through the key ideas, components, and experiments in such a way that readers 
can build for themselves a mémorable, conceptual framework for cell biology— 
a framework that will allow them to critically evaluate ail of the new science and, 
more importantly, to understand it. That is what we hâve tried to do in Molecular 
Biology ofthe Cell. 

In preparing this new édition, we hâve inevitably had to make some difficult 
decisions. In order to incorporate exciting new discoveries, while at the same time 
keeping the boolc portable, much has had to be excised. We hâve added new sec¬ 
tions, such as those on new RNA functions, advances in stem cell biology, new 
methods for studying proteins and genes and for imaging cells, advances in the 
genetics and treatment of cancer, and timing, growth control, and morphogenesis 
in development. 

The chemistry of cells is extremely complex, and any list of cell parts and their 
interactions—no matter how complété—will leave huge gaps in our understanding. 
We now realize that to produce convincing explanations of cell behavior will require 
quantitative information about cells that is coupled to sophisticated mathematical/ 
computational approaches—some not yet invented. As a conséquence, an emerg- 
ing goal for cell biologists is to shift their studies more toward quantitative descrip¬ 
tion and mathematical déduction. We highlight this approach and some of its meth¬ 
ods in a new section at the end of Chapter 8. 

Faced with the immensity of what we hâve learned about cell biology, it might 
be tempting for a student to imagine that there is little left to discover. In fact, the 
more we find out about cells, the more new questions emerge. To emphasize that 
our understanding of cell biology is incomplète, we hâve highlighted some of the 
major gaps in our knowledge by including What We Don't Know at the end of each 
chapter. These brief lists include only a tiny sample of the critical unanswered ques¬ 
tions and challenges for the next génération of scientists. We dérivé great pleasure 
from the knowledge that some of our readers will provide future answers. 

The more than 1500 illustrations hâve been designed to create a parallel narra¬ 
tive, closely interwoven with the text. We hâve increased their consistency between 
chapters, particularly in the use of color and of common icons; membrane pumps 
and channels are a good example. To avoid interruptions to the text, some material 
has been moved into new, readily accessible panels. Most of the important pro¬ 
tein structures depicted hâve now been redrawn and consistently colored. In each 



case, we now provide the corresponding Protein Data Bank (PDB) code for the 
protein, which can be used to access online tools that provide more information 
about it, such as those on the RCSB PDB website (www.rcsb.org). These connec¬ 
tions allow readers of the book to explore more fully the proteins that lie at the core 
of cell biology. 

John Wilson and Tim Hunt hâve again contributed their distinctive and imagi¬ 
native problems to help students gain a more active understanding of the text. 
The problems emphasize quantitative approaches and encourage critical thinking 
about published experiments; they are now présent at the end of ail chapters. The 
answers to these problems, plus more than 1800 additional problems and solutions, 
ail appear in the companion volume that John and Tim hâve written, Molecular 
Biology ofthe Cell , Sixth Edition: The Problems Book. 

We live in a world that présents us with many complex issues related to cell 
biology: biodiversity, climate change, food security, environmental dégradation, 
resource déplétion, and human disease. We hope that our textbook will help the 
reader better understand and possibly contribute to meeting these challenges. 
Knowledge and understanding bring the power to intervene. 

We are indebted to a large number of scientists whose generous help we men¬ 
tion separately in the detailed acknowledgments. Here we must mention some par- 
ticularly significant contributors. For Chapter 8, Hana El-Samad provided the core 
of the section on Mathematical Analysis of Cell Functions, and Karen Hoplcin made 
valuable contributions to the section on Studying Gene Expression and Function. 
Werner Kuhlbrandt helped to reorganize and rewrite Chapter 14 (Energy Conver¬ 
sion: Mitochondria and Chloroplasts). Rebecca Heald did the same for Chapter 16 
(The Cytoskeleton), as did Alexander Schier for Chapter 21 (Development of Mul- 
ticellular Organisms), and Matt Welch for Chapter 23 (Pathogens and Infection). 
Lewis Lanier aided in the writing of Chapter 24 (The Innate and Adaptive Immune 
Systems). Hossein Amiri generated the enormous online instructor's question bank. 

Before starting out on the révision cycle for this édition, we asked a number of 
scientists who had used the last édition to teach cell biology students to meet with 
us and suggest improvements. They gave us useful feedback that has helped inform 
the new édition. We also benefited from the valuable input of groups of students 
who read most of the chapters in page proofs. 

Many people and much effort are needed to convert a long manuscript and a 
large pile of slcetches into a finished textbook. The team at Garland Science that 
managed this conversion was outstanding. Denise Schanck, directing operations, 
displayed forbearance, insight, tact, and energy throughout the journey; she guided 
us ail unerringly, ably assisted by Allie Bochicchio and Janette Scobie. Nigel Orme 
oversaw our revamped illustration program, put ail the artwork into its final form, 
and again enhanced the baclc cover with his graphies skills. Tiago Barros helped us 
refresh our présentation of protein structures. Matthew McClements designed the 
book and its front cover. Emma Jeffcock again laid out the final pages, managing end- 
less rounds of proofs and last-minute changes with remarkable skill and patience; 
Georgina Lucas provided her with help. Michael Morales, assisted by Leah Chris- 
tians, produced and assembled the complex web of videos, animations, and other 
materials that form the core of the online resources that accompany the book. Adam 
Sendroff provided us with the valuable feedback from book users around the world 
that informed our révision cycle. Casting expert eyes over the manuscript, Eliza¬ 
beth Zayatz and Sherry Granum Lewis acted as development editors, Jo Clayton as 
copyeditor, and Sally Huish as proofreader. Bill Johncocks compiled the index. In 
London, Emily Preece fed us, while the Garland team's professional help, skills, and 
energy, together with their friendship, nourished us in every other way throughout 
the révision, making the whole process a pleasure. The authors are extremely fortu- 
nate to be supported so generously. 

We thank our spouses, families, friends, and colleagues for their continuing sup¬ 
port, which has once again made the writing of this book possible. 

Just as we were completing this édition, Julian Lewis, our coauthor, friend, and 
colleague, finally succumbed to the cancer that he had fought so heroically for ten 
years. Starting in 1979, Julian made major contributions to ail six éditions, and, 
as our most élégant wordsmith, he elevated and enhanced both the style and tone 
of ail the many chapters he touched. Noted for his careful scholarly approach, 
clarity and simplicity were at the core of his writing. Julian is irreplaceable, and we 
will ail deeply miss his friendship and collaboration. We dedicate this Sixth Edition 
to his memory. 



Note to the Reader 


Structure of the Book 

Although the chapters of this book can be read independently of one another, they 
are arranged in a logical sequence of five parts. The first three chapters of Part I 
cover elementary principles and basic biochemistry. They can serve either as an 
introduction for those who hâve not studied biochemistry or as a refresher course 
for those who hâve. Part II deals with the storage, expression, and transmission 
of genetic information. Part III présents the principles of the main experimental 
methods for investigating and analyzing cells; here, a new section entitled “Math- 
ematical Analysis of Cell Functions” in Chapter 8 provides an extra dimension in 
our understanding of cell régulation and function. Part IV describes the internai 
organization of the cell. Part V follows the behavior of cells in multicellular Sys¬ 
tems, starting with development of multicellular organisms and concluding with 
chapters on pathogens and infection and on the innate and adaptive immune 
Systems. 

End-of-Chapter Problems 

A sélection of problems, written by lohn Wilson and Tim Hunt, appears in the text 
at the end of each chapter. New to this édition are problems for the last four chap¬ 
ters on multicellular organisms. The complété solutions to ail of these problems 
can be found in Molecular Biology ofthe Cell , Sixth Edition: The Problems Book. 

References 

A concise list of selected references is included at the end of each chapter. These 
are arranged in alphabetical order under the main chapter section headings. 
These references sometimes include the original papers in which important dis- 
coveries were first reported. 

Glossary Terms 

Throughout the book, boldface type has been used to highlight key terms at the 
point in a chapter where the main discussion occurs. Italie type is used to set off 
important terms with a lesser degree of emphasis. At the end of the book is an 
expanded glossary, covering technical terms that are part of the common cur- 
rency of cell biology; it should be the first resort for a reader who encounters an 
unfamiliar term. The complété glossary as well as a set of fiasheards is available 
on the Student Website. 

Nomenclature for Genes and Proteins 

Each species has its own conventions for naming genes; the only common fea- 
ture is that they are always set in italics. In some species (such as humans), gene 
names are spelled out ail in capital letters; in other species (such as zebrafish), ail 
in lowercase; in yet others (most mouse genes), with the first letter in uppercase 
and rest in lowercase; or (as in Drosophila) with different combinations of upper¬ 
case and lowercase, according to whether the first mutant allele to be discovered 
produced a dominant or récessive phenotype. Conventions for naming protein 
products are equally varied. 

This typographical chaos drives everyone crazy. It is not just tiresome and 
absurd; it is also unsustainable. We cannot independently define a fresh conven¬ 
tion for each of the next few million species whose genes we may wish to study. 
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NOTE TO THE READER 


Moreover, there are many occasions, especially in a boolc such as this, where we 
need to refer to a gene generically—without specifying the mouse version, the 
human version, the chiclc version, or the hippopotamus version—because they 
are ail équivalent for the purposes of our discussion. What convention then 
should we use? 

We hâve decided in this boolc to cast aside the different conventions that are 
used in individual species and follow a uniform rule: we write ail gene names, lilce 
the names of people and places, with the first letter in uppercase and the rest in 
lowercase, but ail in italics, thus: Apc , Bazooka , Cdc2, Dishevelled, Egll. The cor¬ 
respondis protein, where it is named after the gene, will be written in the same 
way, but in roman rather than italic letters: Apc, Bazooka, Cdc2, Dishevelled, Egll. 
When it is necessary to specify the organism, this can be done with a prefix to the 
gene name. 

For completeness, we list a few further details of naming rules that we shall 
follow. In some instances, an added letter in the gene name is traditionally used 
to distinguish between genes that are related by function or évolution; for those 
genes, we put that letter in uppercase if it is usual to do so ( LacZ , RecA, HoxA4). 
We use no hyphen to separate added letters or numbers from the rest of the name. 
Proteins are more of a problem. Many of them hâve names in their own right, 
assigned to them before the gene was named. Such protein names talce many 
forms, although most of them traditionally begin with a lowercase letter (actin, 
hemoglobin, catalase), lilce the names of ordinary substances (cheese, nylon), 
unless they are acronyms (such as GFP, for Green Fluorescent Protein, or BMP4, 
for Bone Morphogenetic Protein #4). To force ail such protein names into a uni¬ 
form style would do too much violence to established usages, and we shall simply 
write them in the traditional way (actin, GFP, and so on). For the corresponding 
gene names in ail these cases, we shall nevertheless follow our standard rule: 
Actin, Hemoglobin, Catalase, Bmp4, Gfp. Occasionally in our boolc we need to 
highlight a protein name by setting it in italics for emphasis; the intention will 
generally be clear from the context. 

For those who wish to lcnow them, the table below shows some of the official 
conventions for individual species—conventions that we shall mostly violate in 
this boolc, in the manner shown. 



Species-Specific Convention 

Unified Convention Used in This Book 

Organism 

Gene 

Protein 

Gene 

Protein 

Mouse 

Hoxa4 

Hoxa4 

HoxA4 

HoxA4 


Bmp4 

BMP4 

Bmp4 

BMP4 


integrin a-1, Itgal 

integrin al 

Integrin al, Itgal 

integrin al 

Human 

HOXA4 

HOXA4 

HoxA4 

HoxA4 

Zebrafish 

cyclops, cyc 

Cyclops, Cyc 

Cyclops, Cyc 

Cyclops, Cyc 

Caenorhabditis 

u ne-6 

UNC-6 

Unc6 

Unc6 

Drosophila 

sevenless, sev (named 
after récessive phenotype) 

Sevenless, SEV 

Sevenless, Sev 

Sevenless, Sev 


Deformed, Dfd (named 
after dominant mutant 
phenotype) 

Deformed, DFD 

Deformed, Dfd 

Deformed, Dfd 

Yeast 

Saccharomyces cerevisiae 
(budding yeast) 

CDC28 

Cdc28, Cdc28p 

Cdc28 

Cdc28 

Schizosaccharomyces 
pombe (fission yeast) 

Cdc2 

Cdc2, Cdc2p 

Cdc2 

Cdc2 

Arabidopsis 

GAI 

GAI 

Gai 

GAI 

E. coli 

uvrA 

UvrA 

UvrA 

UvrA 
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Molecular Biology ofthe Cell, Sixth Edition: The Problems Book 

by John Wilson and Tim Hunt (ISBN: 978-0-8153-4453-7) 

The Problems Book is designed to help students appreciate the ways in which 
experiments and simple calculations can lead to an understanding of how cells 
work. It provides problems to accompany Chapters 1-20 of Molecular Biology 
ofthe Cell. Each chapter of problems is divided into sections that correspond to 
those of the main textboolc and review key terms, test for understanding basic 
concepts, pose research-based problems, and nowinclude MCAT-style questions 
which help students to préparé for standardized medical school admission tests. 
Molecular Biology ofthe Cell , Sixth Edition: The Problems Book should be useful 
for homework assignments and as a basis for class discussion. It could even pro¬ 
vide ideas for exam questions. Solutions for ail of the problems are provided in the 
book. Solutions for the end-of-chapter problems for Chapters 1-24 in the main 
textboolc are also found in The Problems Book. 

RESOURCES FOR INSTRUCTORS AND STUDENTS 

The teaching and learning resources for instructors and students are available 
online. The instructor's resources are password-protected and available only to 
adopting instructors. The student resources are available to everyone. We hope 
these resources will enhance student learning and malce it easier for instructors to 
préparé dynamic lectures and activities for the classroom. 

Instructor Resources 

Instructor Resources are available on the Garland Science Instructor's Resource 
Site, located atwww.garlandscience.com/instructors. The website provides access 
not only to the teaching resources for this book but also to ail other Garland Sci¬ 
ence textbooks. Adopting instructors can obtain access to the site from their sales 
représentative or by emailing science@garland.com. 

Art of Molecular Biology ofthe Cell , Sixth Edition 

The images from the book are available in two convenient formats: PowerPoint® 
and IPEG. They hâve been optimized for display on a computer. Figures are 
searchable by figure number, by figure name, or by keywords used in the figure 
legend from the book. 

Figure-Integrated Lecture Outlines 

The section headings, concept headings, and figures from the text hâve been inte- 
grated into PowerPoint présentations. These will be useful for instructors who 
would lilce a head start creating lectures for their course. Lilce ail of our PowerPoint 
présentations, the lecture outlines can be customized. For example, the content of 
these présentations can be combined with videos and questions from the book or 
Question Bank, in order to create unique lectures that facilitate interactive learn¬ 
ing. 

Animations and Videos 

The 174 animations and videos that are available to students are also available on 
the Instructor's Website in two formats. The WMV-formatted movies are created 
for instructors who wish to use the movies in PowerPoint présentations on Win¬ 
dows® computers; the QuickTime-formatted movies are for use in PowerPoint 
for Apple computers or Keynote® présentations. The movies can easily be down- 
loaded using the "download" button on the movie preview page. The movies are 
correlated to each chapter and callouts are highlighted in color. 

Media Guide 

This document provides an overview to the multimedia available for students and 
instructors and contains the text of the voice-over narration for ail of the movies. 

Question Bank 

Written by Hossein Amiri, University of California, Santa Cruz, this greatly 
expanded question banlc includes a variety of question formats: multiple choice, 
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short answer, fill-in-the-blank, true-false, and matching. There are 35-60 ques¬ 
tions per chapter, and a large number of the multiple-choice questions will be 
suitable for use with personal response Systems (that is, cliclcers). The Question 
Bank was created with the philosophy that a good exam should do much more 
than simply test students' ability to memorize information; it should require them 
to reflect upon and integrate information as a part of a Sound understanding. This 
resource provides a comprehensive sampling of questions that can be used either 
directly or as inspiration for instructors to write their own test questions. 

Diploma® Test Generator Software 

The questions from the Question Bank hâve been loaded into the Diploma Test 
Generator software. The software is easy to use and can scramble questions to cre- 
ate multiple tests. Questions are organized by chapter and type and can be addi- 
tionally categorized by the instructor according to difficulty or subject. Existing 
questions can be edited and new ones added. The Test Generator is compatible 
with several course management Systems, including Blackboard®. 

Medical Topics Guide 

This document highlights medically relevant topics covered throughout Molecular 
Biology ofthe Cell and The Problems Book. It will be particularly useful for instruc¬ 
tors with a large number of premedical, health science, or nursing students. 

Blackboard and Learning Management System (LMS) Intégration 

The movies, book images, and student assessments that accompany the book 

can be integrated into Blackboard or other LMSs. These resources are bundled 

into a "Common Cartridge" or "Upload Package" that facilitâtes bulle uploading 

of textbook resources into Blackboard and other LMSs. The LMS Common 

Cartridge can be obtained on a DVD from your sales représentative or by emailing 

science@garland.com. 

Resources for Students 

The resources for students are available on the Molecular Biology of the Cell 
Student Website, located atwww.garlandscience.com/MBOC6-students. 

Animations and Videos 

There are 174 movies, covering a wide range of cell biology topics, which review 
lcey concepts in the book and illuminate subcellular processes. The movies are 
correlated to each chapter and callouts are highlighted in color. 

Cell Explorer Slides 

This application teaches cell morphology through interactive micrographs that 
highlight important cellular structures. 

Flashcards 

Each chapter contains a set of flashcards, built into the website, that allow stu¬ 
dents to review lcey terms from the text. 

Glossary 

The complété glossary from the book is available on the website and can be 
searched and browsed. 
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The surface of our planet is populated by living things—curious, intricately orga- 
nized Chemical factories that talce in matter from their surroundings and use these 
raw materials to generate copies of themselves. These living organisms appear 
extraordinarily diverse. What could be more different than a tiger and a piece of 
seaweed, or a bacterium and a tree? Yet our ancestors, knowing nothing of cells or 
DNA, saw that ail these things had something in common. They called that some- 
thing “life," marveled at it, struggled to define it, and despaired of explaining what 
it was or how it worlced in terms that relate to nonliving matter. 

The discoveries of the past century hâve not diminished the marvel—quite the 
contrary. But they hâve removed the central mystery regarding the nature of life. 
We can now see that ail living things are made of cells: small, membrane-enclosed 
units filled with a concentrated aqueous solution of Chemicals and endowed with 
the extraordinary ability to create copies of themselves by growing and then divid- 
ing in two. 

Because cells are the fundamental units of life, it is to cell biology —the study 
of the structure, function, and behavior of cells—that we must look for answers 
to the questions of what life is and how it worlcs. With a deeper understanding of 
cells and their évolution, we can begin to taclde the grand historical problems of 
life on Earth: its mysterious origins, its stunning diversity, and its invasion of every 
conceivable habitat. Indeed, as emphasized long ago by the pioneering cell biolo- 
gist E. B. Wilson, "the lcey to every biological problem must finally be sought in the 
cell; for every living organism is, or at some time has been, a cell." 

Despite their apparent diversity, living things are fundamentally similar inside. 
The whole of biology is thus a counterpoint between two thèmes: astonishing 
variety in individual particulars; astonishing constancy in fundamental mecha- 
nisms. In this first chapter, we begin by outlining the universal features common 
to ail life on our planet. We then survey, briefly, the diversity of cells. And we see 
how, thanlcs to the common molecular code in which the spécifications for ail 
living organisms are written, it is possible to read, measure, and decipher these 
spécifications to help us achieve a cohérent understanding of ail the forms of life, 
from the smallest to the greatest. 


IN THIS CHAPTER 

THE UNIVERSAL FEATURES OF 
CELLS ON EARTH 

THE DIVERSITY OF GENOMES 
AND THE TREE OF LIFE 

GENETIC INFORMATION IN 
EUKARYOTES 
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Chapter 1 : Cells and Genomes 



THE UNIVERSAL FEATURES OF CELLS ON EARTH 

It is estimated that there are more than 10 million—perhaps 100 million—living 
species on Earth today. Each species is different, and each reproduces itself faith- 
fully, yielding progeny that belong to the same species: the parent organism hands 
down information specifying, in extraordinary detail, the characteristics that the 
offspring shall hâve. This phenomenon of heredity is central to the définition of 
life: it distinguishes life from other processes, such as the growth of a crystal, or the 
burning of a candie, or the formation of waves on water, in which orderly struc¬ 
tures are generated but without the same type of linlc between the peculiarities of 
parents and the peculiarities of offspring. Like the candie flame, the living organ¬ 
ism must consume free energy to create and maintain its organization. But life 
employs the free energy to drive a hugely complex System of Chemical processes 
that are specified by hereditary information. 

Most living organisms are single cells. Others, such as ourselves, are vast mul- 
ticellular cities in which groups of cells perform specialized functions linlced by 
intricate Systems of communication. But even for the aggregate of more than 10 13 
cells that form a human body, the whole organism has been generated by cell 
divisions from a single cell. The single cell, therefore, is the vehicle for ail of the 
hereditary information that defines each species (Figure 1-1). This cell includes 
the machinery to gather raw materials from the environment and to construct 
from them a new cell in its own image, complété with a new copy of its hereditary 
information. Each and every cell is truly amazing. 

Ail Cells Store Their Hereditary Information in the Same Linear 
Chemical Code: DNA 

Computers hâve made us familiar with the concept of information as a measur- 
able quantity—a million bytes (to record a few hundred pages of text or an image 
from a digital caméra), 600 million bytes for the music on a CD, and so on. Com¬ 
puters hâve also made us well aware that the same information can be recorded 
in many different physical forms: the dises and tapes that we used 20 years ago for 
our electronic archives hâve become unreadable on présent-day machines. Living 


Figure 1-1 The hereditary information 
in the fertilized egg cell détermines 
the nature of the whole multicellular 
organism. Although their starting cells 
look superficially similar, as indicated: a 
sea urchin egg gives rise to a sea urchin 
(A and B). A mouse egg gives rise to a 
mouse (C and D). An egg of the seaweed 
Fucus gives rise to a Fucus seaweed 
(E and F). (A, courtesy of David McClay; 

B, courtesy of M. Gibbs, Oxford Scientific 
Films; C, courtesy of Patricia Calarco, from 
G. Martin, Science 209:768-776, 1980. 
With permission from AAAS; D, courtesy of 
O. Newman, Oxford Scientific Films; E and 
F, courtesy of Colin Brownlee.) 
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cells, lilce computers, store information, and it is estimated that they hâve been 
evolving and diversifying for over 3.5 billion years. It is scarcely to be expected 
that they would ail store their information in the same form, or that the archives 
of one type of cell should be readable by the information-handling machinery of 
another. And yet it is so. Ail living cells on Earth store their hereditary informa¬ 
tion in the form of double-stranded molécules of DNA—long, unbranched, paired 
polymer chains, formed always of the same four types of monomers. These mono¬ 
mers, Chemical compounds known as nucléotides, hâve nicknames drawn from 
a four-letter alphabet—A, T, C, G—and they are strung together in a long linear 
sequence that encodes the genetic information, just as the sequence of ls and Os 
encodes the information in a computer file. We can talce a piece of DNA from a 
human cell and insert it into a bacterium, or a piece of bacterial DNA and insert 
it into a human cell, and the information will be successfully read, interpreted, 
and copied. Using Chemical methods, scientists hâve learned how to read out the 
complété sequence of monomers in any DNA molécule—extending for many mil¬ 
lions of nucléotides—and thereby decipher ail of the hereditary information that 
each organism contains. 

Ail Cells Replicate Their Hereditary Information by Templated 
Polymerization 

The mechanisms that malce life possible dépend on the structure of the double- 
stranded DNA molécule. Each monomer in a single DNA strand—that is, each 
nucléotide —consists of two parts: a sugar (deoxyribose) with a phosphate group 
attached to it, and a base, which may be either adenine (A), guanine (G), cytosine 
(C), or thymine (T) (Figure 1-2). Each sugar is linlced to the next via the phos¬ 
phate group, creating a polymer chain composed of a répétitive sugar-phosphate 
backbone with a sériés of bases protruding from it. The DNA polymer is extended 
by adding monomers at one end. For a single isolated strand, these monomers 
can, in principle, be added in any order, because each one links to the next in the 
same way, through the part of the molécule that is the same for ail of them. In the 
living cell, however, DNA is not synthesized as a free strand in isolation, but on 
a template formed by a preexisting DNA strand. The bases protruding from the 


Figure 1-2 DNA and its building blocks. 

(A) DNA is made from simple subunits, 
called nucléotides, each consisting of a 
sugar-phosphate molécule with a nitrogen- 
containing side group, or base, attached to it. 
The bases are of four types (adenine, guanine, 
cytosine, and thymine), corresponding to 
four distinct nucléotides, labeled A, G, C, 
and T. (B) A single strand of DNA consists 
of nucléotides joined together by sugar- 
phosphate linkages. Note that the individual 
sugar-phosphate units are asymmetric, 
giving the backbone of the strand a definite 
directionality, or polarity. This directionality 
guides the molecular processes by which the 
information in DNA is interpreted and copied 
in cells: the information is always “read” in 
a consistent order, just as written English 
text is read from left to right. (C) Through 
templated polymerization, the sequence of 
nucléotides in an existing DNA strand Controls 
the sequence in which nucléotides are joined 
together in a new DNA strand; T in one 
strand pairs with A in the other, and G in one 
strand with C in the other. The new strand 
has a nucléotide sequence complementary 
to that of the old strand, and a backbone 
with opposite directionality: corresponding 
to the GTAA... of the original strand, it has 
...TTAC. (D) A normal DNA molécule consists 
of two such complementary strands. The 
nucléotides within each strand are linked 
by strong (covalent) Chemical bonds; the 
complementary nucléotides on opposite 
strands are held together more weakly, by 
hydrogen bonds. (E) The two strands twist 
around each other to form a double hélix—a 
robust structure that can accommodate any 
sequence of nucléotides without altering its 
basic structure (see Movie 4.1). 
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Figure 1-3 The copying of genetic 
information by DNA réplication. In this 
process, the two strands of a DNA double 
hélix are pulled apart, and each serves 
as a template for synthesis of a new 
complementary strand. 



existing strand bind to bases of the strand being synthesized, according to a strict 
rule defined by the complementary structures of the bases: A binds to T, and C 
binds to G. This base-pairing holds fresh monomers in place and thereby Con¬ 
trols the sélection of which one of the four monomers shall be added to the grow- 
ing strand next. In this way, a double-stranded structure is created, consisting of 
two exactly complementary sequences of As, Cs, Ts, and Gs. The two strands twist 
around each other, forming a DNA double hélix (Figure 1-2E). 

The bonds between the base pairs are wealc compared with the sugar-phos- 
phate links, and this allows the two DNA strands to be pulled apart without brealc- 
age of their backbones. Each strand then can serve as a template, in the way just 
described, for the synthesis of a fresh DNA strand complementary to itself—a 
fresh copy, that is, of the hereditary information (Figure 1-3). In different types 
of cells, this process of DNA réplication occurs at different rates, with different 
Controls to start it or stop it, and different auxiliary molécules to help it along. But 
the basics are universal: DNA is the information store for heredity, and templated 
polymerization is the way in which this information is copied throughout the liv- 
ing world. 


Ail Cells Transcribe Portions of Their Hereditary Information into 
the Same Intermediary Form: RNA 

To carry out its information-bearing function, DNA must do more than copy itself. 
It must also express its information, by letting the information guide the synthesis 
of other molécules in the cell. This expression occurs by a mechanism that is the 
same in ail living organisms, leading first and foremost to the production of two 
other lcey classes of polymers: RNAs and proteins. The process (discussed in detail 
in Chapters 6 and 7) begins with a templated polymerization called transcription, 
in which segments of the DNA sequence are used as templates for the synthe¬ 
sis of shorter molécules of the closely related polymer ribonucleic acid, or RN|| 
Later, in the more complex process of translation, many of these RNA molécules 
direct the synthesis of polymers of a radically different Chemical class—the pro¬ 
teins (Figure 1-4). 

In RNA, the backbone is formed of a slightly different sugar from that of DNA— 
ribose instead of deoxyribose—and one of the four bases is slightly different—ura- 
cil (U) in place of thymine (T). But the other three bases—A, C, and G—are the 
same, and ail four bases pair with their complementary counterparts in DNA—the 
A, U, C, and G of RNA with the T, A, G, and C of DNA. During transcription, the 
RNA monomers are lined up and selected for polymerization on a template strand 
of DNA, just as DNA monomers are selected during réplication. The outcome is a 
polymer molécule whose sequence of nucléotides faithfully represents a portion 
of the cell's genetic information, even though it is written in a slightly different 
alphabet—consisting of RNA monomers instead of DNA monomers. 

The same segment of DNA can be used repeatedly to guide the synthesis of 
many identical RNA molécules. Thus, whereas the celFs archive of genetic infor¬ 
mation in the form of DNA is fixed and sacrosanct, these RNA transcripts are 


DNA 


P 

niiununitiimi 

f DNA synthesis \ 

V REPLICATION J DNA 

I 


/ 

nucléotides 

\ 

RNA synthesis 
TRANSCRIPTION 


U 

RNA 


protein synthesis 
TRANSLATION 

PROTEIN 



amino acids 


Figure 1-4 From DNA to protein. 

Genetic information is read out and put 
to use through a two-step process. First, 
in transcription, segments of the DNA 
sequence are used to guide the synthesis 
of molécules of RNA. Then, in translation, 
the RNA molécules are used to guide the 
synthesis of molécules of protein. 












THE UNIVERSAL FEATURES OF CELLS ON EARTH 


5 


RNA MOLECULES AS EXPENDABLE 
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mass-produced and disposable (Figure 1-5). As we shall see, these transcripts 
function as intermediates in the transfer of genetic information. Most notably, 
they serve as messenger RNA (mRNA) molécules that guide the synthesis of pro¬ 
teins according to the genetic instructions stored in the DNA. 

RNA molécules hâve distinctive structures that can also give them other spe- 
cialized Chemical capabilities. Being single-stranded, their backbone is flexible, 
so that the polymer chain can bend baclc on itself to allow one part of the molécule 
to form wealc bonds with another part of the same molécule. This occurs when 
segments of the sequence are locally complementary: a ...GGGG... segment, for 
example, will tend to associate with a ...CCCC... segment. Ihese types of internai 
associations can cause an RNA chain to fold up into a spécifie shape that is dic- 
tated by its sequence (Figure 1-6). The shape of the RNA molécule, in turn, may 
enable it to recognize other molécules by binding to them selectively—and even, 
in certain cases, to catalyze Chemical changes in the molécules that are bound. In 
fact, some Chemical reactions catalyzed by RNA molécules are crucial for several 
of the most ancient and fundamental processes in living cells, and it has been sug- 
gested that an extensive catalysis by RNA played a central part in the early évolu¬ 
tion of life (discussed in Chapter 6). 

Ail Cells Use Proteins as Catalysts 

Protein molécules, like DNA and RNA molécules, are long unbranched polymer 
chains, formed by stringing together monomeric building bloclcs drawn from a 
standard répertoire that is the same for ail living cells. Like DNA and RNA, pro¬ 
teins carry information in the form of a linear sequence of symbols, in the same 
way as a human message written in an alphabetic script. There are many different 
protein molécules in each cell, and—leaving out the water—they form most of the 
cell's mass. 



Figure 1-5 How genetic information 
is broadeast for use inside the cell. 

Each cell contai ns a fixed set of DNA 
molécules—its archive of genetic 
information. A given segment of this DNA 
guides the synthesis of many identical RNA 
transcripts, which serve as working copies 
of the information stored in the archive. 
Many different sets of RNA molécules can 
be made by transcribing different parts of 
a cell’s DNA sequences, allowing different 
types of cells to use the same information 
store differently. 


Figure 1-6 The conformation of an 
RNA molécule. (A) Nucléotide pairing 
between different régions of the same 
RNA polymer chain causes the molécule 
to adopt a distinctive shape. (B) The 
three-dimensional structure of an actual 
RNA molécule produced by hepatitis delta 
virus; this RNA can catalyze RNA strand 
cleavage. The blue ribbon represents the 
sugar-phosphate backbone and the bars 
represent base pairs (see Movie 6.1). 

(B, based on A.R. Ferré-D’Amaré, K. Zhou, 
and J.A. Doudna, Nature 395:567-574, 
1998. With permission from Macmillan 
Publishers Ltd.) 
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polysaccharide 
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molécule 
(B) 

Figure 1-7 How a protein molécule acts as a catalyst for a Chemical reaction. (A) In a protein 
molécule, the polymer chain folds up into a spécifie shape defined by its amino acid sequence. A 
groove in the surface of this particular folded molécule, the enzyme lysozyme, forms a catalytic site. 

(B) A polysaccharide molécule (red )—a polymer chain of sugar monomers—binds to the catalytic site of 
lysozyme and is broken apart, as a resuit of a covalent bond-breaking reaction catalyzed by the amino 
acids lining the groove (see Movie 3.9). (PDB code: 1LYD.) 

The monomers of protein, the amino acids, are quite different from those of 
DNA and RNA, and there are 20 types instead of 4. Each amino acid is built around 
the same core structure through which it can be linlced in a standard way to any 
other amino acid in the set; attached to this core is a side group that gives each 
amino acid a distinctive Chemical character. Each of the protein molécules is a 
polypeptide, created by joining its amino acids in a particular sequence. Through 
billions of years of évolution, this sequence has been selected to give the protein a 
useful function. Thus, by folding into a précisé three-dimensional form with reac¬ 
tive sites on its surface (Figure 1-7A), these amino-acid polymers can bind with 
high specificity to other molécules and can act as enzymes to catalyze reactions 
that malce or break covalent bonds. In this way they direct the vast majority of 
Chemical processes in the cell (Figure 1-7B). 

Proteins hâve many other functions as well—maintaining structures, generat- 
ing movements, sensing signais, and so on—each protein molécule performing 
a spécifie function according to its own genetically specified sequence of amino 
acids. Proteins, above ail, are the main molécules that put the cell's genetic infor¬ 
mation into action. 

Thus, polynucleotides specify the amino acid sequences of proteins. Proteins, 
in turn, catalyze many Chemical reactions, including those by which new DNA 
molécules are synthesized. From the most fundamental point of view, a living cell 
is a self-replicating collection of catalysts that takes in food, processes this food 
to dérivé both the building bloclcs and energy needed to malce more catalysts, 
and discards the materials left over as waste (Figure 1-8A). A feedback loop that 
connects proteins and polynucleotides forms the basis for this auto catalytic, self- 
reproducing behavior of living organisms (Figure 1-8B). 

Ail Cells Translate RNA into Protein in the Same Way 

How the information in DNA spécifiés the production of proteins was a com¬ 
plété mystery in the 1950s when the double-stranded structure of DNA was first 
revealed as the basis of heredity. But in the intervening years, scientists hâve dis- 
covered the élégant mechanisms involved. The translation of genetic information 
from the 4-letter alphabet of polynucleotides into the 20-letter alphabet of pro¬ 
teins is a complex process. The rules of this translation seem in some respects 
neat and rational but in other respects strangely arbitrary, given that they are 
(with minor exceptions) identical in ail living things. These arbitrary features, it 
is thought, reflect frozen accidents in the early history of life. They stem from the 
chance properties of the earliest organisms that were passed on by heredity and 
hâve become so deeply embedded in the constitution of ail living cells that they 
cannot be changed without disastrous effects. 
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It turns out that the information in the sequence of a messenger RNA molécule 
is read out in groups of three nucléotides at a time: each triplet of nucléotides, or 
codon , spécifiés (codes for) a single amino acid in a corresponding protein. Since 
the number of distinct triplets that can be formed from four nucléotides is 4 3 , 
there are 64 possible codons, ail of which occur in nature. However, there are only 
20 naturally occurring amino acids. That means there are necessarily many cases 
in which several codons correspond to the same amino acid. Ibis genetic code is 
read out by a spécial class of small RNA molécules, the transfer RNAs ( tRNAs ). 
Each type of tRNA becomes attached at one end to a spécifie amino acid, and 
displays at its other end a spécifie sequence of three nucléotides—an anticodon — 
that enables it to recognize, through base-pairing, a particular codon or subset of 
codons in mRNA. The intricate chemistry that enables these tRNAs to translate 
a spécifie sequence of A, C, G, and U nucléotides in an mRNA molécule into a 
spécifie sequence of amino acids in a protein molécule occurs on the ribosome , a 
large multimolecular machine composed of both protein and ribosomal RNA. Ail 
of these processes are described in detail in Chapter 6. 

Each Protein Is Encoded by a Spécifie Gene 

DNA molécules as a rule are very large, containing the spécifications for thou- 
sands of proteins. Spécial sequences in the DNA serve as punctuation, defining 
where the information for each protein begins and ends. And individual segments 
of the long DNA sequence are transcribed into separate mRNA molécules, coding 
for different proteins. Each such DNA segment represents one gene. A complica¬ 
tion is that RNA molécules transcribed from the same DNA segment can often be 
processed in more than one way, so as to give rise to a set of alternative versions 
of a protein, especially in more complex cells such as those of plants and animais. 
In addition, some DNA segments—a smaller number—are transcribed into RNA 
molécules that are not translated but hâve catalytic, regulatory, or structural func- 
tions; such DNA segments also count as genes. A gene therefore is defined as the 
segment of DNA sequence corresponding to a single protein or set of alternative 
protein variants or to a single catalytic, regulatory, or structural RNA molécule. 

In ail cells, the expression of individual genes is regulated: instead of manu- 
facturing its full répertoire of possible proteins at full tilt ail the time, the cell 
adjusts the rate of transcription and translation of different genes independently, 
according to need. Stretches of regulatory DNA are interspersed among the seg¬ 
ments that code for protein, and these noncoding régions bind to spécial protein 
molécules that control the local rate of transcription. The quantity and organiza- 
tion of the regulatory DNA vary widely from one class of organisms to another, 
but the basic strategy is universal. In this way, the genome of the cell—that is, the 
totality of its genetic information as embodied in its complété DNA sequence— 
dictâtes not only the nature of the cell's proteins, but also when and where they 
are to be made. 


Figure 1-8 Life as an autocatalytic 
process. (A) The cell as a self-replicating 
collection of catalysts. (B) Polynucleotides 
(the nucleic acids DNA and RNA, which are 
nucléotide polymers) provide the sequence 
information, while proteins (amino acid 
polymers) provide most of the catalytic 
functions that serve—through a complex 
set of Chemical reactions—to bring about 
the synthesis of more polynucleotides and 
proteins of the same types. 
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Life Requires Free Energy 

A living cell is a dynamic Chemical System, operating far from Chemical equilib- 
rium. For a cell to grow or to make a new cell in its own image, it must talce in 
free energy from the environment, as well as raw materials, to drive the necessary 
synthetic reactions. This consumption of free energy is fundamental to life. When 
it stops, a cell decays toward Chemical equilibrium and soon dies. 

Genetic information is also fundamental to life, and free energy is required 
for the propagation of this information. For example, to specify one bit of infor¬ 
mation—that is, one yes/no choice between two equally probable alternatives— 
costs a defined amount of free energy that can be calculated. The quantitative 
relationship involves some deep reasoning and dépends on a précisé définition of 
the term "free energy," as explained in Chapter 2. The basic idea, however, is not 
difficult to understand intuitively. 

Picture the molécules in a cell as a swarm of objects endowed with thermal 
energy, moving around violently at random, buffeted by collisions with one 
another. To specify genetic information—in the form of a DNA sequence, for 
example—molécules from this wild crowd must be captured, arranged in a spé¬ 
cifie order defined by some preexisting template, and linlced together in a fixed 
relationship. The bonds that hold the molécules in their proper places on the 
template and join them together must be strong enough to resist the disordering 
effect of thermal motion. The process is driven forward by consumption of free 
energy, which is needed to ensure that the correct bonds are made, and made 
robustly. In the simplest case, the molécules can be compared with spring-loaded 
traps, ready to snap into a more stable, lower-energy attached State when they 
meet their proper partners; as they snap together into the bonded arrangement, 
their available stored energy—their free energy—lilce the energy of the spring 
in the trap, is released and dissipated as heat. In a cell, the Chemical processes 
underlying information transfer are more complex, but the same basic principle 
applies: free energy has to be spent on the création of order. 

To replicate its genetic information faithfully, and indeed to make ail its com¬ 
plex molécules according to the correct spécifications, the cell therefore requires 
free energy, which has to be imported somehow from the surroundings. As we 
shall see in Chapter 2, the free energy required by animal cells is derived from 
Chemical bonds in food molécules that the animais eat, while plants get their free 
energy from sunlight. 

Ail Cells Function as Biochemical Factories Dealing with the Same 
Basic Molecular Building Blocks 

Because ail cells make DNA, RNA, and protein, ail cells hâve to contain and 
manipulate a similar collection of small molécules, including simple sugars, 
nucléotides, and amino acids, as well as other substances that are universally 
required. Ail cells, for example, require the phosphorylated nucléotide ATP ( ade- 
nosine triphosphate), not only as a building bloclc for the synthesis of DNA and 
RNA, but also as a carrier of the free energy that is needed to drive a huge number 
of Chemical reactions in the cell. 

Although ail cells function as biochemical factories of a broadly similar type, 
many of the details of their small-molecule transactions differ. Some organisms, 
such as plants, require only the simplest of nutrients and harness the energy of 
sunlight to make ail their own small organic molécules. Other organisms, such as 
animais, feed on living things and must obtain many of their organic molécules 
ready-made. We return to this point later. 

Ail Cells Are Enclosed in a Plasma Membrane Across Which 
Nutrients and Waste Materials Must Pass 

Another universal feature is that each cell is enclosed by a membrane—the 
plasma membrane. This container acts as a sélective barrier that enables the cell 
to concentrate nutrients gathered from its environment and retain the products it 
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Figure 1-9 Formation of a membrane by amphiphilic phospholipid 
molécules. Phospholipids hâve a hydrophilic (water-loving, phosphate) head 
group and a hydrophobie (water-avoiding, hydrocarbon) tail. At an interface 
between oil and water, they arrange themselves as a single sheet with their 
head groups facing the water and their tail groups facing the oil. But when 
immersed in water, they aggregate to form bilayers enclosing aqueous 
compartiments, as indicated. 

synthesizes for its own use, while excreting its waste products. Without a plasma 
membrane, the cell could not maintain its integrity as a coordinated Chemical 
System. 

The molécules that form a membrane hâve the simple physicoChemical 
property of being amphiphilic —that is, consisting of one part that is hydropho¬ 
bie (water-insoluble) and another part that is hydrophilic (water-soluble). Such 
molécules placed in water aggregate spontaneously, arranging their hydropho¬ 
bie portions to be as much in contact with one another as possible to hide them 
from the water, while lceeping their hydrophilic portions exposed. Amphiphilic 
molécules of appropriate shape, such as the phospholipid molécules that com¬ 
prise most of the plasma membrane, spontaneously aggregate in water to create 
a bilayer that forms small closed vesicles (Figure 1-9). The phenomenon can be 
demonstrated in a test tube by simply mixing phospholipids and water together; 
under appropriate conditions, small vesicles form whose aqueous contents are 
isolated from the external medium. 

Although the Chemical details vary, the hydrophobie tails of the prédominant 
membrane molécules in ail cells are hydrocarbon polymers (-CH 2 -CH 2 -CH 2 -), 
and their spontaneous assembly into a bilayered vesicle is but one of many exam¬ 
ples of an important general principle: cells produce molécules whose Chemical 
properties cause them to self-assemble into the structures that a cell needs. 

The cell boundary cannot be totally imperméable. If a cell is to grow and repro¬ 
duce, it must be able to import raw materials and export waste across its plasma 
membrane. Ail cells therefore hâve specialized proteins embedded in their mem¬ 
brane that transport spécifie molécules from one side to the other. Some of these 
membrane transport proteins , lilce some of the proteins that catalyze the funda- 
mental small-molecule reactions inside the cell, hâve been so well preserved over 
the course of évolution that we can recognize the family resemblances between 
them in comparisons of even the most distantly related groups of living organ- 
isms. 

The transport proteins in the membrane largely détermine which molécules 
enter the cell, and the catalytic proteins inside the cell détermine the reactions 
that those molécules undergo. Thus, by specifying the proteins that the cell is to 
manufacture, the genetic information recorded in the DNA sequence dictâtes the 
entire chemistry of the cell; and not only its chemistry, but also its form and its 
behavior, for these too are chiefly constructed and controlled by the cell's proteins. 


phospholipid 

monolayer 



phospholipid 

bilayer 


A Living Cell Can Exist with Fewer Than 500 Genes 

The basic principles of biological information transfer are simple enough, but how 
complex are real living cells? In particular, what are the minimum requirements? 
We can get a rough indication by considering a species that has one of the small- 
est lcnown genomes—the bacterium Mycoplasma genitalium (Figure 1-10). This 
organism lives as a parasite in mammals, and its environment provides it with 
many of its small molécules ready-made. Nevertheless, it still has to malce ail the 
large molécules—DNA, RNAs, and proteins—required for the basic processes of 
heredity. It has about 530 genes, about 400 of which are essential. Its genome of 
580,070 nucléotide pairs represents 145,018 bytes of information—about as much 
as it talces to record the text of one chapter of this boolc. Cell biology may be com- 
plicated, but it is not impossibly so. 

The minimum number of genes for a viable cell in today's environments is 
probably not less than 300, although there are only about 60 genes in the core set 
that is shared by ail living species. 
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Summary 

The individual cell is the minimal self-reproducing unit ofliving matter, and it con- 
sists of a self-replicating collection of catalysts. Central to this reproduction is the 
transmission ofgenetic information to progeny cells. Every cell on our planet stores 
its genetic information in the same Chemical form—as double-stranded DNA. The 
cell replicates its information by separating the paired DNA strands and using each 
as a template for polymerization to make a new DNA strand with a complemen- 
tary sequence of nucléotides. The same strategy oftemplated polymerization is used 
to transcribe portions of the information from DNA into molécules of the closely 
related polymer, RNA. These RNA molécules in turn guide the synthesis ofprotein 
molécules by the more complex machinery of translation; involving a large multi- 
molecular machine, the ribosome. Proteins are the principal catalysts for almost 
ail the Chemical reactions in the cell; their other functions include the sélective 
import and export ofsmall molécules across the plasma membrane thatforms the 
celVs boundary. The spécifie function of each protein dépends on its amino acid 
sequence, which is specified by the nucléotide sequence ofa corresponding segment 
of the DNA—the gene that codes for that protein. In this way, the genome of the 
cell détermines its chemistry; and the chemistry of every living cell is fundamentally 
similar, because it must pro vide for the synthesis of DNA, RNA, and protein. The 
simplest Icnown cells can survive with about 400 genes. 

THE DIVERSITY OF GENOMES AND THE TREE OF LIFE 

The success ofliving organisms based on DNA, RNA, and protein has been spec- 
tacular. Life has populated the océans, covered the land, infiltrated the Earth's 
crust, and molded the surface of our planet. Our oxygen-rich atmosphère, the 
deposits of coal and oil, the layers of iron ores, the cliffs of challc and limestone 
and marble—ail these are products, directly or indirectly, of past biological activ- 
ity on Earth. 

Living things are not confined to the familiar temperate realm of land, water, 
and sunlight inhabited by plants and plant-eating animais. They can be found in 
the darkest depths of the océan, in hot volcanic mud, in pools beneath the fro- 
zen surface of the Antarctic, and buried kilometers deep in the Earth's crust. The 
créatures that live in these extreme environments are generally unfamiliar, not 
only because they are inaccessible, but also because they are mostly microscopie. 
In more homely habitats, too, most organisms are too small for us to see without 
spécial equipment: they tend to go unnoticed, unless they cause a disease or rot 
the timbers of our houses. Yet micro organisms make up most of the total mass 
of living matter on our planet. Only recently, through new methods of molecular 
analysis and specifically through the analysis of DNA sequences, hâve we begun 
to get a picture of life on Earth that is not grossly distorted by our biased perspec¬ 
tive as large animais living on dry land. 

In this section, we consider the diversity of organisms and the relationships 
among them. Because the genetic information for every organism is written in 
the universal language of DNA sequences, and the DNA sequence of any given 
organism can be readily obtained by standard biochemical techniques, it is now 
possible to characterize, catalog, and compare any set of living organisms with 
reference to these sequences. From such comparisons we can estimate the place 
of each organism in the family tree of living species—the “tree of life." But before 
describing what this approach reveals, we need first to consider the routes by 
which cells in different environments obtain the matter and energy they require to 
survive and proliferate, and the ways in which some classes of organisms dépend 
on others for their basic Chemical needs. 

Cells Can Be Powered by a Variety of Free-Energy Sources 

Living organisms obtain their free energy in different ways. Some, such as animais, 
fungi, and the many different bacteria that live in the human gut, get it by feed- 
ing on other living things or the organic Chemicals they produce; such organisms 
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Figure 1-10 Mycoplasma genitalium. 

(A) Scanning électron micrograph showing 
the irregular shape of this small bacterium, 
reflecting the lack of any rigid cell wall. 

(B) Cross section (transmission électron 
micrograph) of a Mycoplasma cell. Of the 
530 genes of Mycoplasma genitalium, 

43 code for transfer, ribosomal, and other 
non-messenger RNAs. Functions are 
known, or can be guessed, for 339 of the 
genes coding for protein: of these, 154 
are involved in réplication, transcription, 
translation, and related processes 
involving DNA, RNA, and protein; 98 in the 
membrane and surface structures of the 
cell; 46 in the transport of nutrients and 
other molécules across the membrane; 

71 in energy conversion and the synthesis 
and dégradation of small molécules; and 
12 in the régulation of cell division 
and other processes. Note that these 
categories are partly overlapping, so that 
some genes feature twice. (A, from 
S. Razin et al., Infect. Immun. 30:538-546, 
1980. With permission from the American 
Society for Microbiology; B, courtesy of 
Roger Cole, in Medical Microbiology, 4th 
ed. [S. Baron ed.]. Galveston: University of 
Texas Medical Branch, 1996.) 
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are called organotrophic (from the Greek word trophe, meaning “food"). Others 
dérivé their energy directiy from the nonliving world. These primary energy con- 
verters fall into two classes: those that harvest the energy of sunlight, and those 
that capture their energy from energy-rich Systems of inorganic Chemicals in the 
environment (chemical Systems that are far from Chemical equilibrium). Organ- 
isms of the former class are called phototrophic (feeding on sunlight); those of the 
latter are called lithotrophic (feeding on rock). Organotrophic organisms could 
not exist without these primary energy converters, which are the most plentiful 
form of life. 

Phototrophic organisms include many types of bacteria, as well as algae and 
plants, on which we—and virtually ail the living things that we ordinarily see 
around us—dépend. Phototrophic organisms hâve changed the whole chemistry 
of our environment: the oxygen in the Earth's atmosphère is a by-product of their 
biosynthetic activities. 

Lithotrophic organisms are not such an obvious feature of our world, because 
they are microscopie and mostly live in habitats that humans do not frequent— 
deep in the océan, buried in the Earth's crust, or in various other inhospitable 
environments. But they are a major part of the living world, and they are especially 
important in any considération of the history of life on Earth. 

Some lithotrophs get energy from aérobic reactions, which use molecular oxy¬ 
gen from the environment; since atmospheric O 2 is ultimately the product of liv¬ 
ing organisms, these aérobic lithotrophs are, in a sense, feeding on the products 
of past life. There are, however, other lithotrophs that live anaerobically, in places 
where little or no molecular oxygen is présent. These are circumstances similar 
to those that existed in the early days of life on Earth, before oxygen had accumu- 
lated. 

The most dramatic of these sites are the hot hydrothermal vents on the floor of 
the Pacific and Atlantic Océans. They are located where the océan floor is spread- 
ing as new portions of the Earth's crust form by a graduai upwelling of material 
from the Earth's interior (Figure 1-11). Downward-percolating seawater is heated 
and driven back upward as a submarine geyser, carrying with it a current of 
Chemicals from the hot rocks below. A typical cocktail might include H 2 S, H 2 , CO, 
Mn 2+ , Fe 2+ , Ni 2+ , CH 2 , NH 4 + , and phosphorus-containing compounds. A dense 
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Figure 1-11 The geology of a hot 
hydrothermal vent in the océan floor. As 

indicated, water percolates down toward 
the hot molten rock upwelling from the 
Earth’s interior and is heated and driven 
back upward, carrying minerais leached 
from the hot rock. A température gradient 
is set up, from more than 350°C near the 
core of the vent, down to 2-3°C in the 
surrounding océan. Minerais precipitate 
from the water as it cools, forming a 
chimney. Different classes of organisms, 
thriving at different températures, live in 
different neighborhoods of the chimney. A 
typical chimney might be a few meters tall, 
spewing out hot, mineral-rich water at a 
flow rate of 1-2 m/sec. 
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Figure 1-12 Organisms living at a 
depth of 2500 meters near a vent in 
the océan floor. Close to the vent, at 
températures up to about 120°C, various 
lithotrophic species of bacteria and archaea 
(archaebacteria) live, directly fueled by 
geochemical energy. A little further away, 
where the température is lower, various 
invertebrate animais live by feeding on 
these microorganisms. Most remarkable 
are these giant (2 meter) tube worms, 

Riftia pachyptila, which, rather than feed 
on the lithotrophic cells, live in symbiosis 
with them: specialized organs in the 
worms harbor huge numbers of symbiotic 
sulfur-oxidizing bacteria. These bacteria 
harness geochemical energy and supply 
nourishment to their hosts, which hâve 
no mouth, gut, or anus. The tube worms 
are thought to hâve evolved from more 
conventional animais, and to hâve become 
secondarily adapted to life at hydrothermal 
vents. (Courtesy of Monika Bright, 

University of Vienna, Austria.) 


population of microbes lives in the neighborhood of the vent, thriving on this aus¬ 
tère diet and harvesting free energy from reactions between the available Chemi¬ 
cals. Other organisms—clams, mussels, and giant marine worms—in turn live off 
the microbes at the vent, forming an entire ecosystem analogous to the world of 
plants and animais that we belong to, but powered by geochemical energy instead 
of light (Figure 1-12). 

Some Cells Fix Nitrogen and Carbon Dioxide for Others 

To malce a living cell requires matter, as well as free energy. DNA, RNA, and 
protein are composed of just six éléments: hydrogen, carbon, nitrogen, oxygen, 
sulfur, and phosphorus. These are ail plentiful in the nonliving environment, in 
the Earth's rocks, water, and atmosphère. But they are not présent in Chemical 
forms that allow easy incorporation into biological molécules. Atmospheric N 2 
and CO 2 , in particular, are extremely unreactive. A large amount of free energy 
is required to drive the reactions that use these inorganic molécules to malce 
the organic compounds needed for further biosynthesis—that is, to fix nitrogen 
and carbon dioxide, so as to malce N and C available to living organisms. Many 
types of living cells laclc the biochemical machinery to achieve this fixation; they 
instead rely on other classes of cells to do the job for them. We animais dépend on 
plants for our supplies of organic carbon and nitrogen compounds. Plants in turn, 
although they can fix carbon dioxide from the atmosphère, laclc the ability to fix 
atmospheric nitrogen; they dépend in part on nitrogen-fixing bacteria to supply 
their need for nitrogen compounds. Plants of the pea family, for example, harbor 
symbiotic nitrogen-fixing bacteria in nodules in their roots. 

Living cells therefore differ widely in some of the most basic aspects of their 
biochemistry. Not surprisingly, cells with complementary needs and capabilities 
hâve developed close associations. Some of these associations, as we see below, 
hâve evolved to the point where the partners hâve lost their separate identities 
altogether: they hâve joined forces to form a single composite cell. 

The Greatest Biochemical Diversity Exists Among Prokaryotic Cells 

From simple microscopy, it has long been clear that living organisms can be 
classified on the basis of cell structure into two groups: the eulcaryotes and the 
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prokaryotes. Eukaryotes lceep their DNA in a distinct membrane-enclosed intra- 
cellular compartment called the nucléus. (The name is from the Greelc, mean- 
ing "truly nucleated ," from the words eu, "well" or "truly" and Icaryon, "kernel" 
or "nucléus") Prokaryotes hâve no distinct nuclear compartment to house their 
DNA. Plants, fungi, and animais are eukaryotes; bacteria are prokaryotes, as are 
archaea—a separate class of prokaryotic cells, discussed below. 

Most prokaryotic cells are small and simple in outward appearance (Figure 
1-13), and they live mostly as independent individuals or in loosely organized 
communities, rather than as multicellular organisms. They are typically spherical 
or rod-shaped and measure a few micrometers in linear dimension. They often 
hâve a tough protective coat, called a cell wall, beneath which a plasma mem¬ 
brane encloses a single cytoplasmic compartment containing DNA, RNA, pro¬ 
teins, and the many small molécules needed for life. In the électron microscope, 
this cell interior appears as a matrix of varying texture without any discernible 
organized internai structure (Figure 1-14). 

Prokaryotic cells live in an enormous variety of ecological niches, and they are 
astonishingly varied in their biochemical capabilities—far more so than eukary- 
otic cells. Organotrophic species can utilize virtually any type of organic molécule 
as food, from sugars and amino acids to hydrocarbons and methane gas. Photo- 
trophic species (Figure 1-15) harvest light energy in a variety of ways, some of 
them generating oxygen as a by-product, others not. Lithotrophic species can feed 
on a plain diet of inorganic nutrients, getting their carbon from CO 2 , and relying 
on H 2 S to fuel their energy needs (Figure 1-16) —or on H 2 , or Fe 2+ , or elemental 
sulfur, or any of a host of other Chemicals that occur in the environment. 


Figure 1-14 The structure of a bacterium. (A) The bacterium Vibrio 
cholerae, showing its simple internai organization. Like many other species, 
Vibrio has a helical appendage at one end—a flagellum—that rotâtes as a 
propeller to drive the cell forward. It can infect the human small intestine to 
cause choiera; the severe diarrhea that accompanies this disease kills more 
than 100,000 people a year. (B) An électron micrograph of a longitudinal 
section through the widely studied bacterium Escherichia coli (E. coli). The 
cell’s DNA is concentrated in the lightly stained région. Part of our normal 
intestinal flora, E. coli is related to Vibrio, and it has many flagella distributed 
over its surface that are not visible in this section. (B, courtesy of 
E. Kellenberger.) 
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Figure 1-13 Shapes and sizes of some 
bacteria. Although most are small, as 
shown, measuring a few micrometers 
in linear dimension, there are also some 
giant species. An extreme example (not 
shown) is the cigar-shaped bacterium 
Epulopiscium fishelsoni, which lives in the 
gut of a surgeonfish and can be up to 
600 (im long. 
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Figure 1-15 The phototrophic bacterium 
Anabaena cylindrica viewed in the light 
microscope. The cells of this species form 
long, multicellular filaments. Most ofthe 
cells (labeled V) perform photosynthesis, 
while others become specialized for 
nitrogen fixation (labeled H) or develop into 
résistant spores (labeled S). (Courtesy of 
Dave G. Adams.) 


Much of this world of microscopie organisms is virtually unexplored. Tradi- 
tional methods of bacteriology hâve given us an acquaintance with those species 
that can be isolated and cultured in the laboratory. But DNA sequence analysis of 
the populations of bacteria and archaea in samples from natural habitats—such 
as soil or océan water, or even the human mouth—has opened our eyes to the fact 
that most species cannot be cultured by standard laboratory techniques. Accord- 
ing to one estimate, at least 99% of prokaryotic species remain to be characterized. 
Detected only by their DNA, it has not yet been possible to grow the vast majority 
of them in laboratories. 


The Tree of Life Has Three Primary Branches: Bacteria, Archaea, 
and Eukaryotes 

The classification of living things has traditionally depended on comparisons of 
their outward appearances: we can see that a fish has eyes, jaws, baclcbone, brain, 
and so on, just as we do, and that a worm does not; that a rosebush is cousin 
to an apple tree, but is less similar to a grass. As Darwin showed, we can read- 
ily interpret such close family resemblances in terms of évolution from common 
ancestors, and we can find the remains of many of these ancestors preserved in 
the fossil record. In this way, it has been possible to begin to draw a family tree of 
living organisms, showing the various fines of descent, as well as branch points 
in the history, where the ancestors of one group of species became different from 
those of another. 

When the disparities between organisms become very great, however, these 
methods begin to fail. How do we décidé whether a fungus is doser kin to a plant 
or to an animal? When it cornes to prokaryotes, the taskbecomes harder still: one 
microscopie rod or sphere looks much lilce another. Microbiologists hâve there- 
fore sought to classify prokaryotes in terms of their biochemistry and nutritional 
requirements. But this approach also has its pitfalls. Amid the bewildering variety 
of biochemical behaviors, it is difficult to know which différences truly reflect dif¬ 
férences of evolutionary history. 

Genome analysis has now given us a simpler, more direct, and much more 
powerful way to détermine evolutionary relationships. The complété DNA 
sequence of an organism defines its nature with almost perfect précision and in 
exhaustive detail. Moreover, this spécification is in a digital form—a string of let- 
ters—that can be entered straightforwardly into a computer and compared with 
the corresponding information for any other living thing. Because DNA is subject 
to random changes that accumulate over long periods of time (as we shall see 
shortly), the number of différences between the DNA sequences of two organ¬ 
isms can provide a direct, objective, quantitative indication of the evolutionary 
distance between them. 

This approach has shown that the organisms that were traditionally classed 
together as “bacteria" can be as widely divergent in their evolutionary origins as 
is any prokaryote from any eukaryote. It is now clear that the prokaryotes com¬ 
prise two distinct groups that diverged early in the history of life on Earth, before 
the eukaryotes diverged as a separate group. The two groups of prokaryotes are 
called the bacteria (or eubacteria) and the archaea (or archaebacteria). Detailed 
genome analyses hâve recently revealed that the first eulcayotic cell formed after a 
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Figure 1-16 A lithotrophic bacterium. 

Beggiatoa, which lives in sulfurous 
environments, gets its energy by oxidizing 
H 2 S and can fix carbon even in the dark. 
Note the yellow deposits of sulfur inside the 
cells. (Courtesy of Ralph W. Wolfe.) 
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particular type of ancient archaeal cell engulfed an ancient bacterium (see Figure 
12-3). Thus, the living world today is considered to consist of three major divisions 
or domains : bacteria, archaea, and eukaryotes (Figure 1-17). 

Archaea are often found inhabiting environments that we humans avoid, such 
as bogs, sewage treatment plants, océan depths, sait brines, and hot acid springs, 
although they are also widespread in less extreme and more homely environ¬ 
ments, from soils and lalces to the stomachs of cattle. In outward appearance they 
are not easily distinguished from bacteria. At a molecular level, archaea seem to 
resemble eukaryotes more closely in their machinery for handling genetic infor¬ 
mation (réplication, transcription, and translation), but bacteria more closely in 
their apparatus for metabolism and energy conversion. We discuss below how 
this might be explained. 

Some Genes Evolve Rapidly; Others Are Highly Conserved 

Both in the storage and in the copying of genetic information, random accidents 
and errors occur, altering the nucléotide sequence—that is, creating mutations. 
Iherefore, when a cell divides, its two daughters are often not quite identical 
to one another or to their parent. On rare occasions, the error may represent a 
change for the better; more probably, it will cause no significant différence in 
the celFs prospects. But in many cases, the error will cause serious damage—for 
example, by disrupting the coding sequence for a key protein. Changes due to 
mistakes of the first type will tend to be perpetuated, because the altered cell has 
an increased likelihood of reproducing itself. Changes due to mistakes of the sec¬ 
ond type— selectively neutral changes—may be perpetuated or not: in the com¬ 
pétition for limited resources, it is a matter of chance whether the altered cell or 
its cousins will succeed. But changes that cause serious damage lead nowhere: 
the cell that suffers them dies, leaving no progeny. Through endless répétition of 
this cycle of error and trial—of mutation and natural sélection— organisms evolve: 
their genetic spécifications change, giving them new ways to exploit the environ¬ 
ment more effectively, to survive in compétition with others, and to reproduce 
successfully. 

Some parts of the genome will change more easily than others in the course of 
évolution. A segment of DNA that does not code for protein and has no significant 
regulatory rôle is free to change at a rate limited only by the frequency of ran¬ 
dom errors. In contrast, a gene that codes for a highly optimized essential protein 
or RNA molécule cannot alter so easily: when mistakes occur, the faulty cells are 
almost always eliminated. Genes of this latter sort are therefore highly conserved. 
Through 3.5 billion years or more of evolutionary history, many features of the 
genome hâve changed beyond ail récognition, but the most highly conserved 
genes remain perfectly recognizable in ail living species. 


Figure 1-17 The three major divisions 
(domains) of the living world. Note that 
the word bacteria was originally used to 
refer to prokaryotes in general, but more 
recently has been redefined to refer to 
eubacteria specifically. The tree shown 
here is based on comparisons of the 
nucléotide sequence of a ribosomal RNA 
(rRNA) subunit in the different species, and 
the distances in the diagram represent 
estimâtes of the numbers of evolutionary 
changes that hâve occurred in this 
molécule in each lineage (see Figure 1-18). 
The parts of the tree shrouded in gray 
cloud represent uncertainties about details 
of the true pattern of species divergence 
in the course of évolution: comparisons 
of nucléotide or amino acid sequences 
of molécules other than rRNA, as well as 
other arguments, can lead to somewhat 
different trees. As indicated, the nucléus of 
the eukaryotic cell is now thought to hâve 
emerged from a sub-branch within the 
archaea, so that in the beginning the tree 
of life had only two branches—bacteria 
and archaea. 
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These latter genes are the ones we must examine if we wish to trace family rela- 
tionships between the most distantly related organisms in the tree of life. The ini¬ 
tial studies that led to the classification of the living world into the three domains 
of bacteria, archaea, and eukaryotes were based chiefly on analysis of one of the 
rRNA components of the ribosome. Because the translation of RNA into protein is 
fundamental to ail living cells, this component of the ribosome has been very well 
conserved since early in the history of life on Earth (Figure 1-18). 

Most Bacteria and Archaea Hâve 1000-6000 Genes 

Natural sélection has generally favored those prolcaryotic cells that can reproduce 
the fastest by taking up raw materials from their environment and replicating 
themselves most efficiently, at the maximal rate permitted by the available food 
supplies. Small size implies a large ratio of surface area to volume, thereby helping 
to maximize the uptalce of nutrients across the plasma membrane and boosting a 
cell's reproductive rate. 

Presumably for these reasons, most prolcaryotic cells carry very little super- 
fluous baggage; their genomes are small, with genes paclced closely together and 
minimal quantities of regulatory DNA between them. The small genome size has 
made it easy to use modem DNA sequencing techniques to détermine complété 
genome sequences. We now hâve this information for thousands of species of 
bacteria and archaea, as well as for hundreds of species of eukaryotes. Most bacte- 
rial and archaeal genomes contain between 10 6 and 10 7 nucléotide pairs, encod- 
ing 1000-6000 genes. 

A complété DNA sequence reveals both the genes an organism possesses and 
the genes it laclcs. When we compare the three domains of the living world, we 
can begin to see which genes are common to ail of them and must therefore hâve 
been présent in the cell that was ancestral to ail présent-day living things, and 
which genes are peculiar to a single branch in the tree of life. To explain the find- 
ings, however, we need to consider a little more closely how new genes arise and 
genomes evolve. 

New Genes Are Generated from Preexisting Genes 

The raw material of évolution is the DNA sequence that already exists: there is 
no natural mechanism for malcing long stretches of new random sequence. In 
this sense, no gene is ever entirely new. Innovation can, however, occur in several 
ways (Figure 1-19): 

1. Intragenic mutation : an existing gene can be randomly modified by changes 
in its DNA sequence, through various types of error that occur mainly in the 
process of DNA réplication. 

2. Gene duplication : an existing gene can be accidentally duplicated so as to 
create a pair of initially identical genes within a single cell; these two genes 
may then diverge in the course of évolution. 

3. DNA segment shuffling : two or more existing genes can break and rejoin to 
malce a hybrid gene consisting of DNA segments that originally belonged to 
separate genes. 

4. Horizontal (intercellular) transfer : a piece of DNA can be transferred from 
the genome of one cell to that of another—even to that of another species. 
This process is in contrast with the usual vertical transfer of genetic infor¬ 
mation from parent to progeny. 

Each of these types of change leaves a characteristic trace in the DNA sequence 
of the organism, and there is clear evidence that ail four processes hâve frequently 


Figure 1-18 Genetic information 
conserved since the days of the last 
common ancestor of ail living things. 

A part of the gene for the smaller of the two 
main rRNA components of the ribosome is 
shown. (The complété molécule is about 
1500-1900 nucléotides long, depending 
on species.) Corresponding segments of 
nucléotide sequence from an archaean 
(Methanococcus jannaschii), a bacterium 
(Escherichia coli), and a eukaryote (Homo 
sapiens) are aligned. Sites where the 
nucléotides are identical between species 
are indicated by a vertical line; the human 
sequence is repeated at the bottom of 
the alignment so that ail three two-way 
comparisons can be seen. A dot halfway 
along the E. coli sequence dénotés a site 
where a nucléotide has been either deleted 
from the bacterial lineage in the course 
of évolution or inserted in the other two 
lineages. Note that the sequences from 
these three organisms, représentative of 
the three domains of the living world, still 
retain unmistakable similarities. 
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Figure 1-19 Four modes of genetic 
innovation and their effects on the DNA 
sequence of an organism. A spécial 
form of horizontal transfer occurs when 
two different types of cells enter into 
a permanent symbiotic association. 
Genes from one of the cells then may be 
transferred to the genome of the other, 
as we shall see below when we discuss 
mitochondria and chloroplasts. 
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occurred. In later chapters, we discuss the underlying mechanisms, but for the 
présent we focus on the conséquences. 

Gene Duplications Give Rise to Families of Related Genes Within a 
Single Cell 

A cell duplicates its entire genome each time it divides into two daughter cells. 
However, accidents occasionally resuit in the inappropriate duplication of just 
part of the genome, with rétention of original and duplicate segments in a single 
cell. Once a gene has been duplicated in this way, one of the two gene copies is 
free to mutate and become specialized to perform a different function within the 
same cell. Repeated rounds of this process of duplication and divergence, over 
many millions of years, hâve enabled one gene to give rise to a family of genes that 
may ail be found within a single genome. Analysis of the DNA sequence of pro- 
karyotic genomes reveals many examples of such gene families: in the bacterium 
Bacillus subtilis, for example, 47% of the genes hâve one or more obvious relatives 
(Figure 1-20). 

When genes duplicate and diverge in this way, the individuals of one species 
become endowed with multiple variants of a primordial gene. This evolutionary 
process has to be distinguished from the genetic divergence that occurs when one 
species of organism splits into two separate lines of descent at a branch point in 
the family tree—when the human line of descent became separate from that of 
chimpanzees, for example. There, the genes gradually become different in the 
course of évolution, but they are lilcely to continue to hâve corresponding fonc¬ 
tions in the two sister species. Genes that are related by descent in this way—that 
is, genes in two separate species that dérivé from the same ancestral gene in the 
last common ancestor of those two species—are called orthologs. Related genes 
that hâve resulted from a gene duplication event within a single genome—and 
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764 genes in families with 
4-19 gene members 


273 genes in families 
with 3 gene members 


283 genes in families with 
38-77 gene members 



2126 genes with 
no family relationship 


568 genes in families 
with 2 gene members 


Figure 1-20 Families of evolutionarily 
related genes in the genome of Bacillus 
subtilis. The largest gene family in this 
bacterium consists of 77 genes coding for 
varieties of ABC transportera—a class of 
membrane transport proteins found in ail 
three domains of the living world. (Adapted 
from F. Kunst et al., Nature 390:249-256, 
1997. With permission from Macmillan 
Publishers Ltd.) 


are lilcely to hâve diverged in their function—are called paralogs. Genes that are 
related by descent in either way are called homologs, a general term used to cover 
both types of relationship (Figure 1-21). 

Genes Can Be Transferred Between Organisais, Both in the 
Laboratory and in Nature 

Prokaryotes provide good examples of the horizontal transfer of genes from one 
species of cell to another. The most obvious tell-tale signs are sequences recogniz- 
able as being derived from viruses, those infecting bacteria being called bacté¬ 
riophages (Figure 1-22). Viruses are small paclcets of genetic material that hâve 
evolved as parasites on the reproductive and biosynthetic machinery of host cells. 
Although not themselves living cells, they often serve as vectors for gene transfer. 
A virus will replicate in one cell, emerge from it with a protective wrapping, and 
then enter and infect another cell, which may be of the same or a different species. 
Often, the infected cell will be killed by the massive prolifération of virus particles 
inside it; but sometimes, the viral DNA, instead of directly generating these par¬ 
ticles, may persist in its host for many cell générations as a relatively innocuous 
passenger, either as a separate intracellular fragment of DNA, known as a plasmid, 
or as a sequence inserted into the cell's regular genome. In their travels, viruses 
can accidentally piclc up fragments of DNA from the genome of one host cell and 
ferry them into another cell. Such transfers of genetic material are very common 
in prokaryotes. 

Horizontal transfers of genes between eulcaryotic cells of different species 
are very rare, and they do not seem to hâve played a significant part in eukaryote 
évolution (although massive transfers from bacterial to eulcaryotic genomes hâve 
occurred in the évolution of mitochondria and chloroplasts, as we discuss below). 
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Figure 1-21 Paralogous genes and 
orthologous genes: two types of 
gene homology based on different 
evolutionary pathways. (A) Orthologs. 
(B) Paralogs. 
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Figure 1-22 The viral transfer of DNA 
into a cell. (A) An électron micrograph 
of particles of a bacterial virus, the T4 
bactériophage. The head of this virus 
contains the viral DNA; the tail contains 
the apparatus for injecting the DNA into a 
host bacterium. (B) A cross section of an 
E coli bacterium with a T4 bactériophage 
latched onto ifs surface. The large dark 
objects inside the bacterium are the 
heads of new T4 particles in the course 
of assembly. When they are mature, 
the bacterium will burst open to release 
them. (C-E) The process of DNA injection 
into the bacterium, as visualized in 
unstained, frozen samples by cryoelectron 
microscopy. (C) Attachment begins. 

(D) Attached State during DNA injection. 

(E) Virus head has emptied ail of its DNA 
into the bacterium. (A, courtesy of James 
Paulson; B, courtesy of Jonathan King 
and Erika Hartwig from G. Karp, Cell and 
Molecular Biology, 2nd ed. New York: John 
Wiley & Sons, 1999. With permission from 
John Wiley & Sons; C-E, courtesy of lan 
Molineux, University of Texas at Austin and 
Jun Liu, University of Texas Health Science 
Center, Houston.) 


In contrast, horizontal gene transfers occur much more frequently between differ¬ 
ent species of prokaryotes. Many prokaryotes hâve a remarlcable capacity to talce 
up even nonviral DNA molécules from their surroundings and thereby capture 
the genetic information these molécules carry. By this route, or by virus-mediated 
transfer, bacteria and archaea in the wild can acquire genes from neighboring 
cells relatively easily. Genes that confer résistance to an antibiotic or an ability 
to produce a toxin, for example, can be transferred from species to species and 
provide the récipient bacterium with a sélective advantage. In this way, new and 
sometimes dangerous strains of bacteria hâve been observed to evolve in the bac¬ 
terial ecosystems that inhabit hospitals or the various niches in the human body. 
For example, horizontal gene transfer is responsible for the spread, over the past 
40 years, of penicillin-resistant strains of Neisseria gonorrhoeae , the bacterium 
that causes gonorrhea. On a longer time scale, the results can be even more pro- 
found; it has been estimated that at least 18% of ail of the genes in the present-day 
genome of E. coli hâve been acquired by horizontal transfer from another species 
within the past 100 million years. 

Sex Results in Horizontal Exchanges of Genetic Information Within 
a Species 

Horizontal gene transfer among prokaryotes has a parallel in a phenomenon 
familiar to us ail: sex. In addition to the usual vertical transfer of genetic mate- 
rial from parent to offspring, sexual reproduction causes a large-scale horizontal 
transfer of genetic information between two initially separate cell lineages—those 
of the father and the mother. A key feature of sex, of course, is that the genetic 
exchange normally occurs only between individuals of the same species. But no 
matter whether they occur within a species or between species, horizontal gene 
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transfers leave a characteristic imprint: they resuit in individuals who are related 
more closely to one set of relatives with respect to some genes, and more closely to 
another set of relatives with respect to others. By comparing the DNA sequences of 
individual human genomes, an intelligent visitor from outer space could deduce 
that humans reproduce sexually, even if it knew nothing about human behavior. 

Sexual reproduction is widespread (although not universal), especially 
among eukaryotes. Even bacteria indulge from time to time in controlled sexual 
exchanges of DNA with other members of their own species. Natural sélection 
has clearly favored organisms that can reproduce sexually, although evolutionary 
theorists dispute precisely what that sélective advantage is. 

The Function of a Gene Can Often Be Deduced from Ifs Sequence 

Family relationships among genes are important not just for their historical inter¬ 
est, but because they simplify the taslc of deciphering gene functions. Once the 
sequence of a newly discovered gene has been determined, a scientist can tap a 
few keys on a computer to search the entire database of known gene sequences 
for genes related to it. In many cases, the function of one or more of these homo- 
logs will hâve been already determined experimentally. Since gene sequence 
détermines gene function, one can frequently make a good guess at the function 
of the new gene: it is likely to be similar to that of the already known homologs. 

In this way, it is possible to decipher a great deal of the biology of an organism 
simply by analyzing the DNA sequence of its genome and using the information 
we already hâve about the functions of genes in other organisms that hâve been 
more intensively studied. 

More Than 200 Gene Families Are Common to Ail Three Primary 
Branches of the Tree of Life 

Given the complété genome sequences of représentative organisms from ail three 
domains—archaea, bacteria, and eukaryotes—we can search systematically for 
homologies that span this enormous evolutionary divide. In this way we can begin 
to take stock of the common inheritance of ail living things. There are considér¬ 
able difficulties in this enterprise. For example, individual species hâve often lost 
some of the ancestral genes; other genes hâve almost certainly been acquired by 
horizontal transfer from another species and therefore are not truly ancestral, 
even though shared. In fact, genome comparisons strongly suggest that both lin- 
eage-specific gene loss and horizontal gene transfer, in some cases between evo- 
lutionarily distant species, hâve been major factors of évolution, at least among 
prokaryotes. Finally, in the course of 2 or 3 billion years, some genes that were 
initially shared will hâve changed beyond récognition through mutation. 

Because of ail these vagaries of the evolutionary process, it seems that only 
a small proportion of ancestral gene families has been universally retained in 
a recognizable form. Thus, out of 4873 protein-coding gene families defined by 
comparing the genomes of 50 species of bacteria, 13 archaea, and 3 unicellular 
eukaryotes, only 63 are truly ubiquitous (that is, represented in ail the genomes 
analyzed). The great majority of these universal families include components of 
the translation and transcription Systems. This is not likely to be a realistic approxi¬ 
mation of an ancestral gene set. A better—though still crude—idea of the latter can 
be obtained by tallying the gene families that hâve représentatives in multiple, but 
not necessarily ail, species from ail three major domains. Such an analysis reveals 
264 ancient conserved families. Each family can be assigned a function (at least in 
terms of general biochemical activity, but usually with more précision). As shown 
in Table 1-1, the largest number of shared gene families are involved in transla¬ 
tion and in amino acid metabolism and transport. However, this set of highly con¬ 
served gene families represents only a very rough sketch of the common inheri¬ 
tance of ail modem life. A more précisé reconstruction of the gene complément 
of the last universal common ancestor will hopefully become feasible with further 
genome sequencing and more sophisticated forms of comparative analysis. 
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TABLE 1-1 


The Number of Gene Families, Classified by Function, Common to Ail Three Domains of the Living World 


Information Processing 

Metabolism 


Translation 

63 

Energy production and conversion 

19 

Transcription 

7 

Carbohydrate transport and metabolism 

16 

Réplication, recombination, and repair 

13 

Amino acid transport and metabolism 

43 

Cellular processes and signaling 

Nucléotide transport and metabolism 

15 

Cell-cycle control, mitosis, and meiosis 

2 

Coenzyme transport and metabolism 

22 

Defense mechanisms 

3 

Lipid transport and metabolism 

9 

Signal transduction mechanisms 

1 

Inorganic ion transport and metabolism 

8 

Cell wall/membrane biogenesis 

2 

Secondary métabolite biosynthesis, 
transport, and catabolism 

5 

Intracellular trafficking and sécrétion 

4 

Poorly characterized 

Post-translational modification, protein 
turnover, chaperones 

8 

General biochemical function predicted; 
spécifie biological rôle unknown 

24 


For the purpose of this analysis, gene families are defined as “universal” if they are represented in the genomes of at least two diverse archaea 
(Archaeoglobus fulgidus and Aeropyrum pernix), two evolutionarily distant bacteria (Escherichia coli and Bacillus subtilis), and one eukaryote 
(yeast, Saccharomyces cerevisiaë). (Datafrom R.L. Tatusov, E.V. Koonin and D.J. Lipman, Science 278:631-637, 1997; R.L. Tatusov et al., BMC 
Bioinformatics 4:41,2003; and the COGs database at the US National Library of Medicine.) 


Mutations Reveal the Functions of Genes 

Without additional information, no amount of gazing at genome sequences will 
reveal the functions of genes. We may recognize that gene B is lilce gene A, but 
how do we discover the function of gene A in the first place? And even if we know 
the function of gene A, how do we test whether the function of gene B is truly 
the same as the sequence similarity suggests? How do we connect the world of 
abstract genetic information with the world of real living organisms? 

The analysis of gene functions dépends on two complementary approaches: 
genetics and biochemistry. Genetics starts with the study of mutants: we either 
find or malce an organism in which a gene is altered, and then examine the effects 
on the organism's structure and performance (Figure 1-23). Biochemistry more 
directly examines the functions of molécules: here we extract molécules from an 
organism and then study their Chemical activities. By combining genetics and 
biochemistry, it is possible to find those molécules whose production dépends on 
a given gene. At the same time, careful studies of the performance of the mutant 
organism show us what rôle those molécules hâve in the operation of the organ¬ 
ism as a whole. Thus, genetics and biochemistry used in combination with cell 
biology provide the best way to relate genes and molécules to the structure and 
function of an organism. 

In recent years, DNA sequence information and the powerful tools of molecu- 
lar biology hâve accelerated progress. From sequence comparisons, we can often 
identify particular subregions within a gene that hâve been preserved nearly 
unchanged over the course of évolution. These conserved subregions are likely 
to be the most important parts of the gene in terms of function. We can test their 
individual contributions to the activity of the gene product by creating in the lab- 
oratory mutations of spécifie sites within the gene, or by constructing artificial 
hybrid genes that combine part of one gene with part of another. Organisms can 
be engineered to malce either the RNA or the protein specified by the gene in large 
quantities to facilitate biochemical analysis. Specialists in molecular structure can 
détermine the three-dimensional conformation of the gene product, revealing 
the exact position of every atom in it. Biochemists can détermine how each of the 
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Figure 1-23 A mutant phenotype 
reflecting the function of a gene. A normal 
yeast (of the species Schizosaccharomyces 
pombe) is compared with a mutant in which 
a change in a single gene has converted the 
cell from a cigar shape (left) to a T shape 
(right). The mutant gene therefore has a 
function in the control of cell shape. But 
how, in molecular terms, does the gene 
product perform that function? That is a 
harder question, and it needs biochemical 
analysis to answer it. (Courtesy of Kenneth 
Sawin and Paul Nurse.) 
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parts of the genetically specified molécule contributes to its Chemical behavior. 
Cell biologists can analyze the behavior of cells that are engineered to express a 
mutant version of the gene. 

There is, however, no one simple recipe for discovering a gene's function, and 
no simple standard universal format for describing it. We may discover, for exam¬ 
ple, that the product of a given gene catalyzes a certain Chemical reaction, and 
yet hâve no idea how or why that reaction is important to the organism. The func- 
tional characterization of each new family of gene products, unlilce the descrip¬ 
tion of the gene sequences, présents a fresh challenge to the biologiste ingenuity. 
Moreover, we will never fully understand the function of a gene until we learn its 
rôle in the life of the organism as a whole. To malce ultimate sense of gene func- 
tions, therefore, we hâve to study whole organisms, not just molécules or cells. 

Molecular Biology Began with a Spotlight on £ coli 

Because living organisms are so complex, the more we learn about any particular 
species, the more attractive it becomes as an object for further study. Each dis- 
covery raises new questions and provides new tools with which to tackle general 
questions in the context of the chosen organism. For this reason, large communi- 
ties of biologists hâve become dedicated to studying different aspects of the same 
model organism. 

In the early days of molecular biology, the spotlight focused intensely on just 
one species: the Escherichia coli , or E. coli , bacterium (see Figures 1-13 and 1-14). 
This small, rod-shaped bacterial cell normally lives in the gut of humans and other 
vertebrates, but it can be grown easily in a simple nutrient broth in a culture bot- 
tle. It adapts to variable Chemical conditions and reproduces rapidly, and it can 
evolve by mutation and sélection at a remarkable speed. As with other bacteria, 
different strains of E. coli , though classified as members of a single species, dif- 
fer genetically to a much greater degree than do different varieties of a sexually 
reproducing organism such as a plant or animal. One E. coli strain may possess 
many hundreds of genes that are absent from another, and the two strains could 
hâve as little as 50% of their genes in common. The standard laboratory strain 
E. coli K-12 has a genome of approximately 4.6 million nucléotide pairs, contained 
in a single circular molécule of DNA that codes for about 4300 different kinds of 
proteins (Figure 1-24). 

In molecular terms, we know more about E. coli than about any other living 
organism. Most of our understanding of the fundamental mechanisms of life— 
for example, how cells replicate their DNA, or how they décodé the instructions 
represented in the DNA to direct the synthesis of spécifie proteins—initially came 
from studies of F. coli. The basic genetic mechanisms hâve turned out to be highly 
conserved throughout évolution: these mechanisms are essentially the same in 
our own cells as in E. coli. 

Summary 

Prokaryotes (cells without a distinct nucléus) are biochemically the most diverse 
organisms and include species that can obtain ail their energy and nutrients from 
inorganic Chemical sources, such as the reactive mixtures of minerais released at 
hydrothermal vents on the océan floor—the sort ofdiet that may hâve nourished the 
first living cells 3.5 billion years ago. DNA sequence comparions reveal the family 
relationships of living organisms and show that the prokaryotes fail into two groups 
that diverged early in the course of évolution: the bacteria (or eubacteria) and the 
archaea. Together with the eukaryotes (cells with a membrane-enclosed nucléus), 
these constitute the three primary branches ofthe tree oflife. 

Most bacteria and archaea are small unicellular organisms with compact 
genomes comprising 1000-6000 genes. Many ofthe genes within a single organism 
show strongfamily resemblances in their DNA sequences, implying that they origi- 
nated from the same ancestral gene through gene duplication and divergence. Fam¬ 
ily resemblances (homologies) are also clear when gene sequences are compared 
between different species, and more than 200 gene families hâve been so highly 
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Figure 1-24 The genome of E. coli. (A) A cluster of E. coli cells. (B) A diagram of the genome of 
E. coli strain K-12. The diagram is circular because the DNA of E. coli, like that of other prokaryotes, 
forms a single, closed loop. Protein-coding genes are shown as yellow or orange bars, depending 
on the DNA strand from which they are transcribed; genes encoding only RNA molécules are 
indicated by green arrows. Some genes are transcribed from one strand of the DNA double hélix (in 
a clockwise direction in this diagram), others from the other strand (counterclockwise). (A, courtesy 
of Dr. Tony Brain and David Parker/Photo Researchers; B, adapted from F.R. Blattner et al., Science 
277:1453-1462, 1997.) 


conserved that they can he recognized as common to most species from ail three 
domains of the living world. Thus, given the DNA sequence of a newly discovered 
gene, it is often possible to deduce the gene's function from the knoum function ofa 
homologous gene in an intensively studied model organism , such as the bacterium 
E. coli. 


GENETIC INFORMATION IN EUKARYOTES 

Eulcaryotic cells, in general, are bigger and more elaborate than prokaryotic cells, 
and their genomes are bigger and more elaborate, too. The greater size is accom- 
panied by radical différences in cell structure and function. Moreover, many 
classes of eulcaryotic cells form multicellular organisms that attain levels of com- 
plexity unmatched by any prokaryote. 

Because they are so complex, eukaryotes confront molecular biologists with a 
spécial set of challenges that will concern us in the rest of this boolc. Increasingly, 
biologists attempt to meet these challenges through the analysis and manipula¬ 
tion of the genetic information within cells and organisms. It is therefore impor¬ 
tant at the outset to lcnow something of the spécial features of the eulcaryotic 
genome. We begin by briefly discussing how eulcaryotic cells are organized, how 
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this reflects their way of life, and how their genomes differ from those of prokary- 
otes. This leads us to an outline of the strategy by which cell biologists, by exploit- 
ing genetic and biochemical information, are attempting to discover how eulcary- 
otic organisms worlc. 

Eukaryotic Cells May Hâve Originated as Predators 

By définition, eukaryotic cells lceep their DNA in an internai compartment called 
the nucléus. The nuclear envelope, a double layer of membrane, surrounds the 
nucléus and séparâtes the DNA from the cytoplasm. Eulcaryotes also hâve other 
features that set them apart from prokaryotes (Figure 1-25). Their cells are, typi- 
cally, 10 times bigger in linear dimension and 1000 times larger in volume. They 
hâve an elaborate cytoskeleton— a System of protein filaments crisscrossing the 
cytoplasm and forming, together with the many proteins that attach to them, a 
System of girders, ropes, and motors that gives the cell mechanical strength, Con¬ 
trols its shape, and drives and guides its movements (Movie 1.1). And the nuclear 
envelope is only one part of a set of internai membranes, each structurally similar 
to the plasma membrane and enclosing different types of spaces inside the cell, 
many of them involved in digestion and sécrétion. Lacking the tough cell wall of 
most bacteria, animal cells and the free-living eukaryotic cells called protozoa can 
change their shape rapidly and engulf other cells and small objects by phagocyto- 
sis (Figure 1-26). 

How ail of the unique properties of eukaryotic cells evolved, and in what 
sequence, is still a mystery. One plausible view, however, is that they are ail reflec- 
tions of the way of life of a primordial cell that was a predator, living by capturing 
other cells and eating them (Figure 1-27). Such a way of life requires a large cell 
with a flexible plasma membrane, as well as an elaborate cytoskeleton to support 
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Figure 1-25 The major features of eukaryotic cells. The drawing depicts a typical animal cell, but almost ail the same components are found in 
plants and fungi as well as in single-celled eukaryotes such as yeasts and protozoa. Plant cells contain chloroplasts in addition to the components 
shown here, and their plasma membrane is surrounded by a tough external wall formed of cellulose. 
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Figure 1-26 Phagocytosis. This sériés of 
stills from a movie shows a human white 
blood cell (a neutrophil) engulfing a red 
blood cell (artificially colored red) that has 
been treated with an antibody that marks it 
for destruction (see Movie 13.5). (Courtesy 
of Stephen E. Malawista and Anne de 
Boisfleury Chevance.) 


and move this membrane. It may also require that the cell's long, fragile DNA mol¬ 
écules be sequestered in a separate nuclear compartment, to protect the genome 
from damage by the movements of the cytoslceleton. 

Modem Eukaryotic Cells Evolved from a Symbiosis 

A predatory way of life helps to explain another feature of eukaryotic cells. Ail 
such cells contain (or at one time did contain) mitochondria (Figure 1-28). These 
small bodies in the cytoplasm, enclosed by a double layer of membrane, talce up 
oxygen and harness energy from the oxidation of food molécules—such as sug- 
ars—to produce most of the ATP that powers the cell's activities. Mitochondria are 
similar in size to small bacteria, and, lilce bacteria, they hâve their own genome in 
the form of a circular DNA molécule, their own ribosomes that differ from those 
elsewhere in the eukaryotic cell, and their own transfer RNAs. It is now gener- 
ally accepted that mitochondria originated from free-living oxygen-metabolizing 
(i aérobic ) bacteria that were engulfed by an ancestral cell that could otherwise 
malce no such use of oxygen (that is, was anaérobie). Escaping digestion, these 
bacteria evolved in symbiosis with the engulfing cell and its progeny, receiving 



(B) 

Figure 1-27 A single-celled eukaryote that eats other cells. (A) Didinium is a carnivorous 
protozoan, belonging to the group known as ciliates. It has a globular body, about 150 gm in 


diameter, encircled by two fringes of cilia—sinuous, whiplike appendages that beat continually; its 
front end is flattened except for a single protrusion, rather like a snout. (B) A Didinium engulfing its 
prey. Didinium normally swims around in the water at high speed by means of the synchronous 
beating of its cilia. When it encounters a suitable prey (yellow), usually another type of protozoan, it 
releases numerous small paralyzing darts from its snout région. Then, the Didinium attaches to and 
devours the other cell by phagocytosis, inverting like a hollow bail to engulf its victim, which can be 
almost as large as itself. (Courtesy of D. Barlow.) 
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shelter and nourishment in return for the power génération they performed for 
their hosts. This partnership between a primitive anaérobie predator cell and an 
aérobic bacterial cell is thought to hâve been established about 1.5 billion years 
ago, when the Earth's atmosphère first became rich in oxygen. 

As indicated in Figure 1-29, recent genomic analyses suggest that the first 
eukaryotic cells formed after an archaeal cell engulfed an aérobic bacterium. This 
would explain why ail eukaryotic cells today, including those that live as strict 
anaerobes show clear evidence that they once contained mitochondria. 

Many eukaryotic cells—specifically, those of plants and algae—also contain 
another class of small membrane-enclosed organelles somewhat similar to mito¬ 
chondria—the chloroplasts (Figure 1-30). Chloroplasts perform photosynthesis, 
using the energy of sunlight to synthesize carbohydrates from atmospheric car¬ 
bon dioxide and water, and deliver the products to the host cell as food. Lilce mito¬ 
chondria, chloroplasts hâve their own genome. They almost certainly originated 
as symbiotic photosynthetic bacteria, acquired by eukaryotic cells that already 
possessed mitochondria (Figure 1-31). 

A eukaryotic cell equipped with chloroplasts has no need to chase after other 
cells as prey; it is nourished by the captive chloroplasts it has inherited from its 
ancestors. Correspondingly, plant cells, although they possess the cytoslcele- 
tal equipment for movement, hâve lost the ability to change shape rapidly and 
to engulf other cells by phagocytosis. Instead, they create around themselves a 
tough, protective cell wall. If the first eukaryotic cells were predators on other 
organisms, we can view plant cells as cells that hâve made the transition from 
hunting to farming. 

Fungi represent yet another eukaryotic way of life. Fungal cells, lilce animal 
cells, possess mitochondria but not chloroplasts; but in contrast with animal cells 
and protozoa, they hâve a tough outer wall that limits their ability to move rapidly 


Figure 1-28 A mitochondrion. (A) A cross 
section, as seen in the électron microscope. 
(B) A drawing of a mitochondrion with 
part of it eut away to show the three- 
dimensional structure (Movie 1.2). (C) A 
schematic eukaryotic cell, with the interior 
space of a mitochondrion, containing the 
mitochondrial DNA and ribosomes, colored. 
Note the smooth outer membrane and the 
convoluted inner membrane, which houses 
the proteins that generate ATP from the 
oxidation of food molécules. (A, courtesy 
of Daniel S. Friend.) 
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Figure 1-29 The origin of mitochondria. 

An ancestral anaérobie predator cell (an 
archaeon) is thought to hâve engulfed the 
bacterial ancestor of mitochondria, initiating 
a symbiotic relationship. Clear evidence of 
a dual bacterial and archaeal inheritance 
can be discerned today in the genomes of 
ail eukaryotes. 
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or to swallow up other cells. Fungi, it seems, hâve turned from hunters into scav- 
engers: other cells secrete nutrient molécules or release them upon death, and 
fungi feed on these leavings—performing whatever digestion is necessary extra- 
cellularly, by secreting digestive enzymes to the exterior. 

Eukaryotes Hâve Hybrid Genomes 

The genetic information of eukaryotic cells has a hybrid origin—from the ances¬ 
tral anaérobie archaeal cell, and from the bacteria that it adopted as symbionts. 
Most of this information is stored in the nucléus, but a small amount remains 
inside the mitochondria and, for plant and algal cells, in the chloroplasts. When 
mitochondrial DNA and the chloroplast DNA are separated from the nuclear DNA 
and individually analyzed and sequenced, the mitochondrial and chloroplast 
genomes are found to be degenerate, cut-down versions of the corresponding 
bacterial genomes. In a human cell, for example, the mitochondrial genome con- 
sists of only 16,569 nucléotide pairs, and codes for only 13 proteins, 2 ribosomal 
RNA components, and 22 transfer RNAs. 
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Figure 1-30 Chloroplasts. These 
organelles capture the energy of sunlight 
in plant cells and some single-celled 
eukaryotes. (A) A single cell isolated 
from a leaf of a flowering plant, seen 
in the light microscope, showing the 
green chloroplasts (Movie 1.3 and see 
Movie 14.9). (B) A drawing of one of the 
chloroplasts, showing the highly folded 
System of internai membranes containing 
the chlorophyll molécules by which light is 
absorbed. (A, courtesy of Preeti Dahiya.) 
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Figure 1-31 The origin of chloroplasts. 

An early eukaryotic cell, already possessing 
mitochondria, engulfed a photosynthetic 
bacterium (a cyanobacterium) and retained 
it in symbiosis. Present-day chloroplasts 
are thought to trace their ancestry back to 
a single species of cyanobacterium that 
was adopted as an internai symbiont (an 
endosymbiont) over a billion years ago. 


Many of the genes that are missing from the mitochondria and chloroplasts 
hâve not been lost; instead, they hâve moved from the symbiont genome into the 
DNA of the host cell nucléus. The nuclear DNA of humans contains many genes 
coding for proteins that serve essential functions inside the mitochondria; in 
plants, the nuclear DNA also contains many genes specifying proteins required in 
chloroplasts. In both cases, the DNA sequences of these nuclear genes show clear 
evidence of their origin from the bacterial ancestor of the respective organelle. 

Eukaryotic Genomes Are Big 

Natural sélection has evidently favored mitochondria with small genomes. By con- 
trast, the nuclear genomes of most eukaryotes seem to hâve been free to enlarge. 
Perhaps the eukaryotic way of life has made large size an advantage: predators 
typically need to be bigger than their prey, and cell size generally increases in pro¬ 
portion to genome size. Whatever the reason, aided by a massive accumulation of 
DNA segments derived from parasitic transposable éléments (discussed in Chap¬ 
ter 5), the genomes of most eukaryotes hâve become orders of magnitude larger 
than those of bacteria and archaea (Figure 1-32). 

The freedom to be extravagant with DNA has had profound implications. 
Eukaryotes not only hâve more genes than prolcaryotes; they also hâve vastly more 
DNA that does not code for protein. The human genome contains 1000 times as 
many nucléotide pairs as the genome of a typical bacterium, perhaps 10 times as 
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Figure 1-32 Genome sizes compared. 

Genome size is measured in nucléotide 
pairs of DNA per haploid genome, that is, 
per single copy of the genome. (The cells 
of sexually reproducing organisms such as 
ourselves are generally diploid: they contain 
two copies of the genome, one inherited 
from the mother, the other from the father.) 
Closely related organisms can vary widely 
in the quantity of DNA in their genomes, 
even though they contain similar numbers 
of functionally distinct genes. (Data from 
W.H. Li, Molecular Evolution, pp. 380-383. 
Sunderland, MA: Sinauer, 1997.) 
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i Some Model Organisms and Their Genomes 

Organism 

Genome size* 
(nucléotide pairs) 

Approximate number 
of genes 

Escherichia coli (bacterium) 

4.6 x 10 6 

4300 

Saccharomyces cerevisiae (yeast) 

13 x 10 6 

6600 

Caenorhabditis elegans 
(roundworm) 

130 x10 6 

21,000 

Arabidopsis thaliana (plant) 

220 x 10 6 

29,000 

Drosophila melanogaster (fruit fly) 

200 x 10 6 

15,000 

Danio reno (zebrafish) 

1400 x 10 6 

32,000 

Mus musculus (mouse) 

2800 x 10 6 

30,000 

Homo sapiens (human) 

3200 x 10 6 

30,000 

*Genome size includes an estimate for the amount of highly repeated DNA sequence not in 
genome databases. 


many genes, and a great deal more noncoding DNA (-98.5% of the genome for a 
human does not code for proteins, as opposed to 11% of the genome for the bacte- 
rium E. colï). The estimated genome sizes and gene numbers for some eukaryotes 
are compiled for easy comparison with E. coli in Table 1-2; we shall discuss how 
each of these eukaryotes serves as a model organism shortly. 

Eukaryotic Genomes Are Rich in Regulatory DNA 

Much of our noncoding DNA is almost certainly dispensable junlc, retained lilce 
a mass of old papers because, when there is little pressure to lceep an archive 
small, it is easier to retain everything than to sort out the valuable information 
and discard the rest. Certain exceptional eukaryotic species, such as the puffer 
fish, bear witness to the profiigacy of their relatives; they hâve somehow managed 
to rid themselves of large quantities of noncoding DNA. Yet they appear similar in 
structure, behavior, and fitness to related species that hâve vastly more such DNA 
(see Figure 4-71). 

Even in compact eukaryotic genomes such as that of puffer fish, there is more 
noncoding DNA than coding DNA, and at least some of the noncoding DNA cer¬ 
tainly has important functions. In particular, it régulâtes the expression of adja¬ 
cent genes. With this regulatory DNA, eukaryotes hâve evolved distinctive ways of 
controlling when and where a gene is brought into play. This sophisticated gene 
régulation is crucial for the formation of complex multicellular organisms. 

The Genome Defines the Program of Multicellular Development 

The cells in an individual animal or plant are extraordinarily varied. Fat cells, skin 
cells, bone cells, nerve cells—they seem as dissimilar as any cells could be (Figure 
1-33). Yet ail these cell types are the descendants of a single fertilized egg cell, and 
ail (with minor exceptions) contain identical copies of the genome of the species. 

The différences resuit from the way in which the cells malce sélective use of 
their genetic instructions according to the eues they get from their surroundings 
in the developing embryo. The DNA is not just a shopping list specifying the mol¬ 
écules that every cell must hâve, and the cell is not an assembly of ail the items 
on the list. Rather, the cell behaves as a multipurpose machine, with sensors to 
receive environmental signais and with highly developed abilities to call different 
sets of genes into action according to the sequences of signais to which the cell 
has been exposed. The genome in each cell is big enough to accommodate the 
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Figure 1-33 Cell types can vary 
enormously in size and shape. An 

animal nerve cell is compared here with a 
neutrophil, a type of white blood cell. Both 
are drawn to scale. 
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Figure 1-34 Genetic control of the 
program of multicellular development. 

The rôle of a regulatory gene is 
demonstrated in the snapdragon 
Antirrhinum. In this example, a mutation 
in a single gene coding for a regulatory 
protein causes leafy shoots to develop 
in place of flowers: because a regulatory 
protein has been changed, the cells adopt 
characters that would be appropriate to a 
different location in the normal plant. The 
mutant is on the left, the normal plant on 
the right. (Courtesy of Enrico Coen and 
Rosemary Carpenter.) 


information that spécifiés an entire multicellular organism, but in any individual 
cell only part of that information is used. 

A large number of genes in the eukaryotic genome code for proteins that reg- 
ulate the activities of other genes. Most of these transcription regulators act by 
binding, directly or indirectly, to the regulatory DNA adjacent to the genes that are 
to be controlled, or by interfering with the abilities of other proteins to do so. The 
expanded genome of eukaryotes therefore not only spécifiés the hardware of the 
cell, but also stores the software that Controls how that hardware is used (Figure 
1-34). 

Cells do not just passively receive signais; rather, they actively exchange sig¬ 
nais with their neighbors. Thus, in a developing multicellular organism, the same 
control System governs each cell, but with different conséquences depending on 
the messages exchanged. The outcome, astonishingly, is a precisely patterned 
array of cells in different States, each displaying a character appropriate to its posi¬ 
tion in the multicellular structure. 

Many Eukaryotes Live as Solitary Cells 

Many species of eukaryotic cells lead a solitary life—some as hunters (the pro- 
tozoa ), some as photosynthesizers (the unicellular algae), some as scavengers 
(the unicellular fungi, or yeasts). Figure 1-35 conveys something of the astonish- 
ing variety of the single-celled eukaryotes. The anatomy of protozoa, especially, 
is often elaborate and includes such structures as sensory bristles, photorecep- 
tors, sinuously beating cilia, leglike appendages, mouth parts, stinging darts, and 
musclelike contractile bundles. Although they are single cells, protozoa can be 
as intricate, as versatile, and as complex in their behavior as many multicellular 
organisms (see Figure 1-27, Movie 1.4, and Movie 1.5). 

In terms of their ancestry and DNA sequences, the unicellular eukaryotes are 
far more diverse than the multicellular animais, plants, and fungi, which arose as 
three comparatively late branches of the eukaryotic pedigree (see Figure 1-17). As 
with prokaryotes, humans hâve tended to neglect them because they are micro¬ 
scopie. Only now, with the help of genome analysis, are we beginning to under- 
stand their positions in the tree of life, and to put into context the glimpses these 
strange créatures can offer us of our distant evolutionary past. 

A Yeast Serves as a Minimal Model Eukaryote 

The molecular and genetic complexity of eukaryotes is daunting. Even more than 
for prokaryotes, biologists need to concentrate their limited resources on a few 
selected model organisms to unravel this complexity. 
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To analyze the internai workings of the eukaryotic cell without the additional 
problems of multicellular development, it malces sense to use a species that is 
unicellular and as simple as possible. The popular choice for this rôle of minimal 
model eukaryote has been the yeast Saccharomyces cerevisiae (Figure 1-36) —the 
same species that is used by brewers of beer and balcers of bread. 

S. cerevisiae is a small, single-celled member of the kingdom of fungi and thus, 
according to modem views, is at least as closely related to animais as it is to plants. 
It is robust and easy to grow in a simple nutrient medium. Lilce other fungi, it has a 
tough cell wall, is relatively immobile, and possesses mitochondria but not chlo- 
roplasts. When nutrients are plentiful, it grows and divides almost as rapidly as a 
bacterium. It can reproduce either vegetatively (that is, by simple cell division), or 
sexually: two yeast cells that are haploid (possessing a single copy of the genome) 
can fuse to create a cell that is diploid (containing a double genome); and the dip- 
loid cell can undergo meiosis (a réduction division) to produce cells that are once 
again haploid (Figure 1-37). In contrast with higher plants and animais, the yeast 
can divide indefinitely in either the haploid or the diploid State, and the process 
leading from one State to the other can be induced at will by changing the growth 
conditions. 

In addition to these features, the yeast has a further property that malces it a 
convenient organism for genetic studies: its genome, by eukaryotic standards, 
is exceptionally small. Nevertheless, it suffices for ail the basic taslcs that every 
eukaryotic cell must perform. Mutants are available for essentially every gene, 


Figure 1-35 An assortaient of protozoa: 
a small sample of an extremely diverse 
class of organisms. The drawings are 
done to different scales, but in each case 
the scale bar represents 10 jim. The 
organisms in (A), (C), and (G) are ciliates; 

(B) is a heliozoan; (D) is an amoeba; 

(E) is a dinoflagellate; and (F) is a euglenoid. 
(From M.A. Sleigh, Biology of Protozoa. 
Cambridge, UK: Cambridge University 
Press, 1973.) 
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Figure 1-36 The yeast Saccharomyces 
cerevisiae. (A) A scanning électron 
micrograph of a cluster of the cells. This 
species is also known as budding yeast; 
it proliférâtes by forming a protrusion or 
bud that enlarges and then séparâtes 
from the rest of the original cell. Many cells 
with buds are visible in this micrograph. 

(B) A transmission électron micrograph of 
a cross section of a yeast cell, showing 
its nucléus, mitochondrion, and thick cell 
wall. (A, courtesy of Ira Herskowitz and Eric 
Schabatach.) 
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Figure 1-37 The reproductive cycles of the yeast S. cerevisiae. 

Depending on environmental conditions and on details of the génotype, 
cells of this species can exist in either a diploid (2a?) State, with a double 
chromosome set, or a haploid (n) State, with a single chromosome set. The 
diploid form can either proliferate by ordinary cell-division cycles or undergo 
meiosis to produce haploid cells. The haploid form can either proliferate by 
ordinary cell-division cycles or undergo sexual fusion with another haploid 
cell to become diploid. Meiosis is triggered by starvation and gives rise to 
spores—haploid cells in a dormant State, résistant to harsh environmental 
conditions. 


and studies on yeasts (using both S. cerevisiae and other species) hâve provided 
a key to many crucial processes, including the eukaryotic cell-division cycle—the 
critical chain of events by which the nucléus and ail the other components of a cell 
are duplicated and parceled out to create two daughter cells from one. The control 
System that governs this process has been so well conserved over the course of 
évolution that many of its components can function interchangeably in yeast and 
human cells: if a mutant yeast lacking an essential yeast cell-division-cycle gene 
is supplied with a copy of the homologous cell-division-cycle gene from a human, 
the yeast is cured of its defect and becomes able to divide normally. 
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The Expression Levels of Ail the Genes of An Organism budding yeast life cycle 

Can Be Monitored Simultaneously 

The complété genome sequence of S. cerevisiae, determined in 1997, consists 
of approximately 13,117,000 nucléotide pairs, including the small contribution 
(78,520 nucléotide pairs) of the mitochondrial DNA. This total is only about 2.5 
times as much DNA as there is in E. coli, and it codes for only 1.5 times as many 
distinct proteins (about 6600 in ail). The way of life of S. cerevisiae is similar in 
many ways to that of a bacterium, and it seems that this yeast has likewise been 
subject to sélection pressures that hâve kept its genome compact. 

Knowledge of the complété genome sequence of any organism—be it a yeast 
or a human—opens up new perspectives on the workings of the cell: things that 
once seemed impossibly complex now seem within our grasp. Using techniques 
described in Chapter 8, it is now possible, for example, to monitor, simultane¬ 
ously, the amount of mRNA transcript that is produced from every gene in the 
yeast genome under any chosen conditions, and to see how this whole pattern of 
gene activity changes when conditions change. The analysis can be repeated with 
mRNA prepared from mutant cells lacking a chosen gene—any gene that we care 
to test. In principle, this approach provides a way to reveal the entire System of 
control relationships that govern gene expression—not only in yeast cells, but in 
any organism whose genome sequence is known. 


Arabidopsis Has Been Chosen Out of 300,000 Species 
As a Model Plant 

The large multicellular organisms that we see around us—the flowers and trees 
and animais—seem fantastically varied, but they are much doser to one another 
in their evolutionary origins, and more similar in their basic cell biology, than 
the great host of microscopie single-celled organisms. Thus, while bacteria and 
archaea are separated by perhaps 3.5 billion years of évolution, vertebrates and 
insects are separated by about 700 million years, fish and mammals by about 450 
million years, and the different species of flowering plants by only about 150 mil¬ 
lion years. 

Because of the dose evolutionary relationship between ail flowering plants, 
we can, once again, get insight into the cell and molecular biology of this whole 
class of organisms by focusing on just one or a few spedes for detailed analysis. 
Out of the several hundred thousand species of flowering plants on Earth today, 
molecular biologists hâve chosen to concentrate their efforts on a small weed, 
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the common Thaïe cress Arabidopsis thaliana (Figure 1-38), which can be grown 
indoors in large numbers and produces thousands of offspring per plant after 
8-10 weelcs. Arabidopsis has a total genome size of approximately 220 million 
nucléotide pairs, about 17 times the size ofyeast's (see Table 1-2). 

The World of Animal Cells Is Represented By a Worm, a Fly, a 
Fish, a Mouse, and a Human 

Multicellular animais account for the majority of ail named species of living 
organisms, and for the largest part of the biological research effort. Five species 
hâve emerged as the foremost model organisms for molecular genetic studies. In 
order of increasing size, they are the nematode worm Caenorhabditis elegans , the 
fly Drosophila melanogaster , the zebrafish Danio rerio , the mouse Mus musculus, 
and the human, Homo sapiens . Each has had its genome sequenced. 

Caenorhabditis elegans (Figure 1-39) is a small, harmless relative of the eel- 
worm that attacks crops. With a life cycle of only a few days, an ability to survive in 
a freezer indefinitely in a State of suspended animation, a simple body plan, and 
an unusual life cycle that is well suited for genetic studies (described in Chapter 
21), it is an idéal model organism. C. elegans develops with clockwork précision 
from a fertilized egg cell into an adult worm with exactly 959 body cells (plus a 
variable number of egg and sperm cells)—an unusual degree of regularity for an 
animal. We now hâve a minutely detailed description of the sequence of events by 
which this occurs, as the cells divide, move, and change their character according 
to strict and predictable rules. The genome of 130 million nucléotide pairs codes 
for about 21,000 proteins, and many mutants and other tools are available for the 
testing of gene functions. Although the worm has a body plan very different from 
our own, the conservation of biological mechanisms has been sufficient for the 
worm to be a model for many of the developmental and cell-biological processes 
that occur in the human body. Thus, for example, studies of the worm hâve been 
critical for helping us to understand the programs of cell division and cell death 
that détermine the number of cells in the body—a topic of great importance for 
both developmental biology and cancer research. 



Studies in Drosophila Provide a Key to Vertebrate Development 

The fruit fly Drosophila melanogaster (Figure 1-40) has been used as a model 
genetic organism for longer than any other; in fact, the foundations of classical 
genetics were built to a large extent on studies of this insect. Over 80 years ago, it 
provided, for example, definitive proof that genes—the abstract units of heredi- 
tary information—are carried on chromosomes, concrète physical objects whose 
behavior had been closely followed in the eukaryotic cell with the light micro¬ 
scope, but whose function was at first unknown. The proof depended on one of 
the many features that malce Drosophila peculiarly convenient for genetics—the 
giant chromosomes, with characteristic banded appearance, that are visible in 
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Figure 1-38 Arabidopsis thaliana, the 
plant chosen as the primary model 
for studying plant molecular genetics. 

(Courtesy of Toni Hayden and the John 
Innés Foundation.) 
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Figure 1-39 Caenorhabditis elegans, the first multicellular organism to hâve its 
complété genome sequence determined. This small nematode, about 1 mm long, lives in 
the soil. Most individuals are hermaphrodites, producing both eggs and sperm. (Courtesy of 
Maria Gallegos, University of Wisconsin, Madison.) 
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some of its cells (Figure 1-41). Spécifie changes in the hereditary information, 
manifest in families of mutant Aies, were found to correlate exactly with the loss 
or alteration of spécifie giant-chromosome bands. 

In more recent times, Drosophila, more than any other organism, has shown us 
how to trace the chain of cause and effect from the genetic instructions encoded 
in the chromosomal DNA to the structure of the adult multicellular body. Dro¬ 
sophila mutants with body parts strangely misplaced or mispatterned provided 
the lcey to the identification and characterization of the genes required to make 
a properly structured body, with gut, limbs, eyes, and ail the other parts in their 
correct places. Once these Drosophila genes were sequenced, the genomes of ver- 
tebrates could be scanned for homologs. These were found, and their functions 
in vertebrates were then tested by analyzing mice in which the genes had been 
mutated. The results, as we see later in the boolc, reveal an astonishing degree of 
similarity in the molecular mechanisms that govern insect and vertebrate devel¬ 
opment (discussed in Chapter 21). 

The majority of ail named species of living organisms are insects. Even if Dro¬ 
sophila had nothing in common with vertebrates, but only with insects, it would 
still be an important model organism. But if understanding the molecular genet- 
ics of vertebrates is the goal, why not simply taclde the problem head-on? Why 
sidle up to it obliquely, through studies in Drosophila ? 

Drosophila requires only 9 days to progress from a fertilized egg to an adult; it 
is vastly easier and cheaper to breed than any vertebrate, and its genome is much 
smaller—about 200 million nucléotide pairs, compared with 3200 million for a 
human. This genome codes for about 15,000 proteins, and mutants can now be 
obtained for essentially any gene. But there is also another, deeper reason why 
genetic mechanisms that are hard to discover in a vertebrate are often read- 
ily revealed in the fly. This relates, as we now explain, to the frequency of gene 
duplication, which is substantially greater in vertebrate genomes than in the fly 
genome and has probably been crucial in making vertebrates the complex and 
subtle créatures that they are. 

The Vertebrate Genome Is a Product of Repeated Duplications 

Almost every gene in the vertebrate genome has paralogs—other genes in the 
same genome that are unmistalcably related and must hâve arisen by gene dupli¬ 
cation. In many cases, a whole cluster of genes is closely related to similar clusters 
présent elsewhere in the genome, suggesting that genes hâve been duplicated in 
linked groups rather than as isolated individuals. According to one hypothesis, at 
an early stage in the évolution of the vertebrates, the entire genome underwent 
duplication twice in succession, giving rise to four copies of every gene. 

The précisé course of vertebrate genome évolution remains uncertain, because 
many further evolutionary changes hâve occurred since these ancient events. 


Figure 1-40 Drosophila melanogaster. 

Molecular genetic studies on this fly hâve 
provided the main key to understanding 
how ail animais develop from a fertilized 
egg into an adult. (From E.B. Lewis, 
Science 221 :cover, 1983. With permission 
from AAAS.) 
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Figure 1-41 Giant chromosomes from 
salivary gland cells of Drosophila. 

Because many rounds of DNA réplication 
hâve occurred without an intervening cell 
division, each of the chromosomes in 
these unusual cells contains over 1000 
identical DNA molécules, ail aligned in 
register. This makes them easy to see in 
the light microscope, where they display 
a characteristic and reproducible banding 
pattern. Spécifie bands can be identified as 
the locations of spécifie genes: a mutant 
fly with a région of the banding pattern 
missing shows a phenotype reflecting loss 
of the genes in that région. Genes that are 
being transcribed at a high rate correspond 
to bands with a “puffed” appearance. 

The bands stained dark brown in the 
micrograph are sites where a particular 
regulatory protein is bound to the DNA. 
(Courtesy of B. Zink and R. Paro, from 
R. Paro, Trends Genet. 6:416-421, 1990. 
With permission from Elsevier.) 
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Genes that were once identical hâve diverged; many of the gene copies hâve been 
lost through disruptive mutations; some hâve undergone further rounds of local 
duplication; and the genome, in each branch of the vertebrate family tree, has 
suffered repeated rearrangements, breaking up most of the original gene order- 
ings. Comparison of the gene order in two related organisms, such as the human 
and the mouse, reveals that—on the time scale of vertebrate évolution—chro¬ 
mosomes frequently fuse and fragment to move large bloclcs of DNA sequence 
around. Indeed, it is possible, as discussed in Chapter 4, that the présent State 
of affairs is the resuit of many separate duplications of fragments of the genome, 
rather than duplications of the genome as a whole. 

There is, however, no doubt that such whole-genome duplications do occur 
from time to time in évolution, for we can see recent instances in which dupli- 
cated chromosome sets are still clearly identifiable as such. The frog genus Xeno¬ 
pus, for example, comprises a set of closely similar species related to one another 
by repeated duplications or triplications of the whole genome. Among these frogs 
are X. tropicalis, with an ordinary diploid genome; the common laboratory spe¬ 
cies X. laevis, with a duplicated genome and twice as much DNA per cell; and 
X. ruwenzoriensis, with a sixfold reduplication of the original genome and six 
times as much DNA per cell (108 chromosomes, compared with 36 inX. laevis , for 
example). These species are estimated to hâve diverged from one another within 
the past 120 million years (Figure 1-42). 


The Frog and the Zebrafish Provide Accessible Models for 
Vertebrate Development 

Frogs hâve long been used to study the early steps of embryonic development 
in vertebrates, because their eggs are big, easy to manipulate, and fertilized out- 
side of the animal, so that the subséquent development of the early embryo is 
easily followed (Figure 1-43). Xenopus laevis, in particular, continues to be an 
important model organism, even though it is poorly suited for genetic analysis 
(Movie 1.6 and see Movie 21.1). 

The zebrafish Danio rerio has similar advantages, but without this drawbaclc. 
Its genome is compact—only half as big as that of a mouse or a human—and it 
has a génération time of only about three months. Many mutants are lcnown, and 
genetic engineering is relatively easy. The zebrafish has the added virtue that it is 
transparent for the first two weelcs of its life, so that one can watch the behavior 
of individual cells in the living organism (see Movie 21.2). Ail this has made it an 
increasingly important model vertebrate (Figure 1-44). 

The Mouse Is the Prédominant Mammalian Model Organism 

Mammals hâve typically two times as many genes as Drosophila, a genome that 
is 16 times larger, and millions or billions of times as many cells in their adult 
bodies. In terms of genome size and function, cell biology, and molecular mech- 
anisms, mammals are nevertheless a highly uniform group of organisms. Even 
anatomically, the différences among mammals are chiefly a matter of size and 
proportions; it is hard to thinlc of a human body part that does not hâve a counter- 
part in éléphants and mice, and vice versa. Evolution plays freely with quantita¬ 
tive features, but it does not readily change the logic of the structure. 


Figure 1-42 Two species of the frog genus Xenopus. X. tropicalis, above, 
has an ordinary diploid genome; X. laevis, below, has twice as much DNA per 
cell. From the banding patterns of their chromosomes and the arrangement 
of genes along them, as well as from comparisons of gene sequences, it is 
clear that the large-genome species hâve evolved through duplications of 
the whole genome. These duplications are thought to hâve occurred in the 
aftermath of matings between frogs of slightly divergent Xenopus species. 
(Courtesy of E. Amaya, M. Offield, and R. Grainger, Trends Genet. 14:253- 
255, 1998. With permission from Elsevier.) 
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Figure 1-43 Stages in the normal development of a frog. These drawings 
show the development of a Rana pipiens tadpole from a fertilized egg. The 
entire process takes place outside of the mother, making the mechanisms 
involved readily accessible for experimental studies. (From W. Shumway, 
Anat. Rec. 78:139-147, 1940.) 
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For a more exact measure of how closely mammalian species resemble one 
another genetically, we can compare the nucléotide sequences of corresponding 
(orthologous) genes, or the amino acid sequences of the proteins that these genes 
encode. The results for individual genes and proteins vary widely. But typically, if 
we line up the amino acid sequence of a human protein with that of the ortholo¬ 
gous protein from, say, an éléphant, about 85% of the amino acids are identical. 
A similar comparison between human and bird shows an amino acid identity of 
about 70%—twice as many différences, because the bird and the mammalian lin- 
eages hâve had twice as long to diverge as those of the éléphant and the human 
(Figure 1-45). 

The mouse, being small, hardy, and a rapid breeder, has become the foremost 
model organism for experimental studies of vertebrate molecular genetics. Many 
naturally occurring mutations are known, often mimicking the effects of corre¬ 
sponding mutations in humans (Figure 1-46). Methods hâve been developed, 
moreover, to test the function of any chosen mouse gene, or of any noncoding 
portion of the mouse genome, by artificially creating mutations in it, as we explain 
later in the boolc. 

Just one made-to-order mutant mouse can provide a wealth of information for 
the cell biologist. It reveals the effects of the chosen mutation in a host of different 
contexts, simultaneously testing the action of the gene in ail the different kinds of 
cells in the body that could in principle be affected. 


284 



tadpole 


Humans Report on Their Own Peculiarities 

As humans, we hâve a spécial interest in the human genome. We want to lcnow the 
full set of parts from which we are made, and to discover how they worlc. But even 
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Figure 1-44 Zebrafish as a model for 
studies of vertebrate development. These 
small, hardy tropical fish are convenient 
for genetic studies. Additionally, they hâve 
transparent embryos that develop outside 
of the mother, so that one can clearly 
observe cells moving and changing their 
character in the living organism throughout 
its development. (A) Adult fish. (B) An 
embryo 24 hours after fertilization. (A, with 
permission from Steve Baskauf; B, from 
M. Rhinn et al., Neural Dev. 4:12, 2009.) 
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if you were a mouse, preoccupied with the molecular biology of mice, humans 
would be attractive as model genetic organisms, because of one spécial property: 
through medical examinations and self-reporting, we catalog our own genetic 
(and other) disorders. The human population is enormous, consisting today 
of some 7 billion individuals, and this self-documenting property means that a 
huge database of information exists on human mutations. The human genome 
sequence of more than 3 billion nucléotide pairs has been determined for thou- 
sands of different people, making it easier than ever before to identify at a molecu¬ 
lar level the précisé genetic change responsible for any given human mutant phe¬ 
notype. 

By drawing together the insights from humans, mice, fish, Aies, worms, yeasts, 
plants, and bacteria—using gene sequence similarities to map out the correspon- 
dences between one model organism and another—we are enriching our under- 
standing of them ail. 


Figure 1-45 Times of divergence of 
different vertebrates. The scale on the left 
shows the estimated date and geological 
era of the last common ancestor of each 
specified pair of animais. Each time 
estimate is based on comparisons of the 
amino acid sequences of orthologous 
proteins; the longer the animais of a pair 
hâve had to evolve independently, the 
smaller the percentage of amino acids 
that remain identical. The time scale 
has been calibrated to match the fossil 
evidence showing that the last common 
ancestor of mammals and birds lived 
310 million years ago. 

The figures on the right give data on 
sequence divergence for one particular 
protein—the a chain of hemoglobin. Note 
that although there is a clear general trend 
of increasing divergence with increasing 
time for this protein, there are irregularities 
that are thought to reflect the action of 
natural sélection driving especially rapid 
changes of hemoglobin sequence when 
the organisms experienced spécial 
physiological demands. Some proteins, 
subject to stricter functional constraints, 
evolve much more slowly than hemoglobin, 
others as much as five times faster. Ail this 
gives rise to substantial uncertainties in 
estimâtes of divergence times, and some 
experts believe that the major groups of 
mammals diverged from one another as 
much as 60 million years more recently 
than shown here. (Adapted from S. Kumar 
and S.B. Hedges, Nature 392:917-920, 
1998. With permission from Macmillan 
Publishers Ltd.) 



Figure 1-46 Human and mouse: similar 
genes and similar development. The 

human baby and the mouse shown 
here hâve similar white patches on their 
foreheads because both hâve mutations in 
the same gene (called Kit), required for the 
development and maintenance of pigment 
cells. (Courtesy of R.A. Fleischman.) 
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We Are Ail Different in Detail 

What precisely do we mean when we spealc of the human genome? Whose 
genome? On average, any two people talcen at random differ in about one or two 
in every 1000 nucléotide pairs in their DNA sequence. The genome of the human 
species is, properly speaking, a very complex thing, embracing the entire pool of 
variant genes found in the human population. Knowledge of this variation is help- 
ing us to understand, for example, why some people are prone to one disease, 
others to another; why some respond well to a drug, others badly. It is also provid- 
ing dues to our history—the population movements and minglings of our ances- 
tors, the infections they suffered, the diets they ate. Ail these things hâve left traces 
in the variant forms of genes that survive today in the human communities that 
populate the globe. 

To Understand Cells and Organisais Will Require Mathematics, 
Computers, and Quantitative Information 

Empowered by knowledge of complété genome sequences, we can list the genes, 
proteins, and RNA molécules in a cell, and we hâve methods that allow us to begin 
to depict the complex web of interactions between them. But how are we to turn 
ail this information into an understanding of how cells worlc? Even for a single cell 
type belonging to a single species of organism, the current deluge of data seems 
overwhelming. The sort of informai reasoning on which biologists usually rely 
seems totally inadéquate in the face of such complexity. 

In fact, the difficulty is more than just a matter of information overload. Bio- 
logical Systems are, for example, full of feedback loops, and the behavior of even 
the simplest of Systems with feedback is remarkably difficult to predict by intu¬ 
ition alone (Figure 1-47); small changes in parameters can cause radical changes 
in outcome. To go from a circuit diagram to a prédiction of the behavior of the 
System, we need detailed quantitative information, and to draw déductions from 
that information we need mathematics and computers. 

Such tools for quantitative reasoning are essential, but they are not all-power- 
ful. You might think that, lcnowing how each protein influences each other pro- 
tein, and how the expression of each gene is regulated by the products of others, 
we should soon be able to calculate how the cell as a whole will behave, just as 
an astronomer can calculate the orbits of the planets, or a Chemical engineer can 
calculate the flows through a Chemical plant. But any attempt to perform this feat 
for anything close to an entire living cell rapidly reveals the limits of our présent 
knowledge. The information we hâve, plentiful as it is, is full of gaps and uncer- 
tainties. Moreover, it is largely qualitative rather than quantitative. Most often, cell 
biologists studying the cell's control Systems sum up their knowledge in simple 
schematic diagrams—this boolc is full of them—rather than in numbers, graphs, 
and differential équations. 

To progress from qualitative descriptions and intuitive reasoning to quantita¬ 
tive descriptions and mathematical déduction is one of the biggest challenges for 
contemporary cell biology. So far, the challenge has been met only for a few very 
simple fragments of the machinery of living cells—subsystems involving a hand- 
ful of different proteins, or two or three cross-regulatory genes, where theory and 
experiment go closely hand in hand. We discuss some of these examples later in 
the boolc and devote the entire final section of Chapter 8 to the rôle of quantitation 
in cell biology. 

Knowledge and understanding bring the power to intervene—with humans, 
to avoid or prevent disease; with plants, to create better crops; with bacteria, to 
turn them to our own uses. Ail these biological enterprises are linlced, because the 
genetic information of ail living organisms is written in the same language. The 
new-found ability of molecular biologists to read and decipher this language has 
already begun to transform our relationship to the living world. The account of 
cell biology in the subséquent chapters will, we hope, equip the reader to under¬ 
stand, and possibly to contribute to, the great scientific adventure of the twen- 
ty-first century. 
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Figure 1-47 A very simple regulatory 
circuit—a single gene regulating its 
own expression by the binding of its 
protein product to its own regulatory 

DNA. Simple schematic diagrams such 
as this are found throughout this book. 

They are often used to summarize what 
we know, but they leave many questions 
unanswered. When the protein binds, does 
it inhibit or stimulate transcription from the 
gene? How steeply does the transcription 
rate dépend on the protein concentration? 
How long, on average, does a molécule of 
the protein remain bound to the DNA? How 
long does it take to make each molécule 
of mRNA or protein, and how quickly does 
each type of molécule get degraded? 

As explained in Chapter 8, mathematical 
modeling shows that we need quantitative 
answers to ail these and other questions 
before we can predict the behavior of 
even this single-gene System. For different 
parameter values, the System may settle 
to a unique steady State; or it may behave 
as a switch, capable of existing in one or 
another of a set of alternative States; or it 
may oscillate; or it may show large random 
fluctuations. 
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Summary 

Eukaryotic cells, by définition, keep their DNA in a separate membrane-enclosed 
compartment , the nucléus. They bave, in addition, a cytoskeleton for support and 
movement , elaborate intracellular compartments for digestion and sécrétion, the 
capacity (in many species) to engulfother cells, and a metabolism that dépends on 
the oxidation oforganic molécules by mitochondria. These properties suggest that 
eulcaryotes may hâve originated as predators on other cells. Mitochondria—and, 
in plants, chloroplasts—contain their own genetic material, and they evidently 
evolved from bacteria that were taken up into the cytoplasm ofancient cells and 
survived as symbionts. 

Eukaryotic cells typically hâve 3-30 times as many genes as prokaryotes, and 
often thousands of times more noncoding DNA. The noncoding DNA allows for 
great complexity in the régulation ofgene expression, as requiredfor the construc¬ 
tion of complex multicellular organisms. Many eukaryotes are, however, unicel- 
lular—among them the yeast Saccharomyces cerevisiae, which serves as a simple 
model organismfor eukaryotic cell biology, revealing the molecular basis of many 
jundamental processes that hâve been strikingly conserved during a billion years of 
évolution. A small number of other organisms hâve also been chosenfor intensive 
study: a worm, a fly, a fish, and the mouse serve as "model organisms"for multicel¬ 
lular animais; and a small milkweed serves as a model for plants. 

Powerful new technologies such as genome sequencing are producing strilcing 
advances in our knowledge ofhuman beings, and they are helping to advance our 
understanding ofhuman health and disease. But living Systems are incredibly com¬ 
plex, and mammalian genomes contain multiple closely related homologs ofmost 
genes. This genetic redundancy has allowed diversification and specialization of 
genes for new purposes, but it also makes biological mechanisms barder to deci- 
pher. For this reason, simpler model organisms hâve played a key part in revealing 
universal genetic mechanisms of animal development, and research using these 
Systems remains criticalfor driving scientific and medical advances. 


WHATWE DONT KNOW 


• What new approaches might 
provide a clearer view of the anaérobie 
archaeon that is thought to hâve 
formed the nucléus of the first 
eukaryotic cell? How did its symbiosis 
with an aérobic bacterium lead to the 
mitochondrion? Somewhere on Earth, 
are there cells not yet identified that 
can fill in the details of how eukaryotic 
cells originated? 

• DNA sequencing has revealed a rich 
and previously undiscovered world 

of microbial cells, the vast majority 
of which fail to grow in a laboratory. 
How might these cells be made more 
accessible for detailed study? 

• What new model cells or organisms 
should be developed for scientists to 
study? Why might a concerted focus 
on these models speed progress 
toward understanding a critical 
aspect of cell function that is poorly 
understood? 

• How did the first cell membranes 
arise? 


PROBLEMS 

Which statements are true? Explain why or why not. 

1-1 Each member of the human hemoglobin gene 
family, which consists of seven genes arranged in two clus¬ 
ters on different chromosomes, is an ortholog to ail of the 
other members. 

1 -2 Horizontal gene transfer is more prévalent in sin¬ 
gle-celled organisms than in multicellular organisms. 

1 -3 Most of the DNA sequences in a bacterial genome 
code for proteins, whereas most of the DNA sequences in 
the human genome do not. 

Discuss the following problems. 

1 -4 Since it was deciphered four décades ago, some 
hâve claimed that the genetic code must be a frozen acci¬ 
dent, while others hâve argued that it was shaped by nat- 
ural sélection. A striking feature of the genetic code is its 
inhérent résistance to the effects of mutation. For example, 
a change in the third position of a codon often spécifiés the 
same amino acid or one with similar Chemical properties. 
The natural code resists mutation more effectively (is less 
susceptible to error) than most other possible versions, as 


illustrated in Figure Ql-1. Only one in a million comput- 
er-generated “random” codes is more error-resistant than 
the natural genetic code. Does the extraordinary mutation 
résistance of the genetic code argue in favor of its origin as 
a frozen accident or as a resuit of natural sélection? Explain 
your reasoning. 



Figure Ql-1 Susceptibility to mutation of the natural code shown 
relative to that of millions of computer-generated alternative genetic 
codes (Problem 1-4). Susceptibility measures the average change in 
amino acid properties caused by random mutations in a genetic code. 
A small value indicates that mutations tend to cause minor changes. 
(Data courtesy of Steve Freeland.) 
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1 -5 You hâve begun to characterize a sample obtained 
from the depths of the océans on Europa, one of Jupi- 
ter's moons. Much to your surprise, the sample contains 
a life-form that grows well in a rich broth. Your prelimi- 
nary analysis shows that it is cellular and contains DNA, 
RNA, and protein. When you show your results to a col- 
league, she suggests that your sample was contaminated 
with an organism from Earth. What approaches might 
you try to distinguish between contamination and a 
novel cellular life-form based on DNA, RNA, and protein? 

1 -6 It is not so difficult to imagine what it means to feed 
on the organic molécules that living things produce. That 
is, after ail, what we do. But what does it mean to “feed" on 
sunlight, as phototrophs do? Or, even stranger, to “feed" on 
rocks, as lithotrophs do? Where is the “food," for example, 
in the mixture of Chemicals (H 2 S, H 2 , CO, Mn + , Fe 2+ , Ni 2+ , 
CH 4 , and NH 4 + ) that spews from a hydrothermal vent? 

1 -7 How many possible different trees (branching pat¬ 
terns) can in theory be drawn to display the évolution of 
bacteria, archaea, and eukaryotes, assuming that they ail 
arose from a common ancestor? 

1 -8 The genes for ribosomal RNA are highly conserved 
(relatively few sequence changes) in ail organisms on 
Earth; thus, they hâve evolved very slowly over time. Were 
ribosomal RNA genes “born" perfect? 

1-9 Genes participating in informational processes 
such as réplication, transcription, and translation are 
transferred between species much less often than are 
genes involved in metabolism. The basis for this inequality 
is unclear at présent, but one suggestion is that it relates 
to the underlying complexity of the two types of processes. 
Informational processes tend to involve large aggregates 
of different gene products, whereas metabolic reactions 
are usually catalyzed by enzymes composed of a single 
protein. Why would the complexity of the underlying pro- 
cess—informational or metabolic—hâve any effect on the 
rate of horizontal gene transfer? 

1-10 Animal cells hâve neither cell walls nor chloro- 
plasts, whereas plant cells hâve both. Fungal cells are 
somewhere in between; they hâve cell walls but laclc chlo- 
roplasts. Are fungal cells more lilcely to be animal cells that 
gained the ability to malce cell walls, or plant cells that lost 
their chloroplasts? This question represented a difficult 
issue for early investigators who sought to assign evolu- 
tionary relationships based solely on cell characteristics 
and morphology. How do you suppose that this question 
was eventually decided? 


VERTEBRATES 



Figure Q1-2 Phylogenetic tree for hemoglobin genes from a variety 
of species (Problem 1-11). The legumes are highlighted in green. The 
lengths of lines that connect the present-day species represent the 
evolutionary distances that separate them. 

1-11 When plant hemoglobin genes were first discov- 
ered in legumes, it was so surprising to find a gene typi- 
cal of animal blood that it was hypothesized that the plant 
gene arose by horizontal transfer from an animal. Many 
more hemoglobin genes hâve now been sequenced, and 
a phylogenetic tree based on some of these sequences is 
shown in Figure Ql-2. 

A. Does this tree support or réfuté the hypothesis that 
the plant hemoglobins arose by horizontal gene transfer? 

B. Supposing that the plant hemoglobin genes were 
originally derived from a parasitic nematode, for example, 
what would you expect the phylogenetic tree to look lilce? 

1-12 Rates of évolution appear to vary in different lin- 
eages. For example, the rate of évolution in the rat lineage 
is significantly higher than in the human lineage. These 
rate différences are apparent whether one looks at changes 
in nucléotide sequences that encode proteins and are sub- 
ject to sélective pressure or at changes in noncoding nuclé¬ 
otide sequences, which are not under obvious sélection 
pressure. Can you offer one or more possible explanations 
for the slower rate of evolutionary change in the human 
lineage versus the rat lineage? 
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It is at first sight diffïcult to accept the idea that living créatures are merely Chemi¬ 
cal Systems. Their incredible diversity of form, their seemingly purposeful behav- 
ior, and their ability to grow and reproduce ail seem to set them apart from the 
world of solids, liquids, and gases that chemistry normally describes. Indeed, 
until the nineteenth century animais were believed to contain a Vital Force—an 
“animus"—that was responsible for their distinctive properties. 

We now lcnow that there is nothing in living organisms that disobeys Chemical 
or physical laws. However, the chemistry of life is indeed spécial. First, it is based 
overwhelmingly on carbon compounds, the study of which is known as organic 
chemistry. Second, cells are 70% water, and life dépends largely on Chemical reac¬ 
tions that talce place in aqueous solution. Third, and most important, cell chem¬ 
istry is enormously complex: even the simplest cell is vastly more complicated 
in its chemistry than any other Chemical System known. In particular, although 
cells contain a variety of small carbon-containing molécules, most of the carbon 
atoms présent are incorporated into enormous polymeric molécules—chains of 
Chemical subunits linlced end-to-end. It is the unique properties of these macro- 
molecules that enable cells and organisms to grow and reproduce—as well as to 
do ail the other things that are characteristic of life. 

THE CHEMICAL COMPONENTS OF A CELL 

Living organisms are made of only a small sélection of the 92 naturally occurring 
éléments, four of which—carbon (C), hydrogen (H), nitrogen (N), and oxygen 
(O)—malce up 96.5% of an organism's weight (Figure 2-1). The atoms of these élé¬ 
ments are linlced together by covalent bonds to form molécules (see Panel 2-1, pp. 
90-91). Because covalent bonds are typically 100 times stronger than the thermal 
energies within a cell, they resist being pulled apart by thermal motions, and they 
are normally brolcen only during spécifie Chemical reactions with other atoms and 
molécules. Two different molécules can be held together by noncovalent bonds , 
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Figure 2-1 The main éléments in cells, 
highlighted in the periodic table. When 
ordered by their atomic number and 
arranged in this manner, éléments fall 
into vertical columns that show similar 
properties. Atoms in the same vertical 
column must gain (or lose) the same 
number of électrons to attain a filled outer 
Shell, and they thus behave similarly in 
bond or ion formation. Thus, for example, 
Mg and Ca tend to give away the two 
électrons in their outer shells. C, N, and O 
occur in the same horizontal row, and tend 
to complété their second shells by sharing 
électrons. 

The four éléments highlighted in red 
constitute 99% of the total number of 
atoms présent in the human body. An 
additional seven éléments, highlighted in 
blue, together represent about 0.9% of 
the total. The éléments shown in green are 
required in trace amounts by humans. It 
remains unclear whether those éléments 
shown in yellow are essential in humans. 
The chemistry of life, it seems, is therefore 
predominantly the chemistry of lighter 
éléments. The atomic weights shown here 
are those of the most common isotope of 
each element. 
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which are much wealcer (Figure 2-2). We shall see later that noncovalent bonds 
are important in the many situations where molécules hâve to associate and dis- 
sociate readily to carry out their biological functions. 

Water Is Held Together by Hydrogen Bonds 

The reactions inside a cell occur in an aqueous environment. Life on Earth began 
in the océan, and the conditions in that primeval environment put a permanent 
stamp on the chemistry of living things. Life therefore hinges on the Chemical 
properties of water, which are reviewed in Panel 2-2, pp. 92-93. 

In each water molécule (H 2 0) the two H atoms are linlced to the O atom by 
covalent bonds. The two bonds are highly polar because the O is strongly attrac¬ 
tive for électrons, whereas the H is only wealdy attractive. Consequently, there is 
an unequal distribution of électrons in a water molécule, with a prépondérance 
of positive charge on the two H atoms and of négative charge on the O. When 
a positively charged région of one water molécule (that is, one of its H atoms) 
approaches a negatively charged région (that is, the O) of a second water molé¬ 
cule, the electrical attraction between them can resuit in a hydrogen bond. These 
bonds are much wealcer than covalent bonds and are easily brolcen by the ran- 
dom thermal motions that reflect the heat energy of the molécules. Thus, each 
bond lasts only a short time. But the combined effect of many wealc bonds can be 
profound. For example, each water molécule can form hydrogen bonds through 
its two H atoms to two other water molécules, producing a network in which 
hydrogen bonds are being continually brolcen and formed. It is only because of 
the hydrogen bonds that linlc water molécules together that water is a liquid at 
room température—with a high boiling point and high surface tension—rather 
than a gas. 

Molécules, such as alcohols, that contain polar bonds and that can form 
hydrogen bonds with water dissolve readily in water. Molécules carrying charges 
(ions) lilcewise interact favorably with water. Such molécules are termed hydro- 
philic, meaning that they are water-loving. Many of the molécules in the aqueous 
environment of a cell necessarily fall into this category, including sugars, DNA, 
RNA, and most proteins. Hydrophobie (water-hating) molécules, by contrast, are 
uncharged and form few or no hydrogen bonds, and so do not dissolve in water. 
Hydrocarbons are an important example. In these molécules ail of the H atoms are 
covalently linlced to C atoms by a largely nonpolar bond; thus they cannot form 
effective hydrogen bonds to other molécules (see Panel 2-1, p. 90). This malces the 
hydrocarbon as a whole hydrophobie—a property that is exploited in cells, whose 
membranes are constructed from molécules that hâve long hydrocarbon tails, as 
we see in Chapter 10. 


Figure 2-2 Some energies important 
for cells. A crucial property of any bond — 
covalent or noncovalent—is its strength. 
Bond strength is measured by the amount 
of energy that must be supplied to break 
it, expressed in units of either kilojoules 
per mole (kJ/mole) or kilocalories per mole 
(kcal/mole). Thus if 100 kJ of energy must 
be supplied to break 6 x 10 23 bonds of 
a spécifie type (that is, 1 mole of these 
bonds), then the strength of that bond is 
100 kJ/mole. Note that, in this diagram, 
energies are compared on a logarithmic 
scale. Typical strengths and lengths of the 
main classes of Chemical bonds are given 
in Table 2-1. 

One joule (J) is the amount of energy 
required to move an object a distance of 
one meter against a force of one Newton. 
This measure of energy is derived from the 
SI units (Système Internationale d’Unités) 
universally employed by physical scientists. 
A second unit of energy, often used by 
cell biologists, is the kilocalorie (kcal); one 
calorie is the amount of energy needed to 
raise the température of 1 g ram of water by 
1 °C. One kJ is equal to 0.239 kcal (1 kcal 
= 4.18 kJ). 


Four Types of Noncovalent Attractions Help Bring Molécules 
Together in Cells 

Much of biology dépends on the spécifie binding of different molécules caused by 
three types of noncovalent bonds: electrostatic attractions (ionic bonds), hydro¬ 
gen bonds, and van der Waals attractions; and on a fourth factor that can push 
molécules together: the hydrophobie force. The properties of the four types of 
noncovalent attractions are presented in Panel 2-3 (pp. 94-95). Although each 
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Figure 2-3 Schematic indicating how two macromolecules with 
complementary surfaces can bind tightly to one another through 
noncovalent interactions. Noncovalent Chemical bonds hâve less than 
1/20 the strength of a covalent bond. They are able to produce tight 
binding only when many of them are formed simultaneously. Although only 
electrostatic attractions are illustrated here, in reality ail four noncovalent 
forces often contribute to holding two macromolecules together (Movie 2.1). 


individual noncovalent attraction would be much too wealc to be effective in the 
face of thermal motions, their energies can sum to create a strong force between 
two separate molécules. Thus sets of noncovalent attractions often allowthe com¬ 
plementary surfaces of two macromolecules to hold those two macromolecules 
together (Figure 2-3). 

Table 2-1 compares noncovalent bond strengths to that of a typical covalent 
bond, both in the presence and in the absence of water. Note that, by forming 
competing interactions with the involved molécules, water greatly reduces the 
strength of both electrostatic attractions and hydrogen bonds. 

The structure of a typical hydrogen bond is illustrated in Figure 2-4. This bond 
represents a spécial form of polar interaction in which an electropositive hydro¬ 
gen atom is shared by two electronegative atoms. Its hydrogen can be viewed as a 
proton that has partially dissociated from a donor atom, allowing it to be shared 
by a second acceptor atom. Unlilce a typical electrostatic interaction, this bond is 
highly directional—being strongest when a straight line can be drawn between ail 
three of the involved atoms. 

The fourth effect that often brings molécules together in water is not, strictly 
speaking, a bond at ail. However, a very important hydrophobie force is caused by 
a pushing of nonpolar surfaces out of the hydrogen-bonded water networlc, where 
they would otherwise physically interfère with the highly favorable interactions 
between water molécules. Bringing any two nonpolar surfaces together reduces 
their contact with water; in this sense, the force is nonspecific. Nevertheless, we 
shall see in Chapter 3 that hydrophobie forces are central to the proper folding of 
protein molécules. 

Some Polar Molécules Form Acids and Bases in Water 

One of the simplest kinds of Chemical reaction, and one that has profound signif- 
icance in cells, talces place when a molécule containing a highly polar covalent 
bond between a hydrogen and another atom dissolves in water. The hydrogen 
atom in such a molécule has given up its électron almost entirely to the companion 
atom, and so exists as an almost nalced positively charged hydrogen nucléus—in 


TABLE 2-1 


Covalent and Noncovalent Chemical Bonds 





Strength kJ/mole** 

Bond type 


Length (nm) 

in vacuum 

in water 

Covalent 


0.15 

377 (90) 

377 (90) 

Noncovalent 

ionic* 

0.25 

335 (80) 

12.6(3) 


hydrogen 

0.30 

16.7 (4) 

4.2 (1) 


van der Waals 
attraction (per 
atom) 

0.35 

0.4 (0.1) 

0.4 (0.1) 


*An ionic bond is an electrostatic attraction between two fully charged atoms. **Values in 
parenthèses are kcal/mole. 1 kJ = 0.239 kcal and 1 kcal = 4.18 kJ. 
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Figure 2-4 Hydrogen bonds. (A) Ball-and- 
stick model of a typical hydrogen bond. 

The distance between the hydrogen and 
the oxygen atom here is less than the sum 
of their van der Waals radii, indicating a 
partial sharing of électrons. (B) The most 
common hydrogen bonds in cells. 
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Figure 2-5 Protons readily move in 
aqueous solutions. (A) The reaction that 
takes place when a molécule of acetic 
acid dissolves in water. At pH 7, nearly ail 
of the acetic acid is présent as acetate 
ion. (B) Water molécules are continuously 
exchanging protons with each other to 
form hydronium and hydroxyl ions. These 
ions in turn rapidly recombine to form water 
molécules. 
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other words, a proton (H + ). When the polar molécule becomes surrounded by 
water molécules, the proton will be attracted to the partial négative charge on the 
O atom of an adjacent water molécule. This proton can easily dissociate from its 
original partner and associate instead with the oxygen atom of the water molé¬ 
cule, generating a hydronium ion (H 3 0 + ) (Figure 2-5A). The reverse reaction also 
takes place very readily, so in the aqueous solution protons are constantly flitting 
to and fro between one molécule and another. 

Substances that release protons when they dissolve in water, thus forming 
H 3 0 + , are termed acids. The higher the concentration of H 3 0 + , the more acidic 
the solution. H 3 0 + is présent even in pure water, at a concentration of HT 7 M, as 
a resuit of the movement of protons from one water molécule to another (Figure 
2-5B). By convention, the H 3 0 + concentration is usually referred to as the H + con¬ 
centration, even though most protons in an aqueous solution are présent as H 3 0 + . 
To avoid the use of unwieldy numbers, the concentration of H 3 0 + is expressed 
using a logarithmic scale called the pH scale. Pure water has a pH of 7.0 and is said 
to be neutral—that is, neither acidic (pH <7) nor basic (pH >7). 

Acids are characterized as being strong or wealc, depending on how readily 
they give up their protons to water. Strong acids, such as hydrochloric acid (HCl), 
lose their protons quickly. Acetic acid, on the other hand, is a wealc acid because 
it holds on to its proton more tightly when dissolved in water. Many of the acids 
important in the cell—such as molécules containing a carboxyl (COOH) group— 
are wealc acids (see Panel 2-2, pp. 92-93). 

Because the proton of a hydronium ion can be passed readily to many types of 
molécules in cells, altering their character, the concentration of H 3 0 + inside a cell 
(the acidity) must be closely regulated. Acids—especially wealc acids—will give up 
their protons more readily if the concentration of H 3 0 + in solution is low and will 
tend to receive them baclc if the concentration in solution is high. 

The opposite of an acid is a base. Any molécule capable of accepting a proton 
from a water molécule is called a base. Sodium hydroxide (NaOH) is basic (the 
term allcaline is also used) because it dissociâtes readily in aqueous solution to 
form Na + ions and OH“ ions. Because of this property, NaOH is called a strong 
base. More important in living cells, however, are the wealc bases—those that 
hâve a wealc tendency to reversibly accept a proton from water. Many biologically 
important molécules contain an amino (NH 2 ) group. This group is a wealc base 
that can generate OH“ by talcing a proton from water: -NH 2 + H 2 0 —► -NH 3 + + OH“ 
(see Panel 2-2, pp. 92-93). 

Because an OH - ion combines with a H 3 0 + ion to form two water molécules, 
an increase in the OH - concentration forces a decrease in the concentration of 
H 3 0 + , and vice versa. A pure solution of water contains an equal concentration 
(HT 7 M) of both ions, rendering it neutral. The interior of a cell is also lcept close 
to neutrality by the presence of buffers: wealc acids and bases that can release or 
talce up protons near pH 7, lceeping the environment of the cell relatively constant 
under a variety of conditions. 
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A Cell Is Formed from Carbon Compounds 

Having reviewed the ways atoms combine into molécules and how these molé¬ 
cules behave in an aqueous environment, we now examine the main classes of 
small molécules found in cells. We shall see that a few categories of molécules, 
formed from a handful of different éléments, give rise to ail the extraordinary rich- 
ness of form and behavior shown by living things. 

If we disregard water and inorganic ions such as potassium, nearly ail the 
molécules in a cell are based on carbon. Carbon is outstanding among ail the 
éléments in its ability to form large molécules; Silicon is a poor second. Because 
carbon is small and has four électrons and four vacancies in its outermost shell, a 
carbon atom can form four covalent bonds with other atoms. Most important, one 
carbon atom can join to other carbon atoms through highly stable covalent C-C 
bonds to form chains and rings and hence generate large and complex molécules 
with no obvious upper limit to their size. The carbon compounds made by cells 
are called organic molécules. In contrast, ail other molécules, including water, are 
said to be inorganic. 

Certain combinations of atoms, such as the methyl (-CH 3 ), hydroxyl (-OH), 
carboxyl (-COOH), carbonyl (-C=0), phosphate (-PO 3 2 "), sulfhydryl (-SH), and 
amino (-NH 2 ) groups, occur repeatedly in the molécules made by cells. Each such 
Chemical group has distinct Chemical and physical properties that influence the 
behavior of the molécule in which the group occurs. The most common Chemical 
groups and some of their properties are summarized in Panel 2-1, pp. 90-91. 

Cells Contain Four Major Families of Small Organic Molécules 

The small organic molécules of the cell are carbon-based compounds that hâve 
molecular weights in the range of 100-1000 and contain up to 30 or so carbon 
atoms. They are usually found free in solution and hâve many different fates. Some 
are used as monomer subunits to construct giant polymeric macromolecules— 
proteins, nucleic acids, and large polysaccharides. Others act as energy sources 
and are brolcen down and transformed into other small molécules in a maze of 
intracellular metabolic pathways. Many small molécules hâve more than one rôle 
in the cell—for example, acting both as a potential subunit for a macromolecule 
and as an energy source. Small organic molécules are much less abundant than 
the organic macromolecules, accounting for only about one-tenth of the total 
mass of organic matter in a cell. As a rough guess, there may be a thousand differ¬ 
ent kinds of these small molécules in a typical cell. 

Ail organic molécules are synthesized from and are brolcen down into the 
same set of simple compounds. As a conséquence, the compounds in a cell are 
chemically related and most can be classified into a few distinct families. Broadly 
speaking, cells contain four major families of small organic molécules: the sugars, 
th efatty acids, the nucléotides, and the amino acids (Figure 2-6). Although many 
compounds présent in cells do not fit into these categories, these four families 
of small organic molécules, together with the macromolecules made by linking 
them into long chains, account for a large fraction of the cell mass. 

Amino acids and the proteins that they form will be the subject of Chapter 
3. A summary of the structures and properties of the remaining three families— 
sugars, fatty acids, and nucléotides—is presented in Panels 2-4, 2-5, and 2-6, 
respectively (see pages 96-101). 

The Chemistry of Cells Is Dominated by Macromolecules with 
Remarkable Properties 

By weight, macromolecules are the most abundant carbon-containing molécules 
in a living cell (Figure 2-7). They are the principal building bloclcs from which 
a cell is constructed and also the components that confer the most distinctive 
properties of living things. The macromolecules in cells are polymers that are 
constructed by covalently linking small organic molécules (called monomers) into 
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Figure 2-6 The four main families of small organic molécules in 
cells. These small molécules form the monomeric building blocks, or 
subunits, for most of the macromolecules and other assemblies of the 
cell. Some, such as the sugars and the fatty acids, are also energy 
sources. Their structures are outlined here and shown in more detail in 
the Panels at the end of this chapter and in Chapter 3. 


long chains (Figure 2-8). They hâve remarkable properties that could not hâve 
been predicted from their simple constituents. 

Proteins are abundant and spectacularly versatile, performing thousands of 
distinct functions in cells. Many proteins serve as enzymes , the catalysts that facil- 
itate the many covalent bond-making and bond-breaking reactions that the cell 
needs. Enzymes catalyze ail of the reactions whereby cells extract energy from 
food molécules, for example, and an enzyme called ribulose bisphosphate car- 
boxylase helps to convert CO 2 to sugars in photosynthetic organisms, producing 
most of the organic matter needed for life on Earth. Other proteins are used to 
build structural components, such as tubulin, a protein that self-assembles to 
malce the cell's long microtubules, or histones, proteins that compact the DNA in 
chromosomes. Yet other proteins act as molecular motors to produce force and 
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Figure 2-7 The distribution of molécules in cells. The approximate composition of a bacterial cell 
is shown by weight. The composition of an animal cell is similar, even though its volume is roughly 
1000 times greater. Note that macromolecules dominate. The major inorganic ions include Na + , K + , 
Mg 2+ , Ca 2+ , and CI - . 
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movement, as for myosin in muscle. Proteins perform many other functions, and 
we shall examine the molecular basis for many of them later in this boolc. 

Although the Chemical reactions for adding subunits to each polymer are dif¬ 
ferent in detail for proteins, nucleic acids, and polysaccharides, they share import¬ 
ant features. Each polymer grows by the addition of a monomer onto the end of a 
growing chain in a condensation reaction , in which one molécule of water is lost 
with each subunit added (Figure 2-9). The stepwise polymerization of monomers 
into a long chain is a simple way to manufacture a large, complex molécule, since 
the subunits are added by the same reaction performed over and over again by the 
same set of enzymes. Apart from some of the polysaccharides, most macromol- 
ecules are made from a limited set of monomers that are slightly different from 
one another—for example, the 20 different amino acids from which proteins are 
made. It is critical to life that the polymer chain is not assembled at random from 
these subunits; instead the subunits are added in a précisé order, or sequence. The 
elaborate mechanisms that allow enzymes to accomplish this taslc are described 
in detail in Chapters 5 and 6. 


SUBUNIT MACROMOLECULE 
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Figure 2-8 Three families of 
macromolecules. Each is a polymer 
formed from small molécules (called 
monomers) linked together by covalent 
bonds. 


Noncovalent Bonds Specify Both the Précisé Shape of a 
Macromolecule and Its Binding to Other Molécules 

Most of the covalent bonds in a macromolecule allow rotation of the atoms they 
join, giving the polymer chain great flexibility. In principle, this allows a macro¬ 
molecule to adopt an almost unlimited number of shapes, or conformations , as 
random thermal energy causes the polymer chain to writhe and rotate. However, 
the shapes of most biological macromolecules are highly constrained because of 
the many wealc noncovalent bonds that form between different parts of the same 
molécule. If these noncovalent bonds are formed in sufficient numbers, the poly¬ 
mer chain can strongly prefer one particular conformation, determined by the 
linear sequence of monomers in its chain. Most protein molécules and many of 
the small RNA molécules found in cells fold tightly into a highly preferred confor¬ 
mation in this way (Figure 2-10). 

The four types of noncovalent interactions important in biological molécules 
were presented earlier, and they are discussed further in Panel 2-3 (pp. 94-95). In 
addition to folding biological macromolecules into unique shapes, they can also 
add up to create a strong attraction between two different molécules (see Figure 
2-3). This form of molecular interaction provides for great specificity, inasmuch 
as the close multipoint contacts required for strong binding malce it possible for a 
macromolecule to select out—through binding—just one of the many thousands 
of other types of molécules présent inside a cell. Moreover, because the strength of 
the binding dépends on the number of noncovalent bonds that are formed, inter¬ 
actions of almost any affinity are possible—allowing rapid dissociation where 
appropriate. 

As we discuss next, binding of this type underlies ail biological catalysis, mak- 
ing it possible for proteins to function as enzymes. In addition, noncovalent inter¬ 
actions allow macromolecules to be used as building bloclcs for the formation of 


A -H + HO- B 


h 2 o 


CONDENSATION 


A-B 


h 2 o 


L 

HYDROLYSIS 


A - + HO- B 


energetically energetically 

unfavorable favorable 


Figure 2-9 Condensation and hydrolysis as opposite reactions. The macromolecules of the cell 
are polymers that are formed from subunits (or monomers) by a condensation reaction, and they 
are broken down by hydrolysis. The condensation reactions are ail energetically unfavorable; thus 
polymer formation requires an energy input, as will be described in the text. 
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Figure 2-10 The folding of proteins 
and RNA molécules into a particularly 
stable three-dimensional shape, or 
conformation. If the noncovalent bonds 
maintaining the stable conformation are 
disrupted, the molécule becomes a flexible 
chain that loses its biological activity. 


larger structures, thereby forming intricate machines with multiple moving parts 
that perform such complex taslcs as DNA réplication and protein synthesis (Fig¬ 
ure 2-11). 

Summary 

Living organisms are autonomous, self-propagating Chemical Systems. They are 
formed from a distinctive and restricted set ofsmall carbon-based molécules that 
are essentially the same for every living species. Each ofthese small molécules is 
composed of a small set ofatoms linlced to each other in a précisé configuration 
through covalent bonds. The main categories are sugars, fatty acids, amino acids, 
and nucléotides. Sugars are a primary source of Chemical energyfor cells and can be 
incorporated into polysaccharides for energy storage. Fatty acids are also import¬ 
ant for energy storage, but their most critical function is in the formation of cell 
membranes. Long chains of amino acids form the remarkably diverse and versatile 
macromolecules known as proteins. Nucléotides play a central part in energy trans¬ 
fer, while also serving as the subunitsfor the informational macromolecules, RNA 
and DNA. 

Most of the dry mass of a cell consists of macromolecules that hâve been pro- 
duced as linear polymers of amino acids (proteins) or nucléotides (DNA and RNA), 
covalently linked to each other in an exact order. Most of the protein molécules 
and many ofthe RNAsfold into a unique conformation that is determined by their 
sequence of subunits. This folding process créâtes unique surfaces, and it dépends 
on a large set ofweak attractions produced by noncovalent forces between atoms. 
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Figure 2-11 Small molécules become covalently linked to form macromolecules, which in turn assemble through noncovalent interactions 
to form large complexes. Small molécules, proteins, and a ribosome are drawn approximately to scale. Ribosomes are a central part of the 
machinery that the cell uses to make proteins: each ribosome is formed as a complex of about 90 macromolecules (protein and RNA molécules). 
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These forces are of four types: electrostatic attractions , hydrogen bonds, van der 
Waals attractions , and an interaction between nonpolar groups caused by their 
hydrophobie expulsion from water. The same set ofwealc forces governs the spécifie 
binding ofother molécules to macromolecules, malcing possible the myriad associa¬ 
tions between biological molécules that produce the structure and the chemistry of 
a cell. 


CATALYSIS AND THE USE OF ENERGY BY CELLS 

One property of living things above ail makes them seem almost miraculously dif¬ 
ferent from nonliving matter: they create and maintain order, in a universe that is 
tending always to greater disorder (Figure 2-12). To create this order, the cells in 
a living organism must perform a never-ending stream of Chemical reactions. In 
some of these reactions, small organic molécules—amino acids, sugars, nuclé¬ 
otides, and lipids—are being talcen apart or modified to supply the many other 
small molécules that the cell requires. In other reactions, small molécules are 
being used to construct an enormously diverse range of proteins, nucleic acids, 
and other macromolecules that endow living Systems with ail of their most dis¬ 
tinctive properties. Each cell can be viewed as a tiny Chemical factory, performing 
many millions of reactions every second. 

Cell Metabolism Is Organized by Enzymes 

The Chemical reactions that a cell carries out would normally occur only at much 
higher températures than those existing inside cells. For this reason, each reac¬ 
tion requires a spécifie boost in Chemical reactivity. This requirement is crucial, 
because it allows the cell to control its chemistry. The control is exerted through 
specialized biological catalysts. These are almost always proteins called enzymes , 
although RNA catalysts also exist, called ribozymes. Each enzyme accelerates, or 
catalyzes, just one of the many possible kinds of reactions that a particular mol¬ 
écule might undergo. Enzyme-catalyzed reactions are connected in sériés, so 
that the product of one reaction becomes the starting material, or substrate, for 
the next (Figure 2-13). Long linear reaction pathways are in turn linlced to one 
another, forming a maze of interconnected reactions that enable the cell to sur¬ 
vive, grow, and reproduce. 

Two opposing streams of Chemical reactions occur in cells: (1) the catabolic 
pathways break down foodstuffs into smaller molécules, thereby generating both 
a useful form of energy for the cell and some of the small molécules that the cell 
needs as building bloclcs, and (2) the anabolic, or biosynthetic, pathways use the 



Figure 2-12 Biological structures are highly ordered. Well-defined, ornate, and beautiful spatial patterns can be found at every level of 
organization in living organisms. In order of increasing size: (A) protein molécules in the coat of a virus (a parasite that, although not technically alive, 
contains the same types of molécules as those found in living cells); (B) the regular array of microtubules seen in a cross section of a sperm tail; 

(C) surface contours of a pollen grain (a single cell); (D) cross section of a fern stem, showing the patterned arrangement of cells; and (E) a spiral 
arrangement of leaves in a succulent plant. (A, courtesy of Robert Grant, Stéphane Crainic, and James M. Hogle; B, courtesy of Lewis Tilney; 

C, courtesy of Colin MacFarlane and Chris Jeffree; D, courtesy of Jim Haseloff.) 
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Figure 2-13 How a set of enzyme-catalyzed reactions generates a metabolic pathway. Each enzyme catalyzes a particular 
Chemical reaction, leaving the enzyme unchanged. In this example, a set of enzymes acting in sériés converts molécule A to 
molécule F, forming a metabolic pathway. (For a diagram of many of the reactions in a human cell, abbreviated as shown, see 
Figure 2-63.) 


small molécules and the energy harnessed by catabolism to drive the synthesis of 
the many other molécules that form the cell. Together these two sets of reactions 
constitute the metabolism of the cell (Figure 2-14). 

The details of cell metabolism form the traditional subject of biochemistry and 
most of them need not concern us here. But the general principles by which cells 
obtain energy from their environment and use it to create order are central to cell 
biology. We begin with a discussion of why a constant input of energy is needed 
to sustain ail living things. 

Biological Order Is Made Possible by the Release of Heat Energy 
from Cells 

The universal tendency of things to become disordered is a fundamental law of 
physics—the second law ofthermodynamics— which States that in the universe, or 
in any isolated System (a collection of matter that is completely isolated from the 
rest of the universe), the degree of disorder always increases. This law has such 
profound implications for life that we will restate it in several ways. 

For example, we can présent the second law in terms of probability by stating 
that Systems will change spontaneously toward those arrangements that hâve the 
greatest probability. If we consider a box of 100 coins ail lying heads up, a sériés 
of accidents that disturbs the box will tend to move the arrangement toward a 
mixture of 50 heads and 50 tails. The reason is simple: there is a huge number 
of possible arrangements of the individual coins in the mixture that can achieve 
the 50-50 resuit, but only one possible arrangement that lceeps ail of the coins 
oriented heads up. Because the 50-50 mixture is therefore the most probable, we 
say that it is more "disordered." For the same reason, it is a common expérience 
that one's living space will become increasingly disordered without intentional 
effort: the movement toward disorder is a spontaneous process, requiring a peri- 
odic effort to reverse it (Figure 2-15). 

The amount of disorder in a System can be quantified and expressed as the 
entropy of the System: the greater the disorder, the greater the entropy. Thus, 
another way to express the second law of thermodynamics is to say that Systems 
will change spontaneously toward arrangements with greater entropy. 

Living cells—by surviving, growing, and forming complex organisms—are 
generating order and thus might appear to defy the second law of thermodynam¬ 
ics. How is this possible? The answer is that a cell is not an isolated System: it talces 
in energy from its environment in the form of food, or as photons from the sun (or 
even, as in some chemosynthetic bacteria, from inorganic molécules alone). It 
then uses this energy to generate order within itself. In the course of the Chemical 
reactions that generate order, the cell converts part of the energy it uses into heat. 
The heat is discharged into the cell's environment and disorders the surroundings. 
As a resuit, the total entropy—that of the cell plus its surroundings—increases, as 
demanded by the second law of thermodynamics. 

To understand the principles governing these energy conversions, thinlc 
of a cell surrounded by a sea of matter representing the rest of the universe. As 
the cell lives and grows, it créâtes internai order. But it constantly releases heat 
energy as it synthesizes molécules and assembles them into cell structures. Heat 
is energy in its most disordered form—the random jostling of molécules. When 
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Figure 2-14 Schematic représentation of 
the relationship between catabolic and 
anabolic pathways in metabolism. As 

suggested in this diagram, a major portion 
of the energy stored in the Chemical bonds 
of food molécules is dissipated as heat. In 
addition, the mass of food required by any 
organism that dérivés ail of its energy from 
catabolism is much greater than the mass 
of the molécules that it can produce by 
anabolism. 
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Figure 2-15 An everyday illustration of 
the spontaneous drive toward disorder. 

Reversing this tendency toward disorder 
requires an intentional effort and an input of 
energy: it is not spontaneous. In fact, from 
the second law of thermodynamics, we 
can be certain that the human intervention 
required will release enough heat to the 
environment to more than compensate for 
the reordering of the items in this room. 


the cell releases heat to the sea, it increases the intensity of molecular motions 
there (thermal motion)—thereby increasing the randomness, or disorder, of the 
sea. The second law of thermodynamics is satisfied because the increase in the 
amount of order inside the cell is always more than compensated for by an even 
greater decrease in order (increase in entropy) in the surrounding sea of matter 
(Figure 2-16). 

Where does the heat that the cell releases corne from? Here we encoun- 
ter another important law of thermodynamics. The first law of thermodynamics 
States that energy can be converted from one form to another, but that it cannot 
be created or destroyed. Figure 2-17 illustrâtes some interconversions between 
different forms of energy. The amount of energy in different forms will change 
as a resuit of the Chemical reactions inside the cell, but the first law tells us that 
the total amount of energy must always be the same. For example, an animal cell 
talces in foodstuffs and converts some of the energy présent in the Chemical bonds 
between the atoms of these food molécules (chemical-bond energy) into the ran- 
dom thermal motion of molécules (heat energy). 

The cell cannot dérivé any benefit from the heat energy it releases unless the 
heat-generating reactions inside the cell are directly linlced to the processes that 
generate molecular order. It is the tight coupling of heat production to an increase 


sea of matter cell 




Figure 2-16 A simple thermodynamic 
analysis of a living cell. In the diagram on 
the left, the molécules of both the cell and 
the rest of the universe (the sea of matter) 
are depicted in a relatively disordered 
State. In the diagram on the right, the cell 
has taken in energy from food molécules 
and released heat through reactions that 
order the molécules the cell contains. The 
heat released increases the disorder in the 
environment around the cell (depicted by 
jagged arrows and distorted molécules, 
indicating increased molecular motions 
caused by heat). As a resuit, the second 
law of thermodynamics—which States 
that the amount of disorder in the universe 
must always increase—is satisfied as 
the cell grows and divides. For a detailed 
discussion, see Panel 2-7 (pp. 102-103). 
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Figure 2-17 Some interconversions 
between different forms of energy. 

Ail energy forms are, in principle, 
interconvertible. In ail these processes the 
total amount of energy is conserved. Thus, 
for example, from the height and weight 
of the brick in (1), we can predict exactly 
how much heat will be released when it hits 
the floor. In (2), note that the large amount 
of chemical-bond energy released when 
water is formed is initially converted to 
very rapid thermal motions in the two new 
water molécules; but collisions with other 
molécules almost instantaneously spread 
this kinetic energy evenly throughout the 
surroundings (heat transfer), making the 
new molécules indistinguishable from ail 
the rest. 
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in order that distinguishes the metabolism of a cell from the wasteful burning of 
fuel in a fire. Later, we illustrate how this coupling occurs. For now, it is sufficient 
to recognize that a direct linkage of the “controlled burning" of food molécules to 
the génération of biological order is required for cells to create and maintain an 
island of order in a universe tending toward chaos. 

Cells Obtain Energy by the Oxidation of Organic Molécules 

Ail animal and plant cells are powered by energy stored in the Chemical bonds 
of organic molécules, whether they are sugars that a plant has photosynthesized 
as food for itself or the mixture of large and small molécules that an animal has 
eaten. Organisms must extract this energy in usable form to live, grow, and repro- 
duce. In both plants and animais, energy is extracted from food molécules by a 
process of graduai oxidation, or controlled burning. 

The Earth's atmosphère contains a great deal of oxygen, and in the presence of 
oxygen the most energetically stable form of carbon is C0 2 and that of hydrogen 
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is H 2 O. A cell is therefore able to obtain energy from sugars or other organic mol¬ 
écules by allowing their carbon and hydrogen atoms to combine with oxygen to 
produce C0 2 and H 2 0, respectively—a process called aérobic respiration. 

Photosynthesis (discussed in detail in Chapter 14) and respiration are com- 
plementary processes (Figure 2-18). This means that the transactions between 
plants and animais are not ail one way. Plants, animais, and microorganisms hâve 
existed together on this planet for so long that many of them hâve become an 
essential part of the others' environments. The oxygen released by photosynthe¬ 
sis is consumed in the combustion of organic molécules during aérobic respira¬ 
tion. And some of the C0 2 molécules that are fixed today into organic molécules 
by photosynthesis in a green leaf were yesterday released into the atmosphère 
by the respiration of an animal—or by the respiration of a fungus or bacterium 
decomposing dead organic matter. We therefore see that carbon utilization forms 
a huge cycle that involves the biosphère (ail of the living organisms on Earth) as a 
whole (Figure 2-19). Similarly, atoms of nitrogen, phosphorus, and sulfur move 
between the living and nonliving worlds in cycles that involve plants, animais, 
fungi, and bacteria. 

Oxidation and Réduction Involve Electron Transfers 

The cell does not oxidize organic molécules in one step, as occurs when organic 
material is burned in a fire. Through the use of enzyme catalysts, metabolism talces 
these molécules through a large number of reactions that only rarely involve the 
direct addition of oxygen. Before we consider some of these reactions and their 
purpose, we discuss what is meant by the process of oxidation. 


Figure 2-18 Photosynthesis and 
respiration as complementary processes 
in the living world. Photosynthesis 
converts the electromagnetic energy in 
sunlight into chemical-bond energy in 
sugars and other organic molécules. 

Plants, algae, and cyanobacteria obtain 
the carbon atoms that they need for this 
purpose from atmospheric C0 2 and the 
hydrogen from water, releasing 0 2 gas 
as a by-product. The organic molécules 
produced by photosynthesis in turn serve 
as food for other organisms. Many of these 
organisms carry out aérobic respiration, 
a process that uses 0 2 to form CO 2 from 
the same carbon atoms that had been 
taken up as CO 2 and converted into sugars 
by photosynthesis. In the process, the 
organisms that respire obtain the chemical- 
bond energy that they need to survive. 

The first cells on the Earth are 
thought to hâve been capable of neither 
photosynthesis nor respiration (discussed 
in Chapter 14). However, photosynthesis 
must hâve preceded respiration on the 
Earth, since there is strong evidence that 
billions of years of photosynthesis were 
required before O 2 had been released in 
sufficient quantity to create an atmosphère 
rich in this gas. (The Earth’s atmosphère 
currently contains 20% O 2 .) 



Figure 2-19 The carbon cycle. Individual carbon atoms are incorporated into organic molécules of 
the living world by the photosynthetic activity of bacteria, algae, and plants. They pass to animais, 
microorganisms, and organic material in soil and océans in cyclic paths. CO 2 is restored to the 
atmosphère when organic molécules are oxidized by cells or burned by humans as fuels. 
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Oxidation refers to more than the addition of oxygen atoms; the term applies 
more generally to any reaction in which électrons are transferred from one atom 
to another. Oxidation in this sense refers to the removal of électrons, and réduc¬ 
tion—the converse of oxidation—means the addition of électrons. Thus, Fe 2+ is 
oxidized if it loses an électron to become Fe 3+ , and a chlorine atom is reduced 
if it gains an électron to become Cl - . Since the number of électrons is conserved 
(no loss or gain) in a Chemical reaction, oxidation and réduction always occur 
simultaneously: that is, if one molécule gains an électron in a reaction (réduc¬ 
tion), a second molécule loses the électron (oxidation). When a sugar molécule is 
oxidized to CO 2 and H 2 O, for example, the O 2 molécules involved in forming H 2 O 
gain électrons and thus are said to hâve been reduced. 

The terms “oxidation" and "réduction" apply even when there is only a partial 
shift of électrons between atoms linlced by a covalent bond (Figure 2-20). When 
a carbon atom becomes covalently bonded to an atom with a strong affmity for 
électrons, such as oxygen, chlorine, or sulfur, for example, it gives up more than 
its equal share of électrons and forms a polar covalent bond. Because the positive 
charge of the carbon nucléus is now somewhat greater than the négative charge of 
its électrons, the atom acquires a partial positive charge and is said to be oxidized. 
Conversely, a carbon atom in a C-H linkage has slightly more than its share of 
électrons, and so it is said to be reduced. 

When a molécule in a cell piclcs up an électron (e - ), it often piclcs up a proton 
(H + ) at the same time (protons being freely available in water). The net effect in 
this case is to add a hydrogen atom to the molécule. 

A + éT + H + ^AH 

Even though a proton plus an électron is involved (instead of just an électron), 
such hydrogénation reactions are réductions, and the reverse, dehydrogenation 
reactions are oxidations. It is especially easy to tell whether an organic molécule 
is being oxidized or reduced: réduction is occurring if its number of C-H bonds 
increases, whereas oxidation is occurring if its number of C-H bonds decreases 
(see Figure 2-20B). 

Cells use enzymes to catalyze the oxidation of organic molécules in small 
steps, through a sequence of reactions that allows useful energy to be harvested. 
We now need to explain how enzymes worlc and some of the constraints under 
which they operate. 
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Figure 2-20 Oxidation and réduction. (A) When two atoms form a polar 
covalent bond, the atom ending up with a greater share of électrons is said 
to be reduced, while the other atom acquires a lesser share of électrons and 
is said to be oxidized. The reduced atom has acquired a partial négative 
charge (8 - ) as the positive charge on the atomic nucléus is now more than 
equaled by the total charge of the électrons surrounding it, and conversely, 
the oxidized atom has acquired a partial positive charge (5 + ). (B) The single 
carbon atom of methane can be converted to that of carbon dioxide by 
the successive replacement of its covalently bonded hydrogen atoms 
with oxygen atoms. With each step, électrons are shifted away from the 
carbon (as indicated by the blue shading), and the carbon atom becomes 
progressively more oxidized. Each of these steps is energetically favorable 
under the conditions présent inside a cell. 
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Enzymes Lower the Activation-Energy Barriers That Block 
Chemical Reactions 

Consider the reaction 

paper + O 2 —> smolce + ashes + heat + CO 2 + H 2 O 

Once ignited, the paper burns readily, releasing to the atmosphère both energy 
as heat and water and carbon dioxide as gases. The reaction is irréversible, since 
the smolce and ashes never spontaneously retrieve these entities from the heated 
atmosphère and reconstitute themselves into paper. When the paper burns, its 
Chemical energy is dissipated as heat—not lost from the universe, since energy 
can never be created or destroyed, but irretrievably dispersed in the chaotic ran- 
dom thermal motions of molécules. At the same time, the atoms and molécules of 
the paper become dispersed and disordered. In the language of thermodynamics, 
there has been a loss of free energy; that is, of energy that can be harnessed to do 
worlc or drive Chemical reactions. This loss reflects a réduction of orderliness in 
the way the energy and molécules were stored in the paper. 

We shall discuss free energy in more detail shortly, but the general principle 
is clear enough intuitively: Chemical reactions proceed spontaneously only in 
the direction that leads to a loss of free energy. In other words, the spontaneous 
direction for any reaction is the direction that goes “downhill," where a “downhill" 
reaction is one that is energetically favorable. 

Although the most energetically favorable form of carbon under ordinary con¬ 
ditions is CO 2 , and that of hydrogen is H 2 O, a living organism does not disappear 
in a puff of smolce, and the paper boolc in your hands does not burst into fiâmes. 
This is because the molécules both in the living organism and in the boolc are in a 
relatively stable State, and they cannot be changed to a State of lower energy with - 
out an input of energy: in other words, a molécule requires activation energy—a 
kick over an energy barrier—before it can undergo a Chemical reaction that leaves 
it in a more stable State (Figure 2-21). In the case of a burning boolc, the activation 
energy can be provided by the heat of a lighted match. For the molécules in the 
watery solution inside a cell, the kick is delivered by an unusually energetic ran- 
dom collision with surrounding molécules—collisions that become more violent 
as the température is raised. 

The chemistry in a living cell is tightly controlled, because the kick over energy 
barriers is greatly aided by a specialized class of proteins—the enzymes. Each 
enzyme binds tightly to one or more molécules, called substrates, and holds 
them in a way that greatly reduces the activation energy of a particular Chemical 
reaction that the bound substrates can undergo. A substance that can lower the 
activation energy of a reaction is termed a catalyst; catalysts increase the rate of 
Chemical reactions because they allow a much larger proportion of the random 
collisions with surrounding molécules to kick the substrates over the energy bar¬ 
rier, as illustrated in Figure 2-22. Enzymes are among the most effective catalysts 




Figure 2-21 The important principle 
of activation energy. (A) Compound Y 
(a reactant) is in a relatively stable State, 
and energy is required to convert it to 
compound X (a product), even though X is 
at a lower overall energy level than Y. This 
conversion will not take place, therefore, 
unless compound Y can acquire enough 
activation energy (energy a minus energy 
b) from its surroundings to undergo the 
reaction that converts it into compound X. 
This energy may be provided by means of 
an unusually energetic collision with other 
molécules. For the reverse reaction, 

X —> Y, the activation energy will be 
much larger (energy a minus energy c); 
this reaction will therefore occur much 
more rarely. Activation energies are 
always positive; note, however, that the 
total energy change for the energetically 
favorable reaction Y —► X is energy c 
minus energy b, a négative number. 

(B) Energy barriers for spécifie reactions 
can be lowered by catalysts, as indicated 
by the line marked d. Enzymes are 
particularly effective catalysts because they 
greatly reduce the activation energy for the 
reactions they perform. 
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average energy 


known: some are capable of speeding up reactions by factors of 10 14 or more. 
Enzymes thereby allow reactions that would not otherwise occur to proceed rap- 
idly at normal températures. 


Figure 2-22 Lowering the activation 
energy greatly increases the probability 
of a reaction. At any given instant, a 
population of identical substrate molécules 
will hâve a range of energies, distributed as 
shown on the graph. The varying energies 
corne from collisions with surrounding 
molécules, which make the substrate 
molécules jiggle, vibrate, and spin. For a 
molécule to undergo a Chemical reaction, 
the energy of the molécule must exceed 
the activation-energy barrier for that 
reaction (dashed Unes). For most biological 
reactions, this almost never happens 
without enzyme catalysis. Even with 
enzyme catalysis, the substrate molécules 
must expérience a particularly energetic 
collision to react (red shaded area). Raising 
the température will also increase the 
number of molécules with sufficient energy 
to overcome the activation energy needed 
for a reaction; but in marked contrast to 
enzyme catalysis, this effect is nonselective, 
speeding up ail reactions (Movie 2.2). 


Enzymes Can Drive Substrate Molécules Along Spécifie Reaction 
Pathways 

An enzyme cannot change the equilibrium point for a reaction. The reason is sim¬ 
ple: when an enzyme (or any catalyst) lowers the activation energy for the reaction 
Y —> X, of necessity it also lowers the activation energy for the reaction X —> Y by 
exactly the same amount (see Figure 2-21). The forward and baclcward reactions 
will therefore be accelerated by the same factor by an enzyme, and the equilib¬ 
rium point for the reaction will be unchanged (Figure 2-23). Thus no matter how 
much an enzyme speeds up a reaction, it cannot change its direction. 

Despite the above limitation, enzymes steer ail of the reactions in cells through 
spécifie reaction paths. This is because enzymes are both highly sélective and 
very précisé, usually catalyzing only one particular reaction. In other words, each 
enzyme selectively lowers the activation energy of only one of the several possible 
Chemical reactions that its bound substrate molécules could undergo. In this way, 
sets of enzymes can direct each of the many different molécules in a cell along a 
particular reaction pathway (Figure 2-24). 

The success of living organisms is attributable to a cell's ability to make 
enzymes of many types, each with precisely specified properties. Each enzyme 
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Figure 2-23 Enzymes cannot change the equilibrium point for reactions. Enzymes, like ail 
catalysts, speed up the forward and backward rates of a reaction by the same factor. Therefore, for 
both the catalyzed and the uncatalyzed reactions shown here, the number of molécules undergoing 
the transition X —> Y is equal to the number of molécules undergoing the transition Y —> X when the 
ratio of Y molécules to X molécules is 3 to 1. In other words, the two reactions reach equilibrium at 
exactly the same point. 
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Figure 2-24 Directing substrate molécules through a spécifie reaction 
pathway by enzyme catalysis. A substrate molécule in a cell (green bail) 
is converted into a different molécule (blue bail) by means of a sériés of 
enzyme-catalyzed reactions. As indicated (yellow box), several reactions 
are energetically favorable at each step, but only one is catalyzed by each 
enzyme. Sets of enzymes thereby détermine the exact reaction pathway that 
is followed by each molécule inside the cell. 


has a unique shape containing an active site , a pocket or groove in the enzyme 
into which only particular substrates will fit (Figure 2-25). Like ail other catalysts, 
enzyme molécules themselves remain unchanged after participating in a reaction 
and therefore can function over and over again. In Chapter 3, we discuss further 
how enzymes worlc. 

How Enzymes Find Their Substrates: The Enormous Rapidity of 
Molecular Motions 

An enzyme will often catalyze the reaction of thousands of substrate molécules 
every second. This means that it must be able to bind a new substrate molécule 
in a fraction of a millisecond. But both enzymes and their substrates are présent 
in relatively small numbers in a cell. How do they find each other so fast? Rapid 
binding is possible because the motions caused by heat energy are enormously 
fast at the molecular level. These molecular motions can be classified broadly into 
three lcinds: (1) the movement of a molécule from one place to another ( transla- 
tional motion ), (2) the rapid baclc-and-forth movement of covalently linlced atoms 
with respect to one another (vibrations), and (3) rotations. Ail of these motions 
help to bring the surfaces of interacting molécules together. 

The rates of molecular motions can be measured by a variety of spectroscopic 
techniques. A large globular protein is constantly tumbling, rotating about its axis 
about a million times per second. Molécules are also in constant translational 
motion, which causes them to explore the space inside the cell very efficiently by 
wandering through it—a process called diffusion. In this way, every molécule in 
a cell collides with a huge number of other molécules each second. As the mol¬ 
écules in a liquid collide and bounce off one another, an individual molécule 
moves first one way and then another, its path constituting a random walk (Figure 
2-26). In such a walk, the average net distance that each molécule travels (as the 
“crow Aies") from its starting point is proportional to the square root of the time 
involved: that is, if it talces a molécule 1 second on average to travel 1 pm, it talces 
4 seconds to travel 2 pm, 100 seconds to travel 10 pm, and so on. 

The inside of a cell is very crowded (Figure 2-27). Nevertheless, experiments 
in which fluorescent dyes and other labeled molécules are injected into cells show 
that small organic molécules diffuse through the watery gel of the cytosol nearly 
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Figure 2-25 How enzymes work. Each enzyme has an active site to which one or more substrate 
molécules bind, forming an enzyme-substrate complex. A reaction occurs at the active site, 
producing an enzyme-product complex. The product is then released, allowing the enzyme to bind 
further substrate molécules. 
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as rapidly as they do through water. A small organic molécule, for example, talces 
only about one-fifth of a second on average to diffuse a distance of 10 pm. Dif¬ 
fusion is therefore an efficient way for small molécules to move the limited dis¬ 
tances in the cell (a typical animal cell is 15 pm in diameter). 

Since enzymes move more slowly than substrates in cells, we can thinlc of them 
as sitting still. The rate of encounter of each enzyme molécule with its substrate 
will dépend on the concentration of the substrate molécule. For example, some 
abundant substrates are présent at a concentration of 0.5 mM. Since pure water 
is 55.5 M, there is only about one such substrate molécule in the cell for every 
10 5 water molécules. Nevertheless, the active site on an enzyme molécule that 
binds this substrate will be bombarded by about 500,000 random collisions with 
the substrate molécule per second. (For a substrate concentration tenfold lower, 
the number of collisions drops to 50,000 per second, and so on.) A random col¬ 
lision between the active site of an enzyme and the matching surface of its sub¬ 
strate molécule often leads immediately to the formation of an enzyme-substrate 
complex. A reaction in which a covalent bond is brolcen or formed can now occur 
extremely rapidly. When one appréciâtes how quicldy molécules move and react, 
the observed rates of enzymatic catalysis do not seem so amazing. 

Two molécules that are held together by noncovalent bonds can also disso- 
ciate. The multiple wealc noncovalent bonds that they form with each other will 
persist until random thermal motion causes the two molécules to separate. In 
general, the stronger the binding of the enzyme and substrate, the slower their 
rate of dissociation. In contrast, whenever two colliding molécules hâve poorly 
matching surfaces, they form few noncovalent bonds and the total energy of asso¬ 
ciation will be negligible compared with that of thermal motion. In this case, the 
two molécules dissociate as rapidly as they corne together, preventing incorrect 
and unwanted associations between mismatched molécules, such as between an 
enzyme and the wrong substrate. 

The Free-Energy Change for a Reaction, AG, Détermines Whether 
It Can Occur Spontaneously 

Although enzymes speed up reactions, they cannot by themselves force ener- 
getically unfavorable reactions to occur. In terms of a water analogy, enzymes 
by themselves cannot malce water run uphill. Cells, however, must do just that in 
order to grow and divide: they must build highly ordered and energy-rich molé¬ 
cules from small and simple ones. We shall see that this is done through enzymes 
that directly couple energetically favorable reactions, which release energy and 
produce heat, to energetically unfavorable reactions, which produce biological 
order. 

What do cell biologists mean by the term "energetically favorable," and how 
can this be quantified? According to the second law of thermodynamics the uni- 
verse tends toward maximum disorder (largest entropy or greatest probability). 
Thus, a Chemical reaction can proceed spontaneously only if it results in a net 
increase in the disorder of the universe (see Figure 2-16). This disorder of the uni- 
verse can be expressed most conveniently in terms of the free energy of a System, a 
concept we touched on earlier. 

Free energy, G, is an expression of the energy available to do work —for exam¬ 
ple, the work of driving Chemical reactions. The value of G is of interest only when 
a System undergoes a change , denoted AG (delta G). The change in G is critical 
because, as explained in Panel 2-7 (pp. 102-103), AG is a direct measure of the 

Figure 2-27 The structure of the cytoplasm. The drawing is approximately 
to scale and emphasizes the crowding in the cytoplasm. Only the 
macromolecules are shown: RNAs are shown in blue, ribosomes in green, 
and proteins in red. Enzymes and other macromolecules diffuse relatively 
slowly in the cytoplasm, in part because they interact with many other 
macromolecules; small molécules, by contrast, diffuse nearly as rapidly as 
they do in water (Movie 2.4). (Adapted from D.S. Goodsell, Trends Biochem. 
Sci. 16:203-206, 1991. With permission from Elsevier.) 
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Figure 2-26 A random walk. Molécules 
in solution move in a random fashion as a 
resuit of the continuai buffeting they receive 
in collisions with other molécules. This 
movement allows small molécules 
to diffuse rapidly from one part of the 
cell to another, as described in the text 
(Movie 2.3). 
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amount of disorder created in the universe when a reaction talces place. Energet- 
ically favorable reactions , by définition, are those that decrease free energy; in 
other words, they hâve a négative AG and disorder the universe (Figure 2-28). 

An example of an energetically favorable reaction on a macroscopie scale is 
the "reaction" by which a compressed spring relaxes to an expanded State, releas¬ 
ing its stored elastic energy as heat to its surroundings; an example on a micro¬ 
scopie scale is sait dissolving in water. Conversely, energetically unfavorable reac¬ 
tions with a positive AG—such as the joining of two amino acids to form a peptide 
bond—by themselves create order in the universe. Therefore, these reactions can 
talce place only if they are coupled to a second reaction with a négative AG so large 
that the AG of the overall process is négative (Figure 2-29). 

The Concentration of Reactants Influences the Free-Energy 
Change and a Reaction’s Direction 

As we hâve just described, a reaction Y <-> X will go in the direction Y —> X when 
the associated free-energy change, AG, is négative, just as a tensed spring left to 
itself will relax and lose its stored energy to its surroundings as heat. For a Chem¬ 
ical reaction, however, AG dépends not only on the energy stored in each indi- 
vidual molécule, but also on the concentrations of the molécules in the reaction 
mixture. Remember that AG reflects the degree to which a reaction créâtes a more 
disordered—in other words, a more probable—State of the universe. Recalling our 
coin analogy, it is very lilcely that a coin will flip from a head to a tail orientation if 
a jiggling box contains 90 heads and 10 tails, but this is a less probable event if the 
box has 10 heads and 90 tails. 

The same is true for a Chemical reaction. For a réversible reaction Y <-> X, a 
large excess of Y over X will tend to drive the reaction in the direction Y —> X. 
Therefore, as the ratio of Y to X increases, the AG becomes more négative for the 
transition Y —> X (and more positive for the transition X —> Y). 

The amount of concentration différence that is needed to compensate for a 
given decrease in chemical-bond energy (and accompanying heat release) is not 
intuitively obvious. In the late nineteenth century, the relationship was deter- 
mined through a thermodynamic analysis that malces it possible to separate 
the concentration-dépendent and the concentration-independent parts of the 
free-energy change, as we describe next. 

The Standard Free-Energy Change, AG°, Makes It Possible to 
Compare the Energetics of Different Reactions 

Because AG dépends on the concentrations of the molécules in the reaction mix¬ 
ture at any given time, it is not a particularly useful value for comparing the rel¬ 
ative energies of different types of reactions. To place reactions on a comparable 
basis, we need to turn to the standard free-energy change of a reaction, AG°. 
The AG° is the change in free energy under a standard condition, defined as that 
where the concentrations of ail the reactants are set to the same fixed value of 1 
mole/liter. Defined in this way, AG° dépends only on the intrinsic characters of 
the reacting molécules. 

For the simple reaction Y —> X at 37°C, AG° is related to AG as follows: 

AG = AG° + J RTÏnM 
[Y] 

where AG is in kilojoules per mole, [Y] and [X] dénoté the concentrations of Y and 
X in moles/liter, ln is the natural logarithm, and RT is the product of the gas con¬ 
stant, R , and the absolute température, T. At 37°C, RT = 2.58 I mole -1 . (A mole is 
6 x 10 23 molécules of a substance.) 

A large body of thermodynamic data has been collected that has made it pos¬ 
sible to détermine the standard free-energy change, AG°, for the important meta- 
bolic reactions of a cell. Given these AG° values, combined with additional infor¬ 
mation about métabolite concentrations and reaction pathways, it is possible to 
quantitatively predict the course of most biological reactions. 


ENERGETICALLY 

FAVORABLE 

REACTION 



The free energy of Y 
is greater than the free 
energy of X. Therefore 
AG < 0, and the disorder 
of the universe increases 
during the reaction 
Y—X. 


this reaction can occur spontaneously 


ENERGETICALLY 

UNFAVORABLE 

REACTION 



If the reaction X—►Y 
occurred, AG would 
be > 0, and the 
universe would 
become more 
ordered. 


this reaction can occur only if 
it is coupled to a second, 
energetically favorable reaction 


Figure 2-28 The distinction between 
energetically favorable and energetically 
unfavorable reactions. 



the energetically unfavorable 
reaction X-*Y is driven by the 
energetically favorable 
reaction C-HD, because the net 
free-energy change for the 
pair of coupled reactions is less 
than zéro 


Figure 2-29 How reaction coupling is 
used to drive energetically unfavorable 
reactions. 
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FOR THE ENERGETICALLY FAVORABLE REACTION Y —► X, 



when X and Y are at equal concentrations, [Y] = [X], the formation of X 
is energetically favored. In other words, the ÀG of Y —> X is négative and 
the ÀG of X —> Y is positive. But because of thermal bombardments, 
there will always be some X converting to Y. 


THUS, FOR EACH INDIVIDUAL MOLECULE, 

O - «N 

• • 

r o 

• • 

Therefore the ratio of X to Y 
molécules will increase with time 


conversion of 
Y to X will 
occur often. 

Conversion of X to Y 
will occur less often 
than the transition 
Y —> X, because it 
requires a more 
energetic collision. 


EVENTUALLY, there will be a large enough excess of X over Y to just 
compensate for the slow rate of X —> Y, such that the number of Y molécules 
being converted to X molécules each second is exactly equal to the number 
of X molécules being converted to Y molécules each second. At this point, 
the reaction will be at equilibrium. 


Y 


U 

U 
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AT EQUILIBRIUM, there is no net change in the ratio of Y to X, and the 
ÀG for both forward and backward reactions is zéro. 


Figure 2-30 Chemical equilibrium. When 
a reaction reaches equilibrium, the forward 
and backward fluxes of reacting molécules 
are equal and opposite. 


The Equilibrium Constant and AG° Are Readily Derived from 
Each Other 

Inspection of the above équation reveals that the AG equals the value of AG° 
when the concentrations of Y and X are equal. But as any favorable reaction pro- 
ceeds, the concentrations of the products will increase as the concentration of the 
substrates decreases. This change in relative concentrations will cause [X]/[Y] to 
become increasingly large, making the initially favorable AG less and less négative 
(the logarithm of a number x is positive for x > 1, négative for x < 1, and zéro for x 
=1). Eventually, when AG = 0, a Chemical equilibrium will be attained; here there 
is no net change in free energy to drive the reaction in either direction, inasmuch 
as the concentration effect just balances the push given to the reaction by AG°. 
As a resuit, the ratio of product to substrate reaches a constant value at Chemical 
equilibrium (Figure 2-30). 

We can define the equilibrium constant, K, for the reaction Y —> X as 

[Y] 

where [X] is the concentration of the product and [Y] is the concentration of the 
reactant at equilibrium. Remembering that AG = AG° + RT ln [X]/[Y], and that 
AG = 0 at equilibrium, we see that 

A G° = -RT ln M = -RTln K 
[Y] 

At 37°C, where RT = 2.58, the equilibrium équation is therefore: 

AG° = -2.58 ln K 
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Converting this équation from the natural logarithm (ln) to the more com- 
monly used base 10 logarithm (log), we get 


AG° = -5.94 log K 


The above équation reveals how the equilibrium ratio of X to Y (expressed as 
the equilibrium constant, K) dépends on the intrinsic character of the molécules, 
(as expressed in the value of AG° in kilojouïes per mole). Note that for every 5.94 
kj/mole différence in free energy at 37°C, the equilibrium constant changes by 
a factor of 10 (Table 2-2). Thus, the more energetically favorable a reaction, the 
more product will accumulate if the reaction proceeds to equilibrium. 

More generally, for a reaction that has multiple reactants and products, such 
as A + B —> C + D, 


K= 


MM 

[A] [B] 


The concentrations of the two reactants and the two products are multiplied 
because the rate of the forward reaction dépends on the collision of A and B and 
the rate of the backward reaction dépends on the collision of C and D. Thus, at 


37°C, 


AG° = -5.94 log 


[C][D] 

[A] [B] 


where A G° is in kiloj ouïes per mole, and [A], [B], [C], and [D] dénoté the concen¬ 
trations of the reactants and products in moles/liter. 


The Free-Energy Changes of Coupled Reactions Are Additive 

We hâve pointed out that unfavorable reactions can be coupled to favorable ones 
to drive the unfavorable ones forward (see Figure 2-29). In thermodynamic terms, 
this is possible because the overall free-energy change for a set of coupled reac¬ 
tions is the sum of the free-energy changes in each of its component steps. Con- 
sider, as a simple example, two sequential reactions 

X^Y and Y^Z 

whose AG° values are +5 and -13 kj/mole, respectively. If these two reactions 
occur sequentially, the AG° for the coupled reaction will be -8 kj/mole. This 
means that, with appropriate conditions, the unfavorable reaction X —> Y can be 
driven by the favorable reaction Y —> Z, provided that this second reaction follows 
the first. For example, several of the reactions in the long pathway that converts 
sugars into CO 2 and H 2 O hâve positive AG° values. But the pathway nevertheless 
proceeds because the total AG° for the sériés of sequential reactions has a large 
négative value. 

Forming a sequential pathway is not adéquate for many purposes. Often the 
desired pathway is simply X —> Y, without further conversion of Y to some other 
product. Fortunately, there are other more general ways of using enzymes to cou¬ 
ple reactions together. These often involve the activated carrier molécules that we 
discuss next. 


Relationship Between 
the Standard Free-Energy 
Change, AG°, and the Equilibrium 
Constant 


Equilibrium 

constant 

i*L=K 

[Y] 

Free energy of X 
minus free energy 
of Y 

[kJ/mole (kcal/mole)] 

10 5 

-29.7 (-7.1) 

10 4 

-23.8 (-5.7) 

10 3 

-17.8 (-4.3) 

10 2 

-11.9 (-2.8) 

10 1 

-5.9 (-1.4) 

1 

0(0) 

icr 1 

5.9 (1.4) 

10- 2 

11.9 (2.8) 

1 er 3 

17.8 (4.3) 

1 er 4 

23.8 (5.7) 

1 er 5 

29.7 (7.1) 

Values of the equilibrium constant were 
calculated for the simple Chemical 
reaction Y <-> X using the équation 
given in the text. The AG° given here 
is in kilojoules per mole at 37°C, with 
kilocalories per mole in parenthèses. 

One kilojoule (kJ) is equal to 0.239 
kilocalories (kcal) (1 kcal = 4.18 kJ). As 
explained in the text, AG° represents the 
free-energy différence under standard 
conditions (where ail components are 
présent at a concentration of 1.0 mole/ 
liter). From this table, we see that if 
there is a favorable standard free-energy 
change (AG°) of -17.8 kJ/mole 
(-4.3 kcal/mole) for the transition Y —> X, 
there will be 1000 times more molécules 
in State X than in State Y at equilibrium 

(K = 1000). 



Activated Carrier Molécules Are Essential for Biosynthesis 

The energy released by the oxidation of food molécules must be stored temporar- 
ily before it can be channeled into the construction of the many other molécules 
needed by the cell. In most cases, the energy is stored as chemical-bond energy 
in a small set of activated "carrier molécules,” which contain one or more energy- 
rich covalent bonds. These molécules diffuse rapidly throughout the cell and 
thereby carry their bond energy from sites of energy génération to the sites where 
the energy will be used for biosynthesis and other cell activities (Figure 2-31). 

The activated carriers store energy in an easily exchangeable form, either as 
a readily transférable Chemical group or as électrons held at a high energy level, 
and they can serve a dual rôle as a source of both energy and Chemical groups in 
biosynthetic reactions. For historical reasons, these molécules are also sometimes 
referred to as coenzymes. The most important of the activated carrier molécules 
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food molécule 
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Figure 2-31 Energy transfer and the 
rôle of activated carriers in metabolism. 

By servi ng as energy shuttles, activated 
carrier molécules perform their function 
as go-betweens that link the breakdown 
of food molécules and the release of 
energy (catabolism) to the energy-requiring 
biosynthesis of small and large organic 
molécules (anabolism). 


CATABOLISM 


ANABOLISM 


are ATP and two molécules that are closely related to each other, NADH and 
NADPH. Cells use such activated carrier molécules like money to pay for reac¬ 
tions that otherwise could not talce place. 

The Formation of an Activated Carrier Is Coupled to an 
Energetically Favorable Reaction 

Coupling mechanisms require enzymes and are fundamental to ail the energy 
transactions of the cell. The nature of a coupled reaction is illustrated by a 
mechanical analogy in Figure 2-32, in which an energetically favorable Chemi¬ 
cal reaction is represented by rocks falling from a cliff. The energy of falling rocks 
would normally be entirely wasted in the form of heat generated by friction when 
the rocks hit the ground (see the falling-briclc diagram in Figure 2-17). By careful 
design, however, part of this energy could be used instead to drive a paddle wheel 
that lifts a buclcet of water (Figure 2-32B). Because the rocks can now reach the 
ground only after moving the paddle wheel, we say that the energetically favor¬ 
able reaction of rock falling has been directly coupled to the energetically unfavor- 
able reaction of lifting the buclcet of water. Note that because part of the energy is 
used to do worlc in Figure 2-32B, the rocks hit the ground with less velocity than in 
Figure 2-32A, and correspondingly less energy is dissipated as heat. 

Similar processes occur in cells, where enzymes play the rôle of the paddle 
wheel. By mechanisms that we discuss later in this chapter, enzymes couple an 


Figure 2-32 A mechanical model 
illustrating the principle of coupled 
Chemical reactions. The spontaneous 
reaction shown in (A) could serve as an 
analogy for the direct oxidation of glucose 
to CO 2 and H 2 O, which produces heat 
only. In (B), the same reaction is coupled 
to a second reaction; this second reaction 
is analogous to the synthesis of activated 
carrier molécules. The energy produced in 
(B) is in a more useful form than in (A) and 
can be used to drive a variety of otherwise 
energetically unfavorable reactions (C). 



kinetic energy of falling rocks is part of the kinetic energy is used to lift the potential kinetic energy stored in 

transformed into heat energy only a bucket of water, and a correspondingly the raised bucket of water can be 

smaller amount is transformed into heat used to drive hydraulic machines that 

carry out a variety of useful tasks 
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phosphoanhydride bonds 



ADP 


Figure 2-33 The hydrolysis of ATP to 
ADP and inorganic phosphate. The two 

outermost phosphates in ATP are held to 
the rest of the molécule by high-energy 
phosphoanhydride bonds and are readily 
transferred. As indicated, water can be 
added to ATP to form ADP and inorganic 
phosphate (Pi). Hydrolysis of the terminal 
phosphate of ATP yields between 46 and 
54 kJ/mole of usable energy, depending 
on the intracellular conditions. The large 
négative AG of this reaction arises from 
several factors: release of the terminal 
phosphate group removes an unfavorable 
repulsion between adjacent négative 
charges, and the inorganic phosphate ion 
(Pi) released is stabilized by résonance and 
by favorable hydrogen-bond formation with 
water. 


energetically favorable reaction, such as the oxidation of foodstuffs, to an ener- 
getically unfavorable reaction, such as the génération of an activated carrier mol¬ 
écule. In this example, the amount of heat released by the oxidation reaction is 
reduced by exactly the amount of energy stored in the energy-rich covalent bonds 
of the activated carrier molécule. And the activated carrier molécule piclcs up a 
paclcet of energy of a size sufficient to power a Chemical reaction elsewhere in the 
cell. 

ATP Is the Most Widely Used Activated Carrier Molécule 

The most important and versatile of the activated carriers in cells is ATP (ade- 
nosine triphosphate). Just as the energy stored in the raised buclcet of water in 
Figure 2-32B can drive a wide variety of hydraulic machines, ATP is a convenient 
and versatile store, or currency, of energy used to drive a variety of Chemical reac¬ 
tions in cells. ATP is synthesized in an energetically unfavorable phosphorylation 
reaction in which a phosphate group is added to ADP (adenosine diphosphate). 
When required, ATP gives up its energy paclcet through its energetically favorable 
hydrolysis to ADP and inorganic phosphate (Figure 2-33). The regenerated ADP 
is then available to be used for another round of the phosphorylation reaction that 
forms ATP. 

The energetically favorable reaction of ATP hydrolysis is coupled to many oth- 
erwise unfavorable reactions through which other molécules are synthesized. 
Many of these coupled reactions involve the transfer of the terminal phosphate in 
ATP to another molécule, as illustrated by the phosphorylation reaction in Figure 
2-34. 

As the most abundant activated carrier in cells, ATP is the principle energy 
currency. To give just two examples, it supplies energy for many of the pumps 
that transport substances into and out of the cell (discussed in Chapter 11), and it 
powers the molecular motors that enable muscle cells to contract and nerve cells 
to transport materials from one end of their long axons to another (discussed in 
Chapter 16). 

Energy Stored in ATP Is Often Harnessed to Join Two Molécules 
Together 

We hâve previously discussed one way in which an energetically favorable reac¬ 
tion can be coupled to an energetically unfavorable reaction, X —> Y, so as to 
enable it to occur. In that scheme, a second enzyme catalyzes the energetically 
favorable reaction Y —> Z, pulling ail of the X to Y in the process. But when the 
required product is Y and not Z, this mechanism is not useful. 
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Figure 2-34 An example of a phosphate 
transfer reaction. Because an energy- 
rich phosphoanhydride bond in ATP 
is converted to a phosphoester bond, 
this reaction is energetically favorable, 
having a large négative AG. Reactions of 
this type are involved in the synthesis of 
phospholipids and in the initial steps of 
reactions that catabolize sugars. 


A typical biosynthetic reaction is one in which two molécules, A and B, are 
joined together to produce A-B in the energetically unfavorable condensation 
reaction 

A-H + B-OH -> A-B + H 2 0 

There is an indirect pathway that allows A-H and B-OH to form A-B, in which 
a coupling to ATP hydrolysis malces the reaction go. Here, energy from ATP hydro- 
lysis is first used to convert B-OH to a higher-energy intermediate compound, 
which then reacts directly with A-H to give A-B. The simplest possible mecha- 
nism involves the transfer of a phosphate from ATP to B-OH to malce B-O-PO 3 , in 
which case the reaction pathway contains only two steps: 

1 . B-OH + ATP -► B-O-PO 3 + ADP 

2. A-H + B-O-PO 3 —► A-B + Pi 

Net resuit: B-OH + ATP + A-H -> A-B + ADP + Pi 


Figure 2-35 An example of an 
energetically unfavorable biosynthetic 
reaction driven by ATP hydrolysis. (A) 

Schematic illustration of the formation of A-B 
in the condensation reaction described in 
the text. (B) The biosynthesis of the common 
amino acid glutamine from glutamic acid and 
ammonia. Glutamic acid is first converted to 
a high-energy phosphorylated intermediate 
(corresponding to the compound B-O-PO 3 
described in the text), which then reacts 
with ammonia (corresponding to A-H) to 
form glutamine. In this example, both steps 
occur on the surface of the same enzyme, 
glutamine synthetase. The high-energy 
bonds are shaded red\ here, as elsewhere 
throughout the book, the symbol Pi = 
HPO 4 2- , and a yellow “circled P” = PO 3 2- . 


The condensation reaction, which by itself is energetically unfavorable, is forced 
to occur by being directly coupled to ATP hydrolysis in an enzyme-catalyzed reac¬ 
tion pathway (Figure 2-35A). 

A biosynthetic reaction of exactly this type synthesizes the amino acid gluta¬ 
mine (Figure 2-35B). We will see shortly that similar (but more complex) mecha- 
nisms are also used to produce nearly ail of the large molécules of the cell. 
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NADH and NADPH Are Important Electron Carriers 

Other important activated carrier molécules participate in oxidation-reduction 
reactions and are commonly part of coupled reactions in cells. These activated 
carriers are specialized to carry électrons held at a high energy level (sometimes 
called “high-energy" électrons) and hydrogen atoms. The most important of these 
électron carriers are NAD + (nicotinamide adenine dinucleotide) and the closely 
related molécule NADP + (nicotinamide adenine dinucleotide phosphate). Each 
piclcs up a “packet of energy" corresponding to two électrons plus a proton (H + ), 
and they are thereby converted to NADH (reduced nicotinamide adenine dinu¬ 
cleotide) and NADPH (reduced nicotinamide adenine dinucleotide phosphate), 
respectively (Figure 2-36). These molécules can therefore be regarded as carriers 
of hydride ions (the H + plus two électrons, or H ). 

Like ATP, NADPH is an activated carrier that participâtes in many important 
biosynthetic reactions that would otherwise be energetically unfavorable. The 
NADPH is produced according to the general scheme shown in Figure 2-36A. 
During a spécial set of energy-yielding catabolic reactions, two hydrogen atoms 
are removed from a substrate molécule. Both électrons but just one proton (that is, 
a hydride ion, H ) are added to the nicotinamide ring of NADP + to form NADPH; 
the second proton (H + ) is released into solution. This is a typical oxidation-reduc¬ 
tion reaction, in which the substrate is oxidized and NADP + is reduced. 

NADPH readily gives up the hydride ion it carries in a subséquent oxida¬ 
tion-reduction reaction, because the nicotinamide ring can achieve a more 
stable arrangement of électrons without it. In this subséquent reaction, which 


(A) 


H— C — 



oxidation of réduction of 

molécule 1 molécule 2 


Figure 2-36 NADPH, an important carrier of électrons. 

(A) NADPH is produced in reactions of the general type shown on 
the left, in which two hydrogen atoms are removed from a substrate. 
The oxidized form of the carrier molécule, NADP+, receives one 
hydrogen atom plus an électron (a hydride ion); the proton (H + ) from 
the other H atom is released into solution. Because NADPH holds 
its hydride ion in a high-energy linkage, the hydride ion can easily 
be transferred to other molécules, as shown on the right. (B) and 
(C) The structures of NADP+ and NADPH. The part of the NADP+ 
molécule known as the nicotinamide ring accepts the hydride ion, 

H - , forming NADPH. The molécules NAD + and NADH are identical in 
structure to NADP+ and NADPH, respectively, except that they lack 
the indicated phosphate group. 



this phosphate group is 
missing in NAD + and NADH 
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regenerates NADP + , it is the NADPH that is oxidized and the substrate that is 
reduced. The NADPH is an effective donor of its hydride ion to other molécules for 
the same reason that ATP readily transfers a phosphate: in both cases the transfer 
is accompanied by a large négative free-energy change. One example of the use of 
NADPH in biosynthesis is shown in Figure 2-37. 

The extra phosphate group on NADPH has no effect on the electron-trans- 
fer properties of NADPH compared with NADH, being far away from the région 
involved in électron transfer (see Figure 2-36C). It does, however, give a molécule 
of NADPH a slightly different shape from that of NADH, making it possible for 
NADPH and NADH to bind as substrates to completely different sets of enzymes. 
Thus, the two types of carriers are used to transfer électrons (or hydride ions) 
between two different sets of molécules. 

Why should there be this division of labor? The answer lies in the need to 
regulate two sets of electron-transfer reactions independently. NADPH opérâtes 
chiefly with enzymes that catalyze anabolic reactions, supplying the high-energy 
électrons needed to synthesize energy-rich biological molécules. NADH, by con- 
trast, has a spécial rôle as an intermediate in the catabolic System of reactions that 
generate ATP through the oxidation of food molécules, as we will discuss shortly. 
The genesis of NADH from NAD + , and of NADPH from NADP + , occur by different 
pathways and are independently regulated, so that the cell can adjust the supply 
of électrons for these two contrasting purposes. Inside the cell the ratio of NAD + 
to NADH is lcept high, whereas the ratio of NADP + to NADPH is lcept low. This pro- 
vides plenty of NAD + to act as an oxidizing agent and plenty of NADPH to act as 
a reducing agent (Figure 2-37B)—as required for their spécial rôles in catabolism 
and anabolism, respectively. 

There Are Many Other Activated Carrier Molécules in Cells 

Other activated carriers also piclcup and carry a Chemical group in an easily trans- 
ferred, high-energylinkage. For example, coenzyme A carries a readily transférable 


7-dehydrocholesterol 




oxidizing agent 
for catabolism 



Figure 2-37 NADPH as a reducing agent. (A) The final stage in the biosynthetic route leading to 
cholestérol. As in many other biosynthetic reactions, the réduction of the C=C bond is achieved by 
the transfer of a hydride ion from the carrier molécule NADPH, plus a proton (H + ) from the solution. 
(B) Keeping NADPH levels high and NADH levels low alters their affinities for électrons (see 
Panel 14-1, p. 765). This causes NADPH to be a much stronger électron donor (reducing 
agent) than NADH, and NAD + therefore to be a much better électron acceptor (oxidizing 
agent) than NADP+, as indicated. 
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Figure 2-38 The structure of the 
important activated carrier molécule 
acetyl CoA. A ball-and-stick model is 
shown above the structure. The sulfur 
atom (yellow) forms a thioester bond to 
acetate. Because this is a high-energy 
linkage, releasing a large amount of free 
energy when it is hydrolyzed, the acetate 
molécule can be readily transferred to other 
molécules. 


acetyl group in a thioester linkage, and in this activated form is known as acetyl 
CoA (acetyl coenzyme A). Acetyl CoA (Figure 2-38) is used to add two carbon 
units in the biosynthesis of larger molécules. 

In acetyl CoA, as in other carrier molécules, the transférable group malces up 
only a small part of the molécule. The rest consists of a large organic portion that 
serves as a convenient “handle,” facilitating the récognition of the carrier molé¬ 
cule by spécifie enzymes. As with acetyl CoA, this handle portion very often con- 
tains a nucléotide (usually adenosine), a curious fact that may be a relie from an 
early stage of évolution. It is currently thought that the main catalysts for early 
life-forms—before DNA or proteins—were RNA molécules (or their close rela¬ 
tives), as described in Chapter 6. It is tempting to speculate that many of the car¬ 
rier molécules that we find today originated in this earlier RNA world, where their 
nucléotide portions could hâve been useful for binding them to RNA enzymes 
(ribozymes). 

Thus, ATP transfers phosphate, NADPH transfers électrons and hydrogen, and 
acetyl CoA transfers two-carbon acetyl groups. FADH 2 (reduced Ravin adenine 
dinucleotide) is used lilce NADH in électron and proton transfers (Figure 2-39). 
The reactions of other activated carrier molécules involve the transfer of a methyl, 
carboxyl, or glucose group for biosyntheses (Table 2-3). These activated carriers 


! Some Activated Carrier Molécules Widely Used in Metabolism 

Activated carrier 

Group carried in high-energy linkage 

ATP 

Phosphate 

NADH, NADPH, FADH 2 

Electrons and hydrogens 
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Acetyl group 
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Figure 2-39 FADH 2 is a carrier of 
hydrogens and high-energy électrons, 
like NADH and NADPH. (A) Structure of 
FADH 2 , with its hydrogen-carrying atoms 
highlighted in yellow. (B) The formation of 
FADH 2 from FAD. 
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are generated in reactions that are coupled to ATP hydrolysis, as in the example 
in Figure 2-40. Therefore, the energy that enables their groups to be used for bio- 
synthesis ultimately cornes from the catabolic reactions that generate ATP. Sim- 
ilar processes occur in the synthesis of the very large molécules of the cell—the 
nucleic acids, proteins, and polysaccharides—that we discuss next. 

The Synthesis of Biological Polymers Is Driven by ATP Hydrolysis 

As discussed previously, the macromolecules of the cell constitute most of its dry 
mass (see Figure 2-7). These molécules are made from subunits (or monomers) 
that are linlced together in a condensation reaction, in which the constituents of a 
water molécule (OH plus H) are removed from the two reactants. Consequently, 
the reverse reaction—the brealcdown of ail three types of polymers—occurs by the 
enzyme-catalyzed addition of water ( hydrolysis ). This hydrolysis reaction is ener- 
getically favorable, whereas the biosynthetic reactions require an energy input 
(see Figure 2-9). 

The nucleic acids (DNA and RNA), proteins, and polysaccharides are ail poly¬ 
mers that are produced by the repeated addition of a monomer onto one end of 
a growing chain. The synthesis reactions for these three types of macromolecules 
are outlined in Figure 2-41. As indicated, the condensation step in each case 
dépends on energy from nucleoside triphosphate hydrolysis. And yet, except for 
the nucleic acids, there are no phosphate groups left in the final product molé¬ 
cules. How are the reactions that release the energy of ATP hydrolysis coupled to 
polymer synthesis? 

For each type of macromolecule, an enzyme-catalyzed pathway exists which 
resembles that discussed previously for the synthesis of the amino acid glutamine 
(see Figure 2-35). The principle is exactly the same, in that the -OH group that will 
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Figure 2-40 A carboxyl group-transfer reaction using an activated carrier molécule. Carboxylated biotin is used by the enzyme pyruvate 
carboxylase to transfer a carboxyl group in the production of oxaloacetate, a molécule needed for the citric acid cycle. The acceptor molécule for 
this group-transfer reaction is pyruvate. Other enzymes use biotin, a B-complex vitamin, to transfer carboxyl groups to other acceptor molécules. 
Note that synthesis of carboxylated biotin requires energy that is derived from ATP—a general feature of many activated carriers. 
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Figure 2-41 The synthesis of polysaccharides, proteins, and nucleic acids. Synthesis 
of each kind of biological polymer involves the loss of water in a condensation reaction. 

Not shown is the consumption of high-energy nucleoside triphosphates that is required to 
activate each monomer before its addition. In contrast, the reverse reaction—the breakdown 
of ail three types of polymers—occurs by the simple addition of water (hydrolysis). 


be removed in the condensation reaction is first activated by becoming involved 
in a high-energy linkage to a second molécule. However, the actual mechanisms 
used to linlc ATP hydrolysis to the synthesis of proteins and polysaccharides are 
more complex than that used for glutamine synthesis, since a sériés of high-en¬ 
ergy intermediates is required to generate the final high-energy bond that is bro- 
lcen during the condensation step (discussed in Chapter 6 for protein synthesis). 

Each activated carrier has limits in its ability to drive a biosynthetic reaction. 
The AG for the hydrolysis of ATP to ADP and inorganic phosphate (Pi) dépends 
on the concentrations of ail of the reactants, but under the usual conditions in a 
cell it is between -46 and -54 kj/mole. In principle, this hydrolysis reaction could 
drive an unfavorable reaction with a AG of, perhaps, +40 kj/mole, provided that a 
suitable reaction path is available. For some biosynthetic reactions, however, even 
-50 kj/mole does not provide enough of a driving force. In these cases, the path 
of ATP hydrolysis can be altered so that it initially produces AMP and pyrophos¬ 
phate ( PPi ), which is itself then hydrolyzed in a subséquent step (Figure 2-42). 
The whole process malces available a total free-energy change of about -100 kj/ 
mole. An important type of biosynthetic reaction that is driven in this way is the 
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Figure 2-42 An alternative pathway of 
ATP hydrolysis, in which pyrophosphate 
is first formed and then hydrolyzed. This 
route releases about twice as much free 
energy (approximately -100 kJ/mole) as 
the reaction shown earlier in Figure 2-33, 
and it forms AMP instead of ADR (A) In 
the two successive hydrolysis reactions, 
oxygen atoms from the participating water 
molécules are retained in the products, as 
indicated, whereas the hydrogen atoms 
dissociate to form free hydrogen ions 
(H + , not shown). (B) Summary of overall 
reaction. 


h 2 o-*-> 

® + ® 


synthesis of nucleic acids (polynucleotides) from nucleoside triphosphates, as 
illustrated on the right side of Figure 2-43. 

Note that the répétitive condensation reactions that produce macromolecules 
can be oriented in one of two ways, giving rise to either the head polymerization 
or the tail polymerization of monomers. In so-called head polymerization, the 
reactive bond required for the condensation reaction is carried on the end of the 



Figure 2-43 Synthesis of a polynucleotide, RNA or DNA, is a multistep process driven by ATP 
hydrolysis. In the first step, a nucleoside monophosphate is activated by the sequential transfer of 
the terminal phosphate groups from two ATP molécules. The high-energy intermediate formed—a 
nucleoside triphosphate—exists free in solution until it reacts with the growing end of an RNA or a 
DNA chain with release of pyrophosphate. Hydrolysis of the latter to inorganic phosphate is highly 
favorable and helps to drive the overall reaction in the direction of polynucleotide synthesis. For 
details, see Chapter 5. 
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growing polymer, and it must therefore be regenerated each time that a monomer 
is added. In this case, each monomer brings with it the reactive bond that will be 
used in adding the next monomer in the sériés. In tail polymerization, the reactive 
bond carried by each monomer is instead used immediately for its own addition 
(Figure 2-44). 

We shall see in later chapters that both of these types of polymerization are 
used. The synthesis of polynucleotides and some simple polysaccharides occurs 
by tail polymerization, for example, whereas the synthesis of proteins occurs by a 
head polymerization process. 


Figure 2-44 The orientation of the 
active intermediates in the répétitive 
condensation reactions that form 
biological polymers. The head growth of 
polymers is compared with its alternative, 
tail growth. As indicated, these two 
mechanisms are used to produce different 
types of biological macromolecules. 


Summary 

Living cells need to create and maintain order within themselves to survive and 
grow. This is thermodynamically possible only because of a continuai input of 
energy, part ofwhich must be released from the cells to their environment as heat 
that disorders the surroundings. The only Chemical reactions possible are those that 
increase the total amount ofdisorder in the universe. The free-energy change for a 
reaction, AG, measures this disorder, and it must be less than zéro for a reaction 
to proceed spontaneously. This AG dépends both on the intrinsic properties ofthe 
reacting molécules and their concentrations, and it can be calculated from these 
concentrations ifeither the equilibrium constant (K) for the reaction or its standard 
free-energy change, AG°, is Icnown. 

The energy neededfor life cornes ultimately from the electromagnetic radiation 
ofthe sun, which drives theformation oforganic molécules in photosynthetic organ- 
isms such as green plants. Animais obtain their energy by eating organic molécules 
and oxidizing them in a sériés of enzyme-catalyzed reactions that are coupled to the 
formation ofATP—a common currency of energy in ail cells. 

To make possible the continuai génération of order in cells; energetically favor¬ 
able reactions, such as the hydrolysis ofATP, are coupled to energetically unfavor- 
able reactions. In the biosynthesis of macromolecules, ATP is used to form reactive 
phosphorylated intermediates. Because the energetically unfavorable reaction of 
biosynthesis now becomes energetically favorable, ATP hydrolysis is said to drive 
the reaction. Polymeric molécules such as proteins, nucleic acids, and polysaccha¬ 
rides are assembled from small activated precursor molécules by répétitive conden¬ 
sation reactions that are driven in this way. Other reactive molécules, called either 
activated carriers or coenzymes, transfer other Chemical groups in the course of 
biosynthesis: NADPH transfers hydrogen as a proton plus two électrons (a hydride 
ion), for example, whereas acetyl CoA transfers an acetyl group. 


HOW CELLS OBTAIN ENERGY FROM FOOD 

The constant supply of energy that cells need to generate and maintain the bio¬ 
logical order that lceeps them alive cornes from the chemical-bond energy in food 
molécules. 

The proteins, lipids, and polysaccharides that make up most of the food we eat 
must be brolcen down into smaller molécules before our cells can use them—either 







74 


Chapter 2: Cell Chemistry and Bioenergetics 


as a source of energy or as building bloclcs for other molécules. Enzymatic diges¬ 
tion breaks down the large polymeric molécules in food into their monomer sub- 
units—proteins into amino acids, polysaccharides into sugars, and fats into fatty 
acids and glycerol. After digestion, the small organic molécules derived from food 
enter the cytosol of cells, where their graduai oxidation begins. 

Sugars are particularly important fuel molécules, and they are oxidized in 
small controlled steps to carbon dioxide (C0 2 ) and water (Figure 2-45). In this 
section, we trace the major steps in the brealcdown, or catabolism, of sugars and 
show how they produce ATP, NADH, and other activated carrier molécules in ani¬ 
mal cells. A very similar pathway also opérâtes in plants, fungi, and many bacteria. 
As we shall see, the oxidation of fatty acids is equally important for cells. Other 
molécules, such as proteins, can also serve as energy sources when they are fun- 
neled through appropriate enzymatic pathways. 

Glycolysis Is a Central ATP-Producing Pathway 

The major process for oxidizing sugars is the sequence of reactions lcnown as 
glycolysis—from the Greelc glukus, "sweet," and lusis, “rupture." Glycolysis pro- 
duces ATP without the involvement of molecular oxygen (0 2 gas). It occurs in the 
cytosol of most cells, including many anaérobie microorganisms. Glycolysis prob- 
ably evolved early in the history of life, before photosynthetic organisms intro- 
duced oxygen into the atmosphère. During glycolysis, a glucose molécule with six 
carbon atoms is converted into two molécules of pyruvate, each of which contains 
three carbon atoms. For each glucose molécule, two molécules of ATP are hydro- 
lyzed to provide energy to drive the early steps, but four molécules of ATP are pro- 
duced in the later steps. At the end of glycolysis, there is consequently a net gain 
of two molécules of ATP for each glucose molécule brolcen down. Two molécules 
of the activated carrier NADH are also produced. 

The glycolytic pathway is outlined in Figure 2-46 and shown in more detail in 
Panel 2-8 (pp. 104-105) and Movie 2.5. Glycolysis involves a sequence of 10 sep- 
arate reactions, each producing a different sugar intermediate and each catalyzed 
by a different enzyme. Lilce most enzymes, these hâve names ending in ase —such 
as isomemse and dehydrogenase—to indicate the type of reaction they catalyze. 

Although no molecular oxygen is used in glycolysis, oxidation occurs, in that 
électrons are removed by NAD + (producing NADH) from some of the carbons 
derived from the glucose molécule. The stepwise nature of the process releases 
the energy of oxidation in small paclcets, so that much of it can be stored in acti¬ 
vated carrier molécules rather than ail of it being released as heat (see Figure 
2-45). Thus, some of the energy released by oxidation drives the direct synthesis 
of ATP molécules from ADP and Pi, and some remains with the électrons in the 
électron carrier NADH. 


Figure 2-45 Schematic représentation 
of the controlled stepwise oxidation of 
sugar in a cell, compared with ordinary 
burning. (A) If the sugar were oxidized to 
C0 2 and H 2 O in a single step, it would 
release an amount of energy much larger 
than could be captured for useful purposes. 
(B) In the cell, enzymes catalyze oxidation 
via a sériés of small steps in which free 
energy is transferred in conveniently sized 
packets to carrier molécules—most often 
ATP and NADH. At each step, an enzyme 
Controls the reaction by reducing the 
activation-energy barrier that has to be 
surmounted before the spécifie reaction 
can occur. The total free energy released is 
exactly the same in (A) and (B). 


(A) DIRECT BURNING OF SUGAR 
IN NONLIVING SYSTEM 


(B) STEPWISE OXIDATION OF SUGAR IN CELLS 
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Two molécules of NADH are formed per molécule of glucose in the course of 
glycolysis. In aérobic organisms, these NADH molécules donate their électrons to 
the electron-transport chain described in Chapter 14, and the NAD + formed from 
the NADH is used again for glycolysis (see step 6 in Panel 2-8, pp. 104-105). 

Fermentations Produce ATP in the Absence of Oxygen 

For most animal and plant cells, glycolysis is only a préludé to the final stage of 
the breakdown of food molécules. In these cells, the pyruvate formed by glycolysis 
is rapidly transported into the mitochondria, where it is converted into C0 2 plus 
acetyl CoA, whose acetyl group is then completely oxidized to C0 2 and H 2 0. 

In contrast, for many anaérobie organisms—which do not utilize molecular 
oxygen and can grow and divide without it—glycolysis is the principal source of 
the cell's ATP. Certain animal tissues, such as slceletal muscle, can also continue 
to function when molecular oxygen is limited. In these anaérobie conditions, the 
pyruvate and the NADH électrons stay in the cytosol. The pyruvate is converted 
into products excreted from the cell—for example, into éthanol and C0 2 in the 
yeasts used in brewing and breadmaking, or into lactate in muscle. In this process, 
the NADH gives up its électrons and is converted baclc into NAD + . This régénér¬ 
ation of NAD + is required to maintain the reactions of glycolysis (Figure 2-47). 

Energy-yielding pathways lilce these, in which organic molécules both donate 
and accept électrons (and which are often, as in these cases, anaérobie), are called 
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Figure 2-46 An outline of glycolysis. 

Each of the 10 steps shown is catalyzed 
by a different enzyme. Note that step 4 
cleaves a six-carbon sugar into two three- 
carbon sugars, so that the number of 
molécules at every stage after this doubles. 
As indicated, step 6 begins the energy- 
generation phase of glycolysis. Because 
two molécules of ATP are hydrolyzed in the 
early, energy-investment phase, glycolysis 
results in the net synthesis of 2 ATP and 2 
NADH molécules per molécule of glucose 
(see also Panel 2-8). 
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(A) FERMENTATION LEADING TO EXCRETION OF LACTATE 
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Figure 2-47 Two pathways for the 
anaérobie breakdown of pyruvate. 

(A) When there is inadéquate oxygen, 
for example, in a muscle cell undergoing 
vigorous contraction, the pyruvate 
produced by glycolysis is converted to 
lactate as shown. This reaction regenerates 
the NAD + consumed in step 6 of glycolysis, 
but the whole pathway yields much less 
energy overall than complété oxidation. 

(B) In some organisms that can grow 
anaerobically, such as yeasts, pyruvate is 
converted via acetaldehyde into carbon 
dioxide and éthanol. Again, this pathway 
regenerates NAD+ from NADH, as required 
to enable glycolysis to continue. Both (A) 
and (B) are examples of fermentations. 


(B) FERMENTATION LEADING TO EXCRETION OF ETHANOL AND C0 2 
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fermentations. Studies of the commercially important fermentations carried out 
by yeasts inspired much of early biochemistry. Work in the nineteenth century 
led in 1896 to the then startling récognition that these processes could be studied 
outside living organisms, in cell extracts. This revolutionary discovery eventually 
made it possible to dissect out and study each of the individual reactions in the 
fermentation process. The piecing together of the complété glycolytic pathway in 
the 1930s was a maj or triumph of biochemistry, and it was quicldy followed by the 
récognition of the central rôle of ATP in cell processes. 

Glycolysis Illustrâtes How Enzymes Couple Oxidation to Energy 
Storage 

The formation of ATP during glycolysis provides a particularly clear démonstra¬ 
tion of how enzymes couple energetically unfavorable reactions with favorable 
ones, thereby driving the many Chemical reactions that malce life possible. Two 
central reactions in glycolysis (steps 6 and 7) convert the three-carbon sugar inter- 
mediate glyceraldehyde 3-phosphate (an aldéhyde) into 3-phosphoglycerate (a 
carboxylic acid; see Panel 2-8, pp. 104-105), thus oxidizing an aldéhyde group to a 
carboxylic acid group. The overall reaction releases enough free energy to convert 
a molécule of ADP to ATP and to transfer two électrons (and a proton) from the 
aldéhyde to NAD + to form NADH, while still liberating enough heat to the envi¬ 
ronment to malce the overall reaction energetically favorable (AG° for the overall 
reaction is -12.5 kl/mole). 

Figure 2-48 outlines this remarkable feat of energy harvesting. The Chemical 
reactions are precisely guided by two enzymes to which the sugar intermediates 
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(A) STEPS 6 AND 7 OF GYCOLYSIS 
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A short-lived covalent bond is 
formed between glyceraldehyde 
3-phosphate and the -SH group of 
a cysteine side chain of the enzyme 
glyceraldehyde 3-phosphate 
dehydrogenase. The enzyme also 
binds noncovalently to NAD + . 


Glyceraldehyde 3-phosphate is 
oxidized as the enzyme removes a 
hydrogen atom (yellow) and 
transfers it f along with an électron, 
to NAD + , forming NADH (see 
Figure 2-37). Part of the energy 
released by the oxidation of the 
aldéhyde is thus stored in NADH, 
and part is stored in the high- 
energy thioester bond that links 
glyceraldehyde 3-phosphate to the 
enzyme. 


A molécule of inorganic phosphate 
displaces the high-energy thioester 
bond to create 1,3-bisphospho- 
glycerate, which contains a 
high-energy phosphate bond. 
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Figure 2-48 Energy storage in steps 
6 and 7 of glycolysis. (A) In step 6, the 
enzyme glyceraldehyde 3-phosphate 
dehydrogenase couples the energetically 
favorable oxidation of an aldéhyde to 
the energetically unfavorable formation 
of a high-energy phosphate bond. At 
the same time, it enables energy to be 
stored in NADH. The formation of the 
high-energy phosphate bond is driven by 
the oxidation reaction, and the enzyme 
thereby acts like the “paddle wheel” 
coupler in Figure 2-32B. In step 7, the 
newly formed high-energy phosphate bond 
in 1,3-bisphosphoglycerate is transferred 
to ADP, forming a molécule of ATP and 
leaving a free carboxylic acid group on the 
oxidized sugar. The part of the molécule 
that undergoes a change is shaded in 
blue ; the rest of the molécule remains 
unchanged throughout ail these reactions. 
(B) Summary of the overall Chemical 
change produced by reactions 6 and 7. 


SUMMARY OF STEPS 6 AND 7 



The oxidation of an aldéhyde to a 
carboxylic acid releases energy, 
much of which is captured in the 
activated carriers ATP and NADH. 
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are tightly bound. As detailed in Figure 2-48, the first enzyme (glyceraldehyde 
3-phosphate dehydrogenase) forms a short-lived covalent bond to the aldéhyde 
through a reactive -SH group on the enzyme, and catalyzes its oxidation by NAD + 
in this attached State. The reactive enzyme-substrate bond is then displaced by 
an inorganic phosphate ion to produce a high-energy phosphate intermediate, 
which is released from the enzyme. This intermediate binds to the second enzyme 
(phosphoglycerate kinase), which catalyzes the energetically favorable transfer of 
the high-energy phosphate just created to ADP, forming ATP and completing the 
process of oxidizing an aldéhyde to a carboxylic acid. Note that the C-H bond oxi¬ 
dation energy in step 6 drives the formation of both NADH and a high-energy 
phosphate bond. The brealcage of the high-energy bond then drives ATP forma¬ 
tion. 

We hâve shown this particular oxidation process in some detail because it pro¬ 
vides a clear example of enzyme-mediated energy storage through coupled reac¬ 
tions (Figure 2-49). Steps 6 and 7 are the only reactions in glycolysis that create a 
high-energy phosphate linkage directly from inorganic phosphate. As such, they 
account for the net yield of two ATP molécules and two NADH molécules per mol¬ 
écule of glucose (see Panel 2-8, pp. 104-105). 

As we hâve just seen, ATP can be formed readily from ADP when a reaction 
intermediate is formed with a phosphate bond of higher energy than the terminal 
phosphate bond in ATP. Phosphate bonds can be ordered in energy by comparing 
the standard free-energy change (A G°) for the brealcage of each bond by hydroly- 
sis. Figure 2-50 compares the high-energy phosphoanhydride bonds in ATP with 
the energy of some other phosphate bonds, several of which are generated during 
glycolysis. 

Organisms Store Food Molécules in Spécial Réservoirs 

Ail organisms need to maintain a high ATP/ADP ratio to maintain biological order 
in their cells. Yet animais hâve only periodic access to food, and plants need to 
survive overnight without sunlight, when they are unable to produce sugar from 
photosynthesis. For this reason, both plants and animais convert sugars and fats 
to spécial forms for storage (Figure 2-51). 

To compensate for long periods of fasting, animais store fatty acids as fat 
droplets composed of water-insoluble triacylglycerols (also called triglycérides). 
The triacylglycerols in animais are mostly stored in the cytoplasm of specialized 
fat cells called adipocytes. For shorter-term storage, sugar is stored as glucose 
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Figure 2-49 Schematic view of the 
coupled reactions that form NADH and 
ATP in steps 6 and 7 of glycolysis. The 

C-H bond oxidation energy drives the 
formation of both NADH and a high-energy 
phosphate bond. The breakage of the high- 
energy bond then drives ATP formation. 
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Figure 2-50 Phosphate bonds hâve different energies. Examples of different types of phosphate bonds with 
their sites of hydrolysis are shown in the molécules depicted on the left. Those starting with a gray carbon atom 
show only part of a molécule. Examples of molécules containing such bonds are given on the right, with the 
standard free-energy change for hydrolysis in kilojoules. The transfer of a phosphate group from one molécule 
to another is energetically favorable if the free-energy change (AG) for hydrolysis of the phosphate bond of 
the first molécule is more négative than that for hydrolysis of the phosphate bond in the second. Thus, under 
standard conditions, a phosphate group is readily transferred from 1,3-bisphosphoglycerate to ADP to form 
ATP. (Standard conditions often do not pertain to living cells, where the relative concentrations of reactants and 
Products will influence the actual change in free energy.) The hydrolysis reaction can be viewed as the transfer 
of the phosphate group to water. 


subunits in the large branched polysaccharide glycogen, which is présent as 
small granules in the cytoplasm of many cells, including liver and muscle. The 
synthesis and dégradation of glycogen are rapidly regulated according to need. 
When cells need more ATP than they can generate from the food molécules talcen 
in from the bloodstream, they break down glycogen in a reaction that produces 
glucose 1-phosphate, which is rapidly converted to glucose 6-phosphate for gly- 
colysis (Figure 2-52). 

Quantitatively, fat is far more important than glycogen as an energy store for 
animais, presumably because it provides for more efficient storage. The oxidation 
of a gram of fat releases about twice as much energy as the oxidation of a gram 
of glycogen. Moreover, glycogen differs from fat in binding a great deal of water, 
producing a sixfold différence in the actual mass of glycogen required to store the 
same amount of energy as fat. An average adult human stores enough glycogen 
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Figure 2-51 The storage of sugars and fats in animal and plant cells. (A) The structures of starch and glycogen, the 
storage form of sugars in plants and animais, respectively. Both are storage polymers of the sugar glucose and differ only in the 
frequency of branch points. There are many more branches in glycogen than in starch. (B) An électron micrograph of glycogen 
granules in the cytoplasm of a liver cell. (C) A thin section of a chloroplast from a plant cell, showing the starch granules and lipid 
(fat droplets) that hâve accumulated as a resuit of the biosyntheses occurring there. (D) Fat droplets (stained red) beginning to 
accumulate in developing fat cells of an animal. (B, courtesy of Robert Fletterick and Daniel S. Friend; C, courtesy of K. Plaskitt; 
D, courtesy of Ronald M. Evans and Peter Totonoz.) 


for only about a day of normal activities, but enough fat to last for nearly a month. 
If our main fuel réservoir had to be carried as glycogen instead of fat, body weight 
would increase by an average of about 60 pounds. 

The sugar and ATP needed by plant cells are largely produced in separate 
organelles: sugars in chloroplasts (the organelles specialized for photosynthesis), 
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Figure 2-52 How sugars are produced 
from glycogen. Glucose subunits are 
released from glycogen by the enzyme 
glycogen phosphorylase. This produces 
glucose 1 -phosphate, which is rapidly 
converted to glucose 6-phosphate for 
glycolysis. 
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Figure 2-53 Some plant seeds that 
serve as important foods for humans. 

Corn, nuts, and peas ail contain rich stores 
of starch and fat that provide the young 
plant embryo in the seed with energy and 
building blocks for biosynthesis. (Courtesy 
of the John Innés Foundation.) 


and ATP in mitochondria. Although plants produce abundant amounts of both 
ATP and NADPH in their chloroplasts, this organelle is isolated from the rest of its 
plant cell by a membrane that is imperméable to both types of activated carrier 
molécules. Moreover, the plant contains many cells—such as those in the roots— 
that laclc chloroplasts and therefore cannot produce their own sugars. Thus, sug- 
ars are exported from chloroplasts to the mitochondria présent in ail cells of the 
plant. Most of the ATP needed for general plant cell metabolism is synthesized in 
these mitochondria, using exactly the same pathways for the oxidative breakdown 
of sugars as in nonphotosynthetic organisms; this ATP is then passed to the rest of 
the cell (see Figure 14-42). 

During periods of excess photosynthetic capacity during the day, chloroplasts 
convert some of the sugars that they malce into fats and into starch, a polymer of 
glucose analogous to the glycogen of animais. The fats in plants are triacyl-glyc- 
erols (triglycérides), just lilce the fats in animais, and differ only in the types of 
fatty acids that predominate. Fat and starch are both stored inside the chloroplast 
until needed for energy-yielding oxidation during periods of darkness (see Figure 
2-51C). 

The embryos inside plant seeds must live on stored sources of energy for a 
prolonged period, until they germinate and produce leaves that can harvest the 
energy in sunlight. For this reason plant seeds often contain especially large 
amounts of fats and starch—which malces them a major food source for animais, 
including ourselves (Figure 2-53). 

Most Animal Cells Dérivé Their Energy from Fatty Acids Between 
Meals 

After a meal, most of the energy that an animal needs is derived from sugars 
obtained from food. Excess sugars, if any, are used to replenish depleted glycogen 
stores, or to synthesize fats as a food store. But soon the fat stored in adipose tissue 
is called into play, and by the morning after an overnight fast, fatty acid oxidation 
generates most of the ATP we need. 

Low glucose levels in the blood trigger the breakdown of fats for energy pro¬ 
duction. As illustrated in Figure 2-54, the triacylglycerols stored in fat droplets 
in adipocytes are hydrolyzed to produce fatty acids and glycerol, and the fatty 
acids released are transferred to cells in the body through the bloodstream. While 
animais readily convert sugars to fats, they cannot convert fatty acids to sugars. 
Instead, the fatty acids are oxidized directly. 

Sugars and Fats Are Both Degraded to Acetyl CoA in 
Mitochondria 

In aérobic metabolism, the pyruvate that was produced by glycolysis from sugars 
in the cytosol is transported into the mitochondria of eukaryotic cells. There, it is 
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stored fat 


FAT CELL 



Figure 2-54 How stored fats are 
mobilized for energy production in 
animais. Low glucose levels in the blood 
trigger the hydrolysis of the triacylglycerol 
molécules in fat droplets to free fatty 
acids and glycerol. These fatty acids enter 
the bloodstream, where they bind to the 
abundant blood protein, sérum albumin. 
Spécial fatty acid transporters in the 
plasma membrane of cells that oxidize fatty 
acids, such as muscle cells, then pass 
these fatty acids into the cytosol, from 
which they are moved into mitochondria for 
energy production. 


rapidly decarboxylated by a giant complex of three enzymes, called the pyruvate 
dehydrogenase complex. The products of pyruvate décarboxylation are a molécule 
of CO 2 (a waste product), a molécule of NADH, and acetyl CoA (see Panel 2-9). 

The fatty acids imported from the bloodstream are moved into mitochondria, 
where ail of their oxidation talces place (Figure 2-55). Each molécule of fatty acid 
(as the activated molecule/at/y acyl CoA ) is brolcen down completely by a cycle of 
reactions that trims two carbons at a time from its carboxyl end, generating one 
molécule of acetyl CoA for each turn of the cycle. A molécule of NADH and a mol¬ 
écule of FADH 2 are also produced in this process (Figure 2-56). 

Sugars and fats are the major energy sources for most nonphotosynthetic 
organisms, including humans. However, most of the useful energy that can be 
extracted from the oxidation of both types of foodstuffs remains stored in the ace¬ 
tyl CoA molécules that are produced by the two types of reactions just described. 
The citric acid cycle of reactions, in which the acetyl group (-COCH 3 ) in acetyl 
CoA is oxidized to C0 2 and H 2 0, is therefore central to the energy metabolism 
of aérobic organisms. In eukaryotes, these reactions ail talce place in mitochon¬ 
dria. We should therefore not be surprised to discover that the mitochondrion is 
the place where most of the ATP is produced in animal cells. In contrast, aérobic 
bacteria carry out ail of their reactions, including the citric acid cycle, in a single 
compartment, the cytosol. 


The Citric Acid Cycle Generates NADH by Oxidizing Acetyl Groups 
to CO2 


In the nineteenth century, biologists noticed that in the absence of air cells pro- 
duce lactic acid (for example, in muscle) or éthanol (for example, in yeast), while 
in its presence they consume 0 2 and produce C0 2 and H 2 0. Efforts to define the 
pathways of aérobic metabolism eventually focused on the oxidation of pyru¬ 
vate and led in 1937 to the discovery of the citric acid cycle, also known as the 



Figure 2-55 Pathways for 
the production of acetyl CoA 
from sugars and fats. The 

mitochondrion in eukaryotic cells 
is where acetyl CoA is produced 
from both types of major food 
molécules. It is therefore the 
place where most of the cell’s 
oxidation reactions occur and 
where most of its ATP is made. 
Amino acids (not shown) can 
also enter the mitochondria, to 
be converted there into acetyl 
CoA or another intermediate of 
the citric acid cycle. The structure 
and function of mitochondria are 
discussed in detail in Chapter 14. 
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tricarboxylic acid cycle or the Krebs cycle. The citric acid cycle accounts for about 
two-thirds of the total oxidation of carbon compounds in most cells, and its major 
end products are CO 2 and high-energy électrons in the form of NADH. The CO 2 
is released as a waste product, while the high-energy électrons from NADH are 
passed to a membrane-bound electron-transport chain (discussed in Chapter 
14), eventually combining with 0 2 to produce H 2 0. The citric acid cycle itself does 
not use gaseous 0 2 (it uses oxygen atoms from H 2 0). But the cycle does require 0 2 
in subséquent reactions to lceep it going. This is because there is no other efficient 
way for the NADH to get rid of its électrons and thus regenerate the NAD + that is 
needed. 

The citric acid cycle talces place inside mitochondria in eukaryotic cells. It 
results in the complété oxidation of the carbon atoms of the acetyl groups in ace- 
tyl CoA, converting them into CO 2 . But the acetyl group is not oxidized directly. 
Instead, this group is transferred from acetyl CoA to a larger, four-carbon molé¬ 
cule, oxaloacetate , to form the six-carbon tricarboxylic acid, citric acid , for which 
the subséquent cycle of reactions is named. The citric acid molécule is then grad- 
ually oxidized, allowing the energy of this oxidation to be harnessed to produce 
energy-rich activated carrier molécules. The chain of eight reactions forms a cycle 
because at the end the oxaloacetate is regenerated and enters a new turn of the 
cycle, as shown in outline in Figure 2-57. 

We hâve thus far discussed only one of the three types of activated carrier 
molécules that are produced by the citric acid cycle; NADH, the reduced form of 
the NAD + /NADH électron carrier System (see Figure 2-36). In addition to three 
molécules of NADH, each turn of the cycle also produces one molécule of FADH 2 
(reduced flavin adenine dinucleotide) from FAD (see Figure 2-39), and one mol¬ 
écule of the ribonucleoside triphosphate GTP from GDP. The structure of GTP is 
illustrated in Figure 2-58. GTP is a close relative of ATP, and the transfer of its 
terminal phosphate group to ADP produces one ATP molécule in each cycle. As 
we discuss shortly, the energy that is stored in the readily transferred électrons of 
NADH and FADH 2 will be utilized subsequently for ATP production through the 


Figure 2-56 The oxidation of fatty acids 
to acetyl CoA. (A) Electron micrograph 
of a lipid droplet in the cytoplasm. (B) The 
structure of fats. Fats are triacylglycerols. 
The glycerol portion, to which three fatty 
acids are linked through ester bonds, 
is shown in blue. Fats are insoluble in 
water and form large lipid droplets in the 
specialized fat cells (adipocytes) in which 
they are stored. (C) The fatty acid oxidation 
cycle. The cycle is catalyzed by a sériés of 
four enzymes in mitochondria. Each turn of 
the cycle shortens the fatty acid chain by 
two carbons (shown in red) and generates 
one molécule of acetyl CoA and one 
molécule each of NADH and FADH 2 . 

(A, courtesy of Daniel S. Friend.) 
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H 3 C — c —S-CoA 
acetyl CoA 



+ H + 


Figure 2-57 Simple overview of the citric 
acid cycle. The reaction of acetyl CoA with 
oxaloacetate starts the cycle by producing 
citrate (citric acid). In each turn of the 
cycle, two molécules of CO 2 are produced 
as waste products, plus three molécules 
of NADH, one molécule of GTP, and one 
molécule of FADH 2 . The number of carbon 
atoms in each intermediate is shown in a 
yellow box. For details, see Panel 2-9 
(pp. 106-107). 


NET RESULT: ONE TURN OF THE CYCLE PRODUCES THREE NADH, ONE GTP, AND 
ONE FADH 2 MOLECULE, AND RELEASES TWO MOLECULES OF C0 2 

process of oxidative phosphorylation , the only step in the oxidative catabolism of 
foodstuffs that directly requires gaseous oxygen (O 2 ) from the atmosphère. 

Panel 2-9 (pp. 106-107) and Movie 2.6 présent the complété citric acid cycle. 
Water, rather than molecular oxygen, supplies the extra oxygen atoms required 
to make CO 2 from the acetyl groups entering the citric acid cycle. As illustrated in 
the panel, three molécules of water are split in each cycle, and the oxygen atoms 
of some of them are ultimately used to make CO 2 . 

In addition to pyruvate and fatty acids, some amino acids pass from the cytosol 
into mitochondria, where they are also converted into acetyl CoA or one of the 
other intermediates of the citric acid cycle. Thus, in the eukaryotic cell, the mito- 
chondrion is the center toward which ail energy-yielding processes lead, whether 
they begin with sugars, fats, or proteins. 

Both the citric acid cycle and glycolysis also function as starting points for 
important biosynthetic reactions by producing vital carbon-containing interme¬ 
diates, such as oxaloacetate and a-lcetoglutarate. Some of these substances pro¬ 
duced by catabolism are transferred back from the mitochondria to the cytosol, 
where they serve in anabolic reactions as precursors for the synthesis of many 
essential molécules, such as amino acids (Figure 2-59). 


Electron Transport Drives the Synthesis of the Majority of the ATP 
in Most Cells 

Most Chemical energy is released in the last stage in the dégradation of a food 
molécule. In this final process, NADH and FADH 2 transfer the électrons that they 
gained when oxidizing food-derived organic molécules to the electron-transport 
chain, which is embedded in the inner membrane of the mitochondrion (see 
Figure 14-10). As the électrons pass along this long chain of specialized électron 
acceptor and donor molécules, they fall to successively lower energy States. The 
energy that the électrons release in this process pumps H + ions (protons) across 
the membrane—from the innermost mitochondrial compartment (the matrix) 
to the intermembrane space (and then to the cytosol)—generating a gradient of 
H + ions (Figure 2-60). This gradient serves as a major source of energy for cells, 
being tapped lilce a battery to drive a variety of energy-requiring reactions. The 
most prominent of these reactions is the génération of ATP by the phosphoryla¬ 
tion of ADP. 
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guanine 



Figure 2-58 The structure of GTP. GTP 

and G DP are close relatives of ATP and 
ADP, respectively. 


At the end of this sériés of électron transfers, the électrons are passed to molé¬ 
cules of oxygen gas (O 2 ) that hâve diffused into the mitochondrion, which simul- 
taneously combine with protons (H + ) from the surrounding solution to produce 
water. The électrons hâve now reached a low energy level, and ail the available 
energy has been extracted from the oxidized food molécule. This process, termed 
oxidative phosphorylation (Figure 2-61), also occurs in the plasma membrane 
of bacteria. As one of the most remarlcable achievements of cell évolution, it is a 
central topic of Chapter 14. 

In total, the complété oxidation of a molécule of glucose to H 2 O and CO 2 is 
used by the cell to produce about 30 molécules of ATP. In contrast, only 2 mol¬ 
écules of ATP are produced per molécule of glucose by glycolysis alone. 


Amino Acids and Nucléotides Are Part of the Nitrogen Cycle 

So far we hâve concentrated mainly on carbohydrate metabolism and hâve not yet 
considered the metabolism of nitrogen or sulfur. These two éléments are import¬ 
ant constituents of biological macromolecules. Nitrogen and sulfur atoms pass 
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Figure 2-59 Glycolysis and the citric 
acid cycle provide the precursors 
needed to synthesize many important 
biological molécules. The amino acids, 
nucléotides, lipids, sugars, and other 
molécules—shown here as products—in 
turn serve as the precursors for the many 
macromolecules of the cell. Each black 
arrow in this diagram dénotés a single 
enzyme-catalyzed reaction; the red arrows 
generally represent pathways with many 
steps that are required to produce the 
indicated products. 
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from compound to compound and between organisms and their environment in 
a sériés of réversible cycles. 

Although molecular nitrogen is abundant in the Earth's atmosphère, nitrogen 
is chemically unreactive as a gas. Only a few living species are able to incorpo- 
rate it into organic molécules, a process called nitrogen fixation. Nitrogen fixa¬ 
tion occurs in certain microorganisms and by some geophysical processes, such 
as lightning discharge. It is essential to the biosphère as a whole, for without it life 
could not exist on this planet. Only a small fraction of the nitrogenous compounds 
in today's organisms, however, is due to fresh products of nitrogen fixation from 
the atmosphère. Most organic nitrogen has been in circulation for some time, 
passing from one living organism to another. Thus, présent-day nitrogen-fixing 
reactions can be said to perform a “topping-up" function for the total nitrogen 
supply. 

Vertebrates receive virtually ail of their nitrogen from their dietary intake of 
proteins and nucleic acids. In the body, these macromolecules are brolcen down 
to amino acids and the components of nucléotides, and the nitrogen they contain 
is used to produce new proteins and nucleic acids—or other molécules. About half 
of the 20 amino acids found in proteins are essential amino acids for vertebrates 
(Figure 2-62), which means that they cannot be synthesized from other ingrédi¬ 
ents of the diet. The other amino acids can be so synthesized, using a variety of 
raw materials, including intermediates of the citric acid cycle. The essential amino 
acids are made by plants and other organisms, usually by long and energetically 
expensive pathways that hâve been lost in the course of vertebrate évolution. 

The nucléotides needed to malce RNA and DNA can be synthesized using spe- 
cialized biosynthetic pathways. Ail of the nitrogens in the purine and pyrimidine 
bases (as well as some of the carbons) are derived from the plentiful amino acids 
glutamine, aspartic acid, and glycine, whereas the ribose and deoxyribose sugars 
are derived from glucose. There are no "essential nucléotides" that must be pro- 
vided in the diet. 

Amino acids not used in biosynthesis can be oxidized to generate metabolic 
energy. Most of their carbon and hydrogen atoms eventually form C0 2 or H 2 0, 
whereas their nitrogen atoms are shuttled through various forms and eventually 
appear as urea, which is excreted. Each amino acid is processed differently, and a 
whole constellation of enzymatic reactions exists for their catabolism. 

Sulfur is abundant on Earth in its most oxidized form, sulfate (SO4 2 "). To be 
useful for life, sulfate must be reduced to sulfide (S 2- ), the oxidation State of sulfur 
required for the synthesis of essential biological molécules, including the amino 
acids méthionine and cysteine, coenzyme A (see Figure 2-39), and the iron-sulfur 
centers essential for électron transport (see Figure 14-16). The sulfur-réduction 
process begins in bacteria, fungi, and plants, where a spécial group of enzymes 
use ATP and reducing power to create a sulfate assimilation pathway. Humans 
and other animais cannot reduce sulfate and must therefore acquire the sulfur 
they need for their metabolism in the food that they eat. 





Figure 2-60 The génération of an 
H + gradient across a membrane by 
electron-transport reactions. An électron 
held in a high-energy State (derived, for 
example, from the oxidation of a métabolite) 
is passed sequentially by carriers A, B, and 
C to a lower energy State. In this diagram, 
carrier B is arranged in the membrane 
in such a way that it takes up H + from 
one side and releases it to the other as 
the électron passes. The resuit is an H + 
gradient. As discussed in Chapter 14, this 
gradient is an important form of energy that 
is harnessed by other membrane proteins 
to drive the formation of ATP (for an actual 
example, see Figure 14-21). 


pyruvate from NADH from 

glycolysis C0 2 glycolysis 0 2 



Figure 2-61 The final stages of oxidation 
of food molécules. Molécules of NADH 
and FADH 2 (FADH 2 is not shown) are 
produced by the citric acid cycle. These 
activated carriers donate high-energy 
électrons that are eventually used to reduce 
oxygen gas to water. A major portion of 
the energy released during the transfer of 
these électrons along an electron-transfer 
chain in the mitochondrial inner membrane 
(or in the plasma membrane of bacteria) is 
harnessed to drive the synthesis of ATP— 
hence the name oxidative phosphorylation 
(discussed in Chapter 14). 
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Metabolism Is Highly Organized and Regulated 


THE ESSENTIAL AMINO ACIDS 


One gets a sense of the intricacy of a cell as a Chemical machine from the relation 
of glycolysis and the citric acid cycle to the other metabolic pathways slcetched 
out in Figure 2-63. Ihis chart represents only some of the enzymatic pathways in 
a human cell. It is obvious that our discussion of cell metabolism has dealt with 
only a tiny fraction of the broad field of cell chemistry. 

Ail these reactions occur in a cell that is less than 0.1 mm in diameter, and each 
requires a different enzyme. As is clear from Figure 2-63, the same molécule can 
often be part of many different pathways. Pyruvate, for example, is a substrate for 
half a dozen or more different enzymes, each of which modifies it chemically in 
a different way. One enzyme converts pyruvate to acetyl CoA, another to oxalo- 
acetate; a third enzyme changes pyruvate to the amino acid alanine, a fourth to 
lactate, and so on. Ail of these different pathways compete for the same pyruvate 
molécule, and similar compétitions for thousands of other small molécules go on 
at the same time. 

The situation is further complicated in a multicellular organism. Different cell 
types will in general require somewhat different sets of enzymes. And different 
tissues malce distinct contributions to the chemistry of the organism as a whole. 
In addition to différences in specialized products such as hormones or antibod- 
ies, there are significant différences in the “common” metabolic pathways among 
various types of cells in the same organism. 

Although virtually ail cells contain the enzymes of glycolysis, the citric acid 
cycle, lipid synthesis and brealcdown, and amino acid metabolism, the levels 
of these processes required in different tissues are not the same. For example, 
nerve cells, which are probably the most fastidious cells in the body, maintain 
almost no reserves of glycogen or fatty acids and rely almost entirely on a constant 
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Figure 2-62 The nine essential amino 
acids. These cannot be synthesized by 
human cells and so must be supplied in 
the diet. 



Figure 2-63 Glycolysis and the citric acid cycle are at the center of an elaborate set of metabolic pathways in human cells. Some 2000 
metabolic reactions are shown schematically with the reactions of glycolysis and the citric acid cycle in red. Many other reactions either lead into 
these two central pathways—delivering small molécules to be catabolized with production of energy—or they lead outward and thereby supply 
carbon compounds for the purpose of biosynthesis. (Adapted with permission from Kanehisa Laboratories.) 
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supply of glucose from the bloodstream. In contrast, liver cells supply glucose to 
actively contracting muscle cells and recycle the lactic acid produced by muscle 
cells baclc into glucose. Ail types of cells hâve their distinctive metabolic traits, and 
they cooperate extensively in the normal State, as well as in response to stress and 
starvation. One might thinlc that the whole System would need to be so finely bal- 
anced that any minor upset, such as a temporary change in dietary intalce, would 
be disastrous. 

In fact, the metabolic balance of a cell is amazingly stable. Whenever the bal¬ 
ance is perturbed, the cell reacts so as to restore the initial State. The cell can adapt 
and continue to function during starvation or disease. Mutations of many kinds 
can damage or even eliminate particular reaction pathways, and yet—provided 
that certain minimum requirements are met—the cell survives. It does so because 
an elaborate network of control mechanisms régulâtes and coordinates the rates 
of ail of its reactions. These Controls rest, ultimately, on the remarkable abilities 
of proteins to change their shape and their chemistry in response to changes in 
their immédiate environment. The principles that underlie how large molécules 
such as proteins are built and the chemistry behind their régulation will be our 
next concern. 


WHATWE DONT KNOW 


• Did chemiosmosis précédé 
fermentation as the source of 
biological energy, or did some form of 
fermentation corne first, as had been 
assumed for many years? 

• What is the minimum number of 
components required to make a living 
cell from scratch? How might we find 
out? 

• Are other life chemistries possible 
besides the single one known on Earth 
(and described in this chapter)? When 
screening for life on other planets, 
what type of Chemical signatures 
should we search for? 


Summary 

Glucose and otherfood molécules are brolcen down by controlled stepwise oxidation 
to provide Chemical energy in the form ofATP and NADH. There are three main sets 
of reactions that act in sériés; the products ofeach being the starting materialfor the 
next: glycolysis (which occurs in the cytosol), the citric acid cycle (in the mitochon¬ 
drial matrix), and oxidative phosphorylation (on the inner mitochondrial mem¬ 
brane). The intermediate products of glycolysis and the citric acid cycle are used 
both as sources of metabolic energy and to produce many of the small molécules 
used as the raw materialsfor biosynthesis. Cells store sugar molécules as glycogen 
in animais and starch in plants; both plants and animais also use fats extensively 
as a food store. These storage materials in turn serve as a major source of food for 
humans, along with the proteins that comprise the majority ofthe dry mass ofmost 
ofthe cells in thefoods we eat. 


• Is the shared chemistry inside ail 
living cells a due for deciphering the 
environment on Earth where the first 
cells originated? For example, what 
might we conclude from the universally 
shared high K + /Na + ratio, neutral pH, 
and central rôle of phosphates? 


PROBLEMS 

Which statements are true? Explain why or why not. 

2-1 A 10 " 8 M solution of HCl has a pH of 8 . 

2-2 Most of the interactions between macromolecules 
could be mediated just as well by covalent bonds as by 
noncovalent bonds. 

Animais and plants use oxidation to extract energy 
from food molécules. 

2-4 If an oxidation occurs in a reaction, it must be 
accompanied by a réduction. 

2-5 Linking the energetically unfavorable reaction A 
—> B to a second, favorable reaction B —> C will shift the 
equilibrium constant for the first reaction. 

2-6 The criterion for whether a reaction proceeds 
spontaneously is AG not AG°, because AG talces into 
account the concentrations of the substrates and products. 

2-7 The oxygen consumed during the oxidation of glu¬ 
cose in animal cells is returned as CO 2 to the atmosphère. 


Discuss the following problems. 

2-8 The organic chemistry of living cells is said to be 
spécial for two reasons: it occurs in an aqueous environ¬ 
ment and it accomplishes some very complex reactions. 
But do you suppose it is really ail that much different from 
the organic chemistry carried out in the top laboratories in 
the world? Why or why not? 

2-9 The molecular weight of éthanol (CH 3 CH 2 OH) is 
46 and its density is 0.789 g/cm 3 . 

A. What is the molarity of éthanol in beer that is 5% 
éthanol by volume? [Alcohol content of beer varies from 
about 4% (lite beer) to 8 % (stout beer).] 

B. The legal limit for a driver's blood alcohol content 
varies, but 80 mg of éthanol per 100 mL of blood (usually 
referred to as a blood alcohol level of 0.08) is typical. What 
is the molarity of éthanol in a person at this legal limit? 

C. How many 12 -oz (355-mL) bottles of 5% beer could 
a 70-kg person drinlc and remain under the legal limit? A 
70-lcg person contains about 40 liters of water. Ignore the 
metabolism of éthanol, and assume that the water content 
of the person remains constant. 
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D. Ethanol is metabolized at a constant rate of about 
120 mg per hour per kg body weight, regardless of its con¬ 
centration. If a 70-lcg person were at twice the legal limit 
(160 mg/100 mL), how long would it talce for their blood 
alcohol level to fall below the legal limit? 

2-10 A histidine side chain is known to play an import¬ 
ant rôle in the catalytic mechanism of an enzyme; how- 
ever, it is not clear whether histidine is required in its pro- 
tonated (charged) or unprotonated (uncharged) State. To 
answer this question you measure enzyme activity over a 
range of pH, with the results shown in Figure Q2-1. Which 
form of histidine is required for enzyme activity? 



Figure Q2-1 Enzyme 
activity as a tu notion 
of pH (Problem 2-10). 


2-11 The three molécules in Figure Q2-2 contain the 
seven most common reactive groups in biology. Most mol¬ 
écules in the cell are built from these functional groups. 
Indicate and name the functional groups in these molé¬ 
cules. 
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Figure Q2-2 Three molécules that illustrate the 
seven most common functional groups in biology 
(Problem 2-11). 1,3-Bisphosphoglycerate and 
pyruvate are intermediates in glycolysis and 
cysteine is an amino acid. 
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2-12 “Diffusion" sounds slow—and over everyday dis¬ 
tances it is—but on the scale of a cell it is very fast. The aver¬ 
age instantaneous velocity of a particle in solution—that is, 
the velocity between the very frequent collisions—is 

v - ( kT/m) l/2 

where le - 1.38 x 10' 16 g cm 2 /K sec 2 , T = température in K 
(37°C is 310 K), and m = mass in g/molecule. 

Calculate the instantaneous velocity of a water 
molécule (molecular mass = 18 daltons), a glucose mol¬ 
écule (molecular mass = 180 daltons), and a myoglobin 
molécule (molecular mass = 15,000 daltons) at 37°C. lust 
for fun, convert these numbers into kilometers/hour. 


Before you do any calculations, try to guess whether the 
molécules are moving at a slow crawl (<1 km/hr), an easy 
wallc (5 km/hr), or a record-setting sprint (40 km/hr). 

2-13 Polymerization of tubulin subunits into microtu- 
bules occurs with an increase in the orderliness of the sub¬ 
units. Yet tubulin polymerization occurs with an increase 
in entropy (decrease in order). How can that be? 

2-14 A 70-lcg adult human (154 lb) could meet his or 
her entire energy needs for one day by eating 3 moles of 
glucose (540 g). (We do not recommend this.) Each mol¬ 
écule of glucose generates 30 molécules of ATP when it is 
oxidized to CO 2 . The concentration of ATP is maintained in 
cells at about 2 mM, and a 70-lcg adult has about 25 liters 
of intracellular fluid. Given that the ATP concentration 
remains constant in cells, calculate how many times per 
day, on average, each ATP molécule in the body is hydro- 
lyzed and resynthesized. 

2-15 Assuming that there are 5 x 10 13 cells in the human 
body and that ATP is turning over at a rate of 10 9 ATP 
molécules per minute in each cell, how many watts is the 
human body consuming? (A watt is a joule per second.) 
Assume that hydrolysis of ATP yields 50 lcf/mole. 

2-16 Does a Sniclcers™ candy bar (65 g, 1360 kl) pro¬ 
vide enough energy to climb from Zermatt (élévation 1660 
m) to the top of the Matterhorn (4478 m, Figure Q2-3), 
or might you need to stop at Hôrnli Hut (3260 m) to eat 
another one? Imagine that you and your gear hâve a mass 
of 75 kg, and that ail of your worlc is done against gravity 
(that is, you are just climbing straight up). Remember from 
your introductory physics course that 

worlc (I) = mass (kg) x g (m/sec 2 ) x height gained (m) 

where g is accélération due to gravity (9.8 m/sec 2 ). One 
joule is 1 kg m 2 /sec 2 . 

What assumptions made here will greatly under- 
estimate how much candy you need? 



Figure Q2-3 The 

Matterhorn (Problem 
2-16). (Courtesy of 
Zermatt Tourism.) 


2-17 In the absence of oxygen, cells consume glucose 
at a high, steady rate. When oxygen is added, glucose con- 
sumption drops precipitously and is then maintained at 
the lower rate. Why is glucose consumed at a high rate in 
the absence of oxygen and at a low rate in its presence? 
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PANEL 2-1 : Chemical Bonds and Groups Commonly Encountered in Biological Molécules 




COVALENT BONDS 

A covalent bond forms when two atoms corne very close 
together and share one or more of their électrons. In a single 
bond, one électron from each of the two atoms is shared; in 
a double bond, a total of four électrons are shared. 

Each atom forms a fixed number of covalent bonds in a 
defined spatial arrangement. For example, carbon forms four 
single bonds arranged tetrahedrally, whereas nitrogen forms 
three single bonds and oxygen forms two single bonds arranged 
as shown below. 






Double bonds exist and hâve a different spatial arrangement. 


Atoms joined by two 
or more covalent bonds 
cannot rotate freely 
around the bond axis. 
This restriction is a 
major influence on the 
three-dimensional shape 
of many macromolecules. 


HYDROCARBONS 


Carbon and hydrogen combine 
together to make stable 
compounds (or Chemical groups) 
called hydrocarbons. These are 
nonpolar, do not form 
hydrogen bonds, and are 
generally insoluble in water. 


H,C 


ALTERNATING DOUBLE BONDS 

The carbon chain can include double 
bonds. If these are on alternate carbon 
atoms, the bonding électrons move 
within the molécule, stabilizing the 
structure by a phenomenon called 
résonance. 


CH, 


Alternating double bonds in a ring 
can generate a very stable structure. 
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part of the hydrocarbon "tail" 
of a fatty acid molécule 























C-0 CHEMICAL GROUPS 

Many biological compounds contain a carbon 
bonded to an oxygen. For example. 


alcohol 


aldéhyde 


ketone 


The -OH is called a 
hydroxyl group. 
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The C=0 is called a 
carbonyl group. 


carboxylic acid 
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The -COOH is called a 
carboxyl group. In water 
this loses an H + ion to 
become -COCT. 


esters Esters are formed by a condensation reaction 

between an acid and an alcohol. 
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C-N CHEMICAL GROUPS 

Amines and amides are two important examples of 
compounds containing a carbon linked to a nitrogen. 

Amines in water combine with an H + ion to become 
positively charged. 
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Amides are formed by combining an acid and an 
amine. Unlike amines, amides are uncharged in water. 
An example is the peptide bond that joins amino acids 
in a protein. 
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Nitrogen also occurs in several ring compounds, including 
important constituents of nucleic acids: purines and pyrimidines. 
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cytosine (a pyrimidine) 
"H 


SULFHYDRYL GROUP 


The —C — SH is called a sulfhydryl group. In the amino acid cysteine, the sulfhydryl group 

1 i 

may exist in the reduced form, —C— SH 

1 i i 

or more rarely in an oxidized, cross-bridging form, —C — S—S— C — 


PHOSPHATES 

Inorganic phosphate is a stable ion formed from 
phosphoric acid, H 3 P0 4 . It is also written as Pj . 


Phosphate esters can form between a phosphate and a free hydroxyl group. 
Phosphate groups are often attached to proteins in this way. 
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written as 

C—OH + HO—P—O - 

-C — O—P—0~ 

+ h 2 o 

| 

1 

1 O” 

cr 


-ç-CM® 


The combination of a phosphate and a carboxyl group, or two or more phosphate groups, gives an acid anhydride. 

Because compounds of this kind are easily hydrolysed in the cell, they are sometimes said to contain a "high-energy" bond. 


-O- 


1 

n 

l' % o 

O 

II 

+ HO— P— O - 
1 

h 2 o 

X 

~T 

O 

— O 

\ Il 


high-energy acyl phosphate 
bond (carboxylic-phosphoric 

UH 

o~ 

h 2 o 

h 2 o 

o—P — 
1 

O” 

O” 

acid anhydride) found in 
some métabolites 

O 

II 

P—OH 

O 

II 

+ HO—P—O' 

_L 

o 

II 

-O—P—o 

o 

II 

— P—0“ 

phosphoanhydride—a high- 
energy bond found in 

1 

O" 

1 

O - 

h 2 o 

1 

O” 

O" 

molécules such as ATP 


also written as 


O 


— C 


O-# 


also written as 
-O—P—P 
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PANEL 2-2: Water and Its Influence on the Behavior of Biological Molécules 



WATER 

Two atoms, connected by a covalent bond, may exert different attractions for 
the électrons of the bond. In such cases the bond is polar, with one end 
slightly negatively charged (ô“) and the other slightly positively charged (ô + ). 


electropositive 

région 



WATER STRUCTURE 

Molécules of water join together transiently 
in a hydrogen-bonded lattice. Even at 37°C, 

15% of the water molécules are joined to 
four others in a short-lived assembly known 
as a "flickering cluster." 


# 


v 


✓ 


# 


W// 'A 


% 




Although a water molécule has an overall neutral charge (having the same 
number of électrons and protons), the électrons are asymmetrically distributed, 
which makes the molécule polar. The oxygen nucléus draws électrons away 
from the hydrogen nuclei, leaving these nuclei with a small net positive charge. 
The excess of électron density on the oxygen atom créâtes weakly négative 
régions at the other two corners of an imaginary tetrahedron. 


The cohesive nature of water is 
responsible for many of its unusual 
properties, such as high surface tension, 
spécifie heat, and heat of vaporization. 


HYDROGEN BONDS 

Because they are polarized, two 
adjacent H 2 0 molécules can form 
a linkage known as a hydrogen 
bond. Hydrogen bonds hâve 
only about 1/20 the strength 
of a covalent bond. 

Hydrogen bonds are strongest when 
the three atoms lie in a straight line. 


5 + 




25 + 




hydrogen bond 


HYDROPHILIC MOLECULES 

Substances that dissolve readily in water are termed hydrophilic. They are 
composed of ions or polar molécules that attract water molécules through 
electrical charge effects. Water molécules surround each ion or polar molécule 
on the surface of a solid substance and carry it into solution. 

h H 

H H \ / 


H H 

y \ 


/ 

O 


H 


O 


-CD 


Na + 


O —-H 


cr 


H ô + 


\ 


cr 


O' 


ô + 


.ex 


O 


/ 

O 


^ / 

y>=c 


\ 

N —H . O 


H" 




O 


\ 

N —H il 
/ 


Ionie substances such as sodium chloride 
dissolve because water molécules are 
attracted to the positive (Na + ) or négative 
(Cl ) charge of each ion. 


O 


H 


H 


Polar substances such as urea 
dissolve because their molécules 
form hydrogen bonds with the 
surrounding water molécules. 


H 


H 


bond lengths 

hydrogen bond 
0.17 nm 


O' 


'H—O 


0.10 nm 
covalent bond 


HYDROPHOBIC MOLECULES 

Molécules that contain a prépondérance 
of nonpolar bonds are usually insoluble in 
water and are termed hydrophobie. This is 
true, especially, of hydrocarbons, which 
contain many C-H bonds. Water molécules 
are not attracted to such molécules and so 
hâve little tendency to surround them and 
carry them into solution. 

,H 


/ 

n-q, 

H 


H 

\ I 

O — H . CL 
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H'*' 


O 

















WATER AS A SOLVENT 

Many substances, such as household sugar, dissolve in water. That is, their 
molécules separate from each other, each becoming surrounded by water molécules. 


ü a 

a a oo 


Q a D ç? t3 

a û a ^ Q 

PJ Œ Ô> 

ü <3 

Q i 3 a ü , 


Q n « 

Q ° a a <3 oe» 
OQ a O O 
a Oûgflg 

o Q a °a oe> 

û o oa 
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sugar 

dissolves 


water 
molécule 


sugar crystal 



When a substance dissolves in a 
liquid, the mixture istermed a solution. 
The dissolved substance (in this case 
sugar) is the soluté, and the liquid that 
does the dissolving (in this case water) 
is the solvent. Water is an excellent 
solvent for many substances because 
of its polar bonds. 


ACIDS 

Substances that release hydrogen ions into solution 
are called acids. 


HCl 


hydrochloric acid 
(strong acid) 


H + + cr 

hydrogen ion chloride ion 


Many of the acids important in the cell are only partially 
dissociated, and they are therefore weak acids —for example, 
the carboxyl group (-COOH), which dissociâtes to give a 
hydrogen ion in solution. 


O 


OH 

(weak acid) 

Note that this is a réversible reaction. 


PH 


The acidity of a 
solution is defined 
by the concentration 
of H + ions it possesses. 
For convenience we 
use the pH scale, where 

pH = -log 10 [H+] 


For pure water 

[H + ] = 10- 7 moles/liter ■ 


FT 

conc. 

moles/l iter 


O 




O 


PH 


HYDROGEN ION EXCHANGE 

Positively charged hydrogen ions (H + ) can spontaneously 
move from one water molécule to another, thereby creating 
two ionic species. 

H 

\ © 

O— H + 

/ 


H 


\ 


H 


/ 


H 


OIIIIIIIH —O 


H 


often written as: H 7 0 


H 

hydronium ion hydroxyl ion 

(water acting as (water acting as 
a weak base) a weak acid) 

i H + + OH" 
hydrogen hydroxyl 


Since the process is rapidly réversible, hydrogen ions are 
continually shuttling between water molécules. Pure water 
contains a steady-state concentration of hydrogen ions and 
hydroxyl ions (both 1CT 7 M). 


BASES 

Substances that reduce the number of hydrogen ions in 
solution are called bases. Some bases, such as ammonia, 
combine directly with hydrogen ions. 


NHo 


H 


ammonia hydrogen ion 


NH 4 + 

ammonium ion 


Other bases, such as sodium hydroxide, reduce the number of 
H + ions indirectly, by making OH - ions that then combine 
directly with H + ions to make H 2 0. 


NaOH 


sodium hydroxide 
(strong base) 


Na + OH~ 
sodium hydroxyl 


Many bases found in cells are partially associated with H + ions 
and are termed weak bases. This is true of compounds that 
contain an amino group (-NH 2 ), which has a weak tendency to 
reversibly accept an H + ion from water, increasing the quantity 
of free OH - ions. 


-NHo 


-NH 2 + H 
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PANEL 2-3: The Principal Types of Weak Noncovalent Bonds that Hold Macromolecules Together 



WEAK NONCOVALENT CHEMICAL BONDS 

Organic molécules can interact with other molécules through three 
types of short-range attractive forces known as noncovalent bonds: 
van der Waals attractions, electrostatic attractions, and hydrogen 
bonds. The repulsion of hydrophobie groups from water is also 
important for the folding of biological macromolecules. 



Weak noncovalent Chemical bonds hâve less than 1/20 the strength 
of a strong covalent bond. They are strong enough to provide 
tight binding only when many of them are formed simultaneously. 


HYDROGEN BONDS 

As already described for water (see Panel 2-2), 
hydrogen bonds form when a hydrogen atom is 
"sandwiched" between two electron-attracting atoms 
(usually oxygen or nitrogen). 


VAN DER WAALS ATTRACTIONS 

If two atoms are too close together they repel each other 
very strongly. For this reason, an atom can often be 
treated as a sphere with a fixed radius. The characteristic 
"size" for each atom is specified by a unique van der 
Waals radius. The contact distance between any two 
noncovalently bonded atoms is the sum of their van der 
Waals radii. 


» ^ 


0.12 nm 
radius 


0.2 nm 
radius 


0.15 nm 
radius 


0.14 nm 
radius 


At very short distances any two atoms show a weak 
bonding interaction due to their fluctuating electrical 
charges. The two atoms will be attracted to each other 
in this way until the distance between their nuclei is 
approximately equal to the sum of their van der Waals 
radii. Although they are individually very weak, van der 
Waals attractions can become important when two 
macromolecular surfaces fit very close together, 
because many atoms are involved. 

Note that when two atoms form a covalent bond, the 
centers of the two atoms (the two atomic nuclei) are 
much doser together than the sum of the two van der 
Waals radii. Thus, 


Hydrogen bonds are strongest when the three atoms are 
in a straight line: 


v 


O-H 


% 


/ 


N—H 


S 


L 


J 


0.4 nm 

two non-bonded 
carbon atoms 


0.15 nm 0.13 nm 

single-bonded double-bonded 
carbon atoms carbon atoms 


Examples in macromolecules: 

Amino acids in a polypeptide chain can be hydrogen-bonded 
together. These stabilize the structure of folded proteins. 


R—C —H 

I 

C =0 I 


Two bases, G and C, are hydrogen-bonded in a 
DNA double hélix. 
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HYDROGEN BONDS IN WATER 

Any molécules that can form hydrogen bonds to each other 
can alternative^ form hydrogen bonds to water molécules. 
Because of this compétition with water molécules, the 
hydrogen bonds formed between two molécules dissolved 
in water are relatively weak. 

peptide 


H 





















HYDROPHOBIC FORCES 


Water forces hydrophobie groups together, 
because doing so minimizes their disruptive 
effects on the hydrogen-bonded water 
network. Hydrophobie groups held 
together in this way are sometimes said 
to be held together by "hydrophobie 
bonds," even though the apparent attraction 
is actually caused by a repulsion from the 
water. 


ELECTROSTATIC ATTRACTIONS 

Attractive forces occur both between fully charged 
groups (ionic bond) and between the partially charged 
groups on polar molécules. 


The force of attraction between the two charges, 8 + 
and 5 _ , falls off rapidly as the distance between the 
charges increases. 

In the absence of water, electrostatic forces are very strong. 
They are responsible for the strength of such minerais as 
marble and agate, and for crystal formation in common 
table sait, NaCI. 


ELECTROSTATIC ATTRACTIONS IN 
AQUEOUS SOLUTIONS 

Charged groups are shielded by their 
interactions with water molécules. 
Electrostatic attractions are therefore 
quite weak in water. 

%/ O ^ 

/ ''H// // J 


O H 
-O—P— O® 


HO- 


H 

v 

h/% 




Similarly, ions in solution can cluster around 
charged groups and further weaken 
these attractions. 




Despite being weakened by water and sait, 
electrostatic attractions are very important in 
biological Systems. For example, an enzyme that 
binds a positively charged substrate will often 
hâve a negatively charged amino acid side chain 
at the appropriate place. 

substrate 


a crystal of 
sait, NaCI 


1 mm 
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PANEL 2-4: An Outline of Some of the Types of Sugars Commonly Found in Cells 



MONOSACCHARIDES 

Monosaccharides usually hâve the general formula (CH 2 0) n/ where n can be 3, 4, 5, 6, 7, or 8, and hâve two or more hydroxyl groups. 
They either contain an aldéhyde group ( ~ c \ H ) and are called aldoses, or a ketone group ( =o ) and are called ketoses. 



RING FORMATION 

In aqueous solution, the aldéhyde or ketone group of a sugar 
molécule tends to react with a hydroxyl group of the same 
molécule, thereby closing the molécule into a ring. 

H. P 


C 


* I 


-OH 


H-C-OH 

4 I 

H-C-OH 
5 I 


H. P 



glucose 


H- 


■ I 


-OH 


H-C-OH 
3 I 

H-C-OH 

4 I 

ch 2 oh 



ribose 


Note that each carbon atom 
has a number. 


ISOMERS 

Many monosaccharides differ only in the spatial arrangement 
of atoms—that is, they are isomers. For example, glucose, 
galactose, and mannose hâve the same formula (C 6 H 12 0 6 ) but 
differ in the arrangement of groups around one or two carbon 
atoms. 



These small différences make only minor changes in the 
Chemical properties of the sugars. But they are recognized by 
enzymes and other proteins and therefore can hâve major 
biological effects. 


































a AND (3 LINKS 

The hydroxyl group on the carbon that carries the 
aldéhyde or ketone can rapidly change from one 
position to the other. These two positions are 
called a and p. 


SUGAR DERIVATIVES 




As soon as one sugar is linked to another, the a or 
P form is frozen. 



DISACCHARIDES 

The carbon that carries the aldéhyde 
or the ketone can react with any 
hydroxyl group on a second sugar 
molécule to form a disaccharide. 

The linkage is called a glycosidic 
bond. 

Three common disaccharides are 

maltose (glucose + glucose) 
lactose (galactose + glucose) 
sucrose (glucose + fructose) 

The reaction forming sucrose is 
shown here. 



OLIGOSACCHARIDES AND POLYSACCHARIDES 

Large linear and branched molécules can be made from simple repeating sugar 
subunits. Short chains are called oligosaccharides, while long chains are called 
polysaccharides. Glycogen, for example, is a polysaccharide made entirely of 
glucose units joined together. 


branch points 


glycogen 


COMPLEX OLIGOSACCHARIDES 

In many cases a sugar sequence 
is nonrepetitive. Many different 
molécules are possible. Such 
complex oligosaccharides are 
usually linked to proteins or to lipids, 
as is this oligosaccharide, which is 
part of a cell-surface molécule 
that defines a particular blood group. 
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PANEL 2-5: Fatty Acids and Other Lipids 



COM MON FATTY 
ACIDS 

These are carboxylic acids 
with long hydrocarbon tails. 


COOH 
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COOH 
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COOH 
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ch 2 
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ch 2 

1 

ch 2 

1 

ch 2 

1 

ch 2 

1 

ch 2 

ch 2 
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1 
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ch 2 
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1 

ch 2 
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ch 2 
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ch 2 
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1 

ch 2 

1 

1 

ch 2 

1 

ch 2 

1 

ch 2 

1 

-n- 

ch 3 

ch 2 

1 

i 

ch 2 

palmitic 

acid 

ch 2 

1 

ch 3 

(Ciô) 

ch 3 

stearic 


oleic 

acid (Ci 8 ) 


acid (Ci 8 ) 


Hundreds of different kinds of fatty acids exist. Some hâve one or more double bonds in their 
hydrocarbon tail and are said to be unsaturated. Fatty acids with no double bonds are saturated. 


O 


O 



✓ 
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This double bond 
is rigid and créâtes 
a kink in the chain. 
The rest of the chain 
is free to rotate 
about the other C-C 
bonds. 


VA 


a 


cl 


stearic 

acid 


H 


space-filling model carbon skeleton 

UNSATURATED 


SATURATED 


TRIACYLGLYCEROLS 

O 


H 9 C— O' 


O 


HC — O' 


O 


H 9 C— O" 


Fatty acids are stored as an energy reserve (fats and 
oils) through an ester linkage to glycerol to form 
triacylglycerols, also known as triglycérides. 


H 9 C —OH 


HC—OH 

I 

H 2 C —OH 
glycerol 


CARBOXYLGROUP 

If free, the carboxyl group of a 
fatty acid will be ionized. 


But more usually it is linked to 
other groups to form either esters 


or amides. 
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PHOSPHOLIPIDS Phospholipids are the major constituents 
of cell membranes. 


hydrophilic 

head 




hydrophobie 
v fatty acid tails 


choline 



space-filling model of 
the phospholipid 
phosphatidylcholine 


In phospholipids, two of the -OH groups in glycerol are 
linked to fatty acids, while the third -OH group is linked 
general structure to phosphoric acid. The phosphate is further linked to 
of a phospholipid one of a variety of small polar groups, such as choline. 



















LIPID AGGREGATES 

Fatty acids hâve a hydrophilic head 
and a hydrophobie tail. 


In water they can form a surface film 
or form small micelles. 


111111 — 11 — 11111111111 

^1 

f 4i 

> 


POLYISOPRENOIDS 

long-chain polymers of isoprene I 


-micelle 


Their dérivatives can form larger aggregates held together by hydrophobie forces: 


Triacylglycerols (triglycérides) can form Phospholipids and glycolipids form self-sealing lipid 
large spherical fat droplets in the cell bilayers that are the basis for ail cell membranes, 
cytoplasm. 


200 nm 
or more 



4 nm 


OTHER LIPIDS 


Lipids are defined as the water-insoluble 
molécules in cells that are soluble in organic 
solvents. Two other common types of lipids 
are steroids and polyisoprenoids. Both are 
made from isoprene units. 


CH, 


c —ch=ch 9 


isoprene 


STEROIDS 


Steroids hâve a common multiple-ring structure. 



OH 



GLYCOLIPIDS 

Like phospholipids, these compounds are composed of a hydrophobie 
région, containing two long hydrocarbon tails and a polar région, 
which contains one or more sugars and, unlike phospholipids, 
no phosphate. I 


galactose 


sugar 


C NH a simple 
^ glycolipid 


dolichol phosphate —used 
to carry activated sugars 
in the membrane-associated 
synthesis of glycoproteins 
and some polysaccharides 
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PANEL 2-6: A Survey of the Nucléotides 



BASES 




HC' 


HC V 


c 


cytosine 


O 


HoC 


"NH 


U 


uracil 




O 


T 


thymine N 
H 




o 


The bases are nitrogen-containing ring 
compounds, either pyrimidines or purines. 


PYRIMIDINE 


PURINE 


adenine 

Â Ç| 

HC 


N. 

». Æ 

HC 

. \ 

guanine n" 
H 


"NHo 


PHOSPHATES 

The phosphates are normally joined to 
the C5 hydroxyl of the ribose or 
deoxyribose sugar (designated 5‘). Mono- 
di-, and triphosphates are common. 


O 


“O— P— O—CH ? 


as in 
AMP 


"O- 


-o— ch 2 

I 


as in 
ADP 


“O— P—O—P—O—P—O—CH 2 

i i i i 

cr o - cr 

The phosphate makes a nucléotide 
negatively charged. 


as in 
AT P 


NUCLEOTIDES 

A nucléotide consists of a nitrogen-containing 
base, a five-carbon sugar, and one or more 
phosphate groups. 


BASE 


NHo 



JDL 


Nucléotides 
are the 
subunits of 
the nucleic acids. 


OH 


OH 


SUGAR 


"O 


BASIC SUGAR 
LINKAGE 


A/-glycosidic 



The base is linked to 
the same carbon (Cl) 
used in sugar-sugar 
bonds. 


SUGARS 


PENTOSE 


a five-carbon sugar 



two kinds are used 



Each numbered carbon on the sugar of a nucléotide is 
followed by a prime mark; therefore, one speaks of the 
"5-prime carbon," etc. 



P-D-ribose 

used in ribonucleic acid 


P-D-2-deoxyribose 

used in deoxyribonucleic acid 












































NOMENCLATURE 


BASE 

NUCLEOSIDE 

ABBR. 

adenine 

adenosine 

A 

guanine 

guanosine 

G 

cytosine 

cytidine 

C 

uracil 

uridine 

U 

thymine 

thymidine 

T 


A nucleoside or nucléotide is named according to its nitrogenous base. 

Single-letter abbreviations are used variously as 
shorthand for (1) the base alone, (2) the 
nucleoside, or (3) the whole nucléotide— 
the context will usually make clear which of 

the three entities is meant. When the context BASE + SUGAR = NUCLEOSIDE 
is not sufficient, we will add the terms "base", 

"nucleoside", "nucléotide", or—as in the 


NUCLEIC ACIDS 

Nucléotides are joined together by a 
phosphodiester linkage between 5' and 
3' carbon atoms to form nucleic acids. 
The linear sequence of nucléotides in a 
nucleic acid chain is commonly 
abbreviated by a one-letter code, such as 
A—G—C—T—T—A—C—A, with the 5' 
end of the chain at the left. 





sugar 


®— i 


examples below—use the full 3-letter nucléotide 
code. 

AMP = adenosine monophosphate 
dAMP = deoxyadenosine monophosphate 
UDP = uridine diphosphate 

AT P = adenosine triphosphate BASE + SUGAR + PHOSPHATE : 


sugar 


NUCLEOTIDE 


3' end of chain 


NUCLEOTIDES HAVE MANY OTHER FUNCTIONS 

1 They carry Chemical energy in their easily hydrolyzed phosphoanhydride bonds. 



2 They combine with other groups to form coenzymes. 


HH OHH OH CH, H 


O 


O 


HS -C -C -N -C -C -C -N -C -C -C —C -O— P —O— P — 


H H H H H H HO CH 3 H O - 
example: coenzyme A (CoA) 

3 They are used as spécifie signaling molécules in the cell 

example: cyclic AMP (cAMP) 


O" 




O" 
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PANEL 2-7: Free Energy and Biological Reactions 



THE IMPORTANCE OF FREE ENERGY FOR CELLS 

Life is possible because of the complex network of interacting 
Chemical reactions occurring in every cell. In viewing the 
metabolic pathways that comprise this network, one might 
suspect that the cell has had the ability to evolve an enzyme to 
carry out any reaction that it needs. But this is not so. Although 
enzymes are powerful catalysts, they can speed up only those 
reactions that are thermodynamically possible; other reactions 
proceed in cells only because they are coupled to very favorable 
reactions that drive them. The question of whether a reaction 


can occur spontaneously, or instead needs to be coupled to 
another reaction, is central to cell biology. The answer is 
obtained by reference to a quantity called the free energy. the 
total change in free energy during a set of reactions détermines 
whether or not the entire reaction sequence can occur. In this 
panel, we shall explain some of the fundamental ideas—derived 
from a spécial branch of chemistry and physics called thermo¬ 
dynamics —that are required for understanding what free 
energy is and why it is so important to cells. 


ENERGY RELEASED BY CHANGES IN CHEMICAL BONDING IS CONVERTED INTO HEAT 



SEA 


An enclosedSystem is defined as a collection of molécules that 
does not exchange matter with the rest of the universe (for 
example, the "cell in a box" shown above). Any such System will 
contain molécules with a total energy E. This energy will be 
distributed in a variety of ways: some as the translational energy 
of the molécules, some as their vibrational and rotational energies, 
but most as the bonding energies between the individual atoms 
that make up the molécules. Suppose that a reaction occurs in 
the System. The first law of thermodynamics places a constraint 
on what types of reactions are possible: it States that "in any 
process, the total energy of the universe remains constant." 

For example, suppose that reaction A—» B occurs somewhere in 
the box and releases a great deal of chemical-bond energy. This 
energy will initially increase the intensity of molecular motions 
(translational, vibrational, and rotational) in the System, which 
is équivalent to raising its température. However, these increased 
motions will soon be transferred out of the System by a sériés 


of molecular collisions that heat up first the walls of the box 
and then the outside world (represented by the sea in 
our example). In the end, the System returns to its initial 
température, by which time ail the chemical-bond energy 
released in the box has been converted into heat energy and 
transferred out of the box to the surroundings. According to 
the first law, the change in the energy in the box (A£ box , which 
we shall dénoté as A£) must be equal and opposite to the 
amount of heat energy transferred, which we shall designate 
as h : that is, A E = ~h. Thus, the energy in the box (£) decreases 
when heat leaves the System. 

E also can change during a reaction as a resuit of work being 
done on the outside world. For example, suppose that there is 
a small increase in the volume (AV) of the box during a reaction. 
Since the walls of the box must push against the constant 
pressure (P) in the surroundings in order to expand, this does 
work on the outside world and requires energy. The energy 
used is P( AV), which according to the first law must decrease 
the energy in the box (£) by the same amount. In most reactions, 
chemical-bond energy is converted into both work and heat. 
Enthalpy ( H ) is a composite function that includes both of these 
(H = E + PV). To be rigorous, it is the change in enthalpy 
(A H) in an enclosed System, and not the change in energy, that 
is equal to the heat transferred to the outside world during a 
reaction. Reactions in which H decreases release heat to the 
surroundings and are said to be "exothermic," while reactions 
in which H increases absorb heat from the surroundings and 
are said to be "endothermie." Thus, ~h = A H. However, the 
volume change is negligible in most biological reactions, so to 
a good approximation 


THE SECOND LAW OF THERMODYNAMICS 

Consider a container in which 1000 coins are ail lying heads up. 

If the container is shaken vigorously, subjecting the coins to 
the types of random motions that ail molécules expérience due 
to their frequent collisions with other molécules, one will end 
up with about half the coins oriented heads down. The 
reason for this reorientation is that there is only a single way in 
which the original orderly State of the coins can be reinstated 
(every coin must lie heads up), whereas there are many different 
ways (about 10 298 ) to achieve a disorderly State in which there is 
an equal mixture of heads and tails; in fact, there are more ways 


to achieve a 50-50 State than to achieve any other State. Each 
State has a probability of occurrence that is proportional to the 
number of ways it can be realized. The second law of thermo¬ 
dynamics States that "Systems will change spontaneously from 
States of lower probability to States of higher probability." 
Since States of lower probability are more "ordered" than 
States of high probability, the second law can be restated: 

"the universe constantly changes so as to become more 
disordered." 













THE ENTROPY, 5 

The second law (but not the first law) allows one to predict the 
direction of a particular reaction. But to make it useful for this 
purpose, one needs a convenient measure of the probability or, 
equivalently, the degree of disorder of a State. The entropy (5) 
is such a measure. It is a logarithmic function of the probability 
such that the change in entropy (AS) that occurs when the 
reaction A —> B converts one mole of A into one mole of B is 


AS = /? I n p B /p A 


where p A and p B are the probabilities of the two States A and B, 
R is the gas constant (8.31 J K -1 mole -1 ), and AS is measured 
in entropy units (eu). In our initial example of 1000 coins, the 
relative probability of ail heads (state A) versus half heads and 
half tails (state B) is equal to the ratio of the number of different 
ways that the two results can be obtained. One can calculate 
that p A = 1 and p B = 10001(500! x 500!) = 10 299 . Therefore, 
the entropy change for the reorientation of the coins when their 


container is vigorously shaken and an equal mixture of heads 
and tails is obtained is R In (10 298 ), or about 1370 eu per mole of 
such containers (6 x 10 23 containers). We see that, because AS 
defined above is positive for the transition from state A to 
state B {p B /p A > 1), reactions with a large increase in S (that is, 
for which AS > 0) are favored and will occur spontaneously. 

As discussed in Chapter 2, heat energy causes the random 
commotion of molécules. Because the transfer of heat from an 
enclosed System to its surroundings increases the number of 
different arrangements that the molécules in the outside world 
can hâve, it increases their entropy. It can be shown that the 
release of a fixed quantity of heat energy has a greater disor- 
dering effect at low température than at high température, and 
that the value of AS for the surroundings, as defined above 
(A5 sea ), is precisely equal to h, the amount of heat transferred to 
the surroundings from the System, divided by the absolute 
température (T): 


THE GIBBS FREE ENERGY, G 

When dealing with an enclosed biological System, one would 
like to hâve a simple way of predicting whether a given reaction 
will or will not occur spontaneously in the System. We hâve 
seen that the crucial question is whether the entropy change for 
the universe is positive or négative when that reaction occurs. 

In our idealized System, the cell in a box, there are two separate 
components to the entropy change of the universe—the entropy 
change for the System enclosed in the box and the entropy 
change for the surrounding "sea"—and both must be added 
together before any prédiction can be made. For example, it is 
possible for a reaction to absorb heat and thereby decrease the 
entropy of the sea (A5 sea < 0) and at the same time to cause 
such a large degree of disordering inside the box (A5 box > 0) 
that the total AS universe = AS sea + A5 box is greater than 0. In this 
case, the reaction will occur spontaneously, even though the 
sea gives up heat to the box during the reaction. An example of 
such a reaction is the dissolving of sodium chloride in a beaker 
containing water (the "box"), which is a spontaneous process 
even though the température of the water drops as the sait 
goes into solution. 

Chemists hâve found it useful to define a number of new 
"composite functions" that describe combinations of physical 
properties of a System. The properties that can be combined 
include the température (7), pressure (P), volume (V), energy 
(£), and entropy (5). The enthalpy (H) is one such composite 
function. But by far the most useful composite function for 
biologists is the Gibbs free energy, G. It serves as an accounting 
device that allows one to deduce the entropy change of the 
universe resulting from a Chemical reaction in the box, while 
avoiding any separate considération of the entropy change in 
the sea. The définition of G is 


G = H - TS 


where, for a box of volume V, H is the enthalpy described above 
(£ + PV), T is the absolute température, and S is the entropy. 

Each of these quantifies applies to the inside of the box only. The 
change in free energy during a reaction in the box (the G of the 
Products minus the G of the starting materials) is denoted as AG 
and, as we shall now demonstrate, it is a direct measure of the 
amount of disorder that is created in the universe when the 
reaction occurs. 


At constant température the change in free energy (AG) 
during a reaction equals A H - TAS. Remembering that 
A H = -h, the heat absorbed from the sea, we hâve 


-AG = -A H + TAS 

-AG = h + TAS , so -A GIT = h/T + AS 


But h/T is equal to the entropy change of the sea (A5 sea ), and 
the AS in the above équation is A5 box . Therefore 


= AS,, 


We conclude that the free-energy change is a direct measure 
of the entropy change of the universe. A reaction will proceed 
in the direction that causes the change in the free energy (AG) 
to be less than zéro, because in this case there will be a positive 
entropy change in the universe when the reaction occurs. 

For a complex set of coupled reactions involving many 
different molécules, the total free-energy change can be com- 
puted simply by adding up the free energies of ail the different 
molecular species after the reaction and comparing this value 
with the sum of free energies before the reaction; for common 
substances the required free-energy values can be found from 
published tables. In this way, one can predict the direction of 
a reaction and thereby readily check the feasibility of any 
proposed mechanism. Thus, for example, from the observed 
values for the magnitude of the electrochemical proton gradient 
across the inner mitochondrial membrane and the AG for ATP 
hydrolysis inside the mitochondrion, one can be certain that ATP 
synthase requires the passage of more than one proton for each 
molécule of ATP that it synthesizes. 

The value of AG for a reaction is a direct measure of how far 
the reaction is from equilibrium. The large négative value for 
ATP hydrolysis in a cell merely reflects the fact that cells keep 
the ATP hydrolysis reaction as much as 10 orders of magnitude 
away from equilibrium. If a reaction reaches equilibrium, 

AG = 0, the reaction then proceeds at precisely equal rates 
in the forward and backward direction. For ATP hydrolysis, 
equilibrium is reached when the vast majority of the ATP 
has been hydrolyzed, as occurs in a dead cell. 
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PANEL 2-8: Details of the 10 Steps of Glycolysis 



For each step, the part of the molécule that undergoes a change is shadowed in blue, 
and the name of the enzyme that catalyzes the reaction is in a yellow box. 


ste P 1 Glucose is 
phosphorylated by ATP to 
form a sugar phosphate. 

The négative charge of the 
phosphate prevents passage 
of the sugar 
phosphate through the 
plasma membrane, 
trapping glucose inside 
the cell. 



hexokinase 



+ ADP + H + 


glucose 6-phosphate 


Step 2 A readi | y 
réversible 
rearrangement of 
the Chemical 
structure 
(isomerization) 
moves the 
carbonyl oxygen HO 
from carbon 1 to 
carbon 2, forming a 
ketose from an 
aldose sugar. 

(See Panel 2-3, 
pp. 70-71.) 


6 CH 2 O^P) 
si 


O. 



OH 

(ring form) 




H —CL—OH 

I 2 

HO—C —H 


H —C—OH 

1 4 

H —C—OH 

15 


phosphoglucose 

isomerase 


-CH 2 O-0 


(open-chain form) 
glucose 6-phosphate 


1 CH 2 OH 

c=o 


HO—C —H 

3 i 

H —C —OH 

4 | 

H —C —OH 
51 


(p)— OH 2 C 


XL ch 2 oh 


g CH 2 0 — (p) 

(open-chain form) 

fructose 6-phosphate 


OH 

(ring form) 


Step 3 The new hydroxyl 
group on carbon 1 is 
phosphorylated by ATP, in 
préparation for the formation 
of two three-carbon sugar 
phosphates. The entry of sugars 
into glycolysis is controlled at 
this step, through régulation 
of the enzyme 
phosphofructokinase. 


(J)—oh 2 c /0 Çh 2 oh 
H °2 

OH 

OH ' 

fructose 6-phosphate 


phosphofructokinase 


(D - OH 2 C o ch 2 o-# 


HO 

' OH 
OH 

fructose 1,6-bisphosphate 


+ ADP + H + 


Step 4 T he 
six-carbon sugar is 
cleaved to produce 
two three-carbon 
molécules. Only the 
glyceraldehyde 
3-phosphate can 
proceed immediately 
through glycolysis. 


®— oh 2 c 


o ch 2 o-© 

HO^ 

OH 

OH ' 

(ring form) 


aldolase 


ch 2 o — (p) 

(open-chain form) 
fructose 1,6-bisphosphate 


dihydroxyacetone 

phosphate 


H \ Z 3 

c 

I 

H —C —OH 

ch 2 o-(p) 

glyceraldehyde 

3-phosphate 


Step 5 The other 
product of step 4, 
dihydroxyacetone 
phosphate, is 
isomerized to form 
glyceraldehyde 
3-phosphate. 


triose phosphate isomerase 


dihydroxyacetone 

phosphate 


H —C—OH 

l 

ch 2 o- p 

glyceraldehyde 

3-phosphate 









































Step 6 The two molécules 
of glyceraldehyde 3-phosphate 
are oxidized. The 
energy-generation phase of 
glycolysis begins, as NADH and 
a new high-energy anhydride 
linkage to phosphate are 
formed (see Figure 13-5). 


glyceraldehyde 3-phosphate 
dehydrogenase 


+ NAD + + Pi 


H —C —OH 

ch 2 o-© 

glyceraldehyde 3-phosphate 


°\ 

f + B 

H —C —OH 
CH 2 O-0 

1 f 3-bisphosphoglycerate 


+ H + 


Step 7 The transfer 
to ADP of the 
high-energy phosphate 
group that was 
generated in step 6 
forms ATP. 


O v 


\ 


/>-% 
c 

I 

H —C —OH 
CH 2 O-0 

1,3-bisphosphoglycerate 


+ ADP 


phosphoglycerate kinase 


C \ + 

H —C —OH 

ch 2 o-© 

3-phosphoglycerate 


Step 8 The remaining 
phosphate ester linkage in 
3-phosphoglycerate, 
which has a relatively low 
free energy of hydrolysis, 
is moved froim carbon 3 
to carbon 2 to form 
2-phosphoglycerate. 


3-phosphoglycerate 


phosphoglycerate mutase 


O. 




O 


i i - 

i i r- 

-o-@ 
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i 

ch 2 c 


2-phosphoglycerate 


Step 9 The re moval of 
water from 2-phosphoglycerate 
créâtes a high-energy enol 
phosphate linkage. 


H —C —0-0 
CH 2 OH 

2-phosphoglycerate 


enolase 


O, 


\ 


O 


C —0-0 + h 2 o 

II 

ch 2 

phosphoenolpyruvate 


Step 10 The transfer to 

°\ A 


°v A 

ADP of the high-energy 

C 

pyruvate kinase 

C 

phosphate group that was 

1 a r\ n 1 1_1 + 


generated in step 9 forms 

ATP, completing 

C-O-# + ADP + H 

II 

ch 2 

phosphoenolpyruvate 


c = o 

glycolysis. 


ch 3 

pyruvate 


NET RESULT OF GLYCOLYSIS 


glucose 


In addition to the pyruvate, the net products are 
two molécules of ATP and two molécules of NADH. 
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two molécules 
of pyruvate 
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PANEL 2-9: The Complété Citric Acid Cycle 



NAD + 


■ H + 


1h 3 -c 
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pyruvate 
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next cycle 
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c=o 
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coenzyme A 
■HS-CoA 

acetyl CoA (2C) 
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NAD + 


COO 

oxaloacetate (4C) 


Step 1 


Step 8 


Soo 

H-Ç-OH 

CH 2 malate (4C) 
COO - 


HS-CoA 


HO-C-COO 

i 

ch 2 

coo - 

citrate (6C) 


oxaloacetate (4C) 


CITRIC ACID CYCLE 


Step 7 



OverView of the complété citric acid cycle. 
The two carbons from acetyl CoA that 
enter this turn of the cycle (shadowed in 
red ) will be converted to C0 2 in 
subséquent turns of the cycle: it is the two 
carbons shadowed in blue that are 
converted to C0 2 in this cycle. 


Step 2 


Boo 


i 


ch 2 

HC-COO 
HO-CH 
COO - 


isocitrate (6C) 


Step 3 


fumarate (4C) 

Hocr 


doc > 

a-ketoglutarate (5C) ± 

■Ho 



NAD + 
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Step 6 


succinate (4C) 

■oo - 

i 

■Ho 


succinyl CoA (4C) 

iOO" 


Step 4 


CH 2 

i 

c=o 


COo 


Step 5 


FAD 


CH 2 

COO' 


HS-CoA 



T 


GDP 


c=o 

S-CoA 



HS-CoA 


COo 


Details of these eight steps are shown below. In this part of the panel, for each step, the part of the molécule that undergoes 
a change is shadowed in blue, and the name of the enzyme that catalyzes the reaction is in a yellow box. 


Step 1 After the 
enzyme removes a proton 
from the CH 3 group on 
acetyl CoA, the negatively 
charged CH 2 - forms a 
bond to a carbonyl carbon 
of oxaloacetate. The 
subséquent loss by 
hydrolysis of the coenzyme 
A (HS-CoA) drives the 
reaction strongly forward. 
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Step 2 An jsomerization 
reaction, in which water is 
first removed and then 
added back, movesthe 
hydroxyl group from one 
carbon atom to its neighbor. 


aconitase 


h 2 o 
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T 

h 9 o 


c/'s-aconitate intermediate 


h 2 o 
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h 9 o 




























































Step 3 Inthefirstof 
four oxidation steps in the 
cycle, the carbon carrying 
the hydroxyl group is 
converted to a carbonyl 
group. The immédiate 
product is unstable, losing 
C0 2 while sti 
the enzyme. 


isocitrate 

dehydrogenase 


NAD + 


coo 

isocitrate 


COO 

oxalosuccinate intermediate 


H + CO, 


COO 

a-ketoglutarate 


Step 4 The a-ketoglutarate 
dehydrogenase complex closely 
resembles the large enzyme 
complex that converts pyruvate 
to acetyl CoA, the pyruvate 
dehydrogenase complex in 
Figure 13-10. It likewise 
catalyzes an oxidation that 
produces NADH, C0 2 , and a 
high-energy thioester bond to 
coenzyme A (CoA). 


a-ketoglutarate 


a-ketoglutarate dehydrogenase complex 

-'S-* 


1 

coo 

NAD + 

■ nADhE^^ 

J 

1 

1 
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succinyl-CoA 


Step 5 a phosphate 

molécule from solution 
displaces the CoA, forming a 
high-energy phosphate 
linkage to succinate. This 
phosphate is then passed to 

GDP to form GTP. (In bacteria 
and plants, ATP is formed 
instead.) 
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Step 6 In the third oxidation 

COO 


COO 

step in the cycle, FAD accepts two 

1 

succinate dehydrogenase 

1 

hydrogen atoms from succinate. 
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fumarate 


Step 7 The addition of 
water to fumarate places a 
hydroxyl group next to a 
carbonyl carbon. 


fumarase 


h 7 o 


Step 8 | n the last of four 
oxidation steps in the cycle, the 
carbon carrying the hydroxyl 
group is converted 
to a carbonyl group, 
regenerating the oxaloacetate 
needed for step 1. 


malate dehydrogenase 


NAD + 


+ H 
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Proteins 


CHAPTER 

3 


When we look at a cell through a microscope or analyze its electrical or bio- 
chemical activity, we are, in essence, observing proteins. Proteins constitute most 
of a celFs dry mass. They are not only the celFs building bloclcs; they also execute 
the majority of the celFs functions. Thus, proteins that are enzymes provide 
the intricate molecular surfaces inside a cell that catalyze its many Chemical 
reactions. Proteins embedded in the plasma membrane form channels and 
pumps that control the passage of small molécules into and out of the cell. Other 
proteins carry messages from one cell to another, or act as signal integrators that 
relay sets of signais inward from the plasma membrane to the cell nucléus. Yet 
others serve as tiny molecular machines with moving parts: Icinesin, for example, 
propels organelles through the cytoplasm; topoisomerase can untangle lcnotted 
DNA molécules. Other specialized proteins act as antibodies, toxins, hormones, 
antifreeze molécules, elastic fibers, ropes, or sources of luminescence. Before 
we can hope to understand how genes work, how muscles contract, how nerves 
conduct electricity, how embryos develop, or how our bodies function, we must 
attain a deep understanding of proteins. 


INTHIS CHAPTER 

THE SHAPE AND STRUCTURE 
OF PROTEINS 

PROTEIN FUNCTION 


THE SHAPE AND STRUCTURE OF PROTEINS 

From a Chemical point of view, proteins are by far the most structurally complex 
and functionally sophisticated molécules lcnown. This is perhaps not surpris- 
ing, once we realize that the structure and chemistry of each protein has been 
developed and fine-tuned over billions of years of evolutionary history. The the- 
oretical calculations of population geneticists reveal that, over evolutionary time 
periods, a surprisingly small sélective advantage is enough to cause a randomly 
altered protein sequence to spread through a population of organisms. Yet, even 
to experts, the remarlcable versatility of proteins can seem truly amazing. 

In this section, we consider how the location of each amino acid in the long 
string of amino acids that forms a protein détermines its three-dimensional shape. 
Later in the chapter, we use this understanding of protein structure at the atomic 
level to describe how the précisé shape of each protein molécule détermines its 
function in a cell. 

The Shape of a Protein Is Specified by Its Amino Acid Sequence 

There are 20 different of amino acids in pro teins that are coded for directly in an 
organism's DNA, each with different Chemical properties. A protein molécule 
is made from a long unbranched chain of these amino acids, each linked to its 
neighbor through a covalent peptide bond. Proteins are therefore also lcnown as 
polypeptides. Each type of protein has a unique sequence of amino acids, and 
there are many thousands of different proteins in a cell. 

The repeating sequence of atoms along the core of the polypeptide chain is 
referred to as the polypeptide baclcbone. Attached to this répétitive chain are 
those portions of the amino acids that are not involved in malcing a peptide bond 
and that give each amino acid its unique properties: the 20 different amino acid 
side chains (Figure 3-1). Some of these side chains are nonpolar and hydrophobie 
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Méthionine 

Aspartic acid 

Leucine 


Tyrosine 

(Met) 

(Asp) 

(Leu) 


(Tyr) 


carboxyl 

terminus 

(C-terminus) 


Figure 3-1 The components of a protein. 

A protein consists of a polypeptide 
backbone with attached side chains. Each 
type of protein differs in its sequence and 
number of amino acids; therefore, it is the 
sequence of the chemically different side 
chains that makes each protein distinct. 

The two ends of a polypeptide Chain are 
chemically different: the end carrying the 
free amino group (NH 3 + , also written NH 2 ) 
is the amino terminus, or N-terminus, 
and that carrying the free carboxyl group 
(COO - , also written COOH) is the carboxyl 
terminus or C-terminus. The amino acid 
sequence of a protein is always presented 
in the N-to-C direction, reading from left 
to right. 


(“water-fearing”), others are negatively or positively charged, some readily form 
covalent bonds, and so on. Panel 3-1 (pp. 112-113) shows their atomic structures 
and Figure 3-2 lists their abbreviations. 

As discussed in Chapter 2, atoms behave almost as if they were hard spheres 
with a definite radius (their van der Waals radius ). The requirement that no two 
atoms overlap plus other constraints limit the possible bond angles in a poly¬ 
peptide chain (Figure 3-3), severely restricting the possible three-dimensional 
arrangements (or conformations) of atoms. Nevertheless, a long flexible chain 
such as a protein can still fold in an enormous number of ways. 

The folding of a protein chain is also determined by many different sets of 
wealc noncovalent bonds that form between one part of the chain and another. 
These involve atoms in the polypeptide backbone, as well as atoms in the amino 
acid side chains. There are three types of these wealc bonds: hydrogen bonds , elec- 
trostatic attractions , and van der Waals attractions , as explained in Chapter 2 (see 
p. 44). Individual noncovalent bonds are 30-300 times wealcer than the typical 
covalent bonds that create biological molécules. But many wealc bonds acting in 
parallel can hold two régions of a polypeptide chain tightly together. In this way, 
the combined strength of large numbers of such noncovalent bonds détermines 
the stability of each folded shape (Figure 3-4). 
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Figure 3-2 The 20 amino acids commonly found in proteins. Each amino acid has a three-letter and a one- 
letter abbreviation. There are equal numbers of polar and nonpolar side chains; however, some side chains listed 
here as polar are large enough to hâve some nonpolar properties (for example, Tyr, Thr, Arg, Lys). For atomic 
structures, see Panel 3-1 (pp. 112-113). 
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(A) 


amino acid 



peptide bonds 


Figure 3-3 Steric limitations on the bond angles in a polypeptide 
chain. (A) Each amino acid contributes three bonds (red) to the backbone 
of the chain. The peptide bond is planar (gray shading) and does not permit 
rotation. By contrast, rotation can occur about the C a -C bond, whose 
angle of rotation is called psi (\j/), and about the N-C a bond, whose angle of 
rotation is called phi (4>). By convention, an R group is often used to dénoté 
an amino acid side chain (purple circles). (B) The conformation of the main- 
chain atoms in a protein is determined by one pair of (|> and \|/ angles for each 
amino acid; because of steric collisions between atoms within each amino 
acid, most of the possible pairs of <|> and \j/ angles do not occur. In this so- 
called Ramachandran plot, each dot represents an observed pair of angles in 
a protein. The three differently shaded clusters of dots reflect three different 
“secondary structures” repeatedly found in proteins, as will be described in 
the text. (B, from J. Richardson, Adv. Prot. Chem. 34:174-175, 1981. 

© Academie Press.) 
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A fourth wealc force—a hydrophobie clustering force—also has a central rôle 
in determining the shape of a protein. As described in Chapter 2, hydrophobie 
molécules, including the nonpolar side chains of particular amino acids, tend to 
be forced together in an aqueous environment in order to minimize their disrup- 
tive effect on the hydrogen-bonded network of water molécules (see Panel 2-2, 
pp. 92-93). Therefore, an important factor governing the folding of any protein is 

glutamic acid 



Figure 3-4 Three types of noncovalent bonds help proteins fold. Although a single one of these bonds is quite weak, many 
of them act together to create a strong bonding arrangement, as in the example shown. As in the previous figure, R is used as a 
general désignation for an amino acid side chain. 
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PANEL 3-1 : The 20 Amino Acids Found in Proteins 
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Proteins consist exclusively of L-amino acids. 
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Amino acids are commonly joined together by an amide linkage, 
called a peptide bond. 
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Peptide bond: The four atoms in each gray box form a rigid 
planar unit. There is no rotation around the C-N bond. 
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Proteins are long polymers 
of amino acids linked by 
peptide bonds, and they 
are always written with the 
N-terminus toward the left. 

The sequence of this tripeptide 
is histidine-cysteine-valine. 
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These two single bonds allow rotation, so that long chains of 
amino acids are very flexible. 
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unfolded polypeptide 


Figure 3-5 How a protein folds into a 
compact conformation. The polar amino 
acid side chains tend to lie on the outside 




of the protein, where they can interact with 
water; the nonpolar amino acid side chains 


nonpolar 
side chains 


polar 

side 

chains 


polypeptide 

backbone 


are buried on the inside forming a tightly 
packed hydrophobie core of atoms that 
are hidden from water. In this schematic 
drawing, the protein contains only about 
35 amino acids. 


polar side chain on the 
outside of the molecul 
can form hydrogen 
bonds to water 



hydrophobie core région 
contains nonpolar 
side chains 


folded conformation in aqueous environment 


the distribution of its polar and nonpolar amino acids. The nonpolar (hydropho¬ 
bie) side chains in a protein—belonging to such amino acids as phenylalanine, 
leucine, valine, and tryptophan—tend to cluster in the interior of the molécule 
(just as hydrophobie oil droplets coalesce in water to form one large droplet). This 
enables them to avoid contact with the water that surrounds them inside a cell. 
In contrast, polar groups—such as those belonging to arginine, glutamine, and 
histidine—tend to arrange themselves near the outside of the molécule, where 
they can form hydrogen bonds with water and with other polar molécules (Figure 
3-5). Polar amino acids buried within the protein are usually hydrogen-bonded to 
other polar amino acids or to the polypeptide backbone. 

Proteins Fold into a Conformation of Lowest Energy 

As a resuit of ail of these interactions, most proteins hâve a particular three-di- 
mensional structure, which is determined by the order of the amino acids in its 
chain. The final folded structure, or conformation, of any polypeptide chain is 
generally the one that minimizes its free energy. Biologists hâve studied pro¬ 
tein folding in a test tube using highly purified proteins. Treatment with certain 
solvents, which disrupt the noncovalent interactions holding the folded chain 
together, unfolds, or dénaturés , a protein. This treatment converts the protein into 
a flexible polypeptide chain that has lost its natural shape. When the denaturing 
solvent is removed, the protein often refolds spontaneously, or renatures, into its 
original conformation. This indicates that the amino acid sequence contains ail of 
the information needed for specifying the three-dimensional shape of a protein, a 
critical point for understanding cell biology. 

Most proteins fold up into a single stable conformation. However, this confor¬ 
mation changes slightly when the protein interacts with other molécules in the 
cell. This change in shape is often crucial to the function of the protein, as we see 
later. 

Although a protein chain can fold into its correct conformation without out¬ 
side help, in a living cell spécial proteins called molecular chaperones often assist 
in protein folding. Molecular chaperones bind to partly folded polypeptide chains 
and help them progress along the most energetically favorable folding pathway. In 
the crowded conditions of the cytoplasm, chaperones are required to prevent the 
temporarily exposed hydrophobie régions in newly synthesized protein chains 
from associating with each other to form protein aggregates (see p. 355). However, 
the final three-dimensional shape of the protein is still specified by its amino acid 
sequence: chaperones simply malce reaching the folded State more reliable. 
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Figure 3-6 Four représentations 
describing the structure of a small 
protein domain. Constructed from a string 
of 100 amino acids, the SH2 domain is part 
of many different proteins (see, for example, 
Figure 3-61). Here, the structure of the 
SF12 domain is displayed as (A) a 
polypeptide backbone model, (B) a ribbon 
model, (C) a wire model that includes the 
amino acid side chains, and (D) a space- 
filling model (Movie 3.1). These images 
are colored in a way that allows the 
polypeptide Chain to be followed from its 
N-terminus (purple) to its C-terminus (red) 
(PDB code: 1SHA). 


Proteins corne in a wide variety of shapes, and most are between 50 and 
2000 amino acids long. Large proteins usually consist of several distinct protein 
domains —structural units that fold more or less independently of each other, as 
we discuss below. The structure of even a small domain is complex, and for clarity, 
several different représentations are conventionally used, each of which empha- 
sizes distinct features. As an example, Figure 3-6 présents four représentations 
of a protein domain called SH2, a structure présent in many different proteins in 
eukaryotic cells and involved in cell signaling (see Figure 15-46). 

Descriptions of protein structures are aided by the fact that proteins are built 
up from combinations of several common structural motifs, as we discuss next. 

The a Hélix and the p Sheet Are Common Folding Patterns 

When we compare the three-dimensional structures of many different protein 
molécules, it becomes clear that, although the overall conformation of each pro¬ 
tein is unique, two regular folding patterns are often found within them. Both pat¬ 
terns were discovered more than 60 years ago from studies of hair and sillc. The 
first folding pattern to be discovered, called the a hélix, was found in the protein 
a-keratin , which is abundant in skin and its dérivatives—such as hair, nails, and 
horns. Within a year of the discovery of the a hélix, a second folded structure, 
called a P sheet, was found in the protein fibroin, the major constituent of sillc. 
These two patterns are particularly common because they resuit from hydro- 
gen-bonding between the N-H and C=0 groups in the polypeptide backbone, 
without involving the side chains of the amino acids. Thus, although incompatible 
with some amino acid side chains, many different amino acid sequences can form 
them. In each case, the protein chain adopts a regular, repeating conformation. 
Figure 3-7 illustrâtes the detailed structures of these two important conforma¬ 
tions, which in ribbon models of proteins are represented by a helical ribbon and 
by a set of aligned arrows, respectively. 
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Figure 3-7 The regular conformation of the polypeptide backbone in the a hélix and the p sheet. The a hélix is shown in 
(A) and (B). The N-H of every peptide bond is hydrogen-bonded to the C=0 of a neighboring peptide bond located four peptide 
bonds away in the same chain. Note that ail of the N-H groups point up in this diagram and that ail of the C=0 groups point 
down (toward the C-terminus); this gives a polarity to the hélix, with the C-terminus having a partial négative and the N-terminus 
a partial positive charge (Movie 3.2). The P sheet is shown in (C) and (D). In this example, adjacent peptide chains run in 
opposite (antiparallel) directions. Hydrogen-bonding between peptide bonds in different strands holds the individual polypeptide 
chains (strands) together in a p sheet, and the amino acid side chains in each strand alternately project above and below the 
plane of the sheet (Movie 3.3). (A) and (C) show ail the atoms in the polypeptide backbone, but the amino acid side chains are 
truncated and denoted by R. In contrast, (B) and (D) show only the carbon and nitrogen backbone atoms. 


The cores of many proteins contain extensive régions of (3 sheet. As shown 
in Figure 3-8, these P sheets can form either from neighboring segments of the 
polypeptide backbone that run in the same orientation (parallel chains) or from 
a polypeptide backbone that folds baclc and forth upon itself, with each section 
of the chain running in the direction opposite to that of its immédiate neigh- 
bors (antiparallel chains). Both types of P sheet produce a very rigid structure, 
held together by hydrogen bonds that connect the peptide bonds in neighboring 
chains (see Figure 3-7C). 

An a hélix is generated when a single polypeptide chain twists around on itself 
to form a rigid cylinder. A hydrogen bond forms between every fourth peptide 
bond, linking the C=0 of one peptide bond to the N-H of another (see Figure 
3-7A). This gives rise to a regular hélix with a complété turn every 3.6 amino acids. 
The SH2 protein domain illustrated in Figure 3-6 contains two a helices, as well as 
a three-stranded antiparallel P sheet. 

Régions of a hélix are abundant in proteins located in cell membranes, such 
as transport proteins and receptors. As we discuss in Chapter 10, those portions 
of a transmembrane protein that cross the lipid bilayer usually cross as a heli¬ 
ces composed largely of amino acids with nonpolar side chains. The polypeptide 
backbone, which is hydrophilic, is hydrogen-bonded to itself in the a hélix and 
shielded from the hydrophobie lipid environment of the membrane by its pro- 
truding nonpolar side chains (see also Figure 3-75A). 

In other proteins, a helices wrap around each other to form a particularly sta¬ 
ble structure, lcnown as a coiled-coil. This structure can form when the two (or in 
some cases, three or four) a helices hâve most of their nonpolar (hydrophobie) 
side chains on one side, so that they can twist around each other with these side 
chains facing inward (Figure 3-9). Long rodlike coiled-coils provide the structural 
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frameworlc for many elongated proteins. Examples are a-keratin, which forms the 
intracellular fibers that reinforce the outer layer of the skin and its appendages, 
and the myosin molécules responsible for muscle contraction. 

Protein Domains Are Modular Units from Which Larger Proteins 
Are Built 

Even a small protein molécule is built from thousands of atoms linlced together by 
precisely oriented covalent and noncovalent bonds. Biologists are aided in visu- 
alizing these extremely complicated structures by various graphie and comput- 
er-based three-dimensional displays. The student resource site that accompanies 
this boolc contains computer-generated images of selected proteins, displayed 
and rotated on the screen in a variety of formats. 

Scientists distinguish four levels of organization in the structure of a protein. 
The amino acid sequence is known as the primary structure. Stretches of poly¬ 
peptide chain that form a helices and (3 sheets constitute the protehTs second- 
ary structure. The full three-dimensional organization of a polypeptide chain is 
sometimes referred to as the tertiary structure, and if a particular protein mol¬ 
écule is formed as a complex of more than one polypeptide chain, the complété 
structure is designated as the quaternary structure. 

Studies of the conformation, function, and évolution of proteins hâve also 
revealed the central importance of a unit of organization distinct from these four. 
This is the protein domain, a substructure produced by any contiguous part of 
a polypeptide chain that can fold independently of the rest of the protein into a 
compact, stable structure. A domain usually contains between 40 and 350 amino 
acids, and it is the modular unit from which many larger proteins are constructed. 

The different domains of a protein are often associated with different func- 
tions. Figure 3-10 shows an example—the Src protein kinase, which functions in 
signaling pathways inside vertebrate cells (Src is pronounced “sarc”). This protein 




Figure 3-8 Two types of p sheet 
structures. (A) An antiparallel P sheet (see 
Figure 3-7C). (B) A parailel P sheet. Both of 
these structures are common in proteins. 
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Figure 3-9 A coiled-coil. (A) A single a 
hélix, with successive amino acid side 
chains labeled in a sevenfold sequence, 
“abedefg” (from bottom to top). Amino 
acids “a” and “d” in such a sequence lie 
close together on the cylinder surface, 
forming a “stripe” (green) that winds 
slowly around the a hélix. Proteins that 
form coiled-coils typically hâve nonpolar 
amino acids at positions “a” and “d.” 
Consequently, as shown in (B), the two a 
helices can wrap around each other with 
the nonpolar side chains of one a hélix 
interacting with the nonpolar side chains 
of the other. (C) The atomic structure 
of a coiled-coil determined by x-ray 
crystallography. The alpha helical backbone 
is shown in red and the nonpolar side 
chains in green, while the more hydrophilic 
amino acid side chains, shown in gray, are 
left exposed to the aqueous environment 
(Movie 3.4). (PDB code: 3NMD.) 
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SH3 domain 


SH2 domain 



Figure 3-10 A protein formed from 
multiple domains. In the Src protein 
shown, a C-terminal domain with two lobes 
(yellow and orange) forms a protein kinase 
enzyme, while the SH2 and SH3 domains 
perform regulatory tu notions. (A) A ribbon 
model, with ATP substrate in red. (B) A 
space-filling model, with ATP substrate in 
red. Note that the site that binds ATP is 
positioned at the interface of the two lobes 
that form the kinase. The structure of the 
SH2 domain was illustrated in Figure 3-6. 
(PDB code: 2SRC.) 


(B) 


is considered to hâve three domains: the SH2 and SH3 domains hâve regulatory 
rôles, while the C-terminal domain is responsible for the kinase catalytic activity. 
Later in the chapter, we shall return to this protein, in order to explain how pro- 
teins can form molecular switches that transmit information throughout cells. 

Figure 3-11 présents ribbon models of three differently organized protein 
domains. As these examples illustrate, the central core of a domain can be con- 
structed from a helices, from p sheets, or from various combinations of these two 
fundamental folding éléments. 

The smallest protein molécules contain only a single domain, whereas larger 
proteins can contain several dozen domains, often connected to each other by 
short, relatively unstructured lengths of polypeptide chain that can act as flexible 
hinges between domains. 

Few of the Many Possible Polypeptide Chains Will Be Useful 
to Cells 

Since each of the 20 amino acids is chemically distinct and each can, in princi- 
ple, occur at any position in a protein chain, there are 20 x 20 x 20 x 20 = 160,000 
different possible polypeptide chains four amino acids long, or 20^ different pos¬ 
sible polypeptide chains n amino acids long. For a typical protein length of about 
300 amino acids, a cell could theoretically malce more than 10 390 (20 300 ) different 
polypeptide chains. This is such an enormous number that to produce just one 
molécule of each lcind would require many more atoms than exist in the universe. 

Only a very small fraction of this vast set of conceivable polypeptide chains 
would adopt a stable three-dimensional conformation—by some estimâtes, less 



Figure 3-11 Ribbon models of three 
different protein domains. (A) Cytochrome 
b> 562 > a single-domain protein involved in 
électron transport in mitochondria. This 
protein is composed almost entirely of 
a helices. (B) The NAD-binding domain of 
the enzyme lactic dehydrogenase, which 
is composed of a mixture of a helices and 
parallel P sheets. (C) The variable domain 
of an immunoglobulin (antibody) light 
chain, composed of a sandwich of two 
antiparallel P sheets. In these examples, the 
a helices are shown in green, while strands 
organized as P sheets are denoted by red 
arrows. Note how the polypeptide chain 
generally traverses back and forth across 
the entire domain, making sharp turns only 
at the protein surface (Movie 3.5). It is the 
protruding loop régions (yellow) that often 
form the binding sites for other molécules. 
(Adapted from drawings courtesy of Jane 
Richardson.) 
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than one in a billion. And yet the majority of proteins présent in cells do adopt 
unique and stable conformations. How is this possible? The answer lies in natu- 
ral sélection. A protein with an unpredictably variable structure and biochemical 
activity is unlilcely to help the survival of a cell that contains it. Such pro teins would 
therefore hâve been eliminated by natural sélection through the enormously long 
trial-and-error process that underlies biological évolution. 

Because évolution has selected for protein function in living organisms, the 
amino acid sequence of most present-day proteins is such that a single confor¬ 
mation is stable. In addition, this conformation has its Chemical properties finely 
tuned to enable the protein to perform a particular catalytic or structural function 
in the cell. Proteins are so precisely built that the change of even a few atoms in 
one amino acid can sometimes disrupt the structure of the whole molécule so 
severely that ail function is lost. And, as discussed later in this chapter, when cer¬ 
tain rare protein misfolding accidents occur, the results can be disastrous for the 
organisms that contain them. 

Proteins Can Be Classified into Many Families 

Once a protein had evolved that folded up into a stable conformation with use- 
ful properties, its structure could be modified during évolution to enable it to 
perform new functions. This process has been greatly accelerated by genetic 
mechanisms that occasionally duplicate genes, allowing one gene copy to evolve 
independently to perform a new function (discussed in Chapter 4). This type of 
event has occurred very often in the past; as a resuit, many present-day proteins 
can be grouped into protein families, each family member having an amino acid 
sequence and a three-dimensional conformation that resemble those of the other 
family members. 

Consider, for example, the serine proteases, a large family of protein-cleaving 
(proteolytic) enzymes that includes the digestive enzymes chymotrypsin, trypsin, 
and elastase, and several proteases involved in blood clotting. When the prote¬ 
ase portions of any two of these enzymes are compared, parts of their amino acid 
sequences are found to match. The similarity of their three-dimensional con¬ 
formations is even more striking: most of the detailed twists and turns in their 
polypeptide chains, which are several hundred amino acids long, are virtually 
identical (Figure 3-12). The many different serine proteases nevertheless hâve 
distinct enzymatic activities, each cleaving different proteins or the peptide bonds 
between different types of amino acids. Each therefore performs a distinct func¬ 
tion in an organism. 

The story we hâve told for the serine proteases could be repeated for hundreds 
of other protein families. In general, the structure of the different members of a 



Figure 3-12 A comparison of the 
conformations of two serine proteases. 

The backbone conformations of elastase 
and chymotrypsin. Although only those 
amino acids in the polypeptide chain 
shaded in green are the same in the two 
proteins, the two conformations are very 
similar nearly everywhere. The active site of 
each enzyme is circled in rec/; this is where 
the peptide bonds of the proteins that 
serve as substrates are bound and cleaved 
by hydrolysis. The serine proteases dérivé 
their name from the amino acid serine, 
whose side chain is part of the active site 
of each enzyme and directly participâtes 
in the cleavage reaction. The two dots on 
the right side of the chymotrypsin molécule 
mark the new ends created when this 
enzyme cuts its own backbone. 
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Figure 3-13 A comparison of a class of DNA-binding domains, called homeodomains, in a pair of proteins from 
two organisms separated by more than a billion years of évolution. (A) A ribbon model of the structure common to 
both proteins. (B) A trace of the a-carbon positions. The three-dimensional structures shown were determined by x-ray 
crystallography for the yeast a2 protein (green) and the Drosophila engrailed protein (red). (C) A comparison of amino acid 
sequences for the région of the proteins shown in (A) and (B). Black dots mark sites with identical amino acids. Orange dots 
indicate the position of a three-amino-acid insert in the a2 protein. (Adapted from C. Wolberger et al., Cell 67:517-528, 1991. 
With permission from Elsevier.) 


protein family has been more highly conserved than has the amino acid sequence. 
In many cases, the amino acid sequences hâve diverged so far that we cannot be 
certain of a family relationship between two proteins without determining their 
three-dimensional structures. The yeast a2 protein and the Drosophila engrailed 
protein, for example, are both gene regulatory proteins in the homeodomain fam¬ 
ily (discussed in Chapter 7). Because they are identical in only 17 of their 60 amino 
acid residues, their relationship became certain only by comparing their three-di¬ 
mensional structures (Figure 3-13). Many similar examples show that two pro¬ 
teins with more than 25% identity in their amino acid sequences usually share the 
same overall structure. 

The various members of a large protein family often hâve distinct functions. 
Some of the amino acid changes that malce family members different were no 
doubt selected in the course of évolution because they resulted in useful changes 
in biological activity, giving the individual family members the different functional 
properties they hâve today. But many other amino acid changes are effectively 
“neutral," having neither a bénéficiai nor a damaging effect on the basic structure 
and function of the protein. In addition, since mutation is a random process, there 
must also hâve been many deleterious changes that altered the three-dimensional 
structure of these proteins sufficiently to harm them. Such faulty proteins would 
hâve been lost whenever the individual organisms making them were at enough 
of a disadvantage to be eliminated by natural sélection. 

Protein families are readily recognized when the genome of any organism is 
sequenced; for example, the détermination of the DNA sequence for the entire 
human genome has revealed that we contain about 21,000 protein-coding genes. 
(Note, however, that as a resuit of alternative RNA splicing, human cells can pro- 
duce much more than 21,000 different proteins, as will be explained in Chapter 
6.) Through sequence comparisons, we can assign the products of at least 40% of 
our protein-coding genes to known protein structures, belonging to more than 
500 different protein families. Most of the proteins in each family hâve evolved 
to perform somewhat different functions, as for the enzymes elastase and chy- 
motrypsin illustrated previously in Figure 3-12. As explained in Chapter 1 (see 
Figure 1-21), these are sometimes called paralogs to distinguish them from the 
many corresponding proteins in different organisms ( orthologs , such as mouse 
and human elastase). 
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As described in Chapter 8, because of the powerful techniques of x-ray crys- 
tallography and nuclear magnetic résonance (NMR), we now know the three-di- 
mensional shapes, or conformations, of more than 100,000 proteins. By carefully 
comparing the conformations of these proteins, structural biologists (that is, 
experts on the structure of biological molécules) hâve concluded that there are 
a limited number of ways in which protein domains fold up in nature—maybe as 
few as 2000, if we consider ail organisms. For most of these so-called proteinfolds, 
représentative structures hâve been determined. 

The présent database of known protein sequences contains more than twenty 
million entries, and it is growing very rapidly as more and more genomes are 
sequenced—revealing huge numbers of new genes that encode proteins. The 
encoded polypeptides range widely in size, from 6 amino acids to a gigantic pro¬ 
tein of 33,000 amino acids. Protein comparisons are important because related 
structures often imply related functions. Many years of expérimentation can be 
saved by discovering that a new protein has an amino acid sequence similarity 
with a protein of known function. Such sequence relationships, for example, first 
indicated that certain genes that cause mammalian cells to become cancerous 
encode protein kinases (discussed in Chapter 20). 

Some Protein Domains Are Found in Many Different Proteins 

As previously stated, most proteins are composed of a sériés of protein domains, 
in which different régions of the polypeptide chain fold independently to form 
compact structures. Such multidomain proteins are believed to hâve originated 
from the accidentai joining of the DNA sequences that encode each domain, cre- 
ating a new gene. In an evolutionary process called domain shuffling, many large 
proteins hâve evolved through the joining of preexisting domains in new com¬ 
binations (Figure 3-14). Novel binding surfaces hâve often been created at the 
juxtaposition of domains, and many of the functional sites where proteins bind to 
small molécules are found to be located there. 

A subset of protein domains has been especially mobile during évolution; 
these seem to hâve particularly versatile structures and are sometimes referred to 
as protein modules. The structure of one, the SH2 domain, was illustrated in Figure 
3-6. Three other abundant protein domains are illustrated in Figure 3-15. 

Each of the domains shown has a stable core structure formed from strands 
of P sheets, from which less-ordered loops of polypeptide chain protrude. The 
loops are ideally situated to form binding sites for other molécules, as most clearly 
demonstrated for the immunoglobulin fold, which forms the basis for antibody 
molécules. Such P-sheet-based domains may hâve achieved their evolutionary 
success because they provide a convenient framework for the génération of new 
binding sites for ligands, requiring only small changes to their protruding loops 
(see Figure 3-42). 
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Figure 3-14 Domain shuffling. An 

extensive shuffling of blocks of protein 
sequence (protein domains) has occurred 
during protein évolution. Those portions 
of a protein denoted by the same shape 
and color in this diagram are evolutionarily 
related. Serine proteases like chymotrypsin 
are formed from two domains (brown). In 
the three other proteases shown, which 
are highly regulated and more specialized, 
these two protease domains are connected 
to one or more domains that are similar to 
domains found in epidermal growth factor 
(EGF; green), to a calcium-binding protein 
(yellow), or to a “kringle” domain (blue). 
Chymotrypsin is illustrated in Figure 3-12. 
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Figure 3-15 The three-dimensional 
structures of three commonly used 
protein domains. In these ribbon 
diagrams, p-sheet strands are shown 
as arrows, and the N- and C-termini are 
indicated by red spheres. Many more such 
“modules” exist in nature. (Adapted from 
M. Baron, D.G. Norman and I.D. Campbell, 
Trends Biochem. Sci. 16:13-17, 1991, with 
permission from Elsevier, and D.J. Leahy 
et al., Science 258:987-991, 1992, with 
permission from AAAS.) 
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Figure 3-16 An extended structure formed from a sériés of protein 
domains. Four fibronectin type 3 domains (see Figure 3-15) from the 
extracellular matrix molécule fibronectin are illustrated in (A) ribbon and (B) 
space-filling models. (Adapted from D.J. Leahy, I. Aukhil and H.P. Erickson, 
Ce//84:155-164, 1996. With permission from Elsevier.) 


A second feature of these protein domains that explains their utility is the ease 
with which they can be integrated into other proteins. Two of the three domains 
illustrated in Figure 3-15 hâve their N- and C-terminal ends at opposite pôles of 
the domain. When the DNA encoding such a domain undergoes tandem duplica¬ 
tion, which is not unusual in the évolution of genomes (discussed in Chapter 4), 
the duplicated domains with this “in-line” arrangement can be readily linlced in 
sériés to form extended structures—either with themselves or with other in-line 
domains (Figure 3-16). Stiff extended structures composed of a sériés of domains 
are especially common in extracellular matrix molécules and in the extracellular 
portions of cell-surface receptor proteins. Other frequently used domains, includ- 
ing the kringle domain illustrated in Figure 3-15 and the SH2 domain, are of a 
"plug-in” type, with their N- and C-termini close together. After genomic rear¬ 
rangements, such domains are usually accommodated as an insertion into a loop 
région of a second protein. 

A comparison of the relative frequency of domain utilization in different 
eukaryotes reveals that, for many common domains, such as protein kinases, this 
frequency is similar in organisms as diverse as yeast, plants, worms, Aies, and 
humans. But there are some notable exceptions, such as the Major Histocom- 
patibility Complex (MHC) antigen-recognition domain (see Figure 24-36) that 
is présent in 57 copies in humans, but absent in the other four organisms just 
mentioned. Domains such as these hâve specialized functions that are not shared 
with the other eukaryotes; they are assumed to hâve been strongly selected for 
during recent évolution to produce the multiple copies observed. Similarly, the 
SH2 domain shows an unusual increase in its numbers in higher eukaryotes; such 
domains might be assumed to be especially useful for multicellularity. 

Certain Pairs of Domains Are Found Together in Many Proteins 

We can construct a large table displaying domain usage for each organism whose 
genome sequence is known. For example, the human genome contains the DNA 
sequences for about 1000 immunoglobulin domains, 500 protein kinase domains, 
250 DNA-binding homeodomains, 300 SH3 domains, and 120 SH2 domains. In 
addition, we find that more than two-thirds of ail proteins consist of two or more 
domains, and that the same pairs of domains occur repeatedly in the same rela¬ 
tive arrangement in a protein. Although half of ail domain families are common 
to archaea, bacteria, and eukaryotes, only about 5% of the two-domain combi¬ 
nations are similarly shared. This pattern suggests that most proteins containing 
especially useful two-domain combinations arose through domain shuffling rel- 
atively late in évolution. 

The Human Genome Encodes a Complex Set of Proteins, 
Revealing That Much Remains Unknown 

The resuit of sequencing the human genome has been surprising, because it 
reveals that our chromosomes contain only about 21,000 protein-coding genes. 
Based on this number alone, we would appear to be no more complex than the 
tiny mustard weed, Arabidopsis, and only about 1.3-fold more complex than a 
nematode worm. The genome sequences also reveal that vertebrates hâve inher- 
ited nearly ail of their protein domains from invertebrates—with only 7% of iden- 
tified human domains being vertebrate-specific. 

Each of our proteins is on average more complicated, however (Figure 3-17). 
Domain shuffling during vertebrate évolution has given rise to many novel 
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Figure 3-17 Domain structure of a group 
of evolutionarily related proteins that 
are thought to hâve a similar function. In 

general, there is a tendency for the proteins 
in more complex organisms, such as 
humans, to contain additional domains—as 
is the case for the DNA-binding protein 
compared here. 
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Figure 3-18 Two identical protein 
subunits binding together to form 
a symmetric protein dimer. The Cro 

repressor protein from bactériophage 
lambda binds to DNA to turn off a spécifie 
subset of viral genes. Its two identical 
subunits bind head-to-head, held together 
by a combination of hydrophobie forces 
(blue) and a set of hydrogen bonds (yellow 
région). (Adapted from D.H. Ohlendorf, 

D.E. Tronrud and B.W. Matthews, J. Mol. 
Biol. 280:129-136, 1998. With permission 
from Academie Press.) 


combinations of protein domains, with the resuit that there are nearly twice as 
many combinations of domains found in human proteins as in a worm or a fiy. 
Thus, for example, the trypsinlike serine protease domain is linlced to at least 18 
other types of protein domains in human proteins, whereas it is found covalently 
joined to only 5 different domains in the worm. This extra variety in our proteins 
greatly increases the range of protein-protein interactions possible (see Figure 
3-79), but how it contributes to making us human is not lcnown. 

The complexity of living organisms is staggering, and it is quite sobering to 
note that we currently laclc even the tiniest hint of what the function might be 
for more than 10,000 of the proteins that hâve thus far been identified through 
examining the human genome. There are certainly enormous challenges ahead 
for the next génération of cell biologists, with no shortage of fascinating mysteries 
to solve. 


Larger Protein Molécules Often Contain More Than One 
Polypeptide Chain 

The same wealc noncovalent bonds that enable a protein chain to fold into a spé¬ 
cifie conformation also allow proteins to bind to each other to produce larger 
structures in the cell. Any région of a protein's surface that can interact with 
another molécule through sets of noncovalent bonds is called a binding site. A 
protein can contain binding sites for various large and small molécules. If a bind¬ 
ing site recognizes the surface of a second protein, the tight binding of two folded 
polypeptide chains at this site créâtes a larger protein molécule with a precisely 
defined geometry. Each polypeptide chain in such a protein is called a protein 
subunit. 

In the simplest case, two identical folded polypeptide chains bind to each 
other in a "head-to-head" arrangement, forming a symmetric complex of two 
protein subunits (a dimer ) held together by interactions between two identical 
binding sites. The Cro repressor protein —a viral gene regulatory protein that binds 
to DNA to turn spécifie viral genes off in an infected bacterial cell—provides an 
example (Figure 3-18). Cells contain many other types of symmetric protein com¬ 
plexes, formed from multiple copies of a single polypeptide chain (for example, 
see Figure 3-20 below). 

Many of the proteins in cells contain two or more types of polypeptide chains. 
Hemoglobin, the protein that carries oxygen in red blood cells, contains two 
identical a-globin subunits and two identical P-globin subunits, symmetrically 
arranged (Figure 3-19). Such multisubunit proteins are very common in cells, 
and they can be very large (Movie 3.6). 

Some Globular Proteins Form Long Helical Filaments 

Most of the proteins that we hâve discussed so far are globular proteins , in which 
the polypeptide chain folds up into a compact shape lilce a bail with an irregu- 
lar surface. Some of these protein molécules can nevertheless assemble to form 
filaments that may span the entire length of a cell. Most simply, a long chain of 
identical protein molécules can be constructed if each molécule has a binding 



Figure 3-19 A protein formed as a 
symmetric assembly using two each of 
two different subunits. Hemoglobin is an 
abundant protein in red blood cells that 
contains two copies of a-globin (green) 
and two copies of p-globin (blue). Each of 
these four polypeptide chains contains a 
heme molécule (red), which is the site that 
binds oxygen (O 2 ). Thus, each molécule 
of hemoglobin in the blood carries four 
molécules of oxygen. (PDB code: 2DHB.) 
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Figure 3-20 Protein assemblies. (A) A protein with just one binding site can 
form a dimer with another identical protein. (B) Identical proteins with two 
different binding sites often form a long helical filament. (C) If the two binding 
sites are disposed appropriately in relation to each other, the protein subunits 
may form a closed ring instead of a hélix. (For an example of A, see 
Figure 3-18; for an example of B, see Figure 3-21 ; for examples of C, see 
Figures 5-14 and 14-31.) 


site complementary to another région of the surface of the same molécule (Figure 
3-20). An actin filament, for example, is a long helical structure produced from 
many molécules of the protein actin (Figure 3-21). Actin is a globular protein that 
is very abundant in eukaryotic cells, where it forms one of the major filament Sys¬ 
tems of the cytoslceleton (discussed in Chapter 16). 

We will encounter many helical structures in this boolc. Why is a hélix such a 
common structure in biology? As we hâve seen, biological structures are often 
formed by linking similar subunits into long, répétitive chains. If ail the subunits 
are identical, the neighboring subunits in the chain can often fit together in only 
one way, adjusting their relative positions to minimize the free energy of the con¬ 
tact between them. As a resuit, each subunit is positioned in exactly the same way 
in relation to the next, so that subunit 3 fits onto subunit 2 in the same way that 
subunit 2 fits onto subunit 1, and so on. Because it is very rare for subunits to join 
up in a straight line, this arrangement generally results in a hélix—a regular struc¬ 
ture that resembles a spiral staircase, as illustrated in Figure 3-22. Depending on 
the twist of the staircase, a hélix is said to be either right-handed or left-handed 
(see Figure 3-22E). Handedness is not affected by turning the hélix upside down, 
but it is reversed if the hélix is reflected in the mirror. 

The observation that helices occur commonly in biological structures holds 
true whether the subunits are small molécules linlced together by covalent bonds 
(for example, the amino acids in an a hélix) or large protein molécules that are 
linlced by noncovalent forces (for example, the actin molécules in actin filaments). 
This is not surprising. A hélix is an unexceptional structure, and it is generated 
simply by placing many similar subunits next to each other, each in the same 
strictly repeated relationship to the one before—that is, with a fixed rotation fol- 
lowed by a fixed translation along the hélix axis, as in a spiral staircase. 

Many Protein Molécules Hâve Elongated, Fibrous Shapes 

Enzymes tend to be globular proteins: even though many are large and compli- 
cated, with multiple subunits, most hâve an overall rounded shape. In Figure 3-21, 
we saw that a globular protein can also associate to form long filaments. But there 
are also functions that require each individual protein molécule to span a large 
distance. These proteins generally hâve a relatively simple, elongated three-di- 
mensional structure and are commonly referred to as fibrous proteins. 

One large family of intracellular fibrous proteins consists of a-keratin, intro- 
duced when we presented the a hélix, and its relatives. Keratin filaments are 
extremely stable and are the main component in long-lived structures such as 
hair, horn, and nails. An a-keratin molécule is a dimer of two identical subunits, 
with the long a helices of each subunit forming a coiled-coil (see Figure 3-9). The 
coiled-coil régions are capped at each end by globular domains containing bind¬ 
ing sites. This enables this class of protein to assemble into ropelilce intermediate 
filaments —an important component of the cytoslceleton that créâtes the cell's 
internai structural frameworlc (see Figure 16-67). 

Fibrous proteins are especially abundant outside the cell, where they are a 
main component of the gel-lilce extracellular matrix that helps to bind collections 
of cells together to form tissues. Cells secrete extracellular matrix proteins into 
their surroundings, where they often assemble into sheets or long fibrils. Colla- 
gen is the most abundant of these proteins in animal tissues. A collagen molécule 
consists of three long polypeptide chains, each containing the nonpolar amino 
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Figure 3-21 Actin filaments. 

(A) Transmission électron micrographs of 
negatively stained actin filaments. (B) The 
helical arrangement of actin molécules in an 
actin filament. (A, courtesy of Roger Craig.) 
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Figure 3-22 Some properties of a hélix. 

(A-D) A hélix forms when a sériés of 
subunits bind to each other in a regular 
way. At the bottom, each of these helices 
is viewed from directly above the hélix and 
seen to hâve two (A), three (B), and six 
(C and D) subunits per helical turn. Note 
that the hélix in (D) has a wider path 
than that in (C), but the same number of 
subunits per turn. (E) As discussed in the 
text, a hélix can be either right-handed or 
left-handed. As a reference, it is useful to 
remember that standard métal screws, 
which insert when turned clockwise, are 
right-handed. Note that a hélix retains the 
same handedness when it is turned upside 
down. (PDB code: 2DHB.) 


acid glycine at every third position. This regular structure allows the chains to 
wind around one another to generate a long regular triple hélix (Figure 3-23A). 
Many collagen molécules then bind to one another side-by-side and end-to- 
end to create long overlapping arrays—thereby generating the extremely tough 
collagen fibrils that give connective tissues their tensile strength, as described in 
Chapter 19. 

Proteins Contain a Surprisingly Large Amount of Intrinsically 
Disordered Polypeptide Chain 

It has been well known for a long time that, in complété contrast to collagen, 
another abundant protein in the extracellular matrix, elastin, is formed as a highly 
disordered polypeptide. This disorder is essential for elastin's function. Its rela- 
tively loose and unstructured polypeptide chains are covalently cross-linked to 
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Figure 3-23 Collagen and elastin. (A) Collagen is a triple hélix formed by three extended protein chains that wrap around one another (bottom). 
Many rodlike collagen molécules are cross-linked together in the extracellular space to form unextendable collagen fibrils (top) that hâve the tensile 
strength of Steel. The striping on the collagen fibril is caused by the regular repeating arrangement of the collagen molécules within the fibril. 

(B) Elastin polypeptide chains are cross-linked together in the extracellular space to form rubberlike, elastic fibers. Each elastin molécule uncoils into a 
more extended conformation when the fiber is stretched and recoils spontaneously as soon as the stretching force is relaxed. The cross-linking in the 
extracellular space mentioned créâtes covalent linkages between lysine side chains, but the chemistry is different for collagen and elastin. 
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produce a rubberlike, elastic meshworlc that can be reversibly pulled from one 
conformation to another, as illustrated in Figure 3-23B. The elastic fibers that 
resuit enable slcin and other tissues, such as arteries and lungs, to stretch and 
recoil without tearing. 

Intrinsically disordered régions of proteins are frequent in nature, and they 
hâve important functions in the interior of cells. As we hâve already seen, proteins 
often hâve loops of polypeptide chain that protrude from the core région of a pro- 
tein domain to bind other molécules. Some of these loops remain largely unstruc- 
tured until they bind to a target molécule, adopting a spécifie folded conforma¬ 
tion only when this other molécule is bound. Many proteins were also known to 
hâve intrinsically disordered tails at one or the other end of a structured domain 
(see, for example, the histones in Figure 4-24). But the extent of such disordered 
structure only became clear when genomes were sequenced. This allowed bio- 
informatic methods to be used to analyze the amino acid sequences that genes 
encode, searching for disordered régions based on their unusually low hydropho- 
bicity and relatively high net charge. Combining these results with other data, it is 
now thought that perhaps a quarter of ail eukaryotic proteins can adopt structures 
that are mostly disordered, fluctuating rapidly between many different conforma¬ 
tions. Many such intrinsically disordered régions contain repeated sequences of 
amino acids. What do these disordered régions do? 

Some known functions are illustrated in Figure 3-24. One prédominant func- 
tion is to form spécifie binding sites for other protein molécules that are of high 
specificity, but readily altered by protein phosphorylation, protein déphosphor¬ 
ylation, or any of the other covalent modifications that are triggered by cell sig- 
naling events (Figure 3-24A and B). We shall see, for example, that the eukaryotic 
RNA polymerase enzyme that produces mRNAs contains a long, unstructured 
C-terminal tail that is covalently modified as its RNA synthesis proceeds, thereby 
attracting spécifie other proteins to the transcription complex at different times 
(see Figure 6-22). And this unstructured tail interacts with a different type of low 
complexity domain when the RNA polymerase is recruited to the spécifie sites on 
the DNA where it begins synthesis. 

As illustrated in Figure 3-24C, an unstructured région can also serve as a 
“tether" to hold two protein domains in close proximity to facilitate their inter¬ 
action. For example, it is this tethering function that allows substrates to move 
between active sites in large multienzyme complexes (see Figure 3-54). A simi- 
lar tethering function allows large scaffold proteins with multiple protein-binding 
sites to concentrate sets of interacting proteins, both increasing reaction rates and 
confining their reaction to a particular site in a cell (see Figure 3-78). 

Lilce elastin, other proteins hâve a function that directly requires that they 
remain largely unstructured. Thus, large numbers of disordered protein chains 
in close proximity can create micro-regions of gel-like consistency inside the cell 
that restrict diffusion. For example, the abundant nucleoporins that coat the inner 
surface of the nuclear pore complex form a random coil meshworlc (Figure 3-24) 
that is critical for sélective nuclear transport (see Figure 12-8). 
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Figure 3-24 Some important functions 
for intrinsically disordered protein 
sequences. (A) Unstructured régions 
of polypeptide chain often form binding 
sites for other proteins. Although these 
binding events are of high specificity, 
they are often of low affinity due to the 
free-energy cost of folding the normally 
unfolded partner (and they are thus readily 
réversible). (B) Unstructured régions can 
be easily modified covalently to change 
their binding preferences, and they are 
therefore frequently involved in cell signaling 
processes. In this schematic, multiple sites 
of protein phosphorylation are indicated. 

(C) Unstructured régions frequently create 
“tethers” that hold interacting protein 
domains in close proximity. (D) A dense 
network of unstructured proteins can form 
a diffusion barrier, as the nucleoporins do 
for the nuclear pore. 
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Figure 3-25 Disulfide bonds. Covalent 
disulfide bonds form between adjacent 
cysteine side chains. These cross- 
linkages can join either two parts of the 
same polypeptide chain or two different 
polypeptide chains. Since the energy 
required to break one covalent bond is 
much larger than the energy required to 
break even a whole set of noncovalent 
bonds (see Table 2-1, p. 45), a disulfide 
bond can hâve a major stabilizing effect on 
a protein (Movie 3.7). 


Covalent Cross-Linkages Stabilize Extracellular Proteins 

Many protein molécules are either attached to the outside of a cell's plasma mem¬ 
brane or secreted as part of the extracellular matrix. Ail such proteins are directly 
exposed to extracellular conditions. To help maintain their structures, the poly¬ 
peptide chains in such proteins are often stabilized by covalent cross-linlcages. 
These linkages can either tie together two amino acids in the same protein, or 
connect different polypeptide chains in a multisubunit protein. Although many 
other types exist, the most common cross-linlcages in proteins are covalent sulfur- 
sulfur bonds. These disulfide bonds (also called S-S bonds ) form as cells préparé 
newly synthesized proteins for export. As described in Chapter 12, their formation 
is catalyzed in the endoplasmic réticulum by an enzyme that links together two 
pairs of -SH groups of cysteine side chains that are adjacent in the folded protein 
(Figure 3-25). Disulfide bonds do not change the conformation of a protein but 
instead act as atomic Staples to reinforce its most favored conformation. For exam¬ 
ple, lysozyme—an enzyme in tears that dissolves bacterial cell walls—retains its 
antibacterial activity for a long time because it is stabilized by such cross-linlcages. 

Disulfide bonds generally fail to form in the cytosol, where a high concentra¬ 
tion of reducing agents converts S-S bonds baclc to cysteine -SH groups. Appar- 
ently, proteins do not require this type of reinforcement in the relatively mild envi¬ 
ronment inside the cell. 

Protein Molécules Often Serve as Subunits for the Assembly of 
Large Structures 

The same principles that enable a protein molécule to associate with itself to form 
rings or a long filament also operate to generate much larger structures formed 
from a set of different macromolecules, such as enzyme complexes, ribosomes, 
viruses, and membranes. These large objects are not made as single, giant, cova- 
lently linlced molécules. Instead they are formed by the noncovalent assembly of 
many separately manufactured molécules, which serve as the subunits of the final 
structure. 

The use of smaller subunits to build larger structures has several advantages: 

1. A large structure built from one or a few repeating smaller subunits requires 
only a small amount of genetic information. 

2. Both assembly and disassembly can be readily controlled réversible pro¬ 
cesses, because the subunits associate through multiple bonds of relatively 
low energy. 

3. Errors in the synthesis of the structure can be more easily avoided, since 
correction mechanisms can operate during the course of assembly to 
exclude malformed subunits. 
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Figure 3-26 Single protein subunits form 
protein assemblies that feature multiple 
protein-protein contacts. Hexagonally 
packed globular protein subunits are 
shown here forming either fiat sheets or 
tubes. Generally, such large structures are 
not considered to be single “molécules.” 
Instead, like the actin filament described 
previously, they are viewed as assemblies 
formed of many different molécules. 


Some protein subunits assemble into fiat sheets in which the subunits are 
arranged in hexagonal patterns. Specialized membrane proteins are sometimes 
arranged this way in lipid bilayers. With a slight change in the geometry of the 
individual subunits, a hexagonal sheet can be converted into a tube (Figure 3-26) 
or, with more changes, into a hollow sphere. Protein tubes and spheres that bind 
spécifie RNA and DNA molécules in their interior form the coats of viruses. 

The formation of closed structures, such as rings, tubes, or spheres, provides 
additional stability because it increases the number of bonds between the protein 
subunits. Moreover, because such a structure is created by mutually dépendent, 
cooperative interactions between subunits, a relatively small change that affects 
each subunit individually can cause the structure to assemble or disassemble. 
These principles are dramatically illustrated in the protein coat or capsid of many 
simple viruses, which talces the form of a hollow sphere based on an icosahedron 
(Figure 3-27). Capsids are often made of hundreds of identical protein subunits 
that enclose and protect the viral nucleic acid (Figure 3-28). The protein in such 
a capsid must hâve a particularly adaptable structure: not only must it malce 
several different kinds of contacts to create the sphere, it must also change this 
arrangement to let the nucleic acid out to initiate viral réplication once the virus 
has entered a cell. 


Many Structures in Cells Are Capable of Self-Assembly 

The information for forming many of the complex assemblies of macromolecules 
in cells must be contained in the subunits themselves, because purified subunits 
can spontaneously assemble into the final structure under the appropriate con¬ 
ditions. The first large macromolecular aggregate shown to be capable of self-as- 
sembly from its component parts was tobacco mosaic virus ( TMV ). This virus is 
a long rod in which a cylinder of protein is arranged around a helical RNA core 
(Figure 3-29). If the dissociated RNA and protein subunits are mixed together in 
solution, they recombine to form fully active viral particles. The assembly process 
is unexpectedly complex and includes the formation of double rings of protein, 
which serve as intermediates that add to the growing viral coat. 

Another complex macromolecular aggregate that can reassemble from its 
component parts is the bacterial ribosome. This structure is composed of about 
55 different protein molécules and 3 different rRNA molécules. Incubating a mix¬ 
ture of the individual components under appropriate conditions in a test tube 
causes them to spontaneously re-form the original structure. Most importantly, 
such reconstituted ribosomes are able to catalyze protein synthesis. As might be 
expected, the reassembly of ribosomes follows a spécifie pathway: after certain 
proteins hâve bound to the RNA, this complex is then recognized by other pro¬ 
teins, and so on, until the structure is complété. 

It is still not clear how some of the more elaborate self-assembly processes 
are regulated. Many structures in the cell, for example, seem to hâve a precisely 
defined length that is many times greater than that of their component macromol¬ 
ecules. How such length détermination is achieved is in many cases a mystery. In 
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Figure 3-27 The protein capsid of a 
virus. The structure of the simian virus 
SV40 capsid has been determined by x-ray 
crystallography and, as for the capsids of 
many other viruses, it is known in atomic 
detail. (Courtesy of Robert Grant, Stephan 
Crainic, and James M. Hogle.) 
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Figure 3-28 The structure of a spherical 
virus. In viruses, many copies of a single 
protein subunit often pack together 
to create a spherical shell (a capsid). 

This capsid encloses the viral genome, 
composed of either RNA or DNA (see also 
Figure 3-27). For géométrie reasons, no 
more than 60 identical subunits can pack 
together in a precisely symmetric way. If 
slight irregularities are allowed, however, 
more subunits can be used to produce 
a larger capsid that retains icosahedral 
symmetry. The tomato bushy stunt virus 
(TBSV) shown here, for example, is a 
spherical virus about 33 nm in diameter 
formed from 180 identical copies of a 
386-amino-acid capsid protein plus an 
RNA genome of 4500 nucléotides. To 
construct such a large capsid, the protein 
must be able to fit into three somewhat 
different environments. This requires three 
slightly different conformations, each of 
which is differently colored in the virus 
particle shown here. The postulated 
pathway of assembly is shown; the précisé 
three-dimensional structure has been 
determined by x-ray diffraction. (Courtesy 
of Steve Harrison.) 


the simplest case, a long core protein or other macromolecule provides a scaffold 
that détermines the extent of the final assembly. This is the mechanism that déter¬ 
mines the length of the TMV particle, where the RNA chain provides the core. 
Similarly, a core protein interacting with actin is thought to détermine the length 
of the thin filaments in muscle. 
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Figure 3-29 The structure of tobacco mosaic virus (TMV). (A) An électron micrograph of the viral particle, which consists of 
a single long RNA molécule enclosed in a cylindrical protein coat composed of identical protein subunits. (B) A model showing 
part of the structure of TMV. A single-stranded RNA molécule of 6395 nucléotides is packaged in a helical coat constructed 
from 2130 copies of a coat protein 158 amino acids long. Fully infective viral particles can self-assemble in a test tube from 
purified RNA and protein molécules. (A, courtesy of Robley Williams; B, courtesy of Richard J. Feldmann.) 
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Figure 3-30 Proteolytic cleavage in insulin assembly. The polypeptide 
hormone insulin cannot spontaneously re-form efficiently if its disulfide bonds 
are disrupted. It is synthesized as a larger protein {proinsulin) that is cleaved 
by a proteolytic enzyme after the protein chain has folded into a spécifie 
shape. Excision of part of the proinsulin polypeptide chain removes some of 
the information needed for the protein to fold spontaneously into its normal 
conformation. Once insulin has been denatured and its two polypeptide 
chains hâve separated, its ability to reassemble is lost. 


Assembly Factors Often Aid the Formation of Complex Biological 
Structures 

Not ail cellular structures held together by noncovalent bonds self-assemble. A cil- 
ium, or a myofibril of a muscle cell, for example, cannot form spontaneously from 
a solution of its component macromolecules. In these cases, part of the assembly 
information is provided by spécial enzymes and other proteins that perform the 
function of templates, serving as assembly factors that guide construction but talce 
no part in the final assembled structure. 

Even relatively simple structures may laclc some of the ingrédients necessary 
for their own assembly. In the formation of certain bacterial viruses, for example, 
the head, which is composed of many copies of a single protein subunit, is assem¬ 
bled on a temporary scaffold composed of a second protein that is produced by 
the virus. Because the second protein is absent from the final viral particle, the 
head structure cannot spontaneously reassemble once it has been talcen apart. 
Other examples are known in which proteolytic cleavage is an essential and irré¬ 
versible step in the normal assembly process. This is even the case for some small 
protein assemblies, including the structural protein collagen and the hormone 
insulin (Figure 3-30). From these relatively simple examples, it seems certain that 
the assembly of a structure as complex as a cilium will involve a temporal and 
spatial ordering that is imparted by numerous other components. 
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Amyloid Fibrils Can Form from Many Proteins 

A spécial class of protein structures, utilized for some normal cell functions, can 
also contribute to human diseases when not controlled. These are self-propagat- 
ing, stable P-sheet aggregates called amyloid fibrils. These fibrils are built from a 
sériés of identical polypeptide chains that become layered one over the other to 
create a continuous stade of P sheets, with the P strands oriented perpendicular 
to the fibril axis to form a cross-beta filament (Figure 3-31). Typically, hundreds of 
monomers will aggregate to form an unbranched fibrous structure that is several 
micrometers long and 5 to 15 nm in width. A surprisingly large fraction of pro- 
teins hâve the potential to form such structures, because the short segment of the 
polypeptide chain that forms the spine of the fibril can hâve a variety of different 
sequences and follow one of several different paths (Figure 3-32). However, very 
few proteins will actually form this structure inside cells. 

In normal humans, the quality control mechanisms governing proteins grad- 
ually décliné with âge, occasionally permitting normal proteins to form patho- 
logical aggregates. The protein aggregates may be released from dead cells and 
accumulate as amyloid in the extracellular matrix. In extreme cases, the accumu¬ 
lation of such amyloid fibrils in the cell interior can kill the cells and damage tis- 
sues. Because the brain is composed of a highly organized collection of nerve cells 
that cannot regenerate, the brain is especially vulnérable to this sort of cumula¬ 
tive damage. Thus, although amyloid fibrils may form in different tissues, and are 
known to cause pathologies in several sites in the body, the most severe amyloid 
pathologies are neurodegenerative diseases. For example, the abnormal forma¬ 
tion of highly stable amyloid fibrils is thought to play a central causative rôle in 
both Alzheimer's and Parkinson's diseases. 

Prion diseases are a spécial type of these pathologies. They hâve attained 
spécial notoriety because, unlilce Parkinson's or Alzheimer's, prion diseases can 
spread from one organism to another, providing that the second organism eats a 
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Figure 3-31 Detailed structure of the core of an amyloid fibril. Illustrated here 
is the cross-beta spine of the amyloid fibril that is formed by a peptide of seven 
amino acids from the protein Sup35, an extensively studied yeast prion. Consisting 
of the sequence glycine-asparagine-asparagine-glutamine-glutamine-asparagine- 
tyrosine (GNNQQNY), its structure was determined by X-ray crystallography. 
Although the cross-beta spines of other amyloids are similar, being composed of 
two long p sheets held together by a “steric zipper,” different detailed structures 
are observed depending on the short peptide sequence involved. (A) One half 
of the spine is illustrated. Here, a standard parallel p-sheet structure (see 
p. 116) is held together by a set of hydrogen bonds between two side chains plus 
hydrogen bonds between two backbone atoms, as illustrated (oxygen atoms red 
and nitrogen atoms bluë). Note that in this example, the adjacent peptides are 
exactly in register. Although only five layers are shown (each layer depicted as an 
arrow), the actual structure would extend for many tens of thousands of layers 
in the plane of the paper. (B) The complété cross-beta spine. A second, identical 
P-sheet is paired with the first one to form a two-sheet motif that runs the entire 
length of the fibril. (C) View of the complété spine in (B) from the top. The closely 
interdigitated side chains form a tight, water-free junction known as a steric zipper. 
(Courtesy of David Eisenberg and Michael Sawaya, UCLA; based on R. Nelson et 
al., Nature 435:773-778, 2005. With permission from Macmillan Publishers Ltd.) 


tissue containing the protein aggregate. A set of closely related diseases—scra- 
pie in sheep, Creutzfeldt-Jakob disease (CJD) in humans, Kuru in humans, and 
bovine spongiform encephalopathy (BSE) in cattle—are caused by a misfolded, 
aggregated form of a particular protein called PrP (for prion protein). PrP is nor- 
mally located on the outer surface of the plasma membrane, most prominently in 
neurons, and it has the unfortunate property of forming amyloid fibrils that are 
“infectious" because they convert normally folded molécules of PrP to the same 
pathological form (Figure 3-33). This property créâtes a positive feedback loop 
that propagates the abnormal form of PrP, called PrP*, and allows the pathological 
conformation to spread rapidly from cell to cell in the brain, eventually causing 
death. It can be dangerous to eat the tissues of animais that contain PrP*, as wit- 
nessed by the spread of BSE (commonly referred to as “mad cow disease") from 
cattle to humans. Fortunately, in the absence of PrP*, PrP is extraordinarily diffi- 
cult to convert to its abnormal form. 

A closely related “protein-only inheritance" has been observed in yeast cells. 
The ability to study infectious proteins in yeast has clarified another remarkable 
feature of prions. These protein molécules can form several distinctively different 
types of amyloid fibrils from the same polypeptide chain. Moreover, each type of 
aggregate can be infectious, forcing normal protein molécules to adopt the same 
type of abnormal structure. Thus, several different “strains" of infectious particles 
can arise from the same polypeptide chain. 
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Figure 3-32 The structure of an amyloid 

fibril. (A) Schematic diagram of the 
structure of a amyloid fibril that is formed 
by the aggregation of a protein. Only 
the cross-beta spine of an amyloid fibril 
resembles the structure shown in Figure 
3-31. (B) A cut-away view of a structure 
proposed for the amyloid fibril that can 
be formed in a test tube by the enzyme 
ribonucléase A, illustrating how the core 
of the fibril—formed by a short segment— 
relates to the rest of the structure. 

(C) Electron micrograph of amyloid fibrils. 
(A, from L. Esposito, C. Pedone and 
L. Vitagliano, Proc. Natl Acad. Sci. USA 
103:11533-11538, 2006; B, from S. 
Sambashivan et al., Nature 437:266-269, 
2005; C, courtesy of David Eisenberg.) 
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Figure 3-33 The spécial protein aggregates that cause prion diseases. 

(A) Schematic illustration of the type of conformational change in the PrP 
protein (prion protein) that produces material for an amyloid fibril. (B) The self- 
infectious nature of the protein aggregation that is central to prion diseases. 
PrP is highly unusual because the misfolded version of the protein, called 
PrP*, induces the normal PrP protein it contacts to change its conformation, 
as shown. 

Amyloid Structures Can Perform Useful Functions in Cells 

Amyloid fibrils were initially studied because they cause disease. But the same type 
of structure is now lcnown to be exploited by cells for useful purposes. Eukaryotic 
cells, for example, store many different peptide and protein hormones that they 
will secrete in specialized “secretory granules,” which package a high concentra¬ 
tion of their cargo in dense cores with a regular structure (see Figure 13-65). We 
now know that these structured cores consist of amyloid fibrils, which in this case 
hâve a structure that causes them to dissolve to release soluble cargo after being 
secreted by exocytosis to the cell exterior (Figure 3-34A). Many bacteria use the 
amyloid structure in a very different way, secreting proteins that form long amy¬ 
loid fibrils projecting from the cell exterior that help to bind bacterial neighbors 
into biofilms (Figure 3-34B). Because these biofilms help bacteria to survive in 
adverse environments (including in humans treated with antibiotics), new drugs 
that specifically disrupt the fibrous networks formed by bacterial amyloids hâve 
promise for treating human infections. 

Many Proteins Contain Low-complexity Domains that Can Form 
“Réversible Amyloids” 

Until recently, those amyloids with useful functions were thought to be either 
confined to the interior of specialized vesicles or expressed on the exterior of cells, 
as in Figure 3-34. However, new experiments reveal that a large set of low com- 
plexity domains can form amyloid fibers that hâve functional rôles in both the 
cell nucléus and the cell cytoplasm. These domains are normally unstructured 
and consist of stretches of amino acid sequence that can span hundreds of amino 
acids, while containing only a small subset of the 20 different amino acids. In con- 
trast to the disease-associated amyloid in Figure 3-33, these newly discovered 
structures are held together by wealcer noncovalent bonds and readily dissociate 
in response to signais—hence their name réversible amyloids. 

Many proteins with such domains also contain a different set of domains that 
bind to spécifie other protein or RNA molécules. Thus, their controlled aggregation 
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Figure 3-34 Two normal functions for 
amyloid fibrils. (A) In eukaryotic cells, 
protein cargo can be packed very densely 
in secretory vesicles and stored until 
signais cause a release of this cargo by 
exocytosis. For example, proteins and 
peptide hormones of the endocrine System, 
such as glucagon and calcitonin, are 
efficiently stored as short amyloid fibrils, 
which dissociate when they reach the cell 
exterior. (B) Bacteria produce amyloid fibrils 
on their surface by secreting the precursor 
proteins; these fibrils then create biofilms 
that link together, and help to protect, large 
numbers of individual bacteria. 
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Figure 3-35 Measuring the association between “réversible amyloids.” (A) Experimental setup. The fiber-forming domains 
from proteins that contain a low-complexity domain were produced in large quantities by cloning the DNA sequence that encodes 
them into an E. coli plasmid so as to allow overproduction of that domain (see p. 483). After these protein domains were purified 
by affinity chromatography, a tiny droplet of concentrated solution of one of the domains (here the FUS low-complexity domain) 
was deposited onto a microscope dish and allowed to gel. The gel was then soaked in a dilute solution of a fluorescently 
labeled low-complexity domain from the same or a different protein, making the gel fluorescent. After replacing the dilute protein 
solution with buffer, the relative strength of binding of the various domains to each other could then be measured by the decay of 
fluorescence, as indicated. (B) Results. The low-complexity domain from the FUS protein binds more tightly to itself than it does 
to the low-complexity domains from the proteins hnRNPAI or hnRNPA2. A separate experiment reveals that these three different 
RNA binding proteins associate by forming mixed amyloid fibrils. (Adapted from M.Kato et al., Cell 149: 753-767, 2012). 


in the cell can form a hydrogel that pulls these and other molécules into punctate 
structures called intracellular bodies, or granules. Spécifie mRNAs can be seques- 
tered in such granules, where they are stored until made available by a controlled 
disassembly of the core amyloid structure that holds them together. 

Consider the FUS protein, an essential nuclear protein with rôles in the tran¬ 
scription, processing, and transport of spécifie mRNA molécules. Over 80 per¬ 
cent of its C-terminal domain of two hundred amino acids is composed of only 
four amino acids: glycine, serine, glutamine, and tyrosine. This low complexity 
domain is attached to several other domains that bind to RNA molécules. At high 
enough concentrations in a test tube, it forms a hydrogel that will associate with 
either itself or with the low complexity domains from other proteins. As illustrated 
by the experiment in Figure 3-35, although different low complexity domains 
bind to each other, homotypic interactions appear to be of greatest affinity (thus, 
the FUS low complexity domain binds most tightly to itself). Further experiments 
reveal that that both the homotypic and the heterotypic bindings are mediated 
through a P-sheet core structure forming amyloid fibrils, and that these structures 
bind to other types of repeat sequences in the manner indicated in Figure 3-36. 
Many of these interactions appear to be controlled by the phosphorylation of ser¬ 
ine side chains in the one or both of the interacting partners. However, a great 
deal remains to be learned concerning these newly discovered structures and the 
varied rôles that they play in the cell biology of eukaryotic cells. 
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Figure 3-36 One type of complex that 
is formed by réversible amyloids. The 

structure shown is based on the observed 
interaction of RNA polymerase with a 
low-complexity domain of a protein that 
régulâtes DNA transcription. (Adapted from 
I. Kwon et al., Cell 155:1049-1060, 2013.) 
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Summary 

A protein molecule's amino acid sequence détermines its three-dimensional con¬ 
formation. Noncovalent interactions between different parts of the polypeptide 
chain stabilize its folded structure. The amino acids with hydrophobie side chains 
tend to cluster in the interior ofthe molécule, and local hydrogen-bond interactions 
between neighboring peptide bonds give rise to a helices and fi sheets. 

Régions of amino acid sequence Icnown as domains are the modular units from 
which many proteins are constructed. Such domains generally contain 40-350 
amino acids, often folded into a globular shape. Small proteins typically consist 
ofonly a single domain, while large proteins are formed from multiple domains 
linlced together by various lengths of polypeptide chain, some of which can be rela- 
tively disordered. As proteins hâve evolved, domains hâve been modified and com- 
bined with other domains to construct large numbers ofnew proteins. 

Proteins are brought together into larger structures by the same noncovalent 
forces that détermine protein folding. Proteins with binding sites for their own sur¬ 
face can assemble into dimers, closed rings, spherical shells, orhelical polymers. The 
amyloid fibril is a long unbranched structure assembled through a repeating aggre- 
gate offi sheets. Although some mixtures of proteins and nucleic acids can assemble 
spontaneously into complex structures in a test tube, not ail structures in the cell are 
capable of spontaneous reassembly after they hâve been dissociated into their com- 
ponent parts, because many biological assembly processes involve assembly factors 
that are not présent in the final structure. 

PROTEIN FUNCTION 

We hâve seen that each type of protein consists of a précisé sequence of amino 
acids that allows it to fold up into a particular three-dimensional shape, or con¬ 
formation. But proteins are not rigid lumps of material. They often hâve precisely 
engineered moving parts whose mechanical actions are coupled to Chemical 
events. It is this coupling of chemistry and movement that gives proteins the 
extraordinary capabilities that underlie the dynamic processes in living cells. 

In this section, we explain how proteins bind to other selected molécules and 
how a protein's activity dépends on such binding. We show that the ability to bind 
to other molécules enables proteins to act as catalysts, signal receptors, switches, 
motors, or tiny pumps. The examples we discuss in this chapter by no means 
exhaust the vast functional répertoire of proteins. You will encounter the special- 
ized functions of many other proteins elsewhere in this boolc, based on similar 
principles. 

Ail Proteins Bind to Other Molécules 

A protein molecule's physical interaction with other molécules détermines its 
biological properties. Thus, antibodies attach to viruses or bacteria to mark them 
for destruction, the enzyme hexokinase binds glucose and ATP so as to catalyze a 
reaction between them, actin molécules bind to each other to assemble into actin 
filaments, and so on. Indeed, ail proteins stick, or bind , to other molécules. In 
some cases, this binding is very tight; in others it is wealc and short-lived. But the 
binding always shows great specificity, in the sense that each protein molécule can 
usually bind just one or a few molécules out of the many thousands of different 
types it encounters. The substance that is bound by the protein—whether it is an 
ion, a small molécule, or a macromolecule such as another protein—is referred to 
as a ligand for that protein (from the Latin word ligare, meaning "to bind"). 

The ability of a protein to bind selectively and with high affmity to a ligand 
dépends on the formation of a set of wealc noncovalent bonds—hydrogen bonds, 
electrostatic attractions, and van der Waals attractions—plus favorable hydropho¬ 
bie interactions (see Panel 2-3, pp. 94-95). Because each individual bond is wealc, 
effective binding occurs only when many of these bonds form simultaneously. 
Such binding is possible only if the surface contours of the ligand molécule fit very 
closely to the protein, matching it lilce a hand in a glove (Figure 3-37). 
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Figure 3-37 The sélective binding of a 
protein to another molécule. Many weak 
bonds are needed to enable a protein 
to bind tightly to a second molécule, or 
ligand. A ligand must therefore fit precisely 
into a protein’s binding site, like a hand 
into a glove, so that a large number of 
noncovalent bonds form between the 
protein and the ligand. (A) Schematic; 

(B) space-filling model. (PDB code: 1G6N.) 


The région of a protein that associâtes with a ligand, known as the ligand's bind¬ 
ing site , usually consists of a cavity in the protein surface formed by a particular 
arrangement of amino acids. These amino acids can belong to different portions 
of the polypeptide chain that are brought together when the protein folds (Figure 
3-38). Separate régions of the protein surface generally provide binding sites for 
different ligands, allowing the protein's activity to be regulated, as we shall see 
later. And other parts of the protein act as a handle to position the protein in the 
cell—an example is the SH2 domain discussed previously, which often moves a 
protein containing it to particular intracellular sites in response to signais. 

Although the atoms buried in the interior of the protein hâve no direct contact 
with the ligand, they form the framework that gives the surface its contours and 
its Chemical and mechanical properties. Even small changes to the amino acids in 
the interior of a protein molécule can change its three-dimensional shape enough 
to destroy a binding site on the surface. 

The Surface Conformation of a Protein Détermines Its Chemistry 

The impressive Chemical capabilities of proteins often require that the Chemical 
groups on their surface interact in ways that enhance the Chemical reactivity of 
one or more amino acid side chains. These interactions fall into two main cate¬ 
gories. 

First, the interaction of neighboring parts of the polypeptide chain may restrict 
the access of water molécules to that protein's ligand-binding sites. Because water 
molécules readily form hydrogen bonds that can compete with ligands for sites 


Figure 3-38 The binding site of a 
protein. (A) The folding of the polypeptide 
chain typically créâtes a crevice or cavity on 
the protein surface. This crevice contains a 
set of amino acid side chains disposed in 
such a way that they can form noncovalent 
bonds only with certain ligands. (B) A 
close-up of an actual binding site, showing 
the hydrogen bonds and electrostatic 
interactions formed between a protein and 
its ligand. In this example, a molécule of 
cyclic AMP is the bound ligand. 
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on the protein surface, a ligand will form tighter hydrogen bonds (and electro- 
static interactions) with a protein if water molécules are lcept away. It might be 
hard to imagine a mechanism that would exclude a molécule as small as water 
from a protein surface without affecting the access of the ligand itself. However, 
because of the strong tendency of water molécules to form water-water hydrogen 
bonds, water molécules exist in a large hydrogen-bonded network (see Panel 2-2, 
pp. 92-93). In effect, a protein can lceep a ligand-binding site dry, increasing that 
site's reactivity, because it is energetically unfavorable for individual water molé¬ 
cules to break away from this network—as they must do to reach into a crevice on 
a protein's surface. 

Second, the clustering of neighboring polar amino acid side chains can alter 
their reactivity. If protein folding forces together a number of negatively charged 
side chains against their mutual repulsion, for example, the affinity of the site for 
a positively charged ion is greatly increased. In addition, when amino acid side 
chains interact with one another through hydrogen bonds, normally unreactive 
groups (such as the -CH 2 OH on the serine shown in Figure 3-39) can become 
reactive, enabling them to be used to malce or break selected covalent bonds. 

The surface of each protein molécule therefore has a unique Chemical reac¬ 
tivity that dépends not only on which amino acid side chains are exposed, but 
also on their exact orientation relative to one another. For this reason, two slightly 
different conformations of the same protein molécule can differ greatly in their 
chemistry. 


Figure 3-39 An unusually reactive amino 
acid at the active site of an enzyme. 

This example is the “catalytic triad” Asp- 
His-Ser found in chymotrypsin, elastase, 
and other serine proteases (see Figure 
3-12). The aspartic acid side chain (Asp) 
induces the histidine (His) to remove the 
proton from a particular serine (Ser). This 
activâtes the serine and enables it to form 
a covalent bond with an enzyme substrate, 
hydrolyzing a peptide bond. The many 
convolutions of the polypeptide chain are 
omitted here. 


Sequence Comparisons Between Protein Family Members 
Highlight Crucial Ligand-Binding Sites 

As we hâve described previously, genome sequences allow us to group many of 
the domains in proteins into families that show clear evidence of their évolution 
from a common ancestor. The three-dimensional structures of members of the 
same domain family are remarkably similar. For example, even when the amino 
acid sequence identity falls to 25%, the backbone atoms in a domain can follow a 
common protein fold within 0.2 nanometers (2 Â). 

We can use a method called evolutionary tracing to identify those sites in a 
protein domain that are the most crucial to the domain's function. Those sites 
that bind to other molécules are the most lilcely to be maintained, unchanged as 
organisms evolve. Thus, in this method, those amino acids that are unchanged, or 
nearly unchanged, in ail of the known protein family members are mapped onto 
a model of the three-dimensional structure of one family member. When this is 
done, the most invariant positions often form one or more clusters on the protein 
surface, as illustrated in Figure 3-40A for the SH2 domain described previously 
(see Figure 3-6). These clusters generally correspond to ligand-binding sites. 

The SH2 domain functions to linlc two proteins together. It binds the protein 
containing it to a second protein that contains a phosphorylated tyrosine side 
chain in a spécifie amino acid sequence context, as shown in Figure 3-40B. The 
amino acids located at the binding site for the phosphorylated polypeptide hâve 
been the slowest to change during the long evolutionary process that produced 
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the large SH2 family of peptide récognition domains. Mutation is a random pro- 
cess; survival is not. Thus, natural sélection (random mutation followed by non- 
random survival) produces the sequence conservation by preferentially eliminat- 
ing organisms whose SH2 domains become altered in a way that inactivâtes the 
SH2 binding site, destroying SH2 function. 

Genome sequencing has revealed huge numbers of proteins whose functions 
are unknown. Once a three-dimensional structure has been determined for one 
member of a protein family, evolutionary tracing allows biologists to détermine 
binding sites for the members of that family, providing a useful start in decipher- 
ing protein function. 

Proteins Bind to Other Proteins Through Several Types of 
Interfaces 

Proteins can bind to other proteins in multiple ways. In many cases, a portion 
of the surface of one protein contacts an extended loop of polypeptide chain (a 
"string”) on a second protein (Figure 3-41A). Such a surface-string interaction, 
for example, allows the SH2 domain to recognize a phosphorylated polypeptide 
loop on a second protein, as just described, and it also enables a protein kinase to 
recognize the proteins that it will phosphorylate (see below). 

A second type of protein-protein interface forms when two a helices, one from 
each protein, pair together to form a coiled-coil (Figure 3-41B). This type of pro¬ 
tein interface is found in several families of gene regulatory proteins, as discussed 
in Chapter 7. 

The most common way for proteins to interact, however, is by the précisé 
matching of one rigid surface with that of another (Figure 3-41C). Such interac¬ 
tions can be very tight, since a large number of wealc bonds can form between two 
surfaces that match well. For the same reason, such surface-surface interactions 
can be extremely spécifie, enabling a protein to select just one partner from the 
many thousands of different proteins found in a cell. 


Figure 3-40 The evolutionary trace 
method applied to the SH2 domain. 

(A) Front and back views of a space- 
filling model of the SH2 domain, with 
evolutionarily conserved amino acids on the 
protein surface colored yellow, and those 
more toward the protein interior colored 
red. (B) The structure of one spécifie SH2 
domain with ifs bound polypeptide. Here, 
those amino acids located within 0.4 nm 
of the bound ligand are colored blue. The 
two key amino acids of the ligand are 
yellow, and the others are purple. Note the 
high degree of correspondence between 
(A) and (B). (Adapted from O. Lichtarge, 

H.R. Bourne and F.E. Cohen, J. Mol. Biol. 
257:342-358, 1996. With permission from 
Elsevier; PDB codes: 1SPR, 1SPS.) 



Figure 3-41 Three ways in which two 
proteins can bind to each other. Only 
the interacting parts of the two proteins 
are shown. (A) A rigid surface on one 
protein can bind to an extended loop of 
polypeptide chain (a “string”) on a 
second protein. (B) Two a helices can 
bind together to form a coiled-coil. 

(C) Two complementary rigid surfaces 
often link two proteins together. Binding 
interactions can also involve the pairing of 
P strands (see, for example, Figure 3-18). 
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heavy chain 


V L 



hypervariable loops 


Figure 3-42 An antibody molécule. 

A typical antibody molécule is Y-shaped 
and has two identical binding sites for 
its antigen, one on each arm of the Y. As 
explained in Chapter 24, the protein is 
composed of four polypeptide chains (two 
identical heavy chains and two identical 
and smaller light chains) held together 
by disulfide bonds. Each chain is made 
up of several different immunoglobulin 
domains, here shaded either blue or gray. 
The antigen-binding site is formed where 
a heavy-chain variable domain (Vh) and 
a light-chain variable domain (V|_) corne 
close together. These are the domains that 
differ most in their sequence and structure 
in different antibodies. At the end of each 
of the two arms of the antibody molécule, 
these two domains form loops that bind to 
the antigen (see Movie 24.5). 


disulfide 
bond 


variable domain 
of light chain (V L ) 


Antibody Binding Sites Are Especially Versatile 

Ail proteins must bind to particular ligands to carry out their various functions. 
The antibody family is notable for its capacity for tight, highly sélective binding 
(discussed in detail in Chapter 24). 

Antibodies, or immunoglobulins, are proteins produced by the immune Sys¬ 
tem in response to foreign molécules, such as those on the surface of an invad- 
ing microorganism. Each antibody binds tightly to a particular target molécule, 
thereby either inactivating the target molécule directly or marking it for destruc¬ 
tion. An antibody recognizes its target (called an antigen) with remarkable spec- 
ificity. Because there are potentially billions of different antigens that humans 
might encounter, we hâve to be able to produce billions of different antibodies. 

Antibodies are Y-shaped molécules with two identical binding sites that are 
complementary to a small portion of the surface of the antigen molécule. A 
detailed examination of the antigen-binding sites of antibodies reveals that they 
are formed from several loops of polypeptide chain that protrude from the ends 
of a pair of closely juxtaposed protein domains (Figure 3-42). Different antibod¬ 
ies generate an enormous diversity of antigen-binding sites by changing only the 
length and amino acid sequence of these loops, without altering the basic protein 


COOH 


constant domain 
of light chain (C L ) 


structure. 


Loops of this kind are idéal for grasping other molécules. They allow a large 
number of Chemical groups to surround a ligand so that the protein can link to it 
with many wealc bonds. For this reason, loops often form the ligand-binding sites 
in proteins. 


The Equilibrium Constant Measures Binding Strength 

Molécules in the cell encounter each other very frequently because of their con¬ 
tinuai random thermal movements. Colliding molécules with poorly matching 
surfaces form few noncovalent bonds with one another, and the two molécules 
dissociate as rapidly as they corne together. At the other extreme, when many 
noncovalent bonds form between two colliding molécules, the association can 
persist for a very long time (Figure 3-43). Strong interactions occur in cells when- 
ever a biological function requires that molécules remain associated for a long 
time—for example, when a group of RNA and protein molécules corne together to 
malce a subcellular structure such as a ribosome. 
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the surfaces of molécules A and B, 
and A and C, are a poor match and 
are capable of forming only a few 
weak bonds; thermal motion rapidly 
breaks them apart 


the surfaces of molécules A and D 
match well and therefore can form 
enough weak bonds to withstand 
thermal jolting; they therefore 
stay bound to each other 


We can measure the strength with which any two molécules bind to each 
other. As an example, consider a population of identical antibody molécules that 
suddenly encounters a population of ligands diffusing in the fluid surrounding 
them. At frequent intervals, one of the ligand molécules will bump into the bind- 
ing site of an antibody and form an antibody-ligand complex. The population of 
antibody-ligand complexes will therefore increase, but not without limit: over 
time, a second process, in which individual complexes break apart because of 
thermally induced motion, will become increasingly important. Eventually, any 
population of antibody molécules and ligands will reach a steady State, or equilib- 
rium, in which the number of binding (association) events per second is precisely 
equal to the number of “unbinding” (dissociation) events (see Figure 2-30). 

From the concentrations of the ligand, antibody, and antibody-ligand complex 
at equilibrium, we can calculate a convenient measure of the strength of bind¬ 
ing—the equilibrium constant (/C)— (Figure 3-44A). This constant was described 
in detail in Chapter 2, where its connection to free energy différences was derived 
(see p. 62). The equilibrium constant for a reaction in which two molécules (A and 
B) bind to each other to form a complex (AB) has units of liters/mole, and half 
of the binding sites will be occupied by ligand when that ligand's concentration 
(in moles/liter) reaches a value that is equal to l/K. This equilibrium constant is 
larger the greater the binding strength, and it is a direct measure of the free-en- 
ergy différence between the bound and free States (Figure 3-44B). Even a change 
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Figure 3-43 How noncovalent bonds 
médiate interactions between 
macromolecules (see Movie 2.1). 


Figure 3-44 Relating standard 
free-energy différence (AG°) to the 
equilibrium constant (K). (A) The 

equilibrium between molécules A and 
B and the complex AB is maintained by 
a balance between the two opposing 
reactions shown in panels 1 and 2. 
Molécules A and B must collide if they 
are to react, and the association rate is 
therefore proportional to the product of their 
individual concentrations [A] x [B]. (Square 
brackets indicate concentration.) As shown 
in panel 3, the ratio of the rate constants 
for the association and the dissociation 
reactions is equal to the equilibrium 
constant (K) for the reaction (see also p. 

63). (B) The equilibrium constant in panel 
3 is that for the reaction A + B <-> AB, and 
the larger its value, the stronger the binding 
between A and B. Note that for every 5.91 
kJ/mole decrease in standard free energy, 
the equilibrium constant increases by a 
factor of 10 at 37°C. 

The equilibrium constant here has units of 
liters/mole; for simple binding interactions 
it is also called the affinity constant or 
association constant, denoted K a . The 
reciprocal of K a is called the dissociation 
constant, K d (in units of moles/liter). 
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of a few noncovalent bonds can hâve a striking effect on a binding interaction, 
as shown by the example in Figure 3-45. (Note that the equilibrium constant, as 
defined here, is also known as the association or affïnity constant, K a .) 

We hâve used the case of an antibody binding to its ligand to illustrate the 
effect of binding strength on the equilibrium State, but the same principles apply 
to any molécule and its ligand. Many proteins are enzymes, which, as we now 
discuss, first bind to their ligands and then catalyze the breakage or formation of 
covalent bonds in these molécules. 

Enzymes Are Powerful and Highly Spécifie Catalysts 

Many proteins can perform their function simply by binding to another molécule. 
An actin molécule, for example, need only associate with other actin molécules 
to form a filament. There are other proteins, however, for which ligand binding is 
only a necessary first step in their function. This is the case for the large and very 
important class of proteins called enzymes. As described in Chapter 2, enzymes 
are remarkable molécules that cause the Chemical transformations that malce and 
break covalent bonds in cells. They bind to one or more ligands, called substrates, 
and convert them into one or more chemically modified products, doing this over 
and over again with amazing rapidity. Enzymes speed up reactions, often by a 
factor of a million or more, without themselves being changed—that is, they act 
as catalysts that permit cells to malce or break covalent bonds in a controlled way. 
It is the catalysis of organized sets of Chemical reactions by enzymes that créâtes 
and maintains the cell, making life possible. 

We can group enzymes into functional classes that perform similar Chemical 
reactions (Table 3-1). Each type of enzyme within such a class is highly spécifie, 


Consider 1000 molécules of A and 
1000 molécules of B in a eukaryotic 
cell. The concentration of both will 
be about 10 -9 M. 

If the equilibrium constant (K) 
for A + B ^ AB is 10 10 , then one can 
calculate that at equilibrium there 
will be 


270 270 730 

A B AB 

molécules molécules molécules 

If the equilibrium constant is a little 
weaker at 10 8 , which represents 
a loss of 11.9 kilojoule/mole of 
binding energy from the example 
above, or 2-3 fewer hydrogen 
bonds, then there will be 


915 915 85 

A B AB 

molécules molécules molécules 


Figure 3-45 Small changes in the 
number of weak bonds can hâve drastic 
effects on a binding interaction. This 
example illustrâtes the dramatic effect of 
the presence or absence of a few weak 
noncovalent bonds in a biological context. 


Some Common Types of Enzymes 

Enzyme 

Reaction catalyzed 

Hydrolases 

General term for enzymes that catalyze a hydrolytic cleavage reaction; nucleases and proteases are 
more spécifie names for subclasses of these enzymes 

Nucleases 

Break down nucleic acids by hydrolyzing bonds between nucléotides. Endo- and exonucleases 
cleave nucleic acids within and from the ends of the polynucleotide chains, respectively 

Proteases 

Break down proteins by hydrolyzing bonds between amino acids 

Synthases 

Synthesize molécules in anabolic reactions by condensing two smaller molécules together 

Ligases 

Join together (ligate) two molécules in an energy-dependent process. DNA ligase, for example, joins 
two DNA molécules together end-to-end through phosphodiester bonds 

Isomerases 

Catalyze the rearrangement of bonds within a single molécule 

Polymerases 

Catalyze polymerization reactions such as the synthesis of DNA and RNA 

Kinases 

Catalyze the addition of phosphate groups to molécules. Protein kinases are an important group of 
kinases that attach phosphate groups to proteins 

Phosphatases 

Catalyze the hydrolytic removal of a phosphate group from a molécule 

Oxido-Reductases 

General name for enzymes that catalyze reactions in which one molécule is oxidized while the 
other is reduced. Enzymes of this type are often more specifically named oxidases, reductases, or 
dehydrogenases 

ATPases 

Hydrolyze ATP. Many proteins with a wide range of rôles hâve an energy-harnessing ATPase activity 
as part of their function; for example, motor proteins such as myosin and membrane transport 
proteins such as the sodium-potassium pump 

GTPases 

Hydrolyze GTP. A large family of GTP-binding proteins are GTPases with central rôles in the 
régulation of cell processes 

Enzyme names typically end in “-ase,” with the exception of some enzymes, such as pepsin, trypsin, thrombin, and lysozyme, that were 
discovered and named before the convention became generally accepted at the end of the nineteenth century. The common name of an enzyme 
usually indicates the substrate or product and the nature of the reaction catalyzed. For example, citrate synthase catalyzes the synthesis of citrate 
by a reaction between acetyl CoA and oxaloacetate. 
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Vmax 



Figure 3-46 Enzyme kinetics. The rate 
of an enzyme reaction (V) increases as 
the substrate concentration increases 
until a maximum value 0/ max ) is reached. 
At this point ail substrate-binding sites on 
the enzyme molécules are fully occupied, 
and the rate of reaction is limited by 
the rate of the catalytic process on the 
enzyme surface. For most enzymes, 
the concentration of substrate at which 
the reaction rate is half-maximal (K m ) is 
a measure of how tightly the substrate 
is bound, with a large value of K m 
corresponding to weak binding. 


catalyzing only a single type of reaction. Thus, hexolcinase adds a phosphate group 
to D-glucose but ignores its optical isomer L-glucose; the blood-clotting enzyme 
thrombin cuts one type of blood protein between a particular arginine and its 
adjacent glycine and nowhere else, and so on. As discussed in detail in Chapter 2, 
enzymes worlc in teams, with the product of one enzyme becoming the substrate 
for the next. The resuit is an elaborate network of metabolic pathways that pro¬ 
vides the cell with energy and generates the many large and small molécules that 
the cell needs (see Figure 2-63). 

Substrate Binding Is the First Step in Enzyme Catalysis 

For a protein that catalyzes a Chemical reaction (an enzyme), the binding of each 
substrate molécule to the protein is an essential préludé. In the simplest case, if 
we dénoté the enzyme by E, the substrate by S, and the product by P, the basic 
reaction path is E + S —> ES —> EP —> E + P. There is a limit to the amount of sub¬ 
strate that a single enzyme molécule can process in a given time. Although an 
increase in the concentration of substrate increases the rate at which product is 
formed, this rate eventually reaches a maximum value (Figure 3-46). At that point 
the enzyme molécule is saturated with substrate, and the rate of reaction ( Vmax) 
dépends only on how rapidly the enzyme can process the substrate molécule. This 
maximum rate divided by the enzyme concentration is called the turnover num- 
ber. Turnover numbers are often about 1000 substrate molécules processed per 
second per enzyme molécule, although turnover numbers between 1 and 10,000 
are known. 

The other lcinetic parameter frequently used to characterize an enzyme is its 
K m , the concentration of substrate that allows the reaction to proceed at one-half 
its maximum rate (0.5 Vmax) (see Figure 3-46). A low K m value means that the 
enzyme reaches its maximum catalytic rate at a low concentration of substrate and 
generally indicates that the enzyme binds to its substrate very tightly, whereas a 
high K m value corresponds to weak binding. The methods used to characterize 
enzymes in this way are explained in Panel 3-2 (pp. 142-143). 

Enzymes Speed Reactions by Selectively Stabilizing Transition 
States 

Enzymes achieve extremely high rates of Chemical reaction—rates that are far 
higher than for any synthetic catalysts. There are several reasons for this effi- 
ciency. First, when two molécules need to react, the enzyme greatly increases the 
local concentration of both of these substrate molécules at the catalytic site, hold¬ 
ing them in the correct orientation for the reaction that is to follow. More impor- 
tantly, however, some of the binding energy contributes directly to the catalysis. 
Substrate molécules must pass through a sériés of intermediate States of altered 
geometry and électron distribution before they form the ultimate products of the 
reaction. The free energy required to attain the most unstable intermediate State, 
called the transition State, is known as the activation energy for the reaction, and 
it is the major déterminant of the reaction rate. Enzymes hâve a much higher 
affinity for the transition State of the substrate than they hâve for the stable form. 
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WHY ANALYZE THE KINETICS OF ENZYMES? 


Enzymes are the most sélective and powerful catalysts known. 
An understanding of their detailed mechanisms provides a 
critical tool for the discovery of new drugs, for the large-scale 
industrial synthesis of useful Chemicals, and for appreciating 
the chemistry of cells and organisms. A detailed study of the 
rates of the Chemical reactions that are catalyzed by a purified 
enzyme—more specifically how these rates change with 
changes in conditions such as the concentrations of substrates, 
Products, inhibitors, and regulatory ligands—allows 


biochemists to figure out exactly how each enzyme works. 

For example, this is the way that the ATP-producing reactions 
of glycolysis, shown previously in Figure 2-48, were 
deciphered—allowing us to appreciate the rationale for this 
critical enzymatic pathway. 

In this Panel, we introduce the important field of enzyme 
kinetics, which has been indispensable for deriving much of 
the detailed knowledge that we now hâve about cell 
chemistry. 


STEADY-STATE ENZYME KINETICS 


Many enzymes hâve only one substrate, which they bind and 
then process to produce products according to the scheme 
outlined in Figure 3-50A. In this case, the reaction is written as 

*1. ^cat 

E + S ES —► E + P 

K-1 

Here we hâve assumed that the reverse reaction, in which E + P 
recombine to form EP and then ES, occurs so rarely that we can 
ignore it. In this case, EP need not be represented, and we can 
express the rate of the reaction—known as its velocity, V, as 

V=k cat [ES] 


At this steady State, [ES] is nearly constant, so that 


rate of ES breakdown 
k_, [ES] + k cat [ES] 


rate of ES formation 
k, [E][S] 


or, since the concentration of the free enzyme, [E], is equal 
to [EJ - [ES], 


[ES] 




\ fc -1 + fc cat / 


[EUS] 




k -1 + k cat 


[E 0 ] - [ES] 


[S] 


where [ES] isthe concentration of the enzyme-substrate complex, 
and k cat is the turnover number, a rate constant that has a value 
equal to the number of substrate molécules processed per 
enzyme molécule each second. 

But how does the value of [ES] relate to the concentrations that 
we know directly, which are the total concentration of the 
enzyme, [E 0 ], and the concentration of the substrate, [S]? When 
enzyme and substrate are first mixed, the concentration [ES] will 
rise rapidly from zéro to a so-called steady-state level, as 
illustrated below. 


Rearranging, and defining the constant K m as 
k -1 + kçat 


we g et 


[ES] = 


[EJ[S] 
Km + [S] 



pre-steady 

State: 

ES forming 


steady State: 

ES almost constant 


or, remembering that V = k cat [ES], we obtain the famous 
Michaelis-Menten équation 


V = 


Kcat [EpHS] 

K m + [S] 


As [S] is increased to higher and higher levels, essentially ail of 
the enzyme will be bound to substrate at steady State; at this 
point, a maximum rate of reaction, V max , will be reached where 
V = V max = k cat [EJ. Thus, it is convenient to rewrite the 
Michaelis-Menten équation as 


V = 


t/max [S] 
K m + [S] 


















THE DOUBLE-RECIPROCAL PLOT 


THE SIGNIFICANCE OF K m , k cat , and k cat /K m 


A typical plot of V versus [S] for an enzyme that follows 
Michaelis-Menten kinetics is shown below. From this plot, 

, is immediately clear. 
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As described in the text, /C m is an approximate measure of 
substrate affinity for the enzyme: it is numerically equal to 
the concentration of [S] at V = 0.5 V max . In general, a lower 
value of /C m means tighter substrate binding. In fact, for 
those cases where k cat is much smaller than /c_ v the K m will 
be equal to K d , the dissociation constant for substrate 
binding to the enzyme (/C d = 1 lK a , see Figure 3-44). 

We hâve seen that k cat is the turnover number for the 
enzyme. At very low substrate concentrations, where 
[S] « K m , most of the enzyme is free. Thus we can think of 
[E] = [E 0 ], so that the Michaelis-Menten équation becomes 
V = ^cat^m [E][S]. Thus, the ratio k cat /K m is équivalent to the 
rate constant for the reaction between free enzyme and 
free substrate. 

A comparison of k cat /K m for the same enzyme with 
different substrates, or for two enzymes with their 
different substrates, is widely used as a measure of enzyme 
effectiveness. 


, from such data, a double-reciprocal 
plot is often used, in which the Michaelis-Menten équation 
has merely been rearranged, so that MV can be plotted 
versus 1/ [S]. 


MV = 


1 K \ 
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^ ^max j 
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+ W m 


For simplicity, in this Panel we hâve discussed enzymes 
that hâve only one substrate, such as the lysozyme enzyme 
described in the text (see p. 144). Most enzymes hâve two 
substrates, one of which is often an active carrier 
molécule—such as NADH or AT P. 

A similar, but more complex, analysis is used to détermine 
the kinetics of such enzymes—allowing the order of substrate 
binding and the presence of covalent intermediates along 
the pathway to be revealed. 
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SOME ENZYMES ARE DIFFUSION LIMITED 


enzymes are given below: 


enzyme 

substrate 

k cat 

(sec- 1 ) 

Km 

(M) 

k cat/ K m 

(sec^M- 1 ) 

acetylcholinesterase 

acétylcholine 

1.4x10 4 

9x10“ 5 

1.6x10 8 

catalase 

H 2 0 2 

4x10 7 

1 

4x10 7 

fumarase 

fumarate 

8x10 2 

5x10 -6 

1.6x10 8 


Because an enzyme and its substrate must collide before 
they can react, k cat /K m has a maximum possible value that is 
limited by collision rates. If every collision forms an 
enzyme-substrate complex, one can calculate from diffusion 
theory that k cat /K m will be between 10 8 and 10 9 sec -1 M -1 , in 
the case where ail subséquent steps proceed immediately. 
Thus, it is claimed that enzymes like acetylcholinesterase and 
fumarase are "perfect enzymes," each enzyme having 
evolved to the point where nearly every collision with its 
substrate converts the substrate to a product. 
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Figure 3-47 Enzymatic accélération of Chemical reactions by decreasing 
the activation energy. There is a single transition State in this example. 
However, often both the uncatalyzed reaction (A) and the enzyme-catalyzed 
reaction (B) go through a sériés of transition States. In that case, it is the 
transition State with the highest energy (S T and ES 7 ) that détermines the 
activation energy and limits the rate of the reaction. (S = substrate; 

P = product of the reaction; ES = enzyme-substrate complex; EP = enzyme- 
product complex.) 


Because this tight binding greatly lowers the energy of the transition State, the 
enzyme greatly accelerates a particular reaction by lowering the activation energy 
that is required (Figure 3-47). 

Enzymes Can Use Simultaneous Acid and Base Catalysis 

Figure 3-48 compares the spontaneous reaction rates and the corresponding 
enzyme-catalyzed rates for five enzymes. Rate accélérations range from 10 9 to 
10 23 . Enzymes not only bind tightly to a transition State, they also contain precisely 
positioned atoms that alter the électron distributions in the atoms that participate 
directly in the making and breaking of covalent bonds. Peptide bonds, for exam¬ 
ple, can be hydrolyzed in the absence of an enzyme by exposing a polypeptide to 
either a strong acid or a strong base. Enzymes are unique, however, in being able 
to use acid and base catalysis simultanéously, because the rigid frameworlc of the 
protein constrains the acidic and basic residues and prevents them from combin- 
ing with each other, as they would do in solution (Figure 3-49). 

The fit between an enzyme and its substrate needs to be précisé. A small 
change introduced by genetic engineering in the active site of an enzyme can 
therefore hâve a profound effect. Replacing a glutamic acid with an aspartic acid 
in one enzyme, for example, shifts the position of the catalytic carboxylate ion by 
only 1 Â (about the radius of a hydrogen atom); yet this is enough to decrease the 
activity of the enzyme a thousandfold. 

Lysozyme Illustrâtes How an Enzyme Works 

To demonstrate how enzymes catalyze Chemical reactions, we examine an enzyme 
that acts as a natural antibiotic in egg white, saliva, tears, and other sécrétions. 
Lysozyme catalyzes the cutting of polysaccharide chains in the cell walls of bac- 
teria. The bacterial cell is under pressure from osmotic forces, and cutting even a 
small number of these chains causes the cell wall to rupture and the cell to burst. 
A relatively small and stable protein that can be easily isolated in large quantities, 
lysozyme was the first enzyme to hâve its structure worlced out in atomic detail by 
x-ray crystallography (in the mid-1960s). 

The reaction that lysozyme catalyzes is a hydrolysis: it adds a molécule of water 
to a single bond between two adjacent sugar groups in the polysaccharide chain, 
thereby causing the bond to break (see Figure 2-9). The reaction is energetically 
favorable because the free energy of the severed polysaccharide chain is lower 
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Figure 3-48 The rate accélérations 
caused by five different enzymes. 

(Adapted from A. Radzicka and 
R. Wolfenden, Science 267:90-93, 1995.) 
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both acid and 
base catalyses 


than the free energy of the intact chain. However, there is an energy barrier to the 
reaction, and a colliding water molécule can break a bond linking two sugars only 
if the polysaccharide molécule is distorted into a particular shape—the transition 
State—in which the atoms around the bond hâve an altered geometry and élec¬ 
tron distribution. Because of this requirement, random collisions must supply a 
very large activation energy for the reaction to talce place. In an aqueous solution 
at room température, the energy of collisions almost never exceeds the activation 
energy. The pure polysaccharide can therefore remain for years in water without 
being hydrolyzed to any détectable degree. 

This situation changes drastically when the polysaccharide binds to lysozyme. 
The active site of lysozyme, because its substrate is a polymer, is a long groove that 
holds six linked sugars at the same time. As soon as the polysaccharide binds to 
form an enzyme-substrate complex, the enzyme cuts the polysaccharide by add- 
ing a water molécule across one of its sugar-sugar bonds. The product chains are 
then quicldy released, freeing the enzyme for further cycles of reaction (Figure 
3-50). 

An impressive increase in hydrolysis rate is possible because conditions are 
created in the micro environment of the lysozyme active site that greatly reduce 
the activation energy necessary for the hydrolysis to talce place. In particular, lyso¬ 
zyme distorts one of the two sugars connected by the bond to be brolcen from its 
normal, most stable conformation. The bond to be brolcen is also held close to two 
amino acids with acidic side chains (a glutamic acid and an aspartic acid) that 
participate directly in the reaction. Figure 3-51 shows the three central steps in 
this enzymatically catalyzed reaction, which occurs millions of times faster than 
uncatalyzed hydrolysis. 

Other enzymes use similar mechanisms to lower activation energies and 
speed up the reactions they catalyze. In reactions involving two or more reactants, 
the active site also acts lilce a template, or mold, that brings the substrates together 
in the proper orientation for a reaction to occur between them (Figure 3-52A). As 
we saw for lysozyme, the active site of an enzyme contains precisely positioned 

(A) S + E -^ ES —^ EP 



Figure 3-49 Acid catalysis and base 
catalysis. (A) The start of the uncatalyzed 
reaction that hydrolyzes a peptide bond, 
with blue shading used to indicate électron 
distribution in the water and carbonyl 
bonds. (B) An acid likes to donate a proton 
(H + ) to other atoms. By pairing with the 
carbonyl oxygen, an acid causes électrons 
to move away from the carbonyl carbon, 
making this atom much more attractive to 
the electronegative oxygen of an attacking 
water molécule. (C) A base likes to take 
up H + . By pairing with a hydrogen of the 
attacking water molécule, a base causes 
électrons to move toward the water oxygen, 
making it a better attacking group for the 
carbonyl carbon. (D) By having appropriately 
positioned atoms on its surface, an enzyme 
can perform both acid catalysis and base 
catalysis at the same time. 



Figure 3-50 The reaction catalyzed by lysozyme. (A) The enzyme lysozyme (E) catalyzes the cutting of a polysaccharide 
chain, which is its substrate (S). The enzyme first binds to the chain to form an enzyme-substrate complex (ES) and then 
catalyzes the cleavage of a spécifie covalent bond in the backbone of the polysaccharide, forming an enzyme-product complex 
(EP) that rapidly dissociâtes. Release of the severed chain (the products P) leaves the enzyme free to act on another substrate 
molécule. (B) A space-filling model of the lysozyme molécule bound to a short length of polysaccharide chain before cleavage 
(Movie 3.8). (B, courtesy of Richard J. Feldmann; PDB code: 3AB6.) 
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SUBSTRATE 


PRODUCTS 


This substrate is an oligosaccharide of six sugars, The final products are an oligosaccharide of four sugars 

labeled A through F. Only sugars D and E are shown in detail. (left) and a disaccharide (right), produced by hydrolysis. 



In the enzyme-substrate complex (ES), the 
enzyme forces sugar D into a strained 
conformation. The Glu35 in the enzyme is 
positioned to serve as an acid that attacks the 
adjacent sugar-sugar bond by donating a proton 
(H + ) to sugar E; Asp52 is poised to attack the 
Cl carbon atom. 


The Asp52 has formed a covalent bond between 
the enzyme and the Cl carbon atom of sugar D. 
The Glu35 then polarizes a water molécule (red ), 
so that its oxygen can readily attack the Cl 
carbon atom and displace Asp52. 


The reaction of the water molécule (red ) 
complétés the hydrolysis and returns the enzyme 
to its initial State, forming the final enzyme- 
product complex (EP). 


atoms that speed up a reaction by using charged groups to alter the distribution of 
électrons in the substrates (Figure 3-52B). And as we hâve also seen, when a sub¬ 
strate binds to an enzyme, bonds in the substrate are often distorted, changing the 
substrate shape. These changes, along with mechanical forces, drive a substrate 
toward a particular transition State (Figure 3-52C). Finally, lilce lysozyme, many 
enzymes participate intimately in the reaction by transiently forming a covalent 
bond between the substrate and a side chain of the enzyme. Subséquent steps in 
the reaction restore the side chain to its original State, so that the enzyme remains 
unchanged after the reaction (see also Figure 2-48). 


Tightly Bound Small Molécules Add Extra Functions to Proteins 


Although we hâve emphasized the versatility of enzymes—and proteins in gen¬ 
eral—as chains of amino acids that perform remarlcable functions, there are many 
instances in which the amino acids by themselves are not enough. Just as humans 



(A) enzyme binds to two 
substrate molécules and 
orients them preciselyto 
encourage a reaction to 
occur between them 



(B) binding of substrate 
to enzyme rearranges 
électrons in the substrate, 
creating partial négative 
and positive charges 
that favor a reaction 



(C) enzyme strains the 
bound substrate 
molécule, forcing it 
toward a transition 
State to favor a reaction 


Figure 3-51 Events at the active site 
of lysozyme. The top left and top right 
drawings show the free substrate and the 
free products, respectively, whereas the 
other three drawings show the sequential 
events at the enzyme active site. Note 
the change in the conformation of sugar 
D in the enzyme-substrate complex; this 
shape change stabilizes the oxocarbenium 
ion-like transition States required for 
formation and hydrolysis of the covalent 
intermediate shown in the middle panel. 

It is also possible that a carbonium ion 
intermediate forms in step 2, but the 
covalent intermediate shown in the middle 
panel has been detected with a synthetic 
substrate (Movie 3.9). (See D.J. Vocadlo et 
al., Nature 412:835-838, 2001.) 


Figure 3-52 Some general strategies of 
enzyme catalysis. (A) Holding substrates 
together in a précisé alignment. (B) Charge 
stabilization of reaction intermediates. 

(C) Applying forces that distort bonds in the 
substrate to increase the rate of a particular 
reaction. 
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Many Vitamin Dérivatives Are Critical Coenzymes for Human Cells 

Vitamin 

Coenzyme 

Enzyme-catalyzed reactions requiring these coenzymes 

Thiamine (vitamin Bd 

Thiamine pyrophosphate 

Activation and transfer of aldéhydes 

Riboflavin (vitamin B 2 ) 

FADH 

Oxidation-reduction 

Niacin 

NADH, NADPH 

Oxidation-reduction 

Pantothenic acid 

Coenzyme A 

Acyl group activation and transfer 

Pyridoxine 

Pyridoxal phosphate 

Amino acid activation; also glycogen phosphorylase 

Biotin 

Biotin 

C0 2 activation and transfer 

Lipoic acid 

Lipoamide 

Acyl group activation; oxidation-reduction 

Folie acid 

Tetrahydrofolate 

Activation and transfer of single carbon groups 

Vitamin Bi 2 

Cobalamin coenzymes 

Isomerization and methyl group transfers 


employ tools to enhance and extend the capabilities of their hands, enzymes and 
other proteins often use small nonprotein molécules to perform functions that 
would be difficult or impossible to do with amino acids alone. Thus, enzymes fre- 
quently hâve a small molécule or métal atom tightly associated with their active 
site that assists with their catalytic function. Carboxypeptidase, for example, an 
enzyme that cuts polypeptide chains, carries a tightly bound zinc ion in its active 
site. During the cleavage of a peptide bond by carboxypeptidase, the zinc ion forms 
a transient bond with one of the substrate atoms, thereby assisting the hydrolysis 
reaction. In other enzymes, a small organic molécule serves a similar purpose. 
Such organic molécules are often referred to as coenzymes. An example is biotin , 
which is found in enzymes that transfer a carboxylate group (-COO ) from one 
molécule to another (see Figure 2-40). Biotin participâtes in these reactions by 
forming a transient covalent bond to the -COO“ group to be transferred, being 
better suited to this function than any of the amino acids used to malce proteins. 
Because it cannot be synthesized by humans, and must therefore be supplied in 
small quantities in our diet, biotin is a vitamin. Many other coenzymes are either 
vitamins or dérivatives of vitamins (Table 3-2). 

Other proteins also frequently require spécifie small-molecule adjuncts to 
function properly. Thus, the signal receptor protein rhodopsin, which is made by 
the photoreceptor cells in the retina, detects light by means of a small molécule, 
retinal, embedded in the protein (Figure 3-53A). Retinal, which is derived from 
vitamin A, changes its shape when it absorbs a photon of light, and this change 
causes the protein to trigger a cascade of enzymatic reactions that eventually lead 
to an electrical signal being carried to the brain. 


COOH COOH 

I I 

ch 2 ch 2 



(A) (B) CH 2 


Figure 3-53 Retinal and heme. (A) The 

structure of retinal, the light-sensitive 
molécule attached to rhodopsin in the eye. 
The structure shown isomerizes when it 
absorbs light. (B) The structure of a heme 
group. The carbon-containing heme ring 
is red and the iron atom at its center is 
orange. A heme group is tightly bound 
to each of the four polypeptide chains in 
hemoglobin, the oxygen-carrying protein 
whose structure is shown in Figure 3-19. 
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Another example of a protein with a nonprotein portion is hemoglobin (see 
Figure 3-19). Each molécule of hemoglobin carries four heme groups, ring-shaped 
molécules each with a single central iron atom (Figure 3-53B). Heme gives hemo¬ 
globin (and blood) its red color. By binding reversibly to oxygen gas through its 
iron atom, heme enables hemoglobin to piclc up oxygen in the lungs and release 
it in the tissues. 

Sometimes these small molécules are attached covalently and permanently 
to their protein, thereby becoming an intégral part of the protein molécule itself. 
We shall see in Chapter 10 that proteins are often anchored to cell membranes 
through covalently attached lipid molécules. And membrane pro teins exposed on 
the surface of the cell, as well as proteins secreted outside the cell, are often mod- 
ified by the covalent addition of sugars and oligosaccharides. 

Multienzyme Complexes Help to Increase the Rate of Cell 
Metabolism 

The efficiency of enzymes in accelerating Chemical reactions is crucial to the 
maintenance of life. Cells, in effect, must race against the unavoidable processes 
of decay, which—if left unattended—cause macromolecules to run downhill 
toward greater and greater disorder. If the rates of désirable reactions were not 
greater than the rates of competing side reactions, a cell would soon die. We can 
get some idea of the rate at which cell metabolism proceeds by measuring the 
rate of ATP utilization. A typical mammalian cell “turns over” (i.e., hydrolyzes and 
restores by phosphorylation) its entire ATP pool once every 1 or 2 minutes. For 
each cell, this turnover represents the utilization of roughly 10 7 molécules of ATP 
per second (or, for the human body, about 1 gram of ATP every minute). 

The rates of reactions in cells are rapid because enzyme catalysis is so effective. 
Some enzymes hâve become so efficient that there is no possibility of further use- 
ful improvement. The factor that limits the reaction rate is no longer the enzyme's 
intrinsic speed of action; rather, it is the frequency with which the enzyme collides 
with its substrate. Such a reaction is said to be diffusion-limited (see Panel 3-2, 
pp. 142-143). 

The amount of product produced by an enzyme will dépend on the concen¬ 
tration of both the enzyme and its substrate. If a sequence of reactions is to occur 
extremely rapidly, each metabolic intermediate and enzyme involved must be 
présent in high concentration. However, given the enormous number of different 
reactions performed by a cell, there are limits to the concentrations that can be 
achieved. In fact, most métabolites are présent in micromolar (10~ 6 M) concentra¬ 
tions, and most enzyme concentrations are much lower. How is it possible, there- 
fore, to maintain very fast metabolic rates? 

The answer lies in the spatial organization of cell components. The cell can 
increase reaction rates without raising substrate concentrations by bringing the 
various enzymes involved in a reaction sequence together to form a large protein 
assembly known as a multienzyme complex (Figure 3-54). Because this assembly 
is organized in a way that allows the product of enzyme A to be passed directly 
to enzyme B, and so on, diffusion rates need not be limiting, even when the con¬ 
centrations of the substrates in the cell as a whole are very low. It is perhaps not 
surprising, therefore, that such enzyme complexes are very common, and they 
are involved in nearly ail aspects of metabolism—including the central genetic 
processes of DNA, RNA, and protein synthesis. In fact, few enzymes in eulcaryotic 
cells diffuse freely in solution; instead, most seem to hâve evolved binding sites 
that concentrate them with other proteins of related function in particular régions 
of the cell, thereby increasing the rate and efficiency of the reactions that they 
catalyze (see p. 331). 

Eulcaryotic cells hâve yet another way of increasing the rate of metabolic reac¬ 
tions: using their intracellular membrane Systems. These membranes can segre- 
gate particular substrates and the enzymes that act on them into the same mem- 
brane-enclosed compartment, such as the endoplasmic réticulum or the cell 
nucléus. If, for example, a compartment occupies a total of 10% of the volume of 
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Figure 3-54 How unstructured régions of polypeptide Chain serving as tethers allow reaction intermediates to be 
passed from one active site to another in large multienzyme complexes. (A-C) The fatty acid synthase in mammals. (A) The 
location of seven protein domains with different activities in this 270 kilodalton protein. The numbers refer to the order in which 
each enzyme domain must function to complété each two-carbon addition step. After multiple cycles of two-carbon addition, 
the termination domain releases the final product once the desired length of fatty acid has been synthesized. (B) The structure 
of the dimeric enzyme, with the location of the five active sites in one monomer indicated. (C) How a flexible tether allows the 
substrate that remains linked to the acyl carrier domain (red) to be passed from one active site to another in each monomer, 
sequentially elongating and modifying the bound fatty acid intermediate (yellow). The five steps are repeated until the final length 
of fatty acid Chain has been synthesized. (Only steps 1 through 4 are illustrated here.) 

(D) Multiple tethered subunits in the giant pyruvate dehydrogenase complex (9500 kilodaltons, larger than a ribosome) that 
catalyzes the conversion of pyruvate to acetyl CoA. As in (C), a covalently bound substrate held on a flexible tether (red balls 
with yellow substrate) is serially passed through active sites on subunits (here labeled 1 through 3) to produce the final products. 
Here, subunit 1 catalyzes the décarboxylation of pyruvate accompanied by the reductive acétylation of a lipoyl group linked to 
one of the red balls. Subunit 2 transfers this acetyl group to CoA, forming acetyl CoA, and subunit 3 reoxidizes the lipoyl group 
to préparé it for the next cycle. Only one-tenth of the subunits labeled 1 and 3, attached to the core formed by subunit 2, are 
illustrated here. This important reaction takes place in the mammalian mitochondrion, as part of the pathway that oxidizes 
sugars to CO 2 and H 2 O (see page 82). (A-C, adapted from T. Maier et al., Quart. Rev. Biophys. 43:373-422, 2010; 

D, from J.L.S. Milne et al., J. Biol. Chem. 281:4364-4370, 2006.) 


the cell, the concentration of reactants in that compartment may be increased by 
10 times compared with a cell with the same number of enzyme and substrate 
molécules, but no compartmentalization. Reactions limited by the speed of diffu¬ 
sion can thereby be speeded up by a factor of 10. 


The Cell Régulâtes the Catalytic Activities of Its Enzymes 

A living cell contains thousands of enzymes, many of which operate at the same 
time and in the same small volume of the cytosol. By their catalytic action, these 
enzymes generate a complex web of metabolic pathways, each composed of 
chains of Chemical reactions in which the product of one enzyme becomes the 
substrate of the next. In this maze of pathways, there are many branch points 
(nodes) where different enzymes compete for the same substrate. The System is 
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complex (see Figure 2-63), and elaborate Controls are required to regulate when 
and how rapidly each reaction occurs. 

Régulation occurs at many levels. At one level, the cell Controls how many 
molécules of each enzyme it malces by regulating the expression of the gene that 
encodes that enzyme (discussed in Chapter 7). The cell also Controls enzymatic 
activities by confining sets of enzymes to particular subcellular compartments, 
whether by enclosing them in a distinct membrane-bounded compartment (dis¬ 
cussed in Chapters 12 and 14) or by concentrating them on a protein scaffold (see 
Figure 3-77). As will be explained later in this chapter, enzymes are also cova- 
lently modified to control their activity. The rate of protein destruction by targeted 
proteolysis represents yet another important regulatory mechanism (see Figure 
6-86). But the most general process that adjusts reaction rates opérâtes through 
a direct, réversible change in the activity of an enzyme in response to the spécifie 
small molécules that it binds. 

The most common type of control occurs when an enzyme binds a molécule 
that is not a substrate to a spécial regulatory site outside the active site, thereby 
altering the rate at which the enzyme converts its substrates to products. For exam¬ 
ple, in feedback inhibition, a product produced late in a reaction pathway inhib- 
its an enzyme that acts earlier in the pathway. Thus, whenever large quantities of 
the final product begin to accumulate, this product binds to the enzyme and slows 
down its catalytic action, thereby limiting the further entry of substrates into that 
reaction pathway (Figure 3-55). Where pathways branch or intersect, there are 
usually multiple points of control by different final products, each of which worlcs 
to regulate its own synthesis (Figure 3-56). Feedback inhibition can worlc almost 
instantaneously, and it is rapidly reversed when the level of the product falls. 



Figure 3-55 Feedback inhibition of a 
single biosynthetic pathway. The end 

product Z inhibits the first enzyme that is 
unique to its synthesis and thereby contrais 
its own level in the cell. This is an example 
of négative régulation. 


aspartate 



Figure 3-56 Multiple feedback inhibition. 

In this example, which shows the 
biosynthetic pathways for four different 
ami no acids in bacteria, the red Unes 
indicate positions at which products feed 
backto inhibit enzymes. Each amino acid 
Controls the first enzyme spécifie to its 
own synthesis, thereby controlling its own 
levels and avoiding a wasteful, or even 
dangerous, buildup of intermediates. The 
products can also separately inhibit the 
initial set of reactions common to ail the 
synthèses; in this case, three different 
enzymes catalyze the initial reaction, each 
inhibited by a different product. 
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Feedback inhibition is négative régulation : it prevents an enzyme from acting. 
Enzymes can also be subject to positive régulation, in which a regulatory molécule 
stimulâtes the enzyme's activity rather than shutting the enzyme down. Positive 
régulation occurs when a product in one branch of the metabolic networlc stimu¬ 
lâtes the activity of an enzyme in another pathway. As one example, the accumu¬ 
lation of ADP activâtes several enzymes involved in the oxidation of sugar molé¬ 
cules, thereby stimulating the cell to convert more ADP to ATP. 

Allosteric Enzymes Hâve Two or More Binding Sites That Interact 

A striking feature of both positive and négative feedback régulation is that the reg¬ 
ulatory molécule often has a shape totally different from the shape of the substrate 
of the enzyme. This is why the effect on a protein is termed allostery (from the 
Greelc words allos, meaning “other," and stéréos, meaning “solid" or “three-dimen- 
sional"). As biologists learned more about feedback régulation, they recognized 
that the enzymes involved must hâve at least two different binding sites on their 
surface—an active site that recognizes the substrates, and a regulatory site that 
recognizes a regulatory molécule. These two sites must somehow communicate 
so that the catalytic events at the active site can be influenced by the binding of the 
regulatory molécule at its separate site on the protein's surface. 

The interaction between separated sites on a protein molécule is now lcnown 
to dépend on a conformational change in the protein: binding at one of the sites 
causes a shift from one folded shape to a slightly different folded shape. During 
feedback inhibition, for example, the binding of an inhibitor at one site on the 
protein causes the protein to shift to a conformation that incapacitates its active 
site located elsewhere in the protein. 

It is thought that most protein molécules are allosteric. They can adopt two or 
more slightly different conformations, and a shift from one to another caused by 
the binding of a ligand can alter their activity. This is true not only for enzymes 
but also for many other proteins, including receptors, structural proteins, and 
motor proteins. In ail instances of allosteric régulation, each conformation of the 
protein has somewhat different surface contours, and the protein's binding sites 
for ligands are altered when the protein changes shape. Moreover, as we discuss 
next, each ligand will stabilize the conformation that it binds to most strongly, and 
thus—at high enough concentrations—will tend to “switch" the protein toward 
the conformation that the ligand prefers. 

Two Ligands Whose Binding Sites Are Coupled Must Reciprocally 
Affect Each Other’s Binding 

The effects of ligand binding on a protein follow from a fundamental Chemical 
principle known as linkage. Suppose, for example, that a protein that binds glu¬ 
cose also binds another molécule, X, at a distant site on the protein's surface. If 
the binding site for X changes shape as part of the conformational change in the 
protein induced by glucose binding, the binding sites for X and for glucose are 
said to be coupled. Whenever two ligands prefer to bind to the same conformation 
of an allosteric protein, it follows from basic thermodynamic principles that each 
ligand must increase the affinity of the protein for the other. For example, if the 
shift of a protein to a conformation that binds glucose best also causes the binding 
site for X to fit X better, then the protein will bind glucose more tightly when X is 
présent than when X is absent. In other words, X will positively regulate the pro¬ 
tein's binding of glucose (Figure 3-57). 

Conversely, linkage opérâtes in a négative way if two ligands prefer to bind 
to different conformations of the same protein. In this case, the binding of the 
first ligand discourages the binding of the second ligand. Thus, if a shape change 
caused by glucose binding decreases the affinity of a protein for molécule X, the 
binding of X must also decrease the protein's affinity for glucose (Figure 3-58). 
The linkage relationship is quantitatively reciprocal, so that, for example, if glu¬ 
cose has a very large effect on the binding of X, X has a very large effect on the 
binding of glucose. 
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The relationships shown in Figures 3-57 and 3-58 apply to ail proteins, and 
they underlie ail of cell biology. The principle seems so obvious in retrospect 
that we now talce it for granted. But the discovery of linkage in studies of a few 
enzymes in the 1950s, followed by an extensive analysis of allosteric mechanisms 
in proteins in the early 1960s, had a revolutionary effect on our understanding of 
biology. Since molécule X in these examples binds at a site on the enzyme that 
is distinct from the site where catalysis occurs, it need not hâve any Chemical 
relationship to the substrate that binds at the active site. Moreover, as we hâve 
just seen, for enzymes that are regulated in this way, molécule X can either turn 
the enzyme on (positive régulation) or turn it off (négative régulation). By such a 
mechanism, allosteric proteins serve as general switches that, in principle, can 
allow one molécule in a cell to affect the fate of any other. 

Symmetric Protein Assemblies Produce Cooperative Allosteric 
Transitions 

A single-subunit enzyme that is regulated by négative feedback can at most 
decrease from 90% to about 10% activity in response to a 100-fold increase in 
the concentration of an inhibitory ligand that it binds (Figure 3-59, red line). 
Responses of this type are apparently not sharp enough for optimal cell régulation, 
and most enzymes that are turned on or off by ligand binding consist of symmet¬ 
ric assemblies of identical subunits. With this arrangement, the binding of a mol¬ 
écule of ligand to a single site on one subunit can promote an allosteric change in 
the entire assembly that helps the neighboring subunits bind the same ligand. As 
a resuit, a cooperative allosteric transition occurs (Figure 3-59, blue line), allowing 



100% active 


10% active 


Figure 3-57 Positive régulation caused 
by conformational coupling between 
two separate binding sites. In this 
example, both glucose and molécule X 
bind best to the closed conformation of a 
protein with two domains. Because both 
glucose and molécule X drive the protein 
toward its closed conformation, each 
ligand helps the other to bind. Glucose 
and molécule X are therefore said to bind 
cooperatively to the protein. 


Figure 3-58 Négative régulation caused 
by conformational coupling between 
two separate binding sites. The scheme 
here resembles that in the previous figure, 
but here molécule X prefers the open 
conformation, while glucose prefers the 
closed conformation. Because glucose 
and molécule X drive the protein toward 
opposite conformations (closed and open, 
respectively), the presence of either ligand 
interfères with the binding of the other. 
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a relatively small change in ligand concentration in the cell to switch the whole 
assembly from an almost fully active to an almost fully inactive conformation (or 
vice versa). 

The principles involved in a cooperative "all-or-none” transition are the same 
for ail proteins, whether or not they are enzymes. Thus, for example, they are crit- 
ical for the efficient uptalce and release of 0 2 by hemoglobin in our blood. But 
they are perhaps easiest to visualize for an enzyme that forms a symmetric dimer. 
In the example shown in Figure 3-60, the first molécule of an inhibitory ligand 
binds with great difficulty since its binding disrupts an energetically favorable 
interaction between the two identical monomers in the dimer. A second molécule 
of inhibitory ligand now binds more easily, however, because its binding restores 
the energetically favorable monomer-monomer contacts of a symmetric dimer 
(this also completely inactivâtes the enzyme). 

As an alternative to this induced fit model for a cooperative allosteric transi¬ 
tion, we can view such a symmetric enzyme as having only two possible confor¬ 
mations, corresponding to the "enzyme on" and "enzyme off” structures in Figure 
3-60. In this view, ligand binding perturbs an all-or-none equilibrium between 
these two States, thereby changing the proportion of active molécules. Both mod- 
els represent true and useful concepts. 

Many Changes in Proteins Are Driven by Protein Phosphorylation 

Proteins are regulated by more than the réversible binding of other molécules. A 
second method that eulcaryotic cells use extensively to regulate a protein's func- 
tion is the covalent addition of a smaller molécule to one or more of its amino acid 
side chains. The most common such regulatory modification in higher eukaryotes 
is the addition of a phosphate group. We shall therefore use protein phosphoryla¬ 
tion to illustrate some of the general principles involved in the control of protein 
function through the modification of amino acid side chains. 

A phosphorylation event can affect the protein that is modified in three 
important ways. First, because each phosphate group carries two négative 
charges, the enzyme-catalyzed addition of a phosphate group to a protein can 
cause a major conformational change in the protein by, for example, attracting a 
cluster of positively charged amino acid side chains. This can, in turn, affect the 
binding of ligands elsewhere on the protein surface, dramatically changing the 

Figure 3-60 A cooperative allosteric transition in an enzyme composed of 
two identical subunits. This diagram illustrâtes how the conformation of one 
subunit can influence that of its neighbor. The binding of a single molécule of 
an inhibitory ligand (yellow) to one subunit of the enzyme occurs with difficulty 
because it changes the conformation of this subunit and thereby disrupts the 
symmetry of the enzyme. Once this conformational change has occurred, 
however, the energy gained by restoring the symmetric pairing interaction 
between the two subunits makes it especially easy for the second subunit 
to bind the inhibitory ligand and undergo the same conformational change. 
Because the binding of the first molécule of ligand increases the affinity with 
which the other subunit binds the same ligand, the response of the enzyme to 
changes in the concentration of the ligand is much steeper than the response 
of an enzyme with only one subunit (see Figure 3-59 and Movie 3.10). 


Figure 3-59 Enzyme activity versus 
the concentration of inhibitory ligand 
for single-subunit and multisubunit 
allosteric enzymes. For an enzyme with a 
single subunit (red line), a drop from 90% 
enzyme activity to 10% activity (indicated 
by the two dots on the curve) requires a 
100-fold increase in the concentration of 
inhibitor. The enzyme activity is calculated 
from the simple equilibrium relationship 
K = [IP]/[I][P], where P is active protein, 

I is inhibitor, and IP is the inactive protein 
bound to inhibitor. An identical curve 
applies to any simple binding interaction 
between two molécules, A and B. In 
contrast, a multisubunit allosteric enzyme 
can respond in a switchlike manner to a 
change in ligand concentration: the steep 
response is caused by a cooperative 
binding of the ligand molécules, as 
explained in Figure 3-60. Here, the green 
line represents the idealized resuit expected 
for the cooperative binding of two inhibitory 
ligand molécules to an allosteric enzyme 
with two subunits, and the blue line shows 
the idealized response of an enzyme with 
four subunits. As indicated by the two dots 
on each of these curves, the more complex 
enzymes drop from 90% to 10% activity 
over a much narrower range of inhibitor 
concentration than does the enzyme 
composed of a single subunit. 
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protein's activity. When a second enzyme removes the phosphate group, the pro- 
tein returns to its original conformation and restores its initial activity. 

Second, an attached phosphate group can form part of a structure that the 
binding sites of other proteins recognize. As previously discussed, the SH2 domain 
binds to a short peptide sequence containing a phosphorylated tyrosine side 
chain (see Figure 3-40B). More than ten other common domains provide binding 
sites for attaching their protein to phosphorylated peptides in other protein mol¬ 
écules, each recognizing a phosphorylated amino acid side chain in a different 
protein context. Third, the addition of a phosphate group can maslc a binding site 
that otherwise holds two proteins together, and thereby disrupt protein-protein 
interactions. As a resuit, protein phosphorylation and déphosphorylation very 
often drive the regulated assembly and disassembly of protein complexes (see, for 
example, Figure 15-11). 

Réversible protein phosphorylation Controls the activity, structure, and cellu¬ 
lar localization of both enzymes and many other types of proteins in eukaryotic 
cells. In fact, this régulation is so extensive that more than one-third of the 10,000 
or so proteins in a typical mammalian cell are thought to be phosphorylated at 
any given time—many with more than one phosphate. As might be expected, the 
addition and removal of phosphate groups from spécifie proteins often occur in 
response to signais that specify some change in a celFs State. For example, the 
complicated sériés of events that talces place as a eukaryotic cell divides is largely 
timed in this way (discussed in Chapter 17), and many of the signais mediating 
cell-cell interactions are relayed from the plasma membrane to the nucléus by a 
cascade of protein phosphorylation events (discussed in Chapter 15). 

A Eukaryotic Cell Contains a Large Collection of Protein Kinases 
and Protein Phosphatases 

Protein phosphorylation involves the enzyme-catalyzed transfer of the terminal 
phosphate group of an ATP molécule to the hydroxyl group on a serine, threonine, 
or tyrosine side chain of the protein (Figure 3-61). A protein kinase catalyzes 
this reaction, and the reaction is essentially unidirectional because of the large 
amount of free energy released when the phosphate-phosphate bond in ATP is 
brolcen to produce ADP (discussed in Chapter 2). A protein phosphatase cata¬ 
lyzes the reverse reaction of phosphate removal, or déphosphorylation. Cells con- 
tain hundreds of different protein kinases, each responsible for phosphorylating a 
different protein or set of proteins. There are also many different protein phospha¬ 
tases; some are highly spécifie and remove phosphate groups from only one or a 
few proteins, whereas others act on a broad range of proteins and are targeted to 
spécifie substrates by regulatory subunits. The State of phosphorylation of a pro¬ 
tein at any moment, and thus its activity, dépends on the relative activities of the 
protein kinases and phosphatases that modify it. 

The protein kinases that phosphorylate proteins in eukaryotic cells belong to a 
very large family of enzymes that share a catalytic (kinase) sequence of about 290 
amino acids. The various family members contain different amino acid sequences 
on either end of the kinase sequence (for example, see Figure 3-10), and often 
hâve short amino acid sequences inserted into loops within it. Some of these 
additional amino acid sequences enable each kinase to recognize the spécifie set 
of proteins it phosphorylâtes, or to bind to structures that localize it in spécifie 
régions of the cell. Other parts of the protein regulate the activity of each kinase, 
so it can be turned on and off in response to different spécifie signais, as described 
below. 

By comparing the number of amino acid sequence différences between the 
various members of a protein family, we can construct an “evolutionary tree” that 
is thought to reflect the pattern of gene duplication and divergence that gave rise 
to the family. Figure 3-62 shows an evolutionary tree of protein kinases. Kinases 
with related functions are often located on nearby branches of the tree: the pro¬ 
tein kinases involved in cell signaling that phosphorylate tyrosine side chains, for 
example, are ail clustered in the top left corner of the tree. The other kinases shown 
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Figure 3-61 Protein phosphorylation. Many 
thousands of proteins in a typical eukaryotic 
cell are modified by the covalent addition of 
a phosphate group. (A) The general reaction 
transfers a phosphate group from ATP to an 
amino acid side chain of the target protein by 
a protein kinase. Removal of the phosphate 
group is catalyzed by a second enzyme, a 
protein phosphatase. In this example, the 
phosphate is added to a serine side chain; in 
other cases, the phosphate is instead linked 
to the -OH group of a threonine or a tyrosine 
in the protein. (B) The phosphorylation of a 
protein by a protein kinase can either increase 
or decrease the protein’s activity, depending 
on the site of phosphorylation and the 
structure of the protein. 
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Cdc7 



receptor serine 
kinase subfamily 


Figure 3-62 An evolutionary tree of 
selected protein kinases. A higher 
eukaryotic cell contains hundreds of such 
enzymes, and the human genome codes 
for more than 500. Note that only some of 
these, those discussed in this book, are 
shown. 


phosphorylate either a serine or a threonine side chain, and many are organized 
into clusters that seem to reflect their function—in transmembrane signal trans¬ 
duction, intracellular signal amplification, cell-cycle control, and so on. 

As a resuit of the combined activities of protein kinases and protein phos¬ 
phatases, the phosphate groups on proteins are continually turning over—being 
added and then rapidly removed. Such phosphorylation cycles may seem waste- 
ful, but they are important in allowing the phosphorylated proteins to switch rap¬ 
idly from one State to another: the more rapid the cycle, the faster a population of 
protein molécules can change its State of phosphorylation in response to a sud- 
den change in its phosphorylation rate (see Figure 15-14). The energy required to 
drive this phosphorylation cycle is derived from the free energy of ATP hydrolysis, 
one molécule of which is consumed for each phosphorylation event. 

The Régulation of the Src Protein Kinase Reveals How a Protein 
Can Function as a Microprocessor 

The hundreds of different protein kinases in a eukaryotic cell are organized into 
complex networlcs of signaling pathways that help to coordinate the cell's activ¬ 
ities, drive the cell cycle, and relay signais into the cell from the cell's environ¬ 
ment. Many of the extracellular signais involved need to be both integrated and 
amplified by the cell. Individual protein kinases (and other signaling proteins) 
serve as input-output devices, or “microprocessors,” in the intégration process. 
An important part of the input to these signal-processing proteins cornes from the 
control that is exerted by phosphates added and removed from them by protein 
kinases and protein phosphatases, respectively. 

The Src family of protein kinases (see Figure 3-10) exhibits such behavior. The 
Src protein (pronounced "sarc" and named for the type of tumor, a sarcoma, that 
its deregulation can cause) was the first tyrosine kinase to be discovered. It is now 
known to be part of a subfamily of nine very similar protein kinases, which are 
found only in multicellular animais. As indicated by the evolutionary tree in Fig¬ 
ure 3-62, sequence comparisons suggest that tyrosine kinases as a group were 
a relatively late innovation that branched off from the serine/threonine kinases, 
with the Src subfamily being only one subgroup of the tyrosine kinases created in 
this way. 

The Src protein and its relatives contain a short N-terminal région that becomes 
covalently linlced to a strongly hydrophobie fatty acid, which anchors the kinase at 
the cytoplasmic face of the plasma membrane. Next along the linear sequence of 
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Figure 3-63 The domain structure of the 
Src family of protein kinases, mapped 
along the amino acid sequence. For the 

three-dimensional structure of Src, see 
Figure 3-13. 


amino acids corne two peptide-binding domains, a Src homology 3 (SH3) domain 
and an SH2 domain, followed by the kinase catalytic domain (Figure 3-63). These 
kinases normally exist in an inactive conformation, in which a phosphorylated 
tyrosine near the C-terminus is bound to the SH2 domain, and the SH3 domain 
is bound to an internai peptide in a way that distorts the active site of the enzyme 
and helps to render it inactive. 

As shown in Figure 3-64, turning the kinase on involves at least two spécifie 
inputs: removal of the C-terminal phosphate and the binding of the SH3 domain 
by a spécifie activating protein. In this way, the activation of the Src kinase sig¬ 
nais the completion of a particular set of separate upstream events (Figure 3-65). 
Thus, the Src family of protein kinases serves as spécifie signal integrators, con- 
tributing to the web of information-processing events that enable the cell to com- 
pute useful responses to a complex set of different conditions. 

Proteins That Bind and Hydrolyze GTP Are Ubiquitous Cell 
Regulators 

We hâve described how the addition or removal of phosphate groups on a protein 
can be used by a cell to control the protein's activity. In the example just discussed, 
a kinase transfers a phosphate from an ATP molécule to an amino acid side chain 
of a target protein. Eukaryotic cells also hâve another way to control protein activ¬ 
ity by phosphate addition and removal. In this case, the phosphate is not attached 
directly to the protein; instead, it is a part of the guanine nucléotide GTP, which 
binds very tightly to a class of proteins known as GTP-binding proteins. In general, 
proteins regulated in this way are in their active conformations with GTP bound. 
The loss of a phosphate group occurs when the bound GTP is hydrolyzed to GDP 
in a reaction catalyzed by the protein itself, and in its GDP-bound State the protein 
is inactive. In this way, GTP-binding proteins act as on-off switches whose activity 
is determined by the presence or absence of an additional phosphate on a bound 
GDP molécule (Figure 3-66). 

GTP-binding proteins (also called GTPases because of the GTP hydrolysis 
they catalyze) comprise a large family of proteins that ail contain variations on 
the same GTP-binding globular domain. When a tightly bound GTP is hydrolyzed 
by the GTP-binding protein to GDP, this domain undergoes a conformational 
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Figure 3-64 The activation of a Src-type protein kinase by two sequential events. As described in the text, the requirement for multiple 
upstream events to trigger these processes allows the kinase to serve as a signal integrator (Movie 3.11). (Adapted from S.C. Harrison et al., Cell 
112:737-740, 2003. With permission from Elsevier.) 
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Figure 3-65 How a Src-type protein kinase acts as a signal-integrating 
device. A disruption of the inhibitory interaction illustrated for the SH3 
domain (green) occurs when its binding to the indicated orange linker région 
is replaced with its higher-affinity binding to an activating ligand. 


change that inactivâtes the protein. The three-dimensional structure of a proto- 
typical member of this family, the monomeric GTPase called Ras, is shown in Fig¬ 
ure 3-67. 

The Ras protein has an important rôle in cell signaling (discussed in Chapter 
15). In its GTP-bound form, it is active and stimulâtes a cascade of protein phos¬ 
phorylations in the cell. Most of the time, however, the protein is in its inactive, 
GDP-bound form. It becomes active when it exchanges its GDP for a GTP molé¬ 
cule in response to extracellular signais, such as growth factors, that bind to recep- 
tors in the plasma membrane (see Figure 15-47). 

Regulatory Proteins GAP and GEF Control the Activity of GTP- 
Binding Proteins by Determining Whether GTP or GDP Is Bound 

GTP-binding proteins are controlled by regulatory proteins that détermine 
whether GTP or GDP is bound, just as phosphorylated proteins are turned on 
and off by protein kinases and protein phosphatases. Thus, Ras is inactivated by a 
GTPase-activatingprotein {GAP), which binds to the Ras protein and induces Ras 
to hydrolyze its bound GTP molécule to GDP—which remains tightly bound— 
and inorganic phosphate (Pi), which is rapidly released. The Ras protein stays in 
its inactive, GDP-bound conformation until it encounters a guanine nucléotide 
exchange factor {GEF), which binds to GDP-Ras and causes Ras to release its GDP. 
Because the empty nucleotide-binding site is immediately filled by a GTP mol¬ 
écule (GTP is présent in large excess over GDP in cells), the GEF activâtes Ras 
by indirectly adding back the phosphate removed by GTP hydrolysis. Thus, in a 
sense, the rôles of GAP and GEF are analogous to those of a protein phosphatase 
and a protein kinase, respectively (Figure 3-68). 
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Src-type protein kinase activity turns on 
fully only if the answers to ail of the 
above questions are yes 


OUTPUT 


Proteins Can Be Regulated by the Covalent Addition of Other 
Proteins 


Cells contain a spécial family of small proteins whose members are covalently 
attached to many other proteins to détermine the activity or fate of the second 
protein. In each case, the carboxyl end of the small protein becomes linlced to 
the amino group of a lysine side chain of a “target” protein through an isopeptide 
bond. The first such protein discovered, and the most abundantly used, is ubiq- 
uitin (Figure 3-69A). Ubiquitin can be covalently attached to target proteins in a 
variety of ways, each of which has a different meaning for cells. The major form of 
ubiquitin addition produces polyubiquitin chains in which—once the first ubiq¬ 
uitin molécule is attached to the target—each subséquent ubiquitin molécule 
links to Lys48 of the previous ubiquitin, creating a chain of Lys48-linlced ubiqui- 
tins that are attached to a single lysine side chain of the target protein. This form 
of polyubiquitin directs the target protein to the interior of a protéasome, where it 
is digested to small peptides (see Figure 6-84). In other circumstances, only single 
molécules of ubiquitin are added to proteins. In addition, some target proteins are 



Figure 3-66 GTP-binding proteins 
as molecular switches. The activity 
of a GTP-binding protein (also called a 
GTPase) generally requires the presence 
of a tightly bound GTP molécule (switch 
“on”). Hydrolysis of this GTP molécule 
by the GTP-binding protein produces 
GDP and inorganic phosphate (Pi), and it 
causes the protein to convert to a different, 
usually inactive, conformation (switch “off”). 
Resetting the switch requires that the tightly 
bound GDP dissociâtes. This is a slow 
step that is greatly accelerated by spécifie 
signais; once the GDP has dissociated, a 
molécule of GTP is quickly rebound. 
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modified with a different type of polyubiquitin chain. These modifications hâve 
different functional conséquences for the protein that is targeted (Figure 3-69B). 

Related structures are created when a different member of the ubiquitin fam- 
ily, such as SUMO (small ubiquitin-related modifier), is covalently attached to a 
lysine side chain of target proteins. Not surprisingly, ail such modifications are 
réversible. Cells contain sets of ubiquitylating and deubiquitylating (and sumoy- 
lating and desumoylating) enzymes that manipulate these covalent adducts, 
thereby playing rôles analogous to the protein kinases and phosphatases that add 
and remove phosphates from protein side chains. 

An Elaborate Ubiquitin-Conjugating System Is Used to Mark 
Proteins 



COOH 


GTP 


site of GTP 
hydrolysis 


How do cells select target proteins for ubiquitin addition? As an initial step, the 
carboxyl end of ubiquitin needs to be activated. This activation is accomplished 
when a protein called a ubiquitin-activating enzyme (El) uses ATP hydrolysis 
energy to attach ubiquitin to itself through a high-energy covalent bond (a thio- 
ester). El then passes this activated ubiquitin to one of a set of ubiquitin-conju- 
gating (E2) enzymes, each of which acts in conjunction with a set of accessory 
(E3) proteins called ubiquitin ligases. There are roughly 30 structurally similar 
but distinct E2 enzymes in mammals, and hundreds of different E3 proteins that 
form complexes with spécifie E2 enzymes. 

Figure 3-70 illustrâtes the process used to mark proteins for proteasomal 
dégradation. [Similar mechanisms are used to attach ubiquitin (and SUMO) to 
other types of target proteins.] Here, the ubiquitin ligase binds to spécifie dégra¬ 
dation signais, called degrons, in protein substrates, thereby helping E2 to form a 
polyubiquitin chain linlced to a lysine of the substrate protein. This polyubiqui¬ 
tin chain on a target protein will then be recognized by a spécifie receptor in the 
protéasome, causing the target protein to be destroyed. Distinct ubiquitin ligases 
recognize different dégradation signais, thereby targeting distinct subsets of 
intracellular proteins for destruction, often in response to spécifie signais (see 
Figure 6-86). 


Figure 3-67 The structure of the Ras 
protein in its GTP-bound form. This 
monomeric GTPase illustrâtes the structure 
of a GTP-binding domain, which is présent 
in a large family of GTP-binding proteins. 
The red régions change their conformation 
when the GTP molécule is hydrolyzed 
to GDP and inorganic phosphate by the 
protein; the GDP remains bound to the 
protein, while the inorganic phosphate is 
released. The spécial rôle of the “switch 
hélix” in proteins related to Ras is 
explained in the text (see Figure 3-72 
and Movie 15.7). 



SIGNALING BY PHOSPHORYLATED PROTEIN 


SIGNALING BY GTP-BINDING PROTEIN 


Figure 3-68 A comparison of two major intracellular signaling mechanisms in eukaryotic 
cells. In both cases, a signaling protein is activated by the addition of a phosphate group and 
inactivated by the removal of this phosphate. Note that the addition of a phosphate to a protein 
can also be inhibitory. (Adapted from E.R. Kantrowitz and W.N. Lipscomb, Trends Biochem. Sci. 
15:53-59, 1990.) 
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Protein Complexes with Interchangeable Parts Make Efficient Use 
of Genetic Information 

The SCF ubiquitin ligase is a protein complex that binds different “target proteins" 
at different times in the cell cycle, covalently adding polyubiquitin polypeptide 
chains to these targets. Its C-shaped structure is formed from five protein sub- 
units, the largest of which serves as a scaffold on which the rest of the complex 
is built. The structure underlies a remarkable mechanism (Figure 3-71). At one 
end of the C is an E2 ubiquitin-conjugating enzyme. At the other end is a sub- 
strate-binding arm, a subunit lcnown as an F-box protein. These two subunits are 
separated by a gap of about 5 nm. When this protein complex is activated, the 
F-box protein binds to a spécifie site on a target protein, positioning the protein 
in the gap so that some of its lysine side chains contact the ubiquitin-conjugating 
enzyme. The enzyme can then catalyze repeated additions of ubiquitin polypep¬ 
tide to these lysines (see Figure 3-71C), producing polyubiquitin chains that mark 
the target proteins for rapid destruction in a protéasome. 


Figure 3-69 The marking of proteins 
by ubiquitin. (A) The three-dimensional 
structure of ubiquitin, a small protein of 76 
amino acids. A family of spécial enzymes 
couples its carboxyl end to the amino 
group of a lysine side chain in a target 
protein molécule, forming an isopeptide 
bond. (B) Some modification patterns that 
hâve spécifie meanings to the cell. Note 
that the two types of polyubiquitylation 
differ in the way the ubiquitin molécules 
are linked together. Linkage through 
Lys48 signifies dégradation by the 
protéasome (see Figure 6-84), whereas 
that through Lys63 has other meanings. 
Ubiquitin markings are “read” by proteins 
that specifically recognize each type of 
modification. 
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Figure 3-70 The marking of proteins with ubiquitin. (A) The C-terminus of ubiquitin is initially activated by being linked via a high-energy thioester 
bond to a cysteine side chain on the El protein. This reaction requires ATP, and it proceeds via a covalent AMP-ubiquitin intermediate. The 
activated ubiquitin on El, also known as the ubiquitin-activating enzyme, is then transferred to the cysteine on an E2 molécule. (B) The addition of 
a polyubiquitin chain to a target protein. In a mammalian cell, there are several hundred distinct E2-E3 complexes. The E2s are called ubiquitin- 
conjugating enzymes. The E3s are referred to as ubiquitin ligases. (Adapted from D.R. Knighton et al., Science 253:407-414, 1991.) 
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In this manner, spécifie proteins are targeted for rapid destruction in response 
to spécifie signais, thereby helping to drive the cell cycle (discussed in Chapter 17). 
The timing of the destruction often involves creating a spécifie pattern of phos¬ 
phorylation on the target protein that is required for its récognition by the F-box 
subunit. It also requires the activation of an SCF ubiquitin ligase that carries the 
appropriate substrate-binding arm. Many of these arms (the F-box subunits) are 
interchangeable in the protein complex (see Figure 3-71B), and there are more 
than 70 human genes that encode them. 

As emphasized previously, once a successful protein has evolved, its genetic 
information tends to be duplicated to produce a family of related proteins. Thus, 
for example, not only are there many F-box proteins—making possible the réc¬ 
ognition of different sets of target proteins—but there is also a family of scaffolds 
(known as cullins) that give rise to a family of SCF-like ubiquitin ligases. 

A protein machine lilce the SCF ubiquitin ligase, with its interchangeable parts, 
malces economical use of the genetic information in cells. It also créâtes opportu¬ 
nités for "rapid" évolution, inasmuch as new functions can evolve for the entire 
complex simply by producing an alternative version of one of its subunits. 

Ubiquitin ligases form a diverse family of protein complexes. Some of these 
complexes are far larger and more complicated than SCF, but their underlying 
enzymatic function remains the same (Figure 3-71D). 

A GTP-Binding Protein Shows How Large Protein Movements 
Can Be Generated 

Detailed structures obtained for one of the GTP-binding protein family members, 
the EF-Tu protein, provide a good example of how allosteric changes in protein 
conformations can produce large movements by amplifying a small, local confor- 
mational change. As will be discussed in Chapter 6, EF-Tu is an abundant molé¬ 
cule that serves as an élongation factor (hence the EF) in protein synthesis, load- 
ing each aminoacyl-tRNA molécule onto the ribosome. EF-Tu contains a Ras-like 
domain (see Figure 3-67), and the tRNA molécule forms a tight complex with its 
GTP-bound form. This tRNA molécule can transfer its amino acid to the growing 


Figure 3-71 The structure and mode of 
action of an SCF ubiquitin ligase. (A) The 

structure of the five-protein ubiquitin ligase 
complex that includes an E2 ubiquitin- 
conjugating enzyme. Four proteins form the 
E3 portion. The protein denoted here as 
adaptor protein 1 is the Rbxl/Hrtl protein, 
adaptor protein 2 is the Skpl protein, and 
the cullin is the Cull protein. One of the 
many different F-box proteins complétés 
the complex. (B) Comparison of the same 
complex with two different substrate-binding 
arms, the F-box proteins Skp2 (top) and 
P-trCP1 (bottom), respectively. (C) The 
binding and ubiquitylation of a target protein 
by the SCF ubiquitin ligase. If, as indicated, 
a Chain of ubiquitin molécules is added to 
the same lysine of the target protein, that 
protein is marked for rapid destruction by the 
protéasome. (D) Comparison of SCF (bottom) 
with a low-resolution électron microscopy 
structure of a ubiquitin ligase called the 
anaphase-promoting complex (APC/C; top) 
at the same scale. The APC/C is a large, 
15-protein complex. As discussed in Chapter 
17, its ubiquitylations control the late stages 
of mitosis. It is distantly related to SCF and 
contains a cullin subunit (green) that lies along 
the side of the complex at right, only partly 
visible in this view. E2 proteins are not shown 
here, but their binding sites are indicated in 
orange, along with substrate-binding sites 
in purple. (A and B, adapted from G. Wu 
et al., Mol. Cell 11:1445-1456, 2003. With 
permission from Elsevier; D, adapted from 
P. da Fonseca et al., Nature 470:274-278, 

2011. With permission from Macmillan 
Publishers Ltd.) 
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Figure 3-72 The large conformational change in EF-Tu caused by GTP hydrolysis. (A and B) The three-dimensional 
structure of EF-Tu with GTP bound. The domain at the top has a structure similar to the Ras protein, and its red a hélix is the 
switch hélix, which moves after GTP hydrolysis. (C) The change in the conformation of the switch hélix in domain 1 allows 
domains 2 and 3 to rotate as a single unit by about 90 degrees toward the viewer, which releases the tRNA that was bound to 
this structure (see also Figure 3-73). (A, adapted from H. Berchtold et al., Nature 365:126-132, 1993. With permission from 
Macmillan Publishers Ltd. B, courtesy of Mathias Sprinzl and Rolf Hilgenfeld. PDB code: 1EFT.) 


polypeptide chain only after the GTP bound to EF-Tu is hydrolyzed, dissociating 
the EF-Tu. Since this GTP hydrolysis is triggered by a proper fit of the tRNA to the 
mRNA molécule on the ribosome, the EF-Tu serves as a factor that discriminâtes 
between correct and incorrect mRNA-tRNA pairings (see Figure 6-65). 

By comparing the three-dimensional structure of EF-Tu in its GTP-bound and 
GDP-bound forms, we can see how the repositioning of the tRNA occurs. The 
dissociation of the inorganic phosphate group (Pi), which follows the reaction 
GTP —► GDP + Pi, causes a shift of a few tenths of a nanometer at the GTP-binding 
site, just as it does in the Ras protein. This tiny movement, équivalent to a few 
times the diameter of a hydrogen atom, causes a conformational change to propa- 
gate along a crucial piece of a hélix, called the switch hélix , in the Ras-lilce domain 
of the protein. The switch hélix seems to serve as a latch that adhères to a spécifie 
site in another domain of the molécule, holding the protein in a “shut” conforma¬ 
tion. The conformational change triggered by GTP hydrolysis causes the switch 
hélix to detach, allowing separate domains of the protein to swing apart, through 
a distance of about 4 nm (Figure 3-72). This releases the bound tRNA molécule, 
allowing its attached amino acid to be used (Figure 3-73). 

Notice in this example how cells hâve exploited a simple Chemical change that 
occurs on the surface of a small protein domain to create a movement 50 times 
larger. Dramatic shape changes of this type also cause the very large movements 
that occur in motor proteins, as we discuss next. 

Motor Proteins Produce Large Movements in Cells 

We hâve seen that conformational changes in proteins hâve a central rôle in 
enzyme régulation and cell signaling. We now discuss proteins whose major 
function is to move other molécules. These motor proteins generate the forces 
responsible for muscle contraction and the crawling and swimming of cells. 
Motor pro teins also power smaller-scale intracellular movements: they help to 
move chromosomes to opposite ends of the cell during mitosis (discussed in 
Chapter 17), to move organelles along molecular traclcs within the cell (discussed 
in Chapter 16), and to move enzymes along a DNA strand during the synthesis of 
a new DNA molécule (discussed in Chapter 5). Ail these fundamental processes 
dépend on proteins with moving parts that operate as force-generating machines. 

How do these machines worlc? In other words, how do cells use shape changes 
in proteins to generate directed movements? If, for example, a protein is required 
to wallc along a narrow thread such as a DNA molécule, it can do this by under- 
going a sériés of conformational changes, such as those shown in Figure 3-74. 
But with nothing to drive these changes in an orderly sequence, they are perfectly 
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Figure 3-73 An aminoacyl tRNA molécule 
bound to EF-Tu. Note how the bound 
protein blocks the use of the tRNA-linked 
amino acid (green) for protein synthesis until 
GTP hydrolysis triggers the conformational 
changes shown in Figure 3-72C, 
dissociating the protein-tRNA complex. 
EF-Tu is a bacterial protein; however, a very 
similar protein exists in eukaryotes, where 
it is called EF-1 (Movie 3.12). (Coordinates 
determined by P. Nissen et al., Science 
270:1464-1472, 1995. PDB code: 1B23.) 
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réversible, and the protein can only wander randomly baclc and forth along the 
thread. We can look at this situation in another way. Since the directional move- 
ment of a protein does worlc, the laws of thermodynamics (discussed in Chapter 2) 
demand that such movement use free energy from some other source (otherwise 
the protein could be used to malce a perpétuai motion machine). Therefore, with- 
out an input of energy, the protein molécule can only wander aimlessly. 

How can the cell malce such a sériés of conformational changes unidirectional? 
To force the entire cycle to proceed in one direction, it is enough to malce any one 
of the changes in shape irréversible. Most proteins that are able to wallc in one 
direction for long distances achieve this motion by coupling one of the confor¬ 
mational changes to the hydrolysis of an ATP molécule that is tightly bound to the 
protein. The mechanism is similar to the one just discussed that drives allosteric 
protein shape changes by GTP hydrolysis. Because ATP (or GTP) hydrolysis 
releases a great deal of free energy, it is very unlilcely that the nucleotide-binding 
protein will undergo the reverse shape change needed for moving baclcward— 
since this would require that it also reverse the ATP hydrolysis by adding a phos¬ 
phate molécule to ADP to form ATP. 

In the model shown in Figure 3-75A, ATP binding shifts a motor protein from 
conformation 1 to conformation 2. The bound ATP is then hydrolyzed to produce 
ADP and inorganic phosphate (Pi), causing a change from conformation 2 to con¬ 
formation 3. Finally, the release of the bound ADP and Pi drives the protein baclc 
to conformation 1. Because the energy provided by ATP hydrolysis drives the tran¬ 
sition 2^3, this sériés of conformational changes is effectively irréversible. Thus, 
the entire cycle goes in only one direction, causing the protein molécule to wallc 
continuously to the right in this example. 

Many motor proteins generate directional movement through the use of 
a similar unidirectional ratchet, including the muscle motor protein myosin, 




Figure 3-74 An allosteric “walking” 
protein. Although its three different 
conformations allow it to wander randomly 
back and forth while bound to a thread or a 
filament, the protein cannot move uniformly 
in a single direction. 


Figure 3-75 How a protein can walk 
in one direction. (A) An allosteric 
motor protein driven by ATP hydrolysis. 

The transition between three different 
conformations includes a step driven by 
the hydrolysis of a bound ATP molécule, 
creating a “unidirectional ratchet” that 
makes the entire cycle essentially 
irréversible. By repeated cycles, the protein 
therefore moves continuously to the right 
along the thread. (B) Direct visualization 
of a walking myosin motor protein by 
high-speed atomic force microscopy; the 
elapsed time between steps was less than 
0.5 sec (see Movie 16.3). (B, modified from 
N. Kodera et al., Nature 468:72-76, 2010. 
With permission from Macmillan 
Publishers Ltd.) 
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which walks along actin filaments (Figure 3-75B), and the Icinesin proteins that 
wallc along microtubules (both discussed in Chapter 16). Ihese movements can 
be rapid: some of the motor proteins involved in DNA réplication (the DNA heli- 
cases) propel themselves along a DNA strand at rates as high as 1000 nucléotides 
per second. 

Membrane-Bound Transporters Harness Energy to Pump 
Molécules Through Membranes 

We hâve thus far seen how proteins that undergo allosteric shape changes can act 
as microprocessors (Src family kinases), as assembly factors (EF-Tu), and as gen- 
erators of mechanical force and motion (motor proteins). Allosteric proteins can 
also harness energy derived from ATP hydrolysis, ion gradients, or electron-trans- 
port processes to pump spécifie ions or small molécules across a membrane. We 
consider one example here that will be discussed in more detail in Chapter 11. 

The ABC transporters (ATP-binding cassette transporters) constitute an 
important class of membrane-bound pump proteins. In humans, at least 48 dif¬ 
ferent genes encode them. These transporters mostly function to export hydro¬ 
phobie molécules from the cytoplasm, serving to remove toxic molécules at the 
mucosal surface of the intestinal tract, for example, or at the blood-brain barrier. 
The study of ABC transporters is of intense interest in clinical medicine, because 
the overproduction of proteins in this class contributes to the résistance of tumor 
cells to chemotherapeutic drugs. In bacteria, the same types of proteins primarily 
function to import essential nutrients into the cell. 

A typical ABC transporter contains a pair of membrane-spanning subunits 
linlced to a pair of ATP-binding subunits located just below the plasma mem¬ 
brane. As in other examples we hâve discussed, the hydrolysis of the bound ATP 
molécules drives conformational changes in the protein, transmitting forces that 
cause the membrane-spanning subunits to move their bound molécules across 
the lipid bilayer (Figure 3-76). 

Humans hâve invented many different types of mechanical pumps, and it 
should not be surprising that cells also contain membrane-bound pumps that 

(A) A BACTERIAL ABC TRANSPORTER (C) 


small molécule 



(B) A EUKARYOTIC ABC TRANSPORTER 



Figure 3-76 The ABC (ATP-binding 
cassette) transporter, a protein machine 
that pumps molécules through a 
membrane. (A) How this large family of 
transporters pumps molécules into the 
cell in bacteria. As indicated, the binding 
of two molécules of ATP causes the two 
ATP-binding domains to clamp together 
tightly, opening a channel to the cell exterior. 
The binding of a substrate molécule to the 
extracellular face of the protein complex then 
triggers ATP hydrolysis followed by ADP 
release, which opens the cytoplasmic gâte; 
the pump is then reset for another cycle. 

(B) As discussed in Chapter 11, in 
eukaryotes an opposite process occurs, 
causing selected substrate molécules to be 
pumped out of the cell. (C) The structure 
of a bacterial ABC transporter (see Movie 
11.5). (C, from R.J. Dawson and K.P. Locher, 
Nature 443:180-185, 2006. With permission 
from Macmillan Publishers Ltd; 

PDB code: 2HYD). 
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function in other ways. Among the most notable are the rotary pumps that couple 
the hydrolysis of ATP to the transport of H + ions (protons). These pumps resem¬ 
ble miniature turbines, and they are used to acidify the interior of lysosomes and 
other eukaryotic organelles. Lilce other ion pumps that create ion gradients, they 
can function in reverse to catalyze the reaction ADP + Pi —> ATP, if the gradient 
across their membrane of the ion that they transport is steep enough. 

One such pump, the ATP synthase, harnesses a gradient of proton concentra¬ 
tion produced by electron-transport processes to produce most of the ATP used in 
the living world. This ubiquitous pump has a central rôle in energy conversion, and 
we shall discuss its three-dimensional structure and mechanism in Chapter 14. 


Proteins Often Form Large Complexes That Function as Protein 
Machines 

Large proteins formed from many domains are able to perform more elaborate 
functions than small, single-domain proteins. But large protein assemblies formed 
from many protein molécules linlced together by noncovalent bonds perform 
the most impressive taslcs. Now that it is possible to reconstruct most biological 
processes in cell-free Systems in the laboratory, it is clear that each of the central 
processes in a cell—such as DNA réplication, protein synthesis, vesicle budding, 
or transmembrane signaling—is catalyzed by a highly coordinated, linlced set of 
10 or more proteins. In most such protein machines, an energetically favorable 
reaction such as the hydrolysis of bound nucleoside triphosphates (ATP or GTP) 
drives an ordered sériés of conformational changes in one or more of the indi- 
vidual protein subunits, enabling the ensemble of proteins to move coordinately. 
In this way, each enzyme can be moved directly into position, as the machine 
catalyzes successive reactions in a sériés (Figure 3-77). This is what occurs, for 
example, in protein synthesis on a ribosome (discussed in Chapter 6)—or in DNA 
réplication, where a large multiprotein complex moves rapidly along the DNA 
(discussed in Chapter 5). 

Cells hâve evolved protein machines for the same reason that humans hâve 
invented mechanical and electronic machines. For accomplishing almost any 
taslc, manipulations that are spatially and temporally coordinated through linlced 
processes are much more efficient than the use of many separate tools. 

Scaffolds Concentrate Sets of Interacting Proteins 

As scientists hâve learned more of the details of cell biology, they hâve recognized 
an increasing degree of sophistication in cell chemistry. Thus, not only do we now 
lcnow that protein machines play a prédominant rôle, but it has also become clear 
that they are very often localized to spécifie sites in the cell, being assembled and 
activated only where and when they are needed. As one example, when extra¬ 
cellular signaling molécules bind to receptor proteins in the plasma membrane, 
the activated receptors often recruit a set of other proteins to the inside surface of 
the plasma membrane to form a large protein complex that passes the signal on 
(discussed in Chapter 15). 

The mechanisms frequently involve scaffold proteins. These are proteins with 
binding sites for multiple other proteins, and they serve both to linlc together spé¬ 
cifie sets of interacting proteins and to position them at spécifie locations inside a 
cell. At one extreme are rigid scaffolds, such as the cullin in SCF ubiquitin ligase 
(see Figure 3-71). At the other extreme are the large, flexible scaffold proteins 
that often underlie régions of specialized plasma membrane. These include the 

Figure 3-77 How “protein machines” carry out complex functions. 

These machines are made of individual proteins that collaborate to perform 
a spécifie task (Movie 3.13). The movement of these proteins is often 
coordinated by the hydrolysis of a bound nucléotide such as ATP or GTP. 
Directional allosteric conformational changes of proteins that are driven in this 
way often occur in a large protein assembly in which the activities of several 
different protein molécules are coordinated by such movements within the 
complex. 
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Discs-large protein (Dlg), a protein of about 900 amino acids that is concentrated 
in spécial régions beneath the plasma membrane in épithélial cells and at syn¬ 
apses. Dlg contains binding sites for at least seven other proteins, interspersed 
with régions of more flexible polypeptide chain. An ancient protein, conserved in 
organisms as diverse as sponges, worms, Aies, and humans, Dlg dérivés its name 
from the mutant phenotype of the organism in which it was first discovered; the 
cells in the imaginai dises of a Drosophila embryo with a mutation in the Dlg gene 
fail to stop proliferating when they should, and they produce unusually large dises 
whose épithélial cells can form tumors. 

Although incompletely studied, Dlg and a large number of similar scaffold 
proteins are thought to function lilce the protein that is schematically illustrated in 
Figure 3-78. By binding a spécifie set of interacting proteins, these scaffolds can 
enhance the rate of critical reactions, while also confining them to the particular 
région of the cell that contains the scaffold. For similar reasons, cells also make 
extensive use of scaffold RNA molécules, as discussed in Chapter 7. 


Figure 3-78 How the proximity created 
by scaffold proteins can greatly speed 
reactions in a cell. In this example, long 
unstructured régions of polypeptide chain 
in a large scaffold protein connect a sériés 
of structured domains that bind a set of 
reacting proteins. The unstructured régions 
serve as flexible “tethers” that greatly speed 
reaction rates by causing a rapid, random 
collision of ail of the proteins that are bound 
to the scaffold. (For spécifie examples of 
protein tethering, see Figure 3-54 and 
Figure 16-18; for scaffold RNA molécules, 
see Figure 7-49B.) 


Many Proteins Are Controlled by Covalent Modifications That 
Direct Them to Spécifie Sites Inside the Cell 

We hâve thus far described only a few ways in which proteins are post-translation- 
ally modified. A large number of other such modifications also occur, more than 
200 distinct types being lcnown. To give a sense of the variety, Table 3-3 présents 


Some Molécules Covalently Attached to Proteins Regulate Protein 

Function 

Modifying group 

Some prominent functions 

Phosphate on Ser, Thr, 
or Tyr 

Drives the assembly of a protein into larger complexes 
(see Figure 15-11) 

Methyl on Lys 

Helps to create distinct régions in chromatin through 
forming either mono-, di-, or trimethyl lysine in histones 
(see Figure 4-36) 

Acetyl on Lys 

Helps to activate genes in chromatin by modifying 
histones (see Figure 4-33) 

Palmityl group on Cys 

This fatty acid addition drives protein association with 
membranes (see Figure 10-18) 

A/-acetylglucosamine on 
Ser or Thr 

Controls enzyme activity and gene expression in glucose 
homeostasis 

Ubiquitin on Lys 

Monoubiquitin addition régulâtes the transport of 
membrane proteins in vesicles (see Figure 13-50) 


A polyubiquitin chain targets a protein for dégradation 
(see Figure 3-70) 

Ubiquitin is a 76-amino-acid polypeptide; there are at least 10 other ubiquitin-related proteins in 
mammalian cells. 
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Figure 3-79 Multisite protein modification 
and its effects. (A) A protein that carries 
a post-translational addition to more than 
one of its amino acid side chains can 
be considered to carry a combinatorial 
regulatory code. Multisite modifications 
are added to (and removed from) a protein 
through signaling networks, and the 
resulting combinatorial regulatory code on 
the protein is read to alter its behavior in 
the cell. (B) The pattern of some covalent 
modifications to the protein p53. 
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a few of the modifying groups with known regulatory rôles. As in phosphate 
and ubiquitin additions described previously, these groups are added and then 
removed from proteins according to the needs of the cell. 

A large number of proteins are now known to be modified on more than one 
amino acid side chain, with different regulatory events producing a different pat¬ 
tern of such modifications. A striking example is the protein p53, which plays a 
central part in controlling a cell's response to adverse circumstances (see Figure 
17-62). Through one of four different types of molecular additions, this protein 
can be modified at 20 different sites. Because an enormous number of different 
combinations of these 20 modifications are possible, the protein's behavior can 
in principle be altered in a huge number of ways. Such modifications will often 
create a site on the modified protein that binds it to a scaffold protein in a spécifie 
région of the cell, thereby connecting it—via the scaffold—to the other proteins 
required for a reaction at that site. 

One can view each protein's set of covalent modifications as a combinatorial 
regulatory code. Spécifie modifying groups are added to or removed from a pro¬ 
tein in response to signais, and the code then alters protein behavior—changing 
the activity or stability of the protein, its binding partners, and/or its spécifie loca¬ 
tion within the cell (Figure 3-79). As a resuit, the cell is able to respond rapidly 
and with great versatility to changes in its condition or environment. 

A Complex Network of Protein Interactions Underlies Cell Function 

There are many challenges facing cell biologists in this information-rich era when 
a large number of complété genome sequences are known. One is the need to 
dissect and reconstruct each one of the thousands of protein machines that exist 
in an organism such as ourselves. To understand these remarkable protein com¬ 
plexes, each will need to be reconstituted from its purified protein parts, so that 
we can study its detailed mode of operation under controlled conditions in a test 
tube, free from ail other cell components. This alone is a massive taslc. But we now 
lcnow that each of these subcomponents of a cell also interacts with other sets of 
macromolecules, creating a large networlc of protein-protein and protein-nucleic 
acid interactions throughout the cell. To understand the cell, therefore, we will 
need to analyze most of these other interactions as well. 
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We can gain some idea of the complexity of intracellular protein networks 
from a particularly well-studied example described in Chapter 16: the many doz- 
ens of proteins that interact with the actin cytoslceleton to control actin filament 
behavior (see Panel 16-3, p. 905). 

The extent of such protein-protein interactions can also be estimated more 
generally. An enormous amount of valuable information is now freely available in 
protein databases on the Internet: tens of thousands of three-dimensional protein 
structures plus tens of millions of protein sequences derived from the nucléotide 
sequences of genes. Scientists hâve been developing new methods for mining 
this great resource to increase our understanding of cells. In particular, comput- 
er-based bioinformatics tools are being combined with robotics and other tech¬ 
nologies to allow thousands of proteins to be investigated in a single set of exper- 
iments. Proteomics is a term that is often used to describe such research focused 
on the analysis of large sets of proteins, analogous to the term genomics describing 
the large-scale analysis of DNA sequences and genes. 

A biochemical method based on affinity tagging and mass spectroscopy 
has proven especially powerful for determining the direct binding interactions 
between the many different proteins in a cell (discussed in Chapter 8). The results 
are being tabulated and organized in Internet databases. This allows a cell biolo- 
gist studying a small set of proteins to readily discover which other proteins in the 
same cell are lilcely to bind to, and thus interact with, that set of proteins. When 
displayed graphically as a protein interaction map, each protein is represented by 
a box or dot in a two-dimensional network, with a straight line connecting those 
proteins that hâve been found to bind to each other. 

When hundreds or thousands of proteins are displayed on the same map, the 
network diagram becomes bewilderingly complicated, serving to illustrate the 
enormous challenges that face scientists attempting to understand the cell (Fig¬ 
ure 3-80). Much more useful are small subsections of these maps, centered on a 
few proteins of interest. 

We hâve previously described the structure and mode of action of the SCF 
ubiquitin ligase, using it to illustrate how protein complexes are constructed from 
interchangeable parts (see Figure 3-71). Figure 3-81 shows a network of protein- 
protein interactions for the five proteins that form this protein complex in a yeast 
cell. Four of the subunits of this ligase are located at the bottom right of this figure. 
The remaining subunit, the F-box protein that serves as its substrate-binding arm, 
appears as a set of 15 different gene products that bind to adaptor protein 2 (the 
Slcpl protein). Along the top and left of the figure are sets of additional protein 
interactions marlced with yellow and green shading: as indicated, these protein 
sets function at the origin of DNA réplication, in cell cycle régulation, in méthi¬ 
onine synthesis, in the kinetochore, and in vacuolar H + -ATPase assembly. We 
shall use this figure to explain how such protein interaction maps are used, and 
what they do and do not mean. 

1. Protein interaction maps are useful for identifying the lilcely function of 
previously uncharacterized proteins. Examples are the products of the 
genes that hâve thus far only been inferred to exist from the yeast genome 
sequence, which are the three proteins in the figure that laclc a simple 
three-letter abbreviation (white letters beginning with Y). The three in this 
diagram are F-box proteins that bind to Slcpl; these are therefore lilcely to 
function as part of the ubiquitin ligase, serving as substrate-binding arms 
that recognize different target proteins. However, as we discuss next, nei- 
ther assignment can be considered certain without additional data. 

2. Protein interaction networks need to be interpreted with caution because, 
as a resuit of évolution malcing efficient use of each organism's genetic 
information, the same protein can be used as part of different protein 
complexes that hâve different types of functions. Thus, although protein A 
binds to protein B and protein B binds to protein C, proteins A and C need 
not function in the same process. For example, we lcnow from detailed 
biochemical studies that the functions of Slcpl in the kinetochore and in 



Figure 3-80 A network of protein- 
binding interactions in a yeast cell. 

Each line connecting a pair of dots 
(proteins) indicates a protein-protein 
interaction. (From A. Guimerâ and 
M. Sales-Pardo, Mol. Syst. Biol. 2:42, 
2006. With permission from Macmillan 
Publishers Ltd.) 


168 


Chapter3: Proteins 



VACUOLAR H + -ATPase protein 2 (cullin) 

ASSEMBLY 

Figure 3-81 A map of some protein-protein interactions of the SCF ubiquitin ligase and other proteins in the yeast 
S. cerevisiae. The symbols and/or colors used for the five proteins of the ligase are those in Figure 3-71. Note that 15 different 
F-box proteins are shown (’purple); those with white lettering (beginning with Y) are known from the genome sequence as 
open reading trames. For additional details, see text. (Courtesy of Peter Bowers and David Eisenberg, UCLA-DOE Institute for 
Genomics and Proteomics, UCLA.) 


vacuolar H + -ATPase assembly {yellow shading) are separate from its func- 
tion in the SCF ubiquitin ligase. In fact, only the remaining three functions 
of Slcpl illustrated in the diagram—méthionine synthesis, cell cycle régula¬ 
tion, and origin of réplication (green shading )—involve ubiquitylation. 

3. In cross-species comparisons, those proteins displaying similar patterns of 
interactions in the two protein interaction map s are lilcely to hâve the same 
function in the cell. Thus, as scientists generate more and more highly 
detailed maps for multiple organisms, the results will become increasingly 
useful for inferring protein function. These map comparisons will be a par- 
ticularly powerful tool for deciphering the functions of human proteins, 
because a vast amount of direct information about protein function can 
be obtained from genetic engineering, mutational, and genetic analyses in 
experimental organisms—such as yeast, worms, and Aies—that are not fea- 
sible in humans. 

What does the future hold? There are lilcely to be on the order of 10,000 differ¬ 
ent proteins in a typical human cell, each of which interacts with 5 to 10 differ¬ 
ent partners. Despite the enormous progress made in recent years, we cannot yet 
claim to understand even the simplest known cells, such as the small Mycoplasma 
bacterium formed from only about 500 gene products (see Figure 1-10). How then 
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can we hope to understand a human? Clearly, a great deal of new biochemistry 
will be essential, in which each protein in a particular interacting set is purified so 
that its chemistry and interactions can be dissected in a test tube. But in addition, 
more powerful ways of analyzing networlcs will be needed based on mathematical 
and computational tools not yet invented, as we shall emphasize in Chapter 8. 
Clearly, there are many wonderful challenges that remain for future générations 
of cell biologists. 

Summary 

Proteins can form enormously sophisticated Chemical devices, whose Junctions 
largely dépend on the detailed Chemical properties oftheir surfaces. Binding sites 
for ligands areformed as surface cavities in which precisely positioned amino acid 
side chains are brought together by protein folding. In this way, normally unreac- 
tive amino acid side chains can be activated to malce and break covalent bonds. 
Enzymes are catalytic proteins that greatly speed up reaction rates by binding the 
high-energy transition States for a spécifie reaction path; they also can perform acid 
catalysis and base catalysis simultaneously. The rates of enzyme reactions are often 
sofast that they are limited only by diffusion. Rates can be further increased only if 
enzymes that act sequentially on a substrate are joined into a single multienzyme 
complex, or if the enzymes and their substrates are attached to protein scaffolds, or 
otherwise confined to the same part ofthe cell. 

Proteins reversibly change their shape when ligands bind to their surface. The 
allosteric changes in protein conformation produced by one ligand affect the bind¬ 
ing ofa second ligand, and this linlcage between two ligand-binding sites provides 
a crucial mechanism for regulating cell processes. Metabolic pathways, for exam¬ 
ple, are controlled by feedback régulation: some small molécules inhibit and other 
small molécules activate enzymes early in a pathway. Enzymes controlled in this 
way generally form symmetric assemblies, allowing cooperative conformational 
changes to create a steep response to changes in the concentrations of the ligands 
that regulate them. 

The expenditure of Chemical energy can drive unidirectional changes in protein 
shape. By coupling allosteric shape changes to ATP hydrolysis, for example, pro¬ 
teins can do useful work, such as generating a mechanical force or movingfor long 
distances in a single direction. The three-dimensional structures of proteins hâve 
revealed how a small local change caused by nucleoside triphosphate hydrolysis is 
amplified to create major changes elsewhere in the protein. By such means, these 
proteins can serve as input-output devices that transmit information, as assem- 
bly factors, as motors, or as membrane-bound pumps. Highly efficient protein 
machines areformed by incorporating many different protein molécules into larger 
assemblies that coordinate the allosteric movements ofthe individual components. 
Such machines perform most ofthe important reactions in cells. 

Proteins are subjected to many réversible, post-translational modifications, such 
as the covalent addition ofa phosphate or an acetyl group to a spécifie amino acid 
side chain. The addition of these modifying groups is used to regulate the activity of 
a protein, changing its conformation, its binding to other proteins, and its location 
inside the cell. A typical protein in a cell will interact with more than five different 
partners. Through proteomics, biologists can analyze thousands of proteins in one 
set ofexperiments. One important resuit is the production of detailed protein inter¬ 
action maps, which aim at describing ail ofthe binding interactions between the 
thousands of distinct proteins in a cell. However, understanding these networlcs will 
require new biochemistry, through which small sets of interacting proteins can be 
purified and their chemistry dissected in detail. In addition, new computational 
techniques will be required to deal with the enormous complexity. 


WHATWE DONT KNOW 


• What are the functions of the 
surprisingly large amount of unfolded 
polypeptide chain found in proteins? 

• How many types of protein functions 
remain to be discovered? What are 
the most promising approaches for 
discovering them? 

• When will scientists be able to 
take any amino acid sequence and 
accurately predict both that protein’s 
three-dimensional conformations 
and its Chemical properties? What 
breakthroughs will be needed to 
accomplish this important goal? 

• Are there ways to reveal the detailed 
workings of a protein machine that do 
not require the purification of each of 
its component parts in large amounts, 
so that the machine’s functions can 
be reconstituted and dissected using 
Chemical techniques in a test tube? 

• What are the rôles of the dozens 
of different types of covalent 
modifications of proteins that hâve 
been found in addition to those listed 
in Table 3-3? Which ones are critical 
for cell function and why? 

• Why is amyloid toxic to cells 
and how does it contribute to 
neurodegenerative diseases such as 
Alzheimer’s disease? 
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PROBLEMS 

Which statements are true? Explain why or why not. 


One such lcelch repeat domain is shown in Figure Q3-1. 
Would you classify this domain as an "in-line" or "plug-in" 
type domain? 


3-1 Each strand in a P sheet is a hélix with two amino 
acids per turn. 

3-2 Intrinsically disordered régions of proteins can be 
identified using bioinformatic methods to search genes for 
encoded amino acid sequences that possess high hydro- 
phobicity and low net charge. 

3-3 Loops of polypeptide that protrude from the sur¬ 
face of a protein often form the binding sites for other mol¬ 
écules. 

3-4 An enzyme reaches a maximum rate at high sub- 
strate concentration because it has a fixed number of 
active sites where substrate binds. 

3-5 Higher concentrations of enzyme give rise to a 
higher turnover number. 

3-6 Enzymes that undergo cooperative allosteric tran¬ 
sitions invariably consist of symmetric assemblies of mul¬ 
tiple subunits. 

3-7 Continuai addition and removal of phosphates 
by protein kinases and protein phosphatases is wasteful 
of energy—since their combined action consumes ATP— 
but it is a necessary conséquence of effective régulation by 
phosphorylation. 

Discuss the following problems. 

3-8 Consider the following statement. "To produce 
one molécule of each possible kind of polypeptide chain, 
300 amino acids in length, would require more atoms than 
exist in the universe." Given the size of the universe, do you 
suppose this statement could possibly be correct? Since 
counting atoms is a triclcy business, consider the problem 
from the standpoint of mass. The mass of the observable 
universe is estimated to be about 10 80 grams, give or take 
an order of magnitude or so. Assuming that the average 
mass of an amino acid is 110 daltons, what would be the 
mass of one molécule of each possible kind of polypeptide 
chain 300 amino acids in length? Is this greater than the 
mass of the universe? 

A common strategy for identifying distantly related 
homologous proteins is to search the database using a short 
signature sequence indicative of the particular protein 
function. Why is it better to search with a short sequence 
than with a long sequence? Do you not hâve more chances 
for a "hit" in the database with a long sequence? 

3-10 The so-called lcelch motif consists of a four- 
stranded (3 sheet, which forms what is lcnown as a p pro- 
peller. It is usually found to be repeated four to seven times, 
forming a lcelch repeat domain in a multidomain protein. 



Figure Q3-1 The 

kelch repeat domain of 
galactose oxidase from 
D. dendroides (Problem 
3-10). The seven individual 
(3 propellers are color 
coded and labeled. The 
N- and C-termini are 
indicated by N and C. 


3-11 Titin, which has a molecular weight of about 
3 x 10 6 , is the largest polypeptide yet described. Titin 
molécules extend from muscle thiclc filaments to the 
Z dise; they are thought to act as springs to lceep the thiclc 
filaments centered in the sarcomere. Titin is composed of a 
large number of repeated immunoglobulin (Ig) sequences 
of 89 amino acids, each of which is folded into a domain 
about 4 nm in length (Figure Q3-2A). 

You suspect that the springlilce behavior of titin is 
caused by the sequential unfolding (and refolding) of indi¬ 
vidual Ig domains. You test this hypothesis using the atomic 
force microscope, which allows you to piclc up one end of 
a protein molécule and pull with an accurately measured 
force. For a fragment of titin containing seven repeats of the 
Ig domain, this experiment gives the sawtooth force-ver- 
sus-extension curve shown in Figure Q3-2B. If the experi¬ 
ment is repeated in a solution of 8 M urea (a protein déna¬ 
turant), the pealcs disappear and the measured extension 
becomes much longer for a given force. If the experiment 
is repeated after the protein has been cross-linlced by treat- 
ment with glutaraldehyde, once again the pealcs disappear 
but the extension becomes much smaller for a given force. 



0 50 100 150 200 


extension (nm) 

Figure Q3-2 Springlike behavior of titin (Problem 3-11). (A) The 
structure of an individual Ig domain. (B) Force in piconewtons versus 
extension in nanometers obtained by atomic force microscopy. 
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A. Are the data consistent with your hypothesis that 
titin's springlilce behavior is due to the sequential unfold- 
ing of individual Ig domains? Explain your reasoning. 

B. Is the extension for each putative domain-un- 
folding event the magnitude you would expect? (In an 
extended polypeptide chain, amino acids are spaced at 
intervals of 0.34 nm.) 

C. Why is each successive pealc in Figure Q3-2B a lit- 
tle higher than the one before? 

D. Why does the force collapse so abruptly after each 
pealc? 

3-12 Rous sarcoma virus (RSV) carries an oncogene 
called Src, which encodes a continuously active protein 
tyrosine kinase that leads to unchecked cell prolifération. 
Normally, Src carries an attached fatty acid (myristoylate) 
group that allows it to bind to the cytoplasmic side of the 
plasma membrane. A mutant version of Src that does not 
allow attachment of myristoylate does not bind to the 
membrane. Infection of cells with RSV encoding either the 
normal or the mutant form of Src leads to the same high 
level of protein tyrosine kinase activity, but the mutant Src 
does not cause cell prolifération. 

A. Assuming that the normal Src is ail bound to the 
plasma membrane and that the mutant Src is distributed 
throughout the cytoplasm, calculate their relative concen¬ 
trations in the neighborhood of the plasma membrane. For 
the purposes of this calculation, assume that the cell is a 
sphere with a radius (r) of 10 gm and that the mutant Src 
is distributed throughout the cell, whereas the normal Src 
is confined to a 4-nm-thiclc layer immediately beneath the 
membrane. [For this problem, assume that the membrane 
has no thiclcness. The volume of a sphere is (4/3)71^.] 

B. The target (X) for phosphorylation by Src résides in 
the membrane. Explain why the mutant Src does not cause 
cell prolifération. 

3-13 An antibody binds to another protein with an 
equilibrium constant, K, of 5 x 10 9 M -1 . When it binds to 
a second, related protein, it forms three fewer hydrogen 
bonds, reducing its binding affinity by 11.9 kl/mole. What 
is the K for its binding to the second protein? (Free-energy 
change is related to the equilibrium constant by the équa¬ 
tion AG° = -2.3 RT log K , where R is 8.3 x 10“ 3 l<J/(mole K) 
and Tis 310 K.) 

3-14 The protein SmpB binds to a spécial species of 
tRNA, tmRNA, to eliminate the incomplète proteins made 
from truncated mRNAs in bacteria. If the binding of SmpB 
to tmRNA is plotted as fraction tmRNA bound versus SmpB 
concentration, one obtains a symmetrical S-shaped curve 
as shown in Figure Q3-3. This curve is a visual display of 
a very useful relationship between Kd and concentration, 
which has broad applicabilité The general expression for 
fraction of ligand bound is derived from the équation for 
K d [K d = [Pr][L]/[Pr-L]) by substituting ([L] T ot - [L]) for 
[Pr-L] and rearranging. Because the total concentration of 
ligand ([L] T ot) is equal to the free ligand ([L]) plus bound 
ligand ([Pr-L]), 

fraction bound = [Pr-L]/[L]xoT = [Pr]/([Pr] + Kd) 



Figure Q3-3 Fraction 
of tmRNA bound versus 
SmpB concentration 
(Problem 3-14). 


10- 5 


concentration of SmpB (M) 


For SmpB and tmRNA, the fraction bound = [SmpB- 
tmRNA]/[tmRNA] tôt = [SmpB]/([SmpB] + Kd). Using 
this relationship, calculate the fraction of tmRNA bound 
for SmpB concentrations equal to 10 4 Xd, 10 3 Xd, 10 2 Xd, 
ÎO 1 ^, K d , lO~ l K d , 10 ~ 2 K d , 10 ~ 3 K d , and 10" 4 X d . 

3-15 Many enzymes obey simple Michaelis-Menten 
kinetics, which are summarized by the équation 

rate= V max [S]/([S ]+K m ) 

where V max = maximum velocity, [S] = concentration of 
substrate, and K m = the Michaelis constant. 

It is instructive to plug a few values of [S] into the 
équation to see how rate is affected. What are the rates for 
[S] equal to zéro, equal to K m , and equal to infinité concen¬ 
tration? 

3-16 The enzyme hexokinase adds a phosphate to 
D-glucose but ignores its mirror image, L-glucose. Suppose 
that you were able to synthesize hexokinase entirely from 
D-amino acids, which are the mirror image of the normal 
L-amino acids. 

A. Assuming that the "D" enzyme would fold to a sta¬ 
ble conformation, what relationship would you expect it to 
bear to the normal "L" enzyme? 

B. Do you suppose the “D" enzyme would add a 
phosphate to L-glucose, and ignore D-glucose? 

3-17 How do you suppose that a molécule of hemoglo- 
bin is able to bind oxygen efficiently in the lungs, and yet 
release it efficiently in the tissues? 

3-18 Synthesis of the purine nucléotides AMP and 
GMP proceeds by a branched pathway starting with ribose 
5-phosphate (R5P), as shown schematically in Figure 
Q3-4. Using the principles of feedback inhibition, propose 
a regulatory strategy for this pathway that ensures an adé¬ 
quate supply of both AMP and GMP and minimizes the 
buildup of the intermediates ( A-I ) when supplies of AMP 
and GMP are adéquate. 


R5P —►A —► B —► C —► D —► E 

V _ 

H — ► / — ►IGMP 

Figure Q3-4 Schematic diagram of the metabolic pathway for 
synthesis of AMP and GMP from R5P (Problem 3-18). 
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BASIC GENETIC MECHANISMS 


DNA, Chromosomes, 
and Genomes 


CHAPTER 

4 


Life dépends on the ability of cells to store, retrieve, and translate the genetic 
instructions required to malce and maintain a living organism. This hereditary 
information is passed on from a cell to its daughter cells at cell division, and from 
one génération of an organism to the next through the organism's reproductive 
cells. The instructions are stored within every living cell as its genes, the infor¬ 
mation-containing éléments that détermine the characteristics of a species as a 
whole and of the individuals within it. 

As soon as genetics emerged as a science at the beginning of the twentieth cen- 
tury, scientists became intrigued by the Chemical structure of genes. The informa¬ 
tion in genes is copied and transmitted from cell to daughter cell millions of times 
during the life of a multicellular organism, and it survives the process essentially 
unchanged. What form of molécule could be capable of such accurate and almost 
unlimited réplication and also be able to exert précisé control, directing multi¬ 
cellular development as well as the daily life of every cell? What kind of instruc¬ 
tions does the genetic information contain? And how can the enormous amount 
of information required for the development and maintenance of an organism fit 
within the tiny space of a cell? 

The answers to several of these questions began to emerge in the 1940s. At 
this time researchers discovered, from studies in simple fungi, that genetic infor¬ 
mation consists largely of instructions for making proteins. Proteins are phenom- 
enally versatile macromolecules that perform most cell functions. As we saw in 
Chapter 3, they serve as building bloclcs for cell structures and form the enzymes 
that catalyze most of the cell's Chemical reactions. They also regulate gene expres¬ 
sion (Chapter 7), and they enable cells to communicate with each other (Chapter 
15) and to move (Chapter 16). The properties and functions of cells and organisms 
are determined to a great extent by the proteins that they are able to malce. 

Painstaking observations of cells and embryos in the late nineteenth century 
had led to the récognition that the hereditary information is carried on chro¬ 
mosomes— threadlike structures in the nucléus of a eukaryotic cell that become 
visible by light microscopy as the cell begins to divide (Figure 4-1). Later, when 
biochemical analysis became possible, chromosomes were found to consist of 
deoxyribonucleic acid (DNA) and protein, with both being présent in roughly the 
same amounts. For many décades, the DNA was thought to be merely a structural 
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Figure 4-1 Chromosomes in cells. (A) Two adjacent plant cells 
photographed through a light microscope. The DNA has been stained with a 
fluorescent dye (DAPI) that binds to it. The DNA is présent in chromosomes, 
which become visible as distinct structures in the light microscope only when 
they become compact, sausage-shaped structures in préparation for cell 
division, as shown on the left. The cell on the right, which is not dividing, 
contains identical chromosomes, but they cannot be clearly distinguished 
at this phase in the cell’s life cycle, because they are in a more extended 
conformation. (B) Schematic diagram of the outlines of the two cells along 
with their chromosomes. (A, courtesy of Peter Shaw.) 


element. However, the other crucial advance made in the 1940s was the identifica¬ 
tion of DNA as the lilcely carrier of genetic information. This breakthrough in our 
understanding of cells came from studies of inheritance in bacteria (Figure 4-2). 
But still, as the 1950s began, both how proteins could be specified by instructions 
in the DNA and how this information might be copied for transmission from cell 
to cell seemed completely mysterious. The puzzle was suddenly solved in 1953, 
when lames Watson and Francis Criclc derived the mechanism from their model 
of DNA structure. As outlined in Chapter 1, the détermination of the double-he- 
lical structure of DNA immediately solved the problem of how the information 
in this molécule might be copied, or replicated. It also provided the first dues as 
to how a molécule of DNA might use the sequence of its subunits to encode the 
instructions for making pro teins. Today, the fact that DNA is the genetic material 
is so fundamental to biological thought that it is difficult to appreciate the enor- 
mous intellectual gap that was filled by this breakthrough discovery. 

We begin this chapter by describing the structure of DNA. We see how, despite 
its Chemical simplicity, the structure and Chemical properties of DNA malce it 
ideally suited as the raw material of genes. We then consider how the many pro- 
teins in chromosomes arrange and package this DNA. The paclcing has to be done 
in an orderly fashion so that the chromosomes can be replicated and apportioned 
correctly between the two daughter cells at each cell division. And it must also 
allow access to chromosomal DNA, both for the enzymes that repair DNA damage 
and for the specialized proteins that direct the expression of its many genes. 

In the past two décades, there has been a révolution in our ability to déter¬ 
mine the exact order of subunits in DNA molécules. As a resuit, we now lcnow the 




10 jim 


S strain 


smooth pathogenic bacterium 
causes pneumonia 




| RANDOM MUTATION 


rough nonpathogenic 
mutant bacterium 


o 

o 


O 


live R strain cells grown in 
presence of either heat-killed 
S strain cells or cell-free 
extract of S strain cells 


TRANSFORMATION 

\ 



some R strain cells are 
transformée! to S strain 
cells, whose daughters 
are pathogenic and 
cause pneumonia 


FRACTIONATION OF CELL-FREE 
EXTRACT INTO CLASSES OF 
PU RI Fl E D MOLECULES 

rm r~rn 

RNA protein DNA lipid carbohydrate 


molécules tested for transformation of R strain cells 








oo 

o 0 

•• 

°o 

oo 

R 

R 

S 

R 

R 

strain 

strain 

strain 

strain 

strain 


CONCLUSION: Molécules that can CONCLUSION: The molécule that 

carry heritable information are carries the heritable information 

présent in S strain cells. is DNA. 

(B) 


Figure 4-2 The first experimental 
démonstration that DNA is the genetic 
material. These experiments, carried out 
in the 1920s (A) and 1940s (B), showed 
that adding purified DNA to a bacterium 
changed the bacterium’s properties and 
that this change was faithfully passed 
on to subséquent générations. Two 
closely related strains of the bacterium 
Streptococcus pneumoniae differ from 
each other in both their appearance under 
the microscope and their pathogenicity. 
One strain appears smooth (S) and causes 
death when injected into mice, and the 
other appears rough (R) and is nonlethal. 

(A) An initial experiment shows that some 
substance présent in the S strain can 
change (or transform) the R strain into the 
S strain and that this change is inherited by 
subséquent générations of bacteria. 

(B) This experiment, in which the R strain 
has been incubated with various classes 
of biological molécules purified from the 
S strain, identifies the active substance 
as DNA. 


(A) 
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sequence of the 3.2 billion nucléotide pairs that provide the information for pro- 
ducing a human adult from a fertilized egg, as well as having the DNA sequences 
for thousands of other organisms. Detailed analyses of these sequences are pro- 
viding exciting insights into the process of évolution, and it is with this subject that 
the chapter ends. 

This is the first of four chapters that deal with basic genetic mechanisms—the 
ways in which the cell maintains, replicates, and expresses the genetic informa¬ 
tion carried in its DNA. In the next chapter (Chapter 5), we shall discuss the mech¬ 
anisms by which the cell accurately replicates and repairs DNA; we also describe 
how DNA sequences can be rearranged through the process of genetic recombi¬ 
nation. Gene expression—the process through which the information encoded in 
DNA is interpreted by the cell to guide the synthesis of proteins—is the main topic 
of Chapter 6. In Chapter 7, we describe how this gene expression is controlled by 
the cell to ensure that each of the many thousands of proteins and RNA molécules 
encrypted in its DNA is manufactured only at the proper time and place in the life 
of a cell. 


THE STRUCTURE AND FUNCTION OF DNA 

Biologists in the 1940s had difficulty in conceiving how DNA could be the genetic 
material. The molécule seemed too simple: a long polymer composed of only four 
types of nucléotide subunits, which resemble one another chemically. Early in the 
1950s, DNA was examined by x-ray diffraction analysis, a technique for determin- 
ing the three-dimensional atomic structure of a molécule (discussed in Chapter 
8). The early x-ray diffraction results indicated that DNA was composed of two 
strands of the polymer wound into a hélix. The observation that DNA was dou- 
ble-stranded provided one of the major dues that led to the Watson-Criclc model 
for DNA structure that, as soon as it was proposed in 1953, made DNAs potential 
for réplication and information storage apparent. 

A DNA Molécule Consists of Two Complementary Chains of 
Nucléotides 

A deoxyribonucleic acid (DNA) molécule consists of two long polynucleotide 
chains composed of four types of nucléotide subunits. Each of these chains is 
known as a DNA chain, or a DNA strand. The chains run antiparallel to each other, 
and hydrogen bonds between the base portions of the nucléotides hold the two 
chains together (Figure 4-3). As we saw in Chapter 2 (Panel 2-6, pp. 100-101), 
nucléotides are composed of a five-carbon sugar to which are attached one or 
more phosphate groups and a nitrogen-containing base. In the case of the nuclé¬ 
otides in DNA, the sugar is deoxyribose attached to a single phosphate group 
(hence the name deoxyribonucleic acid), and the base may be either adenine (A), 
cytosine (C), guanine (G), or thymine (T). The nucléotides are covalently linlced 
together in a chain through the sugars and phosphates, which thus form a “baclc- 
bone” of alternating sugar-phosphate-sugar-phosphate. Because only the base 
differs in each of the four types of nucléotide subunit, each polynucleotide chain 
in DNA is analogous to a sugar-phosphate necklace (the backbone), from which 
hang the four types of beads (the bases A, C, G, and T). These same symbols (A, 
C, G, and T) are commonly used to dénoté either the four bases or the four entire 
nucléotides—that is, the bases with their attached sugar and phosphate groups. 

The way in which the nucléotides are linlced together gives a DNA strand a 
Chemical polarity. If we thinlc of each sugar as a bloclc with a protruding lcnob (the 
5' phosphate) on one side and a hole (the 3' hydroxyl) on the other (see Figure 
4-3), each completed chain, formed by interlocking lcnobs with holes, will hâve 
ail of its subunits lined up in the same orientation. Moreover, the two ends of the 
chain will be easily distinguishable, as one has a hole (the 3' hydroxyl) and the 
other a lcnob (the 5' phosphate) at its terminus. This polarity in a DNA chain is 
indicated by referring to one end as the 3' end and the other as the 5' end , names 
derived from the orientation of the deoxyribose sugar. With respect to DNAs 
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Figure 4-3 DNA and its building blocks. 

DNA is made of four types of nucléotides, 
which are linked covalently into a 
polynucleotide chain (a DNA strand) with 
a sugar-phosphate backbone from which 
the bases (A, C, G, and T) extend. A DNA 
molécule is composed of two antiparallel 
DNA strands held together by hydrogen 
bonds between the paired bases. The 
arrowheads at the ends of the DNA strands 
indicate the polarities of the two strands. In 
the diagram at the bottom left of the figure, 
the DNA molécule is shown straightened 
out; in reality, it is twisted into a double 
hélix, as shown on the right. For details, 
see Figure 4-5 and Movie 4.1 . 


information-carrying capacity, the chain of nucléotides in a DNA strand, being 
both directional and linear, can be read in much the same way as the letters on 
this page. 

The three-dimensional structure of DNA—the DNA double hélix—arises from 
the Chemical and structural features of its two polynucleotide chains. Because 
these two chains are held together by hydrogen-bonding between the bases on 
the different strands, ail the bases are on the inside of the double hélix, and the 
sugar-phosphate backbones are on the outside (see Figure 4-3). In each case, a 
bulkier two-ring base (a purine; see Panel 2-6, pp. 100-101) is paired with a sin¬ 
gle-ring base (a pyrimidine): A always pairs with T, and G with C (Figure 4-4). 
This complementary base-pairing enables the base pairs to be paclced in the ener- 
getically most favorable arrangement in the interior of the double hélix. In this 
arrangement, each base pair is of similar width, thus holding the sugar-phosphate 
backbones a constant distance apart along the DNA molécule. To maximize the 
efficiency of base-pair packing, the two sugar-phosphate backbones wind around 
each other to form a right-handed double hélix, with one complété turn every ten 
base pairs (Figure 4-5). 

The members of each base pair can fit together within the double hélix only 
if the two strands of the hélix are antiparallel—that is, only if the polarity of one 
strand is oriented opposite to that of the other strand (see Figures 4-3 and 4-4). 
A conséquence of DNA's structure and base-pairing requirements is that each 
strand of a DNA molécule contains a sequence of nucléotides that is exactly com¬ 
plementary to the nucléotide sequence of its partner strand. 
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The Structure of DNA Provides a Mechanism for Heredity 

The discovery of the structure of DNA immediately suggested answers to the two 
most fundamental questions about heredity. First, how could the information to 
specify an organism be carried in a Chemical form? And second, how could this 
information be duplicated and copied from génération to génération? 

The answer to the first question came from the realization that DNA is a linear 
polymer of four different kinds of monomer, strung out in a defined sequence lilce 
the letters of a document written in an alphabetic script. 

The answer to the second question came from the double-stranded nature of 
the structure: because each strand of DNA contains a sequence of nucléotides 
that is exactly complementary to the nucléotide sequence of its partner strand, 
each strand can act as a template, or mold, for the synthesis of a new complemen¬ 
tary strand. In other words, if we designate the two DNA strands as S and S', strand 


Figure 4-4 Complementary base pairs 
in the DNA double hélix. The shapes 
and Chemical structures of the bases 
allow hydrogen bonds to form efficiently 
only between A and T and between G 
and C, because atoms that are able to 
form hydrogen bonds (see Panel 2-3, 
pp. 94-95) can then be brought close 
together without distorting the double hélix. 
As indicated, two hydrogen bonds form 
between A and T, while three form between 
G and C. The bases can pair in this way 
only if the two polynucleotide chains that 
contain them are antiparallel to each other. 



2 nm 


(A) 



Figure 4-5 The DNA double hélix. 

(A) A space-filling model of 1.5 turns of 
the DNA double hélix. Each turn of DNA is 
made up of 10.4 nucléotide pairs, and the 
center-to-center distance between adjacent 
nucléotide pairs is 0.34 nm. The coiling of 
the two strands around each other créâtes 
two grooves in the double hélix: the wider 
groove is called the major groove, and the 
smaller the minor groove, as indicated. 

(B) A short section of the double hélix 
viewed from its side, showing four base 
pairs. The nucléotides are linked together 
covalently by phosphodiester bonds that 
join the 3'-hydroxyl (-OH) group of one 
sugar to the 5'-hydroxyl group of the next 
sugar. Thus, each polynucleotide strand 
has a Chemical polarity; that is, its two 
ends are chemically different. The 5' end 
of the DNA polymer is by convention often 
illustrated carrying a phosphate group, 
while the 3' end is shown with a hydroxyl. 
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Figure 4-6 DNA as a template for its 
own duplication. Because the nucléotide 
A successfully pairs only with T, and G 
pairs with C, each strand of DNA can act 
as a template to specify the sequence of 
nucléotides in its complementary strand. 

In this way, double-helical DNA can be 
copied precisely, with each parental DNA 
hélix producing two identical daughter DNA 
helices. 


S can serve as a template for malcing a new strand S', while strand S'can serve as a 
template for malcing a new strand S (Figure 4-6). Unis, the genetic information in 
DNA can be accurately copied by the beautifully simple process in which strand 
S séparâtes from strand S', and each separated strand then serves as a template 
for the production of a new complementary partner strand that is identical to its 
former partner. 

The ability of each strand of a DNA molécule to act as a template for producing 
a complementary strand enables a cell to copy, or replicate, its genome before 
passing it on to its descendants. We shall describe the élégant machinery that the 
cell uses to perform this taslc in Chapter 5. 

Organisms differ from one another because their respective DNA molécules 
hâve different nucléotide sequences and, consequently, carry different biological 
messages. But how is the nucléotide alphabet used to malce messages, and what 
do they spell out? 

As discussed above, it was lcnown well before the structure of DNA was deter- 
mined that genes contain the instructions for producing proteins. If genes are 
made of DNA, the DNA must therefore somehow encode proteins (Figure 4-7). 
As discussed in Chapter 3, the properties of a protein, which are responsible for its 
biological function, are determined by its three-dimensional structure. This struc¬ 
ture is determined in turn by the linear sequence of the amino acids of which it is 
composed. The linear sequence of nucléotides in a gene must therefore somehow 
spell out the linear sequence of amino acids in a protein. The exact correspon¬ 
dent between the four-letter nucléotide alphabet of DNA and the twenty-letter 
amino acid alphabet of proteins—the genetic code —is not at ail obvious from the 
DNA structure, and it toolc over a decade after the discovery of the double hélix 
before it was worked out. In Chapter 6, we will describe this code in detail in the 
course of elaborating the process of gene expression , through which a cell converts 
the nucléotide sequence of a gene first into the nucléotide sequence of an RNA 
molécule, and then into the amino acid sequence of a protein. 

The complété store of information in an organism's DNA is called its genome, 
and it spécifiés ail the RNA molécules and proteins that the organism will ever 
synthesize. (The term genome is also used to describe the DNA that carries this 
information.) The amount of information contained in genomes is staggering. The 
nucléotide sequence of a very small human gene, written out in the four-letter 
nucléotide alphabet, occupies a quarter of a page of text (Figure 4-8), while the 
complété sequence of nucléotides in the human genome would fill more than a 
thousand boolcs the size of this one. In addition to other critical information, it 
includes roughly 21,000 protein-coding genes, which (through alternative splic- 
ing; see p. 415) give rise to a much greater number of distinct pro teins. 

In Eukaryotes, DNA Is Enclosed in a Cell Nucléus 

As described in Chapter 1, nearly ail the DNA in a eukaryotic cell is sequestered in 
a nucléus, which in many cells occupies about 10% of the total cell volume. This 
compartment is delimited by a nuclear envelope formed by two concentric lipid 


gene A gene B gene C 

-1 I —I I -1 


DNA 

double 

hélix 


GENE 

EXPRESSION 


protein A 


protein B 


protein C 


Figure 4-7 The relationship between 
genetic information carried in DNA and 
proteins. (Discussed in Chapter 1.) 
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Figure 4-7 The nucléotide sequence of the human p-globin gene. By 

convention, a nucléotide sequence is written from its 5' end to its 3' end, 
and it should be read from left to right in successive lines down the page as 
though it were normal English text. This gene carries the information for the 
amino acid sequence of one of the two types of subunits of the hemoglobin 
molécule; a different gene, the a-globin gene, carries the information for the 
other. (Hemoglobin, the protein that carries oxygen in the blood, has four 
subunits, two of each type.) Only one of the two strands of the DNA double 
hélix containing the p-globin gene is shown; the other strand has the exact 
complementary sequence. The DNA sequences highlighted in yellow show 
the three régions of the gene that specify the amino acid sequence for the 
P-globin protein. We shall see in Chapter 6 how the cell splices these three 
sequences together at the level of messenger RNA in order to synthesize a 
full-length p-globin protein. 


bilayer membranes (Figure 4-9). These membranes are punctured at intervals 
by large nuclear pores, through which molécules move between the nucléus and 
the cytosol. The nuclear envelope is directly connected to the extensive System 
of intracellular membranes called the endoplasmic réticulum , which extend out 
from it into the cytoplasm. And it is mechanically supported by a network of inter- 
mediate filaments called the nuclear lamina —a thin feltlilce mesh just beneath 
the inner nuclear membrane (see Figure 4-9B). 

The nuclear envelope allows the many proteins that act on DNA to be concen- 
trated where they are needed in the cell, and, as we see in subséquent chapters, it 
also lceeps nuclear and cytosolic enzymes separate, a feature that is crucial for the 
proper functioning of eukaryotic cells. 

Summary 

Genetic information is carried in the linear sequence of nucléotides in DNA. Each 
molécule of DNA is a double hélix formed from two complementary antiparallel 
strands of nucléotides held together by hydrogen bonds between G-C andA-T base 
pairs. Duplication of the genetic information occurs by the use of one DNA strand 
as a templatefor the formation ofa complementary strand. The genetic information 
stored in an organism’s DNA contains the instructions for ail the RNA molécules and 
proteins that the organism will ever synthesize and is said to comprise its genome. 
In eukaryotes, DNA is contained in the cell nucléus, a large membrane-bound com- 
partment. 

CHROMOSOMAL DNA AND ITS PACKAGING IN THE 
CHROMATIN FIBER 

The most important function of DNA is to carry genes, the information that spéc¬ 
ifiés ail the RNA molécules and proteins that malce up an organism—including 
information about when, in what types of cells, and in what quantity each RNA 
molécule and protein is to be made. The nuclear DNA of eukaryotes is divided up 
into chromosomes, and in this section we see how genes are typically arranged on 
each chromosome. In addition, we describe the specialized DNA sequences that 
are required for a chromosome to be accurately duplicated as a separate entity 
and passed on from one génération to the next. 

We also confront the serious challenge of DNA packaging. If the double helices 
comprising ail 46 chromosomes in a human cell could be laid end to end, they 
would reach approximately 2 meters; yet the nucléus, which contains the DNA, is 
only about 6 pm in diameter. This is geometrically équivalent to packing 40 km (24 
miles) of extremely fine thread into a tennis bail. The complex taslc of packaging 
DNA is accomplished by specialized proteins that bind to the DNA and fold it, 
generating a sériés of organized coils and loops that provide increasingly higher 
levels of organization, and prevent the DNA from becoming an unmanageable 
tangle. Amazingly, although the DNA is very tightly compacted, it nevertheless 
remains accessible to the many enzymes in the cell that replicate it, repair it, and 
use its genes to produce RNA molécules and proteins. 


CCCTGTGGAGCCACACCCTAGGGTTGGCCA 
ATCTACTCCCAGGAGCAGGGAGGGCAGGAG 
C CAGGGC T GGGCATAAAAGT CAGGGCAGAG 
CCATCTATTGCTTACATTTGCTTCTGACAC 
AACT GT GT T CAC TAGCAAC TCAAACAGACA 
C CAT GG TGCAC C T GAC T C C T GAGGAGAAGT 
CTGCCGTTACTGCCCTGTGGGGCÀAGGTGA 
AC GT GGAT GAAGT TGG TGG TGAGGC C CTGG 
GCAGGTTGGTATCAAGGTTACAAGACAGGT 
T TAAGGAGACCAATAGAAACTGGGCATGTG 
GAGACAGAGAAGAC TC T TGGGT T T C T GATA 
GGCACTGACTCTCTCTGCCTATTGGTCTAT 
TTTCCCACCCTTAG GCTGCTGGTGGTCTAC 
CCTTGGACCCAGAGGTTCTTTGAGTCCTTT 
GGGGATCTGTCCACTCCTGATGCTGTTATG 
GGCAACCCTAAGGTGAAGGCTCATGGCAAG 
AAAGTGCTCGGTGCCTTTAGTGATGGCCTG 
GCTCACCTGGACAACCTCAAGGGCACCTTT 
GCCACACTGAGTGAGCTGCACTGTGACAAG 
C T GCAC GT GGAT C C TGAGAAC T TCAG GGTG 
AGTCTATGGGACCCTTGATGTTTTCTTTCC 
CCTTCTTTTCTATGGTTAAGTTCATGTCAT 
AGGAAG GGGAGAAGTAACAGGGTACAGT T T 
AGAATGGGAAACAGACGAATGATTGCATCA 
G T GT GGAAGT C TCAGGAT CGTTTTAGTTTC 
TTTTATTTGCTGTTCATAACAATTGTTTTC 
TTTTGTTTAATTCTTGCTTTCTTTTTTTTT 
C T TC T C CGCAAT T T TTAC TAT TATAC T TAA 
TGCCTTAACATTGTGTATAACAAAAGGAAA 
TAT C T C TGAGA TACAT TAAGTAAC T TAAAA 
AAAAAC TT TACACAGT C TGCC TAGTACAT T 
ACTATTTGGAATATATGTGTGCTTATTTGC 
ATATTCATAATCTCCCTACTTTATTTTCTT 
TTATTTTTAATTGATACATAATCATTATAC 
ATATTTATGGGTTAAAGTGTAATGTTTTAA 
TATGTGTACACATATTGACCAAATCAGGGT 
AATTTTGCATTTGTAATTTTAAAAAATGCT 
TTCTTCTTTTAATATACTTTTTTGTTTATC 
TTATTTCTAATACTTTCCCTAATCTCTTTC 
TTTCAGGGCAATAATGATACAATGTATCAT 
GCCTCTTTGCACCATTCTAAAGAATAACAG 
TGATAATTTCTGGGTTAAGGCAATAGCAAT 
ATTTCTGCATATAAATATTTCTGCATATAA 
ATTGTAACTGATGTAAGAGGTTTCATATTG 
CTAATAGCAGCTACAATCCAGCTACCATTC 
TGCTTTTATTTTATGGTTGGGATAAGGCTG 
GATTATTCTGAGTCCAAGCTAGGCCCTTTT 
GCTAATCATGTTCATACCTCTTATCTTCCT 
CCCACAGCTCCTGGGCAACGTGCTGGTCTG 

Sgtgctggcccatcactttggcaaagaatt 
CACCCCACCAGTGCAGGCTGCCTATCAGAA 
AGTGGTGGCTGGTGTGGCTAATGCCCTGGC 
CCACAAGTATCACTAAGCTCGCTTTCTTGC 
TGTCCAATTTCTATTAAAGGTTCCTTTGTT 
CCCTAAGTCCAACTACTAAACTGGGGGATA 
TTATGAAGGGCCTTGAGCATCTGGATTCTG 
CCTAATAAAAAACATTTATTTTCATTGCAA 
TGATGTATTTAAATTATTTCTGAATATTTT 
ACTAAAAAGGGAATGTGGGAGGTCAGTGCA 
TTTAAAACATAAAGAAATGATGAGCTGTTC 
AAACCTTGGGAAAATACACTATATCTTAAA 
C T C CAT GAAAGAAGGT GAGGC T GC AACCAG 
CTAATGCACATTGGCAACAGCCCCTGATGC 
CTATGCCTTATTCATCCCTCAGAAAAGGAT 
TCTTGTAGAGGCTTGATTTGCAGGTTAAAG 
TTTTGCTATGCTGTATTTTACATTACTTAT 
TGTTTTAGCTGTCCTCATGAATGTCTTTTC 
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Figure 4-9 A cross-sectional view of a typical cell nucléus. (A) Electron micrograph of a thin section through the nucléus of 
a human fibroblast. (B) Schematic drawing, showing that the nuclear envelope consists of two membranes, the outer one being 
continuous with the endoplasmic réticulum (ER) membrane (see also Figure 12-7). The space inside the endoplasmic réticulum 
(the ER lumen) is colored yellow, it is continuous with the space between the two nuclear membranes. The lipid bilayers of the 
inner and outer nuclear membranes are connected at each nuclear pore. A sheetlike network of intermediate filaments (brown) 
inside the nucléus forms the nuclear lamina (brown), providing mechanical support for the nuclear envelope (for details, see 
Chapter 12). The dark-staining heterochromatin contains specially condensed régions of DNA that will be discussed later. (A, 
courtesy of E.G. Jordan and J. McGovern.) 


Eukaryotic DNA Is Packaged into a Set of Chromosomes 

Each chromosome in a eukaryotic cell consists of a single, enormously long linear 
DNA molécule along with the proteins that fold and pack the fine DNA thread into 
a more compact structure. In addition to the pro teins involved in packaging, chro¬ 
mosomes are also associated with many other proteins (as well as numerous RNA 
molécules). These are required for the processes of gene expression, DNA repli- 
cation, and DNA repair. The complex of DNA and tightly bound protein is called 
chromatin (from the Greelc chroma, "color," because of its staining properties). 

Bacteria laclc a spécial nuclear compartment, and they generally carry their 
genes on a single DNA molécule, which is often circular (see Figure 1-24). This 
DNA is also associated with proteins that package and condense it, but they are 
different from the proteins that perform these functions in eukaryotes. Although 
the bacterial DNA with its attendant pro teins is often called the bacterial "chro¬ 
mosome," it does not hâve the same structure as eukaryotic chromosomes, and 
less is lcnown about how the bacterial DNA is packaged. Therefore, our discussion 
of chromosome structure will focus almost entirely on eukaryotic chromosomes. 

With the exception of the gametes (eggs and sperm) and a few highly spécial - 
ized cell types that cannot multiply and either laclc DNA altogether (for example, 
red blood cells) or hâve replicated their DNA without completing cell division (for 
example, megakaryocytes), each human cell nucléus contains two copies of each 
chromosome, one inherited from the mother and one from the father. The mater¬ 
nai and paternal chromosomes of a pair are called homologous chromosomes 
(homologs). The only nonhomologous chromosome pairs are the sex chromo¬ 
somes in males, where a §§ chromosome is inherited from the father and an [X: 
chromosome from the mother. Thus, each human cell contains a total of 46 chro¬ 
mosomes—22 pairs common to both males and females, plus two so-called sex 
chromosomes (X and Y in males, two Xs in females). These human chromosomes 
can be readily distinguished by "painting" each one a different color using a tech¬ 
nique based on DNA hybridization (Figure 4-10). In this method (described in 
detail in Chapter 8), a short strand of nucleic acid tagged with a fluorescent dye 
serves as a "probe" that piclcs out its complementary DNA sequence, lighting up 
the target chromosome at any site where it binds. Chromosome painting is most 
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frequently done at the stage in the cell cycle called mitosis, when chromosomes 
are especially compacted and easy to visualize (see below). 

Another more traditional way to distinguish one chromosome from another 
is to stain them with dyes that reveal a striking and reproducible pattern of bands 
along each mitotic chromosome (Figure 4-11). These banding patterns presum- 
ably reflect variations in chromatin structure, but their basis is not well under- 
stood. Nevertheless, the pattern of bands on each type of chromosome is unique, 
and it provided the initial means to identify and number each human chromo¬ 
some reliably. 



Figure 4-10 The complété set of human 
chromosomes. These chromosomes, 
from a female, were isolated from a cell 
undergoing nuclear division (mitosis) 
and are therefore highly compacted. 

Each chromosome has been “painted” a 
different color to permit its unambiguous 
identification under the fluorescence 
microscope, using a technique called 
“spectral karyotyping.” Chromosome 
painting can be performed by exposing 
the chromosomes to a large collection of 
DNA molécules whose sequence matches 
known DNA sequences from the human 
genome. The set of sequences matching 
each chromosome is coupled to a different 
combination of fluorescent dyes. DNA 
molécules derived from chromosome 1 are 
labeled with one spécifie dye combination, 
those from chromosome 2 with another, 
and so on. Because the labeled DNA can 
form base pairs, or hybridize, only to the 
chromosome from which it was derived, 
each chromosome becomes labeled 
with a different combination of dyes. For 
such experiments, the chromosomes are 
subjected to treatments that separate 
the two strands of double-helical DNA in 
a way that permits base-pairing with the 
single-stranded labeled DNA, but keeps 
the overall chromosome structure relatively 
intact. (A) The chromosomes visualized as 
they originally spilled from the lysed cell. 

(B) The same chromosomes artificially 
lined up in their numerical order. This 
arrangement of the full chromosome 
set is called a karyotype. (Adapted from 
N. McNeil and T. Ried, Expert Rev. Mol. 
Med. 2:1-14, 2000. With permission from 
Cambridge University Press.) 


Figure 4-11 The banding patterns of 
human chromosomes. Chromosomes 
1-22 are numbered in approximate order 
of size. A typical human cell contains two 
of each of these chromosomes, plus two 
sex chromosomes—two X chromosomes 
in a female, one X and one Y chromosome 
in a male. The chromosomes used to 
make these maps were stained at an early 
stage in mitosis, when the chromosomes 
are incompletely compacted. The 
horizontal red line represents the position 
of the centromere (see Figure 4-19), 
which appears as a constriction on 
mitotic chromosomes. The red knobs on 
chromosomes 13, 14, 15, 21, and 22 
indicate the positions of genes that code 
for the large ribosomal RNAs (discussed 
in Chapter 6). These banding patterns are 
obtained by staining chromosomes with 
Giemsa stain, and they can be observed 
under the light microscope. (Adapted from 
U. Francke, Cytogenet. Cell Genet. 31:24- 
32, 1981. With permission from the author.) 
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Figure 4-12 Aberrant human chromosomes. (A) Two normal human 
chromosomes, 4 and 6. (B) In an individual carrying a balanced chromosomal 
translocation, the DNA double hélix in one chromosome has crossed over 
with the DNA double hélix in the other chromosome due to an abnormal 
recombination event. The chromosome painting technique used on the 
chromosomes in each of the sets allows the identification of even short 
pièces of chromosomes that hâve become translocated, a frequent event in 
cancer cells. (Courtesy of Zhenya Tang and the NIGMS Human Genetic Cell 
Repository at the Coriell Institute for Medical Research: GM21880.) 

The display of the 46 human chromosomes at mitosis is called the human 
karyotype. If parts of chromosomes are lost or are switched between chromo¬ 
somes, these changes can be detected either by changes in the banding patterns 
or—with greater sensitivity—by changes in the pattern of chromosome painting 
(Figure 4-12). Cytogeneticists use these alterations to detect inherited chromo¬ 
some abnormalities and to reveal the chromosome rearrangements that occur in 
cancer cells as they progress to malignancy (discussed in Chapter 20). 

Chromosomes Contain Long Strings of Genes 

Chromosomes carry genes—the functional units of heredity. A gene is often 
defined as a segment of DNA that contains the instructions for making a particu- 
lar protein (or a set of closely related proteins), but this définition is too narrow. 
Genes that code for protein are indeed the majority, and most of the genes with 
clear-cut mutant phenotypes fall under this heading. In addition, however, there 
are many "RNA genes"—segments of DNA that generate a functionally significant 
RNA molécule, instead of a protein, as their final product. We shall say more about 
the RNA genes and their products later. 

As might be expected, some corrélation exists between the complexity of 
an organism and the number of genes in its genome (see Table 1-2, p. 29). For 
example, some simple bacteria hâve only 500 genes, compared to about 30,000 
for humans. Bacteria, archaea, and some single-celled eulcaryotes, such as yeast, 
hâve concise genomes, consisting of little more than strings of closely paclced 
genes. However, the genomes of multicellular plants and animais, as well as many 
other eulcaryotes, contain, in addition to genes, a large quantity of interspersed 
DNA whose function is poorly understood (Figure 4-13). Some of this additional 
DNA is crucial for the proper control of gene expression, and this may in part 
explain why there is so much of it in multicellular organisms, whose genes hâve to 
be switched on and off according to complicated rules during development (dis¬ 
cussed in Chapters 7 and 21). 

Différences in the amount of DNA interspersed between genes, far more than 
différences in numbers of genes, account for the astonishing variations in genome 
size that we see when we compare one species with another (see Figure 1-32). For 
example, the human genome is 200 times larger than that of the yeast Saccharo- 
myces cerevisiae, but 30 times smaller than that of some plants and amphibians 
and 200 times smaller than that of a species of amoeba. Moreover, because of dif¬ 
férences in the amount of noncoding DNA, the genomes of closely related organ¬ 
isms (bony fish, for example) can vary several hundredfold in their DNA content, 
even though they contain roughly the same number of genes. Whatever the excess 
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Figure 4-13 The arrangement of 
genes in the genome of S. cerevisiae 
compared to humans. (A) S. cerevisiae is 
a budding yeast widely used for brewing 
and baking. The genome of this single- 
celled eukaryote is distributed over 16 
chromosomes. A small région of one 
chromosome has been arbitrarily selected 
to show its high density of genes. (B) A 
région of the human genome of equal 
length to the yeast segment in (A). The 
human genes are much less densely 
packed and the amount of interspersed 
DNA sequence is far greater. Not shown in 
this sample of human DNA is the tact that 
most human genes are much longer than 
yeast genes (see Figure 4-15). 
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DNA may do, it seems clear that it is not a great handicap for a eulcaryotic cell to 
carry a large amount of it. 

How the genome is divided into chromosomes also differs from one eulcaryotic 
species to the next. For example, while the cells of humans hâve 46 chromosomes, 
those of some small deer hâve only 6, while those of the common carp contain 
over 100. Even closely related species with similar genome sizes can hâve very 
different numbers and sizes of chromosomes (Figure 4-14). Thus, there is no sim¬ 
ple relationship between chromosome number, complexity of the organism, and 
total genome size. Rather, the genomes and chromosomes of modern-day species 
hâve each been shaped by a unique history of seemingly random genetic events, 
acted on by poorly understood sélection pressures over long evolutionary times. 


Figure 4-14 Two closely related species 
of deer with very different chromosome 
numbers. In the évolution of the Indian 
muntjac, initially separate chromosomes 
fused, without having a major effect on the 
animal. These two species contain a similar 
number of genes. (Chinese muntjac photo 
courtesy of Deborah Carreno, Natural 
Wonders Photography.) 


The Nucléotide Sequence of the Human Genome Shows How 
Our Genes Are Arranged 

With the publication of the full DNA sequence of the human genome in 2004, it 
became possible to see in detail how the genes are arranged along each of our 
chromosomes (Figure 4-15). It will be many décades before the information con- 
tained in the human genome sequence is fully analyzed, but it has already stimu- 
lated new experiments that hâve had major effects on the content of every chapter 
in this boolc. 


(A) human chromosome 22 in its mitotic conformation, composed of two 
double-stranded DNA molécules, each 48 x 10 6 nucléotide pairs long 
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Figure 4-15 The organization of genes on 
a human chromosome. (A) Chromosome 
22, one of the smallest human chromosomes, 
contains 48 x 10 6 nucléotide pairs and 
makes up approximately 1.5% of the human 
genome. Most of the left arm of chromosome 
22 consists of short repeated sequences 
of DNA that are packaged in a particularly 
compact form of chromatin (heterochromatin) 
discussed later in this chapter. (B) A tenfold 
expansion of a portion of chromosome 22, 
with about 40 genes indicated. Those in dark 
brown are known genes and those in red are 
predicted genes. (C) An expanded portion of 
(B) showing four genes. (D) The intron-exon 
arrangement of a typical gene is shown 
after a further tenfold expansion. Each exon 
(red) codes for a portion of the protein, while 
the DNA sequence of the introns (gray) is 
relatively unimportant, as discussed in detail 
in Chapter 6. 

The human genome (3.2 x 10 9 nucléotide 
pairs) is the totality of genetic information 
belonging to our species. Almost ail of this 
genome is distributed over the 22 different 
autosomes and 2 sex chromosomes (see 
Figures 4-10 and 4-11) found within the 
nucléus. A minute fraction of the human 
genome (16,569 nucléotide pairs—in multiple 
copies per cell) is found in the mitochondria 
(introduced in Chapter 1, and discussed 
in detail in Chapter 14). The term human 
genome sequence refers to the complété 
nucléotide sequence of DNA in the 24 
nuclear chromosomes and the mitochondria. 
Being diploid, a human somatic cell nucléus 
contains roughly twice the haploid amount of 
DNA, or 6.4 x 10 9 nucléotide pairs, when not 
duplicating its chromosomes in préparation 
for division. (Adapted from International 
Human Genome Sequencing Consortium, 
Nature 409:860-921,2001. With permission 
from Macmillan Publishers Ltd.) 
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E Some Vital Statistics for the Human Genome 


Human genome 

DNA length 

3.2 x 10 9 nucléotide pairs* 

Number of genes coding for proteins 

Approximately 21,000 

Largest gene coding for protein 

2.4 x 10 6 nucléotide pairs 

Mean size for protein-coding genes 

27,000 nucléotide pairs 

Smallest number of exons per gene 

1 

Largest number of exons per gene 

178 

Mean number of exons per gene 

10.4 

Largest exon size 

17,106 nucléotide pairs 

Mean exon size 

145 nucléotide pairs 

Number of noncoding RNA genes 

Approximately 9000** 

Number of pseudogenes*** 

More than 20,000 

Percentage of DNA sequence in exons (protein-coding 
sequences) 

1.5% 

Percentage of DNA in other highly conserved 
sequences™ 

3.5% 

Percentage of DNA in high-copy-number répétitive 
éléments 

Approximately 50% 

* The sequence of 2.85 billion nucléotides is known precisely (error rate of only about 1 in 

100,000 nucléotides). The remaining DNA primarily consists of short sequences that are 
tandemly repeated many times over, with repeat numbers differing from one individual to the 
next. These highly répétitive blocks are hard to sequence accurately. 

** This number is only a very rough estimate. 

*** A pseudogene is a DNA sequence closely resembling that of a functional gene, but 
containing numerous mutations that prevent its proper expression or function. Most 
pseudogenes arise from the duplication of a functional gene followed by the accumulation of 
damaging mutations in one copy. 

**** These conserved functional régions include DNA encoding 5' and 3' UTRs (untranslated 
régions of mRNA), DNA specifying structural and functional RNAs, and DNA with conserved 
protein-binding sites. 


The first striking feature of the human genome is how little of it (only a few 
percent) codes for proteins (Table 4-1 and Figure 4-16). It is also notable that 
nearly half of the chromosomal DNA is made up of mobile pièces of DNA that 
hâve gradually inserted themselves in the chromosomes over evolutionary time, 
multiplying like parasites in the genome (see Figure 4-62). We discuss these trans¬ 
posable éléments in detail in later chapters. 

A second notable feature of the human genome is the large average gene 
size—about 27,000 nucléotide pairs. As discussed above, a typical gene carries in 
its linear sequence of nucléotides the information for the linear sequence of the 
amino acids of a protein. Only about 1300 nucléotide pairs are required to encode 
a protein of average size (about 430 amino acids in humans). Most of the remain- 
ing sequence in a gene consists of long stretches of noncoding DNA that interrupt 
the relatively short segments of DNA that code for protein. As will be discussed in 
detail in Chapter 6, the coding sequences are called exons; the intervening (non¬ 
coding) sequences in genes are called introns (see Figure 4-15 and Table 4-1). 
The majority of human genes thus consist of a long string of alternating exons and 
introns, with most of the gene consisting of introns. In contrast, the majority of 
genes from organisms with concise genomes laclc introns. This accounts for the 
much smaller size of their genes (about one-twentieth that of human genes), as 
well as for the much higher fraction of coding DNA in their chromosomes. 


\Mmmm mmi/ 
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Figure 4-16 Scale of the human genome. 

If drawn with a 1 mm space between each 
nucléotide pair, as in (A), the human genome 
would extend 3200 km (approximately 
2000 miles), far enough to stretch across 
the center of Africa, the site of our human 
origins (red line in B). At this scale, there 
would be, on average, a protein-coding 
gene every 150 m. An average gene would 
extend for 30 m, but the coding sequences 
in this gene would add up to only just over 
a meter. 
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In addition to introns and exons, each gene is associated with regulatory DNA 
sequences , which are responsible for ensuring that the gene is turned on or off at 
the proper time, expressed at the appropriate level, and only in the proper type of 
cell. In humans, the regulatory sequences for a typical gene are spread out over 
tens of thousands of nucléotide pairs. As would be expected, these regulatory 
sequences are much more compressed in organisms with concise genomes. We 
discuss how regulatory DNA sequences worlc in Chapter 7. 

Research in the last decade has surprised biologists with the discovery that, 
in addition to 21,000 protein-coding genes, the human genome contains many 
thousands of genes that encode RNA molécules that do not produce proteins, but 
instead hâve a variety of other important functions. What is thus far known about 
these molécules will be presented in Chapters 6 and 7. Last, but not least, the 
nucléotide sequence of the human genome has revealed that the archive of infor¬ 
mation needed to produce a human seems to be in an alarming State of chaos. As 
one commentator described our genome, "In some ways it may resemble your 
garage/bedroom/refrigerator/life: highly individualistic, but unkempt; little évi¬ 
dence of organization; much accumulated clutter (referred to by the uninitiated 
as 'junlc'); virtually nothing ever discarded; and the few patently valuable items 
indiscriminately, apparently carelessly, scattered throughout." We shall discuss 
how this is thought to hâve corne about in the final sections of this chapter entitled 
"How Genomes Evolve.” 

Each DNA Molécule That Forms a Linear Chromosome Must 
Contain a Centromere, Two Telomeres, and Réplication Origins 

To form a functional chromosome, a DNA molécule must be able to do more than 
simply carry genes: it must be able to replicate, and the replicated copies must be 
separated and reliably partitioned into daughter cells at each cell division. This 
process occurs through an ordered sériés of stages, collectively known as the cell 
cycle, which provides for a temporal séparation between the duplication of chro¬ 
mosomes and their ségrégation into two daughter cells. The cell cycle is briefly 
summarized in Figure 4-17, and it is discussed in detail in Chapter 17. Briefly, 
during a long interphase , genes are expressed and chromosomes are replicated, 
with the two replicas remaining together as a pair of sister chromatids. Through¬ 
out this time, the chromosomes are extended and much of their chromatin exists 
as long threads in the nucléus so that individual chromosomes cannot be easily 
distinguished. It is only during a much briefer period of mitosis that each chro¬ 
mosome condenses so that its two sister chromatids can be separated and dis- 
tributed to the two daughter nuclei. The highly condensed chromosomes in a 
dividing cell are known as mitotic chromosomes (Figure 4-18). This is the form 
in which chromosomes are most easily visualized; in fact, the images of chromo¬ 
somes shown so far in the chapter are of chromosomes in mitosis. 

Each chromosome opérâtes as a distinct structural unit: for a copy to be passed 
on to each daughter cell at division, each chromosome must be able to replicate, 
and the newly replicated copies must subsequently be separated and partitioned 


Figure 4-17 A simplifiée! view of the 
eukaryotic cell cycle. During interphase, 
the cell is actively expressing its genes 
and is therefore synthesizing proteins. 

Also, during interphase and before cell 
division, the DNA is replicated and each 
chromosome is duplicated to produce two 
closely paired sister DNA molécules (called 
sister chromatids). A cell with only one type 
of chromosome, présent in maternai and 
paternal copies, is illustrated here. Once 
DNA réplication is complété, the cell can 
enter M phase, when mitosis occurs and 
the nucléus is divided into two daughter 
nuclei. During this stage, the chromosomes 
condense, the nuclear envelope breaks 
down, and the mitotic spindle forms from 
microtubules and other proteins. The 
condensed mitotic chromosomes are 
captured by the mitotic spindle, and one 
complété set of chromosomes is then 
pulled to each end of the cell by separating 
the members of each sister-chromatid pair. 
A nuclear envelope re-forms around each 
chromosome set, and in the final step of 
M phase, the cell divides to produce two 
daughter cells. Most of the time in the cell 
cycle is spent in interphase; M phase is 
brief in comparison, occupying only about 
an hour in many mammalian cells. 
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correctly into the two daughter cells. These basic functions are controlled by three 
types of specialized nucléotide sequences in the DNA, each of which binds spé¬ 
cifie proteins that guide the machinery that replicates and segregates chromo¬ 
somes (Figure 4-19). 

Experiments in yeasts, whose chromosomes are relatively small and easy to 
manipulate, hâve identified the minimal DNA sequence éléments responsible for 
each of these functions. One type of nucléotide sequence acts as a DNA repli- 
cation origin, the location at which duplication of the DNA begins. Eukaryotic 
chromosomes contain many origins of réplication to ensure that the entire chro¬ 
mosome can be replicated rapidly, as discussed in detail in Chapter 5. 

After DNA réplication, the two sister chromatids that form each chromosome 
remain attached to one another and, as the cell cycle proceeds, are condensed 
further to produce mitotic chromosomes. The presence of a second specialized 
DNA sequence, called a centromere, allows one copy of each duplicated and con¬ 
densed chromosome to be pulled into each daughter cell when a cell divides. A 
protein complex called a kinetochore forms at the centromere and attaches the 
duplicated chromosomes to the mitotic spindle, allowing them to be pulled apart 
(discussed in Chapter 17). 

The third specialized DNA sequence forms telomeres, the ends of a chromo¬ 
some. Telomeres contain repeated nucléotide sequences that enable the ends of 
chromosomes to be efficiently replicated. Telomeres also perform another func- 
tion: the repeated telomere DNA sequences, together with the régions adjoining 
them, form structures that protect the end of the chromosome from being mis- 
taken by the cell for a brolcen DNA molécule in need of repair. We discuss both this 
type of repair and the structure and function of telomeres in Chapter 5. 

In yeast cells, the three types of sequences required to propagate a chromo¬ 
some are relatively short (typically less than 1000 base pairs each) and therefore 
use only a tiny fraction of the information-carrying capacity of a chromosome. 
Although telomere sequences are fairly simple and short in ail eulcaryotes, the 
DNA sequences that form centromeres and réplication origins in more complex 
organisms are much longer than their yeast counterparts. For example, experi¬ 
ments suggest that a human centromere can contain up to a million nucléotide 
pairs and that it may not require a stretch of DNA with a defined nucléotide 
sequence. Instead, as we shall discuss later in this chapter, a human centromere 
is thought to consist of a large, regularly repeating protein-nucleic acid structure 
that can be inherited when a chromosome replicates. 
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Figure 4-18 A mitotic chromosome. 

A mitotic chromosome is a condensed 
duplicated chromosome in which the 
two new chromosomes, called sister 
chromatids, are still linked together (see 
Figure 4-17). The constricted région 
indicates the position of the centromere. 
(Courtesy of Terry D. Allen.) 


Figure 4-19 The three DNA sequences 
required to produce a eukaryotic 
chromosome that can be replicated and 
then segregated accurately at mitosis. 

Each chromosome has multiple origins 
of réplication, one centromere, and two 
telomeres. Shown here is the sequence of 
events that a typical chromosome follows 
during the cell cycle. The DNA replicates 
in interphase, beginning at the origins of 
réplication and proceeding bidirectionally 
from the origins across the chromosome. 

In M phase, the centromere attaches the 
duplicated chromosomes to the mitotic 
spindle so that a copy of the entire genome 
is distributed to each daughter cell during 
mitosis; the spécial structure that attaches 
the centromere to the spindle is a protein 
complex called the kinetochore (dark 
green). The centromere also helps to hold 
the duplicated chromosomes together 
until they are ready to be moved apart. 

The telomeres form spécial caps at each 
chromosome end. 
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DNA Molécules Are Highly Condensed in Chromosomes 

Ail eulcaryotic organisms hâve spécial ways of packaging DNA into chromosomes. 
For example, if the 48 million nucléotide pairs of DNA in human chromosome 
22 could be laid out as one long perfect double hélix, the molécule would extend 
for about 1.5 cm if stretched out end to end. But chromosome 22 measures only 
about 2 pm in length in mitosis (see Figures 4-10 and 4-11), representing an end- 
to-end compaction ratio of over 7000-fold. This remarkable feat of compression 
is performed by proteins that successively coil and fold the DNA into higher and 
higher levels of organization. Although much less condensed than mitotic chro¬ 
mosomes, the DNA of human interphase chromosomes is still tightly paclced. 

In reading these sections it is important to lceep in mind that chromosome 
structure is dynamic. We hâve seen that each chromosome condenses to an 
extreme degree in the M phase of the cell cycle. Much less visible, but of enormous 
interest and importance, spécifie régions of interphase chromosomes décon¬ 
dense to allow access to spécifie DNA sequences for gene expression, DNA repair, 
and réplication—and then recondense when these processes are completed. The 
packaging of chromosomes is therefore accomplished in a way that allows rapid 
localized, on-demand access to the DNA. In the next sections, we discuss the spe- 
cialized proteins that make this type of packaging possible. 

Nucléosomes Are a Basic Unit of Eukaryotic Chromosome 
Structure 

The pro teins that bind to the DNA to form eukaryotic chromosomes are tradi- 
tionally divided into two classes: the histones and the non-histone chromosomal 
proteins, each contributing about the same mass to a chromosome as the DNA. 
The complex of both classes of protein with the nuclear DNA of eukaryotic cells is 
known as chromatin (Figure 4-20). 

Histones are responsible for the first and most basic level of chromosome 
packing, the nucléosome, a protein-DNA complex discovered in 1974. When 
interphase nuclei are brolcen open very gently and their contents examined under 
the électron microscope, most of the chromatin appears to be in the form of a 
fiber with a diameter of about 30 nm (Figure 4-21A). If this chromatin is sub- 
jected to treatments that cause it to unfold partially, it can be seen under the élec¬ 
tron microscope as a sériés of “beads on a string" (Figure 4-21 B). The string is 
DNA, and each bead is a “nucléosome core particle" that consists of DNA wound 
around a histone core (Movie 4.2). 

The structural organization of nucléosomes was determined after first isolat- 
ing them from unfolded chromatin by digestion with particular enzymes (called 
nucleases) that break down DNA by cutting between the nucléosomes. After 
digestion for a short period, the exposed DNA between the nucléosome core par- 
ticles, the linkerDNA, is degraded. Each individual nucléosome core particle con¬ 
sists of a complex of eight histone pro teins—two molécules each of histones H2A, 


chromatin 



Figure 4-20 Chromatin. As illustrated, 
chromatin consists of DNA bound to both 
histone and non-histone proteins. The 
mass of histone protein présent is about 
equal to the total mass of non-histone 
protein, but—as schematically indicated 
here—the latter class is composed of an 
enormous number of different species. In 
total, a chromosome is about one-third 
DNA and two-thirds protein by mass. 
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(A) 


(B) 




Figure 4-21 Nucléosomes as seen in 
the électron microscope. (A) Chromatin 
isolated directly from an interphase nucléus 
appears in the électron microscope as a 
thread about 30 nm thick. (B) This électron 
micrograph shows a length of chromatin 
that has been experimentally unpacked, 
or decondensed, after isolation to show 
the nucléosomes. (A, courtesy of Barbara 
Hamkalo; B, courtesy of Victoria Foe.) 


50 nm 


H2B, H3, and H4—and double-stranded DNA that is 147 nucléotide pairs long. 
The historié octamer forms a protein core around which the double-stranded DNA 
is wound (Figure 4-22). 

The région of linlcer DNA that séparâtes each nucléosome core particle from 
the next can vary in length from a few nucléotide pairs up to about 80. (The term 
nucléosome technically refers to a nucléosome core particle plus one of its adjacent 
DNA linkers, but it is often used synonymously with nucléosome core particle.) 
On average, therefore, nucléosomes repeat at intervals of about 200 nucléotide 
pairs. For example, a diploid human cell with 6.4 x 10 9 nucléotide pairs contains 
approximately 30 million nucléosomes. The formation of nucléosomes converts a 
DNA molécule into a chromatin thread about one-third of its initial length. 

The Structure of the Nucléosome Core Particle Reveals How DNA 
Is Packaged 

Ihe high-resolution structure of a nucléosome core particle, solved in 1997, 
revealed a disc-shaped histone core around which the DNA was tightly wrapped 
in a left-handed coil of 1.7 turns (Figure 4-23). Ail four of the histones that malce 
up the core of the nucléosome are relatively small proteins (102-135 amino acids), 
and they share a structural motif, known as the histone fold, formed from three a 
helices connected by two loops (Figure 4-24). In assembling a nucléosome, the 
histone folds first bind to each other to form H3-H4 and H2A-H2B dimers, and 
the H3-H4 dimers combine to form tetramers. An H3-H4 tetramer then further 
combines with two H2A-H2B dimers to form the compact octamer core, around 
which the DNA is wound. 

The interface between DNA and histone is extensive: 142 hydrogen bonds are 
formed between DNA and the histone core in each nucléosome. Nearly half of 
these bonds form between the amino acid backbone of the histones and the sug- 
ar-phosphate backbone of the DNA. Numerous hydrophobie interactions and sait 
linkages also hold DNA and protein together in the nucléosome. More than one- 
fifth of the amino acids in each of the core histones are either lysine or arginine 
(two amino acids with basic side chains), and their positive charges can effectively 


Figure 4-22 Structural organization of the nucléosome. A nucléosome 
contains a protein core made of eight histone molécules. In biochemical 
experiments, the nucléosome core particle can be released from isolated 
chromatin by digestion of the linker DNA with a nuclease, an enzyme that 
breaks down DNA. (The nuclease can dégradé the exposed linker DNA but 
cannot attack the DNA wound tightly around the nucléosome core.) After 
dissociation of the isolated nucléosome into its protein core and DNA, the 
length of the DNA that was wound around the core can be determined. 

This length of 147 nucléotide pairs is sufficient to wrap 1.7 times around the 
histone core. 
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Figure 4-23 The structure of a nucléosome 
core particle, as determined by x-ray 
diffraction analyses of crystals. Each 
histone is colored according to the scheme in 
Figure 4-22, with the DNA double hélix in light 
gray. (Adapted from K. Luger et al., Nature 
389:251-260, 1997. With permission from 
Macmillan Publishers Ltd.) 


side view 


edge view 
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neutralize the negatively charged DNA backbone. These numerous interactions 
explain in part why DNA of virtually any sequence can be bound on a histone 
octamer core. The path of the DNA around the histone core is not smooth; rather, 
several kinlcs are seen in the DNA, as expected from the nonuniform surface of the 



Figure 4-24 The overall structural organization of the core histones. (A) Each of the core 
histones contains an N-terminal tail, which is subject to several forms of covalent modification, and 
a histone fold région, as indicated. (B) The structure of the histone fold, which is formed by ail four 
of the core histones. (C) Histones 2A and 2B form a dimer through an interaction known as the 
“handshake.” Histones H3 and H4 form a dimer through the same type of interaction. (D) The final 
histone octamer on DNA. Note that ail eight N-terminal tails of the histones protrude from the disc- 
shaped core structure. Their conformations are highly flexible, and they serve as binding sites for 
sets of other proteins. 
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core. The bending requires a substantial compression of the minor groove of the 
DNA hélix. Certain dinucleotides in the minor groove are especially easy to com- 
press, and some nucléotide sequences bind the nucléosome more tightly than 
others (Figure 4-25). This probably explains some striking, but unusual, cases 
of very précisé positioning of nucléosomes along a stretch of DNA. However, the 
sequence preference of nucléosomes must be wealc enough to allow other factors 
to dominate, inasmuch as nucléosomes can occupy any one of a number of posi¬ 
tions relative to the DNA sequence in most chromosomal régions. 

In addition to its histone fold, each of the core histones has an N-terminal 
amino acid “tail,” which extends out from the DNA-histone core (see Figure 
4-24D). These histone tails are subject to several different types of covalent mod¬ 
ifications that in turn control critical aspects of chromatin structure and function, 
as we shall discuss shortly. 

As a reflection of their fundamental rôle in DNA function through controlling 
chromatin structure, the histones are among the most highly conserved eulcary- 
otic proteins. For example, the amino acid sequence of histone H4 from a pea 
differs from that of a cow at only 2 of the 102 positions. This strong evolution- 
ary conservation suggests that the functions of histones involve nearly ail of their 
amino acids, so that a change in any position is deleterious to the cell. But in addi¬ 
tion to this remarkable conservation, eulcaryotic organisms also produce smaller 
amounts of specialized variant core histones that differ in amino acid sequence 

from the main ones. As discussed later, these variants, combined with the surpris- 
ingly large number of covalent modifications that can be added to the histones in 
nucléosomes, give rise to a variety of chromatin structures in cells. 

Nucléosomes Hâve a Dynamic Structure, and Are Frequently 
Subjected to Changes Catalyzed by ATP-Dependent Chromatin 
Remodeling Complexes 

For many years biologists thought that, once formed in a particular position on 
DNA, a nucléosome would remain fixed in place because of the very tight asso¬ 
ciation between its core histones and DNA. If true, this would pose problems for 
genetic readout mechanisms, which in principle require easy access to many 
spécifie DNA sequences. It would also hinder the rapid passage of the DNA tran¬ 
scription and réplication machinery through chromatin. But kinetic experiments 
show that the DNA in an isolated nucléosome unwraps from each end at a rate of 
about four times per second, remaining exposed for 10 to 50 milliseconds before 
the partially unwrapped structure recloses. Thus, most of the DNA in an isolated 
nucléosome is in principle available for binding other proteins. 

For the chromatin in a cell, a further loosening of DNA-histone contacts is 
clearly required, because eulcaryotic cells contain a large variety of ATP-depen- 
dent chromatin remodeling complexes. These complexes include a subunit that 
hydrolyzes ATP (an ATPase evolutionarily related to the DNA helicases discussed 
in Chapter 5). This subunit binds both to the protein core of the nucléosome and 
to the double-stranded DNA that winds around it. By using the energy of ATP 
hydrolysis to move this DNA relative to the core, the protein complex changes the 
structure of a nucléosome temporarily, malcing the DNA less tightly bound to the 
histone core. Through repeated cycles of ATP hydrolysis that pull the nucléosome 
core along the DNA double hélix, the remodeling complexes can catalyze nuclé¬ 
osome sliding. In this way, they can reposition nucléosomes to expose spécifie 
régions of DNA, thereby malcing them available to other proteins in the cell (Fig¬ 
ure 4-26). In addition, by cooperating with a variety of other proteins that bind to 
histones and serve as histone chaperones, some remodeling complexes are able to 
remove either ail or part of the nucléosome core from a nucléosome—catalyzing 
either an exchange of its H2A-H2B histones, or the complété removal of the oct- 
americ core from the DNA (Figure 4-27). As a resuit of such processes, measure- 
ments reveal that a typical nucléosome is replaced on the DNA every one or two 
hours inside the cell. 
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Figure 4-25 The bending of DNA in a 
nucléosome. The DNA hélix makes 
1.7 tight turns around the histone octamer. 
This diagram illustrâtes how the minor 
groove is compressed on the inside of the 
turn. Owing to structural features of the 
DNA molécule, the indicated dinucleotides 
are preferentially accommodated in such 
a narrow minor groove, which helps to 
explain why certain DNA sequences 
will bind more tightly than others to the 
nucléosome core. 
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Cells contain dozens of different ATP-dépendent chromatin remodeling com¬ 
plexes that are specialized for different rôles. Most are large protein complexes 
that can contain 10 or more subunits, some of which bind to spécifie modifica¬ 
tions on histones (see Figure 4-26C). The activity of these complexes is carefully 
controlled by the cell. As genes are turned on and off, chromatin remodeling com¬ 
plexes are brought to spécifie régions of DNA where they act locally to influence 
chromatin structure (discussed in Chapter 7; see also Figure 4-40, below). 

Although some DNA sequences bind more tightly than others to the nucléo¬ 
some core (see Figure 4-25), the most important influence on nucléosome posi- 
tioning appears to be the presence of other tightly bound proteins on the DNA. 
Some bound proteins favor the formation of a nucléosome adjacent to them. 
Others create obstacles that force the nucléosomes to move elsewhere. The exact 
positions of nucléosomes along a stretch of DNA therefore dépend mainly on the 
presence and nature of other proteins bound to the DNA. And due to the presence 
of ATP-dépendent chromatin remodeling complexes, the arrangement of nuclé¬ 
osomes on DNA can be highly dynamic, changing rapidly according to the needs 
of the cell. 

Nucléosomes Are Usually Packed Together into a Compact 
Chromatin Fiber 

Although enormously long strings of nucléosomes form on the chromosomal 
DNA, chromatin in a living cell probably rarely adopts the extended “beads-on-a- 
string" form. Instead, the nucléosomes are packed on top of one another, gener- 
ating arrays in which the DNA is even more highly condensed. Thus, when nuclei 
are very gently lysed onto an électron microscope grid, much of the chromatin is 
seen to be in the form of a fiber with a diameter of about 30 nm, which is consid- 
erably wider than chromatin in the “beads-on-a-string” form (see Figure 4-21). 





(B) 


(C) 


Figure 4-26 The nucléosome sliding 
catalyzed by ATP-dependent chromatin 
remodeling complexes. (A) Using the 
energy of ATP hydrolysis, the remodeling 
complex is thought to push on the DNA 
of its bound nucléosome and loosen its 
attachment to the nucléosome core. Each 
cycle of ATP binding, ATP hydrolysis, and 
release of the ADP and Pj products thereby 
moves the DNA with respect to the histone 
octamer in the direction of the arrow in this 
diagram. It requires many such cycles to 
produce the nucléosome sliding shown. 

(B) The structure of a nucleosome-bound 
dimer of the two identical ATPase subunits 
(green) that slide nucléosomes back and 
forth in the ISW1 family of chromatin 
remodeling complexes. (C) The structure 
of a large chromatin remodeling complex, 
showing how it is thought to wrap around a 
nucléosome. Modeled in green is the yeast 
RSC complex, which contains 15 subunits— 
including an ATPase and at least four 
subunits with domains that recognize spécifie 
covalently modified histones. (B, from 
L.R. Racki et al., Nature 462:1016-1021, 
2009. With permission from Macmillan 
Publishers Ltd; C, adapted from 
A.E. Leschziner et al., Proc. Natl Acad. Sci. 
USA 104:4913-4918, 2007.) 
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histone chaperone 




EXCHANGE OF 
H2A-H2B DIMERS 



Figure 4-27 Nucléosome removal and histone 
exchange catalyzed by ATP-dependent chromatin 
remodeling complexes. By cooperating with spécifie 
members of a large family of different histone chaperones, 
some chromatin remodeling complexes can remove 
the H2A-H2B dimers from a nucléosome (top sériés of 
reactions) and replace them with dimers that contain a 
variant histone, such as the H2AZ-H2B dimer (see Figure 
4-35). Other remodeling complexes are attracted to 
spécifie sites on chromatin and cooperate with histone 
chaperones to remove the histone octamer completely 
and/or to replace it with a different nucléosome core 
(bottom sériés of reactions). Highly simplified views of the 
processes are illustrated here. 



How nucléosomes are organized into condensed arrays is unclear. The struc¬ 
ture of a tetranucleosome (a complex of four nucléosomes) obtained by x-ray 
crystallography and high-resolution électron microscopy of reconstituted chro¬ 
matin hâve been used to support a zigzag model for the stacking of nucléosomes 
in a 30-nm fiber (Figure 4-28). But cryoelectron microscopy of carefully prepared 
nuclei suggests that most régions of chromatin are less regularly structured. 

What causes nucléosomes to stack so tightly on each other? Nucleosome-to- 
nucleosome linkages that involve histone tails, most notably the H4 tail, consti- 
tute one important factor (Figure 4-29). Another important factor is an additional 
histone that is often présent in a 1-to-l ratio with nucléosome cores, known as 
histone Hl. This so-called linker histone is larger than the individual core histones 
and it has been considerably less well conserved during évolution. A single his¬ 
tone Hl molécule binds to each nucléosome, contacting both DNA and protein, 
and changing the path of the DNA as it exits from the nucléosome. This change in 
the exit path of DNA is thought to help compact nucleosomal DNA (Figure 4-30). 


Figure 4-28 A zigzag model for the 30- 
nm chromatin fiber. (A) The conformation 
of two of the four nucléosomes in a 
tetranucleosome, from a structure 
determined by x-ray crystallography. 

(B) Schematic of the entire tetranucleosome; 
the fourth nucléosome is not visible, being 
stacked on the bottom nucléosome and 
behind it in this diagram. (C) Diagrammatic 
illustration of a possible zigzag structure 
that could account for the 30-nm chromatin 
fiber. (A, PDB code: 1ZBB; C, adapted 
from C.L. Woodcock, Nat. Struct. Mol. Biol. 
12:639-640, 2005. With permission from 
Macmillan Publishers Ltd.) 
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Most eukaryotic organisms malce several histone H1 proteins of related but quite 
distinct amino acid sequences. The presence of many other DNA-binding pro¬ 
teins, as well as proteins that bind directly to histones, is certain to add important 
additional features to any array of nucléosomes. 

Summary 

A gene is a nucléotide sequence in a DNA molécule that acts as a Junctional unit 
for the production of a protein, a structural RNA, or a catalytic or regulatory RNA 
molécule. In eulcaryotes, protein-coding genes are usually composed ofa string of 
alternating introns and exons associated with regulatory régions ofDNA. A chro¬ 
mosome isformed from a single, enormously long DNA molécule that contains a 
linear array ofmany genes, bound to a large set of pro teins. The human genome 
contains 3.2 x 10 9 DNA nucléotide pairs, divided between 22 different autosomes 
(présent in two copies each) and 2 sex chromosomes. Only a small percentage ofthis 
DNA codes for pro teins or junctional RNA molécules. A chromosomal DNA molé¬ 
cule also contains three other types of important nucléotide sequences: réplication 
origins and telomeres allow the DNA molécule to be efficiently replicated, while a 
centromere attaches the sister DNA molécules to the mitotic spindle, ensuring their 
accurate ségrégation to daughter cells during the M phase ofthe cell cycle. 

The DNA in eulcaryotes is tightly bound to an equal mass of histones, which 
form repeated arrays ofDNA-protein particles called nucléosomes. The nucléosome 
is composed ofan octameric core of histone pro teins around which the DNA dou¬ 
ble hélix is wrapped. Nucléosomes are spaced at internais of about 200 nucléotide 
pairs, and they are usually packed together (with the aid of histone H1 molécules) 
into quasi-regular arrays to form a 30-nm chromatin fiber. Even though compact, 
the structure of chromatin must be highly dynamic to allow access to the DNA. 
There is some spontaneous DNA unwrapping and rewrapping in the nucléosome 
itself; however, the general strategy for reversibly changing local chromatin struc¬ 
ture features ATP-driv en chromatin remodeling complexes. Cells contain a large set 
ofsuch complexes, which are targeted to spécifie régions of chromatin at appropri- 
ate times. The remodeling complexes collaborate with histone chaperones to allow 
nucléosome cores to be repositioned, reconstituted with different histones, or com- 
pletely removed to expose the underlying DNA. 


Figure 4-29 A model for the rôle played 
by histone tails in the compaction of 
chromatin. (A) A schematic diagram 
shows the approximate exit points of 
the eight histone tails, one from each 
histone protein, that extend from each 
nucléosome. The actual structure is 
shown to its right. In the high-resolution 
structure of the nucléosome, the tails are 
largely unstructured, suggesting that they 
are highly flexible. (B) As indicated, the 
histone tails are thought to be involved in 
interactions between nucléosomes that 
help to pack them together. (A, PDB 
code: 1KX5.) 



Figure 4-30 How the linker histone 
binds to the nucléosome. The position 
and structure of histone H1 is shown. The 
H1 core région constrains an additional 
20 nucléotide pairs of DNA where it exits 
from the nucléosome core and is important 
for compacting chromatin. (A) Schematic, 
and (B) structure inferred for a single 
nucléosome from a structure determined 
by high-resolution électron microscopy of 
a reconstituted chromatin fiber (C). (B and 
C, adapted from F. Song et al., Science 
344:376-380,2014.) 
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CHROMATIN STRUCTURE AND FUNCTION 

Having described how DNA is packaged into nucléosomes to create a chromatin 
fiber, we now turn to the mechanisms that create different chromatin structures 
in different régions of a cell's genome. Mechanisms of this type hâve a variety of 
important functions in cells. Most strikingly, certain types of chromatin structure 
can be inherited; that is, the structure can be directly passed down from a cell 
to its descendants. Because the cell memory that results is based on an inher¬ 
ited chromatin structure rather than on a change in DNA sequence, this is a form 
of epigenetic inheritance. The prefix epi is Greelc for "on”; this is appropriate, 
because epigenetics represents a form of inheritance that is superimposed on the 
genetic inheritance based on DNA. 

In Chapter 7, we shall introduce the many different ways in which the expres¬ 
sion of genes is regulated. There we discuss epigenetic inheritance in detail and 
présent several different mechanisms that can produce it. Here, we are con- 
cerned with only one, that based on chromatin structure. We begin this section by 
reviewing the observations that first demonstrated that chromatin structures can 
be inherited. We then describe some of the chemistry that makes this possible— 
the covalent modification of histones in nucléosomes. These modifications hâve 
many functions, inasmuch as they serve as récognition sites for protein domains 
that linlc spécifie protein complexes to different régions of chromatin. Histones 
thereby hâve effects on gene expression, as well as on many other DNA-linked 
processes. Through such mechanisms, chromatin structure plays an important 
rôle in the development, growth, and maintenance of ail eukaryotic organisms, 
including ourselves. 

Heterochromatin Is Highly Organized and Restricts Gene 
Expression 

Light-microscope studies in the 1930s distinguished two types of chromatin in 
the interphase nuclei of many higher eukaryotic cells: a highly condensed form, 
called heterochromatin, and ail the rest, which is less condensed, called euchro- 
matin. Heterochromatin represents an especially compact form of chromatin 
(see Figure 4-9), and we are finally beginning to understand its molecular prop- 
erties. It is highly concentrated in certain specialized régions, most notably at the 
centromeres and telomeres introduced previously (see Figure 4-19), but it is also 
présent at many other locations along chromosomes—locations that can vary 
according to the physiological State of the cell. In a typical mammalian cell, more 
than 10% of the genome is packaged in this way. 

The DNA in heterochromatin typically contains few genes, and when euchro- 
matic régions are converted to a heterochromatic State, their genes are generally 
switched off as a resuit. However, we lcnow now that the term heterochromatin 
encompasses several distinct modes of chromatin compaction that hâve different 
implications for gene expression. Thus, heterochromatin should not be thought 
of as simply encapsulating “dead” DNA, but rather as a descriptor for compact 
chromatin domains that share the common feature of being unusually résistant 
to gene expression. 

The Heterochromatic State Is Self-Propagating 

Through chromosome brealcage and rejoining, whether brought about by a nat- 
ural genetic accident or by experimental artifice, a piece of chromosome that is 
normally euchromatic can be translocated into the neighborhood of heteroch¬ 
romatin. Remarkably, this often causes silencing— inactivation—of the normally 
active genes. This phenomenon is referred to as a position effect. It reflects a 
spreading of the heterochromatic State into the originally euchromatic région, 
and it has provided important dues to the mechanisms that create and maintain 
heterochromatin. First recognized in Drosophila, position effects hâve now been 
observed in many eukaryotes, including yeasts, plants, and humans. 
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Figure 4-31 The cause of position effect variegation in Drosophila. (A) Heterochromatin (green) is normally prevented from 
spreading into adjacent régions of euchromatin (red) by barrier DNA sequences, which we shall discuss shortly. In flies that 
inherit certain chromosomal rearrangements, however, this barrier is no longer présent. (B) During the early development of such 
flies, heterochromatin can spread into neighboring chromosomal DNA, proceeding for different distances in different cells. This 
spreading soon stops, but the established pattern of heterochromatin is subsequently inherited, so that large clones of progeny 
cells are produced that hâve the same neighboring genes condensed into heterochromatin and thereby inactivated (hence the 
“variegated” appearance of some of these flies; see Figure 4-32). Although “spreading” is used to describe the formation of 
new heterochromatin close to previously existing heterochromatin, the term may not be wholly accurate. There is evidence that 
during expansion, the condensation of DNA into heterochromatin can “skip over” some régions of chromatin, sparing the genes 
that lie within them from répressive effects. 


In chromosome brealcage-and-rejoining events of the sort just described, the 
zone of silencing, where euchromatin is converted to a heterochromatic State, 
spreads for different distances in different early cells in the fiy embryo. Remarlc- 
ably, these différences then are perpetuated for the rest of the animaks life: in 
each cell, once the heterochromatic condition is established on a piece of chro¬ 
matin, it tends to be stably inherited by ail of that cell's progeny (Figure 4-31). This 
remarkable phenomenon, called position effect variegation, was first recognized 
through a detailed genetic analysis of the mottled loss of red pigment in the fiy eye 
(Figure 4-32). It shares features with the extensive spread of heterochromatin that 
inactivâtes one of the two X chromosomes in female mammals. There too, a ran- 
dom process acts in each cell of the early embryo to dictate which X chromosome 
will be inactivated, and that same X chromosome then remains inactive in ail the 
cell's progeny, creating a mosaic of different clones of cells in the adult body (see 
Figure 7-50). 

These observations, talcen together, point to a fundamental strategy of het¬ 
erochromatin formation: heterochromatin begets more heterochromatin. This 
positive feedback can operate both in space, causing the heterochromatic State to 
spread along the chromosome, and in time, across cell générations, propagating 
the heterochromatic State of the parent cell to its daughters. The challenge is to 
explain the molecular mechanisms that underlie this remarkable behavior. 
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Figure 4-32 The discovery of position 
effects on gene expression. The White 
gene in the fruit fiy Drosophila Controls eye 
pigment production and is named after the 
mutation that first identified it. Wild-type 
flies with a normal White gene (White + ) 
hâve normal pigment production, which 
gives them red eyes, but if the White gene 
is mutated and inactivated, the mutant 
flies ( White~ ) make no pigment and hâve 
white eyes. In flies in which a normal White 
gene has been moved near a région of 
heterochromatin, the eyes are mottled, 
with both red and white patches. The white 
patches represent cell lineages in which 
the White gene has been silenced by the 
effects of the heterochromatin. In contrast, 
the red patches represent cell lineages in 
which the White gene is expressed. Early 
in development, when the heterochromatin 
is first formed, it spreads into neighboring 
euchromatin to different extents in different 
embryonic cells (see Figure 4-31). The 
presence of large patches of red and white 
cells reveals that the State of transcriptional 
activity, as determined by the packaging of 
this gene into chromatin in those ancestor 
cells, is inherited by ail daughter cells. 
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(A) LYSINE ACETYLATION AND METHYLATION ARE COMPETING REACTIONS 
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Figure 4-33 Some prominent types of covalent amino acid side-chain 
modifications found on nucleosomal histones. (A) Three different levels 
of lysine méthylation are shown; each can be recognized by a different 
binding protein and thus each can hâve a different significance for the cell. 
Note that acétylation removes the plus charge on lysine, and that, most 
importantly, an acetylated lysine cannot be methylated, and vice versa. 

(B) Serine phosphorylation adds a négative charge to a histone. Modifications 
of histones not shown here include the mono- or dimethylation of an arginine, 
the phosphorylation of a threonine, the addition of ADP-ribose to a glutamic 
acid, and the addition of a ubiquityl, sumoyl, or biotin group to a lysine. 


As a first step, one can carry out a search for the molécules that are involved. 
This has been done by means of genetic screens, in which large numbers of 
mutants are generated, after which one piclcs out those that show an abnormal- 
ity of the process in question. Extensive genetic screens in Drosophila, fungi, and 
mice hâve identified more than 100 genes whose products either enhance or sup- 
press the spread of heterochromatin and its stable inheritance—in other words, 
genes that serve as either enhancers or suppressors of position effect variegation. 
Many of these genes turn out to code for non-histone chromosomal proteins that 
interact with histones and are involved in modifying or maintaining chromatin 
structure. We shall discuss how they worlc in the sections that follow. 


(B) SERINE PHOSPHORYLATION 
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The Core Histones Are Covalently Modified at Many Different Sites 

The amino acid side chains of the four histones in the nucléosome core are sub- 
jected to a remarkable variety of covalent modifications, including the acétylation 
of lysines, the mono-, di-, and trimethylation of lysines, and the phosphorylation 
of serines (Figure 4-33). A large number of these side-chain modifications occur 
on the eight relatively unstructured N-terminal "histone tails” that protrude from 
the nucléosome (Figure 4-34). However, there are also more than 20 spécifie side- 
chain modifications on the nucleosome's globular core. 

Ail of the above types of modifications are réversible, with one enzyme serv- 
ing to create a particular type of modification, and another to remove it. These 
enzymes are highly spécifie. Thus, for example, acetyl groups are added to spécifie 
lysines by a set of different histone acetyl transferases (HATs) and removed by a set 
of histone deacetylase complexes (HDACs). Likewise, methyl groups are added to 
lysine side chains by a set of different histone methyl transferases and removed 
by a set of histone demethylases. Each enzyme is recruited to spécifie sites on 
the chromatin at defined times in each cell's life history. For the most part, the 
initial recruitment dépends on transcription regulator proteins (sometimes called 
"transcription factors"). As we shall explain in Chapter 7, these proteins recognize 
and bind to spécifie DNA sequences in the chromosomes. They are produced at 
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Figure 4-34 The covalent modification of core histone tails. (A) The structure of the nucléosome highlighting the location of 
the first 30 amino acids in each of its eight N-terminal histone tails (green). These tails are unstructured and highly mobile, and 
thus will change their conformation depending on other bound proteins. (B) Well-documented modifications of the four histone 
core proteins are indicated. Although only a single symbol is used here for méthylation (M), each lysine (K) or arginine (R) can be 
methylated in several different ways. Note also that some positions (e.g., lysine 9 of H3) can be modified either by méthylation 
or by acétylation, but not both. Most of the modifications shown add a relatively small molécule onto the histone tails; the 
exception is ubiquitin, a 76-amino-acid protein also used for other cell processes (see Figure 3-69). Not shown are more than 
20 possible modifications located in the globular core of the histones. (A, PDB: 1KX5; B, adapted from H. Santos-Rosa and 
C. Caldas, Eur. J. Cancer AA :2381-2402, 2005. With permission from Elsevier.) 


different times and places in the life of an organism, thereby determining where 
and when the chromatin-modifying enzymes will act. In this way, the DNA 
sequence ultimately détermines how histones are modified. But in at least some 
cases, the covalent modifications on nucléosomes can persist long after the tran¬ 
scription regulator proteins that first induced them hâve disappeared, thereby 
providing the cell with a memory of its developmental history. Most remarkably, 
as in the related phenomenon of position effect variegation discussed above, this 
memory can be transmitted from one cell génération to the next. 

Very different patterns of covalent modification are found on different groups 
of nucléosomes, depending both on their exact position in the genome and on 
the history of the cell. The modifications of the histones are carefully controlled, 
and they hâve important conséquences. The acétylation of lysines on the N-ter- 
minal tails loosens chromatin structure, in part because adding an acetyl group 
to lysine removes its positive charge, thereby reducing the affinity of the tails for 
adjacent nucléosomes. However, the most profound effects of the histone modifi¬ 
cations lie in their ability to recruit spécifie other proteins to the modified stretch 
of chromatin. Trimethylation of one spécifie lysine on the histone H3 tail, for 
instance, attracts the heterochromatin-specific protein HP1 and contributes to 
the establishment and spread of heterochromatin. More generally, the recruited 
proteins act with the modified histones to détermine how and when genes will be 
expressed, as well as other chromosome functions. In this way, the précisé struc¬ 
ture of each domain of chromatin governs the readout of the genetic information 
that it contains, and thereby the structure and function of the eukaryotic cell. 
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Figure 4-35 The structure of some histone 
variants compared with the major histone 
that they replace. The histone variants 
are inserted into nucléosomes at spécifie 
sites on chromosomes by ATP-dependent 
chromatin remodeling enzymes that act in 
concert with histone chaperones (see Figure 
4-27). The CENP-A (Centromere Protein-A) 
variant of histone H3 is discussed later in 
this chapter (see Figure 4-42); other variants 
are discussed in Chapter 7. The sequences 
in each variant that are colored differently 
(compared to the major histone above it) 
dénoté régions with an amino acid sequence 
different from this major histone. (Adapted 
from K. Sarma and D. Reinberg, Nat. Rev. 
Mol. Cell Biol. 6:139-149, 2005. With 
permission from Macmillan Publishers Ltd.) 


Chromatin Acquires Additional Variety Through the Site-Specific 
Insertion of a Small Set of Histone Variants 

In addition to the four highly conserved standard core histones, eulcaryotes also 
contain a few variant histones that can also assemble into nucléosomes. These 
histones are présent in much smaller amounts than the major histones, and they 
hâve been less well conserved over long evolutionary times. Variants are known 
for each of the core histones with the exception of H4; some examples are shown 
in Figure 4-35. 

The major histones are synthesized primarily during the S phase of the cell 
cycle and assembled into nucléosomes on the daughter DNA helices just behind 
the réplication forlc (see Figure 5-32). In contrast, most histone variants are syn¬ 
thesized throughout interphase. They are often inserted into already-formed 
chromatin, which requires a histone-exchange process catalyzed by the ATP-de¬ 
pendent chromatin remodeling complexes discussed previously. These remodel¬ 
ing complexes contain subunits that cause them to bind both to spécifie sites on 
chromatin and to histone chaperones that carry a particular variant. As a resuit, 
each histone variant is inserted into chromatin in a highly sélective manner (see 
Figure 4-27). 

Covalent Modifications and Histone Variants Act in Concert to 
Control Chromosome Functions 

The number of possible distinct markings on an individual nucléosome is in prin- 
ciple enormous, and this potential for diversity is still greater when we allow for 
nucléosomes that contain histone variants. However, the histone modifications 
are known to occur in coordinated sets. More than 15 such sets can be identified 
in mammalian cells. However, it is not yet clear how many different types of chro¬ 
matin are functionally important in cells. 

Some combinations are known to hâve a spécifie meaning for the cell in the 
sense that they détermine how and when the DNA packaged in the nucléosomes 
is to be accessed or manipulated—a fact that led to the idea of a “histone code ” 
For example, one type of marking signais that a stretch of chromatin has been 
newly replicated, another signais that the DNA in that chromatin has been dam- 
aged and needs repair, while others signal when and how gene expression should 
talce place. Various regulatory proteins contain small domains that bind to spé¬ 
cifie marks, recognizing, for example, a trimethylated lysine 4 on histone H3 (Fig¬ 
ure 4-36). These domains are often linked together as modules in a single large 
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Figure 4-36 How a mark on a nucléosome is read. The figure shows the structure of a protein module (called an ING PHD 
domain) that specifically recognizes histone H3 trimethylated on lysine 4. (A) A trimethyl group. (B) Space-filling model of an ING 
PHD domain bound to a histone tail (green, with the trimethyl group highlighted in yellow). (C) A ribbon model showing how 
the N-terminal six amino acids in the H3 tail are recognized. The red Unes represent hydrogen bonds. This is one of a family of 
PHD domains that recognize methylated lysines on histones; different members of the family bind tightly to lysines located at 
different positions, and they can discriminate between a mono-, di-, and trimethylated lysine. In a similar way, other small protein 
modules recognize spécifie histone side chains that hâve been marked with acetyl groups, phosphate groups, and so on. 
(Adapted from PV. Pena et al., Nature 442:100-103, 2006. With permission from Macmillan Publishers Ltd.) 


protein or protein complex, which thereby recognizes a spécifie combination of 
histone modifications (Figure 4-37). The resuit is a reader complex that allows 
particular combinations of markings on chromatin to attract additional proteins, 
so as to execute an appropriate biological function at the right time (Figure 4-38). 

The marks on nucléosomes due to covalent additions to histones are dynamic, 
being constantly removed and added at rates that dépend on their chromosomal 
locations. Because the histone tails extend outward from the nucléosome core 
and are lilcely to be accessible even when chromatin is condensed, they would 
seem to provide an especially suitable format for creating marks that can be 
readily altered as a cell's needs change. Although much remains to be learned 
about the meaning of the different histone modifications, a few well-studied 
examples of the information that can be encoded in the histone H3 tail are listed 
in Figure 4-39. 


A Complex of Reader and Writer Proteins Can Spread Spécifie 
Chromatin Modifications Along a Chromosome 

The phenomenon of position effect variegation described previously requires that 
some modified forms of chromatin hâve the ability to spread for substantial dis¬ 
tances along a chromosomal DNA molécule (see Figure 4-31). How is this possi¬ 
ble? 

The enzymes that add or remove modifications to histones in nucléosomes 
are part of multisubunit complexes. They can initially be brought to a particu¬ 
lar région of chromatin by one of the sequence-specific DNA-binding proteins 
(transcription regulators) discussed in Chapters 6 and 7 (for a spécifie example, 


Figure 4-37 Récognition of a spécifie combination of marks on a 
nucléosome. In the example shown, two adjacent domains that are part of 
the NURF (Nucléosome Remodeling Factor) chromatin remodeling complex 
bind to the nucléosome, with the PHD domain (red) recognizing a methylated 
H3 lysine 4 and another domain (a bromodomain, bluë) recognizing an 
acetylated H4 lysine 16. These two histone marks constitute a unique histone 
modification pattern that occurs in subsets of nucléosomes in human cells. 
Here the two histone tails are indicated by green dotted lines, and only half 
of one nucléosome is shown. (Adapted from A.J. Ruthenburg et al., Cell 
145:692-706, 2011. With permission from Elsevier.) 
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Figure 4-38 Schematic diagram showing 
how a particular combination of histone 
modifications can be recognized by a 
reader complex. A large protein complex 
that contains a sériés of protein modules, 
each of which recognizes a spécifie histone 
mark, is schematically illustrated (green). 
This “reader complex” will bind tightly only 
to a région of chromatin that contains 
several of the different histone marks that 
it recognizes. Therefore, only a spécifie 
combination of marks will cause the 
complex to bind to chromatin and attract 
the additional protein complexes (purple) 
needed to catalyze a biological function. 


attachment to other components in nucléus, 
leading to gene expression, gene silencing, 
or other biological function 


see Figure 7-20). But after a modifying enzyme “writes" its mark on one or a few 
neighboring nucléosomes, events that resemble a chain reaction can ensue. In 
such a case, the “writer enzyme" worlcs in concert with a “reader protein" located 
in the same protein complex. The reader protein contains a module that recog¬ 
nizes the mark and binds tightly to the newly modified nucléosome (see Figure 
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Figure 4-39 Some spécifie meanings 
of histone modifications. (A) The 

modifications on the histone H3 N-terminal 
tail are shown, repeated from Figure 
4-34. (B) The H3 tail can be marked by 
different sets of modifications that act in 
combination to convey a spécifie meaning. 
Only a small number of the meanings 
are known, including the three examples 
shown. Not illustrated is the fact that, as 
just implied (see Figure 4-38), reading a 
histone mark generally involves the joint 
récognition of marks at other sites on the 
nucléosome along with the indicated H3 
tail récognition. In addition, spécifie levels 
of méthylation (mono-, di-, or trimethyl 
groups) are generally required. Thus, 
for example, the trimethylation of lysine 
9 attracts the heterochromatin-specific 
protein HP1, which induces a spreading 
wave of further lysine 9 trimethylation 
followed by further HP1 binding, according 
to the general scheme that will be 
illustrated shortly (see Figure 4-40). Also 
important in this process, however, is a 
synergistic trimethylation of the histone H4 
N-terminal tail on lysine 20. 
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Figure 4-40 How the recruitment 
of a reader-writer complex can 
spread chromatin changes along a 
chromosome. The writer is an enzyme 
that créâtes a spécifie modification on one 
or more of the four nucleosomal histones. 
After its recruitment to a spécifie site on a 
chromosome by a transcription regulatory 
protein, the writer collaborâtes with a 
reader protein to spread its mark from 
nucléosome to nucléosome by means of 
the indicated reader-writer complex. For 
this mechanism to work, the reader must 
recognize the same histone modification 
mark that the writer produces; its binding 
to that mark can be shown to activate 
the writer. In this schematic example, a 
spreading wave of chromatin condensation 
is thereby induced. Not shown are the 
additional proteins involved, including an 
ATP-dependent chromatin remodeling 
complex required to reposition the modified 
nucléosomes. 


4-36), activating an attached writer enzyme and positioning it near an adjacent 
nucléosome. Through many such read-write cycles, the reader protein can carry 
the writer enzyme along the DNA—spreading the mark in a hand-over-hand man- 
ner along the chromosome (Figure 4-40). 

In reality, the process is more complicated than the scheme just described. 
Both readers and writers are part of a protein complex that is lilcely to contain 
multiple readers and writers, and to require multiple marks on the nucléosome to 
spread. Moreover, many of these reader-writer complexes also contain an ATP-de¬ 
pendent chromatin remodeling protein (see Figure 4-26C), and the reader, writer, 
and remodeling proteins can work in concert to either décondense or condense 
long stretches of chromatin as the reader moves progressively along the nucleo- 
some-packaged DNA. 

A similar process is used to remove histone modifications from spécifie régions 
of the DNA; in this case, an “eraser enzyme," such as a histone demethylase or his¬ 
tone deacetylase, is recruited to the complex. As for the writer complex in Figure 
4-40, sequence-specific DNA-binding proteins (transcription regulators) direct 
where such modifications occur (discussed in Chapter 7). 

Some idea of the complexity of the above processes can be derived from the 
results of genetic screens for genes that either enhance or suppress the spreading 
and stability of heterochromatin, as manifest in effects on position effect varie- 
gation in Drosophila (see Figure 4-32). As pointed out previously, more than 100 
such genes are known, and most of them are lilcely to code for subunits in one or 
more reader-writer-remodeling protein complexes. 
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Barrier DNA Sequences Block the Spread of Reader-Writer 
Complexes and thereby Separate Neighboring Chromatin 
Domains 

The above mechanism for spreading chromatin structures raises a potential prob- 
lem. Inasmuch as each chromosome contains one continuous, very long DNA 
molécule, what prevents a cacophony of confusing cross-tallc between adjacent 
chromatin domains of different structure and function? Early studies of position 
effect variegation had suggested an answer: certain DNA sequences mark the 
boundaries of chromatin domains and separate one such domain from another 
(see Figure 4-31). Several such barrier sequences hâve now been identified and 
characterized through the use of genetic engineering techniques that allow spé¬ 
cifie DNA segments to be deleted from, or inserted in, chromosomes. 

For example, in cells that are destined to give rise to red blood cells, a sequence 
called HS4 normally séparâtes the active chromatin domain that contains the 
human (3-globin locus from an adjacent région of silenced, condensed chromatin. 
If this sequence is deleted, the P-globin locus is invaded by condensed chromatin. 
This chromatin silences the genes it covers, and it spreads to a different extent in 
different cells, causing position effect variegation similar to that observed in Dro- 
sophila. As described in Chapter 7, the conséquences are dire: the globin genes 
are poorly expressed, and individuals who carry such a délétion hâve a severe 
form of anémia. 

In genetic engineering experiments, the HS4 sequence is often added to both 
ends of a gene that is to be inserted into a mammalian genome, in order to protect 
that gene from the silencing caused by spreading heterochromatin. Analysis of 
this barrier sequence reveals that it contains a cluster of binding sites for histone 
acetylase enzymes. Since the acétylation of a lysine side chain is incompatible 
with the méthylation of the same side chain, and spécifie lysine méthylations are 
required to spread heterochromatin, histone acetylases are logical candidates for 
the formation of DNA barriers to spreading (Figure 4-41). However, several other 
types of chromatin modifications are known that can also protect genes from 
silencing. 


(A) 


nuclear pore 



Figure 4-41 Some mechanisms of 
barrier action. These models are derived 
from experimental analyses of barrier 
action, and a combination of several of 
them may function at any one site. 

(A) The tethering of a région of chromatin to 
a large fixed site, such as the nuclear pore 
complex illustrated here, can form a barrier 
that stops the spread of heterochromatin. 

(B) The tight binding of barrier proteins to 
a group of nucléosomes can make this 
chromatin résistant to heterochromatin 
spreading. (C) By recruiting a group of 
highly active histone-modifying enzymes, 
barriers can erase the histone marks that 
are required for heterochromatin to spread. 
For example, a potent acétylation of lysine 
9 on histone H3 will compete with lysine 9 
méthylation, thereby preventing the binding 
of the H PI protein needed to form a major 
form of heterochromatin. (Based on 

A.G. West and P. Fraser, Hum. Mol. Genet. 
14:R101-R111,2005. With permission 
from Oxford University Press.) 
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Figure 4-42 A model for the structure 
of a simple centromere. (A) In the yeast 
Saccharomyces cerevisiae, a spécial 
centromeric DNA sequence assembles a 
single nucléosome in which two copies of 
an H3 variant histone (called CENP-A in 
most organisms) replace the normal H3. 

(B) How peptide sequences unique to 
this variant histone (see Figure 4-35) help 
to assemble additional proteins, some 
of which form a kinetochore. The yeast 
kinetochore is unusual in capturing only 
a single microtubule; humans hâve much 
larger centromeres and form kinetochores 
that can capture 20 or more microtubules 
(see Figure 4-43). The kinetochore is 
discussed in detail in Chapter 17. (Adapted 
from A. Joglekar et al., Nat. Cell Biol. 
8:581-585, 2006. With permission from 
Macmillan Publishers Ltd.) 


The Chromatin in Centromeres Reveals How Histone Variants Can 
Create Spécial Structures 

Nucléosomes carrying histone variants hâve a distinctive character and are 
thought to be able to produce marks in chromatin that are unusually long-lasting. 
An important example is seen in the formation and inheritance of the specialized 
chromatin structure at the centromere, the région of each chromosome required 
for attachment to the mitotic spindle and orderly ségrégation of the duplicated 
copies of the genome into daughter cells each time a cell divides. In many com- 
plex organisms, including humans, each centromere is embedded in a stretch of 
spécial centromeric chromatin that persists throughout interphase, even though 
the centromere-mediated attachment to the spindle and movement of DNA occur 
only during mitosis. This chromatin contains a centromere-specific variant H3 
histone, lcnown as CENP-A (Centromere Protein-A; see Figure 4-35), plus addi¬ 
tional proteins that pack the nucléosomes into particularly dense arrangements 
and form the kinetochore, the spécial structure required for attachment of the 
mitotic spindle (see Figure 4-19). 

A spécifie DNA sequence of approximately 125 nucléotide pairs is sufficient to 
serve as a centromere in the yeast S. cerevisiae. Despite its small size, more than 
a dozen different proteins assemble on this DNA sequence; the proteins include 
the CENP-A histone H3 variant, which, along with the three other core histones, 
forms a centromere-specific nucléosome. The additional proteins at the yeast 
centromere attach this nucléosome to a single microtubule from the yeast mitotic 
spindle (Figure 4-42). 

The centromeres in more complex organisms are considerably larger than 
those in budding yeasts. For example, fly and human centromeres extend over 
hundreds of thousands of nucléotide pairs and, while they contain CENP-A, they 
do not seem to contain a centromere-specific DNA sequence. These centromeres 
largely consist of short, repeated DNA sequences, lcnown as alpha satellite DNA 
in humans. But the same repeat sequences are also found at other (non-centro- 
meric) positions on chromosomes, indicating that they are not sufficient to direct 
centromere formation. Most strikingly, in some unusual cases, new human cen¬ 
tromeres (called neocentromeres) hâve been observed to form spontaneously on 
fragmented chromosomes. Some of these new positions were originally euchro- 
matic and laclc alpha satellite DNA altogether (Figure 4-43). It seems that cen¬ 
tromeres in complex organisms are defined by an assembly of proteins, rather 
than by a spécifie DNA sequence. 

Inactivation of some centromeres and genesis of others de novo seem to hâve 
played an essential part in évolution. Different species, even when quite closely 
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Figure 4-43 Evidence for the plasticity of human centromere formation. (A) A sériés of A-T-rich alpha satellite DNA 
sequences is repeated many thousands of times at each human centromere (red), and is surrounded by pericentric 
heterochromatin (brown). However, due to an ancient chromosome breakage-and-rejoining event, some human chromosomes 
contain two blocks of alpha satellite DNA, each of which presumably functioned as a centromere in its original chromosome. 
Usually, chromosomes with two functional centromeres are not stably propagated because they attach improperly to the 
spindle and are broken apart during mitosis. In chromosomes that do survive, however, one of the centromeres has somehow 
become inactivated, even though it contains ail the necessary DNA sequences. This allows the chromosome to be stably 
propagated. (B) In a small fraction (1/2000) of human births, extra chromosomes are observed in cells of the offspring. Some of 
these extra chromosomes, which hâve formed from a breakage event, lack alpha satellite DNA altogether, yet new centromeres 
(neocentromeres) hâve arisen from what was originally euchromatic DNA. 

The complexity of centromeric chromatin is not illustrated in these diagrams. The alpha satellite DNA that forms centromeric 
chromatin in humans is packaged into alternating blocks of chromatin. One block is formed from a long string of nucléosomes 
containing the CENP-A H3 variant histone; the other block contains nucléosomes that are specially marked with dimethyl lysine 
4 on the normal H3 histone. Each block is more than a thousand nucléosomes long. This centromeric chromatin is flanked by 
pericentric heterochromatin, as shown. The pericentric chromatin contains methylated lysine 9 on its H3 histones, along with 
HP1 protein, and it is an example of “classical” heterochromatin (see Figure 4-39). 


related, often hâve different numbers of chromosomes; see Figure 4-14 for an 
extreme example. As we shall discuss below, detailed genome comparisons show 
that in many cases the changes in chromosome numbers hâve arisen through 
chromosome breakage-and-rejoining events, creating novel chromosomes, some 
of which must initially hâve contained abnormal numbers of centromeres—either 
more than one, or none at ail. Yet stable inheritance requires that each chromo¬ 
some should contain one centromere, and one only. It seems that surplus cen¬ 
tromeres must hâve been inactivated, and/or new centromeres created, so as to 
allow the rearranged chromosome sets to be stably maintained. 

Some Chromatin Structures Can Be Directly Inherited 

The changes in centromere activity just discussed, once established, need to be 
perpetuated through subséquent cell générations. What could be the mechanism 
of this type of epigenetic inheritance? 

It has been proposed that de novo centromere formation requires an initial 
seeding event, involving the formation of a specialized DNA-protein structure that 
contains nucléosomes formed with the CENP-A variant of histone H3. In humans, 
this seeding event happens more readily on arrays of alpha satellite DNA than 
on other DNA sequences. The H3-H4 tetramers from each nucléosome on the 
parental DNA hélix are directly inherited by the sister DNA helices at a réplication 
forlc (see Figure 5-32). Therefore, once a set of CENP-A-containing nucléosomes 
has been assembled on a stretch of DNA, it is easy to understand how a new cen¬ 
tromere could be generated in the same place on both daughter chromosomes 
following each round of cell division. One need only assume that the presence of 
the CENP-A histone in an inherited nucléosome selectively recruits more CENP-A 
histone to its newly formed neighbors. 

There are some striking similarities between the formation and maintenance 
of centromeres and the formation and maintenance of some other régions of 
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Figure 4-44 How the packaging of 
DNA in chromatin can be inherited 
following chromosome réplication. 

In this model, some of the specialized 
chromatin components are distributed 
to each sister chromosome after DNA 
duplication, along with the specially marked 
nucléosomes that they bind. After DNA 
réplication, the inherited nucléosomes that 
are specially modified, acting in concert 
with the inherited chromatin components, 
change the pattern of histone modification 
on the newly formed nucléosomes nearby. 
This créâtes new binding sites for the 
same chromatin components, which then 
assemble to complété the structure. The 
latter process is likely to involve reader- 
writer-remodeling complexes operating in a 
manner similar to that previously illustrated 
in Figure 4-40. 


heterochromatin. In particular, the entire centromere forms as an all-or-none 
entity, suggesting that the création of centromeric chromatin is a highly coop¬ 
erative process, spreading out from an initial seed in a manner reminiscent of 
the phenomenon of position effect variegation that we discussed earlier. In both 
cases, a particular chromatin structure, once formed, seems to be directly inher¬ 
ited on the DNA following each round of chromosome réplication. A cooperative 
recruitment of proteins, along with the action of reader-writer complexes, can 
thus not only account for the spreading of spécifie forms of chromatin in space 
along the chromosome, but also for its propagation across cell générations—from 
parent cell to daughter cell (Figure 4-44). 

Experiments with Frog Embryos Suggest that both Activating and 
Répressive Chromatin Structures Can Be Inherited Epigenetically 

Epigenetic inheritance plays a central part in the création of multicellular organ- 
isms. Their differentiated cell types become established during development, and 
persist thereafter even through repeated cell-division cycles. The daughters of a 
liver cell persist as liver cells, those of an epidermal cell as epidermal cells, and so 
on, even though they ail contain the same genome; and this is because distinctive 
patterns of gene expression are passed on faithfully from parent cell to daughter 
cell. Chromatin structure has a rôle in this epigenetic transmission of information 
from one cell génération to the next. 

One type of evidence cornes from studies in which the nucléus of a cell from 
a frog or tadpole is transplanted into a frog egg whose own nucléus has been 
removed (an enucleated egg). In a classic set of experiments performed in 1968, 
it was shown that a nucléus talcen from a differentiated donor cell can be repro- 
grammed in this way to support development of a whole new tadpole (see Figure 
7-2). But this reprogramming occurs only with difficulty, and it becomes less and 
less efficient as nuclei from older animais are used. Thus, for example, less than 
2% of the enucleated eggs injected with a nucléus from a tadpole épithélial cell 
developed to the swimming tadpole stage, compared with 35% when the donor 
nuclei were talcen instead from an early (gastrula-stage) embryo. With new exper¬ 
imental tools, the cause of this résistance to reprogramming can now be traced. 
It arises, at least in part, because spécifie chromatin structures in the original dif¬ 
ferentiated nucléus tend to persist and be transmitted through the many cell-di- 
vision cycles required for embryonic development. In experiments with Xenopus 
embryos, spécifie forms of either répressive or active chromatin structures could 
be demonstrated to persist through as many as 24 cell divisions, causing the mis- 
placed expression of genes. Figure 4-45 briefiy describes one such experiment, 
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focused on chromatin containing the histone variant, H3.3. We shall return to 
these phenomena in the final section of Chapter 22, where we discuss stem cells 
and the ways in which one cell type can be converted into another. 


Figure 4-45 Evidence for the inheritance 
of a gene-activating chromatin State. 

The well-characterized MyoD gene 
encodes a master transcription regulatory 
protein for muscle, MyoD (see p. 399). This 
gene is normally turned on in the indicated 
région of the young embryo where somites 
form. When a nucléus from this région is 
injected into an enucleated egg as shown, 
many of the progeny cell nuclei abnormally 
express the MyoD protein in non-muscle 
régions of the “nuclear transplant embryo” 
that forms. This abnormal expression can 
be attributed to maintenance of the MyoD 
promoter région in its active chromatin 
State through the many cycles of cell 
division that produce the blastula-stage 
embryo—a so-called “epigenetic memory” 
that persists in this case in the absence 
of transcription. The active chromatin 
surrounding the MyoD promoter contains 
the variant histone H3.3 (see Figure 4-35) 
in a Lys4 methylated form. As indicated, 
an overproduction of this histone caused 
by injecting excess mRNA encoding the 
normal H3.3 protein increases both H3.3 
occupancy on the MyoD promoter and 
the epigenetic MyoD production, whereas 
injection of an mRNA producing a mutant 
form of H3.3 that cannot be methylated 
at Lys4 reduces the epigenetic MyoD 
production. Such experiments provide 
evidence that an inherited chromatin State 
underlies the epigenetic memory observed. 
(Adapted from R.K. Ng and J.B. Gurdon, 
Nat. Cell Biol. 10:102-109, 2008. With 
permission from Macmillan Publishers Ltd.) 


Chromatin Structures Are Important for Eukaryotic Chromosome 
Function 

Although a great deal remains to be learned about the functions of different chro¬ 
matin structures, the packaging of DNA into nucléosomes was probably crucial 
for the évolution of eulcaryotes lilce ourselves. To form a complex multicellular 
organism, the cells in different lineages must specialize by changing the acces- 
sibility and activity of many hundreds of genes. As described in Chapter 21, this 
process dépends on cell memory: each cell holds a record of its past developmen- 
tal history in the regulatory circuits that control its many genes. That record, it 
seems, is partly stored in the structure of the chromatin. 

Although bacteria also hâve cell memory mechanisms, the complexity of the 
memory circuits in higher eulcaryotes is unparalleled. Strategies based on local 
variations in chromatin structure, unique to eulcaryotes, can enable individual 
genes, once they are switched on or switched off, to stay in that State until some 
new factor acts to reverse it. At one extreme are structures lilce centromeric chro¬ 
matin that, once established, are stably inherited from one cell génération to the 
next. Lilcewise, the major “classical" type of heterochromatin, which contains long 
arrays of the HP1 protein (see Figure 4-39), can persist stably throughout life. In 
contrast, a form of condensed chromatin that is created by the Polycomb group of 
proteins serves to silence genes that must be lcept inactive in some conditions, but 
are active in others. The latter mechanism governs the expression of a large num- 
ber of genes that encode transcription regulators important in early embryonic 
development, as discussed in Chapter 21. There are many other variant forms of 
chromatin, some with much shorter lifetimes, often less than the division time of 
the cell. We shall say more about the variety of chromatin types in the next section. 
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Summary 

In the chromosomes of eukaryotes, DNA is uniformly assembled into nucléosomes, 
but a variety of different chromatin structures is possible. This variety is based on a 
large set of réversible covalent modifications of the four histones in the nucléosome 
core. These modifications include the mono-, di-, and trimethylation ofmany differ¬ 
ent lysine side chains, an important reaction that is incompatible with the acétyla¬ 
tion that can occur on the same lysines. Spécifie combinations ofthe modifications 
mark many nucléosomes, governing their interactions with other proteins. These 
marks are read when protein modules that are part of a larger protein complex 
bind to the modified nucléosomes in a région of chromatin. These reader proteins 
then attract additional proteins thatperform various functions. 

Some reader protein complexes contain a histone-modifying enzyme, such as a 
histone lysine methylase, that “writes” the same mark that the reader recognizes. A 
reader-writer-remodeling complex ofthis type can spread a spécifie form of chro¬ 
matin along a chromosome. In particular, large régions ofcondensed heterochro- 
matin are thought to beformed in this way. Heterochromatin is commonly found 
around centromeres and near telomeres, but it is also présent at many other posi¬ 
tions in chromosomes. The tight packaging ofDNA into hetero chromatin usually 
silences the genes within it. 

The phenomenon of position effect variegation provides strong evidencefor the 
inheritance ofcondensed States of chromatin from one cell génération to the next. A 
similar mechanism appears to be responsiblefor maintaining the specialized chro¬ 
matin at centromeres. More generally, the ability to propagate spécifie chromatin 
structures across cell générations makes possible an epigenetic cell memory process 
that plays a rôle in maintaining the set of different cell States required by complex 
multicellular organisms. 

THE GLOBAL STRUCTURE OF CHROMOSOMES 

Having discussed the DNA and protein molécules from which the chromatin fiber 
is made, we now turn to the organization of the chromosome on a more global 
scale and the way in which its various domains are arranged in space. As a 30-nm 
fiber, a typical human chromosome would still be 0.1 cm in length and able to 
span the nucléus more than 100 times. Clearly, there must be a still higher level 
of folding, even in interphase chromosomes. Although the molecular details are 
still largely a mystery, this higher-order packaging almost certainly involves the 
folding of the chromatin into a sériés of loops and coils. This chromatin packing is 
fluid, frequently changing in response to the needs of the cell. 

We begin this section by describing some unusual interphase chromosomes 
that can be easily visualized. Exceptional though they are, these spécial cases 
reveal features that are thought to be représentative of ail interphase chromo¬ 
somes. Moreover, they provide ways to investigate some fundamental aspects of 
chromatin structure that we hâve touched on in the previous section. Next, we 
describe how a typical interphase chromosome is arranged in the mammalian 
cell nucléus. Finally, we shall discuss the additional tenfold compaction that chro¬ 
mosomes undergo in the passage from interphase to mitosis. 

Chromosomes Are Folded into Large Loops of Chromatin 

Insight into the structure of the chromosomes in interphase cells has corne from 
studies of the stiff and enormously extended chromosomes in growing amphib- 
ian oocytes (immature eggs). These very unusual lampbrush chromosomes (the 
largest chromosomes known), paired in préparation for meiosis, are clearly visi¬ 
ble even in the light microscope, where they are seen to be organized into a sériés 
of large chromatin loops emanating from a linear chromosomal axis (Figure 4-46 
and Figure 4-47). 

In these chromosomes, a given loop always contains the same DNA sequence 
that remains extended in the same manner as the oocyte grows. These chromo¬ 
somes are producing large amounts of RNA for the oocyte, and most of the genes 


extended chromatin 
in looped domain 



Figure 4-46 A model for the chromatin 
domains in a lampbrush chromosome. 

Shown is a small portion of one pair of 
sister chromatids. Here, two identical DNA 
double helices are aligned side by side, 
packaged into different types of chromatin. 
The set of lampbrush chromosomes 
in many amphibians contains a total of 
about 10,000 loops resembling those 
shown here. The rest of the DNA in each 
chromosome (the great majority) remains 
highly condensed. Four copies of each 
loop are présent in the cell, since each 
lampbrush chromosome consists of two 
aligned sets of paired chromatids. This 
four-stranded structure is characteristic of 
this stage of development of the oocyte, 
which has arrested at the diplotene stage 
of meiosis; see Figure 17-56. 
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présent in the DNA loops are being actively expressed. The majority of the DNA, 
however, is not in loops but remains highly condensed on the chromosome axis, 
where genes are generally not expressed. 

It is thought that the interphase chromosomes of ail eulcaryotes are similarly 
arranged in loops. Although these loops are normally too small and fragile to be 
easily observed in a light microscope, other methods can be used to infer their 
presence. For example, modem DNA technologies hâve made it possible to assess 
the frequency with which any two loci along an interphase chromosome are held 
together, thus revealing lilcely candidates for the sites on chromatin that form the 
bases of loop structures (Figure 4-48). These experiments and others suggest that 
the DNA in human chromosomes is lilcely to be organized into loops of various 
lengths. A typical loop might contain between 50,000 and 200,000 nucléotide 
pairs of DNA, although loops of a million nucléotide pairs hâve also been sug- 
gested (Figure 4-49). 

Polytene Chromosomes Are Uniquely Useful for Visualizing 
Chromatin Structures 

Further insight has corne from another unusual class of cells—the polytene cells of 
Aies, such as the fruit fly Drosophila. Some types of cells, in many organisms, grow 
abnormally large through multiple cycles of DNA synthesis without cell division. 
Such cells, containing increased numbers of standard chromosomes, are said to 
be polyploid. In the salivary glands of fly larvae, this process is talcen to an extreme 
degree, creating huge cells that contain hundreds or thousands of copies of the 




Figure 4-47 Lampbrush chromosomes. 

(A) A light micrograph of lampbrush 
chromosomes in an amphibian oocyte. 
Early in oocyte différentiation, each 
chromosome replicates to begin 
meiosis, and the homologous replicated 
chromosomes pair to form this highly 
extended structure containing a total of 
four replicated DNA double helices, or 
chromatids. The lampbrush chromosome 
stage persists for months or years, while 
the oocyte builds up a supply of materials 
required for its ultimate development into 
a new individual. (B) An enlarged région 
of a similar chromosome, stained with a 
fluorescent reagent that makes the loops 
active in RNA synthesis clearly visible. 
(Courtesy of Joseph G. Gall.) 
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genome. Moreover, in this case, ail the copies of each chromosome are aligned 
side by side in exact register, lilce drinking straws in a box, to create giant polytene 
chromosomes. These allow features to be detected that are thought to be shared 
with ordinary interphase chromosomes, but are normally hard to see. 

When polytene chromosomes from a fly's salivary glands are viewed in the 
light microscope, distinct alternating darlc bands and light interbands are visible 
(Figure 4-50), each formed from a thousand identical DNA sequences arranged 
side by side in register. About 95% of the DNA in polytene chromosomes is in 
bands, and 5% is in interbands. A very thin band can contain 3000 nucléotide 
pairs, while a thiclc band may contain 200,000 nucléotide pairs in each of its chro- 
matin strands. The chromatin in each band appears darlc because the DNA is more 
condensed than the DNA in interbands; it may also contain a higher concentra¬ 
tion of proteins (Figure 4-51). This banding pattern seems to reflect the same sort 
of organization detected in the amphibian lampbrush chromosomes described 
earlier. 

There are approximately 3700 bands and 3700 interbands in the complété set 
of Drosophila polytene chromosomes. The bands can be recognized by their dif¬ 
ferent thicknesses and spacings, and each one has been given a number to gener- 
ate a chromosome “map" that has been indexed to the finished genome sequence 
of this fly. 

The Drosophila polytene chromosomes provide a good starting point for exam- 
ining how chromatin is organized on a large scale. In the previous section, we 
saw that there are many forms of chromatin, each of which contains nucléosomes 
with a different combination of modified histones. Spécifie sets of non-histone 
proteins assemble on these nucléosomes to affect biological function in differ¬ 
ent ways. Recruitment of some of these non-histone proteins can spread for long 
distances along the DNA, imparting a similar chromatin structure to broad tracts 
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Figure 4-48 A method for determining 
the position of loops in interphase 
chromosomes. In this technique, known 
as the chromosome conformation 
capture (3C) method, cells are treated 
with formaldéhyde to create the indicated 
covalent DNA-protein and DNA-DNA 
cross-links. The DNA is then treated with 
an enzyme (a restriction nuclease) that 
chops the DNA into many pièces, cutting 
at strictly defined nucléotide sequences 
and forming sets of identical “cohesive 
ends” (see Figure 8-28). The cohesive 
ends can be made to join through their 
complementary base-pairing. Importantly, 
prior to the ligation step shown, the DNA 
is diluted so that the fragments that hâve 
been kept in close proximity to each other 
(through cross-linking) are the ones most 
likely to join. Finally, the cross-links are 
reversed and the newly ligated fragments 
of DNA are identified and quantified by 
PCR (the polymerase chain reaction, 
described in Chapter 8). From the results, 
combined with DNA sequence information, 
one can dérivé models for the interphase 
conformation of chromosomes. 
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Figure 4-49 A model for the organization of an interphase chromosome. A section of an interphase chromosome is shown folded into a sériés 
of looped domains, each containing perhaps 50,000-200,000 or more nucléotide pairs of double-helical DNA condensed into a chromatin fiber. 
The chromatin in each individual loop is further condensed through poorly understood folding processes that are reversed when the cell requires 
direct access to the DNA packaged in the loop. Neither the composition of the postulated chromosomal axis nor how the folded chromatin fiber is 
anchored to it is clear. However, in mitotic chromosomes, the bases of the chromosomal loops are enriched both in condensins (discussed below) 
and in DNA topoisomerase II enzymes (discussed in Chapter 5), two proteins that may form much of the axis at metaphase. 
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of the genome (see Figure 4-40). Such régions, where ail of the chromatin has 
a similar structure, are separated from neighboring domains by barrier proteins 
(see Figure 4-41). At low resolution, the interphase chromosome can therefore 
be considered as a mosaic of chromatin structures, each containing particular 
nucléosome modifications associated with a particular set of non-histone pro¬ 
teins. Polytene chromosomes allow us to see details of this mosaic of domains in 
the light microscope, as well as to observe some of the changes associated with 
gene expression. 


Figure 4-50 The entire set of polytene 
chromosomes in one Drosophila salivary 

cell. In this drawing of a light micrograph, 
the giant chromosomes hâve been 
spread out for viewing by squashing them 
against a microscope slide. Drosophila 
has four chromosomes, and there are four 
different chromosome pairs présent. But 
each chromosome is tightly paired with 
its homolog (so that each pair appears 
as a single structure), which is not true 
in most nuclei (except in meiosis). Each 
chromosome has undergone multiple 
rounds of réplication, and the homologs 
and ail their duplicates hâve remained in 
exact register with each other, resulting 
in huge chromatin cables many DNA 
strands thick. 

The four polytene chromosomes 
are normally linked together by 
heterochromatic régions near their 
centromeres that aggregate to create 
a single large chromocenter (pink 
région). In this préparation, however, the 
chromocenter has been split into two 
halves by the squashing procedure used. 
(Adapted from T.S. Painter, J. Hered. 
25:465-476, 1934. With permission from 
Oxford University Press.) 


There Are Multiple Forms of Chromatin 

By staining Drosophila polytene chromosomes with antibodies, or by using a 
more recent technique called ChIP (chromatin immunoprécipitation) analysis 
(see Chapter 8), the locations of the histone and non-histone proteins in chro¬ 
matin can be mapped across the entire DNA sequence of an organism's genome. 
Such an analysis in Drosophila has thus far localized more than 50 different chro¬ 
matin proteins and histone modifications. The results suggest that three major 
types of répressive chromatin predominate in this organism, along with two major 
types of chromatin on actively transcribed genes, and that each type is associated 
with a different complex of non-histone proteins. Thus, classical heterochromatin 
contains more than six such pro teins, including heterochromatin protein 1 (HP1), 
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Figure 4-51 Micrographs of polytene 
chromosomes from Drosophila salivary 
glands. (A) Light micrograph of a portion of 
a chromosome. The DNA has been stained 
with a fluorescent dye, but a reverse image 
is presented here that renders the DNA 
black rather than white-, the bands are 
clearly seen to be régions of increased 
DNA concentration. This chromosome 
has been processed by a high-pressure 
treatment so as to show its distinct pattern 
of bands and interbands more clearly. 

(B) An électron micrograph of a small 
section of a Drosophila polytene 
chromosome seen in thin section. Bands 
of very different thickness can be readily 
distinguished, separated by interbands, 
which contain less condensed chromatin. 
(A, adapted from D.V. Novikov, I. Kireev 
and A.S. Belmont, Nat. Methods 4:483- 
485, 2007. With permission from 
Macmillan Publishers Ltd; B, courtesy 
of Veikko Sorsa.) 
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Figure 4-52 RNA synthesis in polytene chromosome puffs. 

An autoradiograph of a single puff in a polytene chromosome from the 
salivary glands of the freshwater midge Chironomus tentans. As outlined 
in Chapter 1 and described in detail in Chapter 6, the first step in gene 
expression is the synthesis of an RNA molécule using the DNA as a template. 
The decondensed portion of the chromosome is undergoing RNA synthesis 
and has become labeled with 3 H-uridine, an RNA precursor molécule that is 
incorporated into growing RNA chains. (Courtesy of José Bonner.) 


whereas the so-called Polycomb form of heterochromatin contains a similar 
number of proteins of a different set (PcG proteins). In addition to the five major 
chromatin types, other more minor forms of chromatin appear to be présent, each 
of which may be differently regulated and hâve distinct rôles in the cell. 

The set of proteins bound as part of the chromatin at a given locus varies 
depending on the cell type and its stage of development. These variations malce 
the accessibility of spécifie genes different in different tissues, helping to generate 
the cell diversification that accompanies embryonic development (described in 
Chapter 21). 

Chromatin Loops Décondense When the Genes Within Them Are 
Expressed 

When an insect progresses from one developmental stage to another, distinc¬ 
tive chromosome puffs arise and old puffs recede in its polytene chromosomes 
as new genes become expressed and old ones are turned off (Figure 4-52). From 
inspection of each puff when it is relatively small and the banding pattern is still 
discernible, it seems that most puffs arise from the decondensation of a single 
chromosome band. 

The individual chromatin fibers that malce up a puff can be visualized with 
an électron microscope. In favorable cases, loops are seen, much lilce those 
observed in amphibian lampbrush chromosomes. When genes in the loop are 
not expressed, the loop assumes a thickened structure, possibly that of a folded 
30-nm fiber, but when gene expression is occurring, the loop becomes more 
extended. In électron micrographs, the chromatin located on either side of the 
decondensed loop appears considerably more compact, suggesting that a loop 
constitutes a distinct functional domain of chromatin structure. 

Observations in human cells also suggest that highly folded loops of chromatin 
expand to occupy an increased volume when a gene within them is expressed. For 
example, quiescent chromosome régions from 0.4 to 2 million nucléotide pairs in 
length appear as compact dots in an interphase nucléus when visualized by fluo¬ 
rescence microscopy. However, the same DNA is seen to occupy a larger territory 
when its genes are expressed, with elongated, punctate structures replacing the 
original dot. 

New ways of visualizing individual chromosomes hâve shown that each of the 
46 interphase chromosomes in a human cell tends to occupy its own discrète ter¬ 
ritory within the nucléus: that is, the chromosomes are not extensively entangled 
with one another (Figure 4-53). However, pictures such as these présent only 
an average view of the DNA in each chromosome. Experiments that specifically 
localize the heterochromatic régions of a chromosome reveal that they are often 

Figure 4-53 Simultaneous visualization of the chromosome territories 
for ail of the human chromosomes in a single interphase nucléus. Here, 
a mixture of DNA probes for each chromosome has been labeled so as to 
fluoresce with a different spectra; this allows DNA-DNA hybridization to be 
used to detect each chromosome, as in Figure 4-10. Three-dimensional 
reconstructions were then produced. Below the micrograph, each 
chromosome is identified in a schematic of the actual image. Note that 
homologous chromosomes (e.g., the two copies of chromosome 9) are not 
in general co-located. (From M.R. Speicher and N.P. Carter, Nat. Rev. Genet. 

6:782-792, 2005. With permission from Macmillan Publishers Ltd.) H 17 
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Figure 4-54 The distribution of gene-rich régions of the human genome 
in an interphase nucléus. Gene-rich régions hâve been visualized with a 
fluorescent probe that hybridizes to the Alu interspersed repeat, which is 
présent in more than a million copies in the human genome (see page 292). 
For unknown reasons, these sequences cluster in chromosomal régions 
rich in genes. In this représentation, régions enriched forth eAlu sequence 
are green, régions depleted for these sequences are red, while the average 
régions a reyellow. The gene-rich régions are seen to be largely absent in 
the DNA near the nuclear envelope. (From A. Bolzer et al., PLoS Biol. 
3:826-842, 2005.) 


closely associated with the nuclear lamina, regardless of the chromosome exam- 
ined. And DNA probes that preferentially stain gene-rich régions of human chro¬ 
mosomes produce a striking picture of the interphase nucléus that presumably 
reflects different average positions for active and inactive genes (Figure 4-54). 

How is most of the chromatin in each interphase chromosome condensed 
when its genes are not being expressed? A powerful extension of the chromosome 
conformation capture method described previously (see Figure 4-48), which 
exploits a high-throughput DNA sequencing technology called massive parallel 
sequencing (see Panel 8-1, pp. 478-481), allows the connections between ail of 
the different one-megabase (1 Mb) segments of the human genome to be mapped 
in human interphase chromosomes. The results reveal that most régions of our 
chromosomes are folded into a conformation referred to as a fractal globule : a 
lcnot-free arrangement that facilitâtes maximally dense packing while, at the same 
time, preserving the ability of the chromatin fiber to unfold and fold (Figure 4-55). 
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Chromatin Can Move to Spécifie Sites Within the Nucléus to Alter 
Gene Expression 

A variety of different types of experiments has led to the conclusion that the 
position of a gene in the interior of the nucléus changes when it becomes highly 
expressed. Thus, a région that becomes very actively transcribed is sometimes 
found to extend out of its chromosome territory, as if in an extended loop (Figure 
4-56). We will see in Chapter 6 that the initiation of transcription—the first step in 
gene expression—requires the assembly of over 100 proteins, and it malces sense 
that this would be facilitated in régions of the nucléus enriched in these proteins. 

More generally, it is clear that the nucléus is very heterogeneous, with func- 
tionally different régions to which portions of chromosomes can move as they are 
subjected to different biochemical processes—such as when their gene expres¬ 
sion changes. It is this issue that we discuss next. 
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Figure 4-55 A fractal globule model for 
interphase chromatin. An extension of 
the 3C method in Figure 4-48, called FHi-C, 
was used to measure the extent to which 
each of the three thousand 1 Mb segments 
in the human genome was located adjacent 
to any other of these segments. The 
results support the type of model shown. 

In the enlarged fractal globule illustrated, 
a région of 5 million base pairs is seen to 
fold in a way that keeps régions that are 
neighbors along the one-dimensional DNA 
hélix as neighbors in three dimensions; 
this gives rise to monochromatic blocks in 
this représentation that are obvious both 
on the surface and in cross section. The 
fractal globule is a knot-free conformation 
of the DNA that permits dense packing, yet 
retains an ability to easily fold and unfold 
any genomic locus. (Adapted from 
E. Lieberman-Aiden et al., Science 
326:289-293, 2009. With permission 
from AAAS.) 
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Figure 4-56 An effect of high levels of 
gene expression on the intranuclear 
location of chromatin. (A) Fluorescence 
micrographs of human nuclei showing 
how the position of a gene changes when 
it becomes highly transcribed. The région 
of the chromosome adjacent to the gene 
(red) is seen to leave its chromosomal 
territory (green) only when it is highly 
active. (B) Schematic représentation of 
a large loop of chromatin that expands 
when the gene is on, and contracts when 
the gene is off. Other genes that are less 
actively expressed can be shown by the 
same methods to remain inside their 
chromosomal territory when transcribed. 
(From J.R. Chubb and W.A. Bickmore, Cell 
112:403-406, 2003. With permission from 
Elsevier.) 


Networks of Macromolecules Form a Set of Distinct Biochemical 
Environments inside the Nucléus 

In Chapter 6, we shall describe the function of a variety of subcompartments that 
are présent within the nucléus. The largest and most obvious of these is the nucle- 
oluS; a structure well lcnown to microscopists even in the nineteenth century (see 
Figure 4-9). The nucleolus is the cell's site of ribosome subunit formation, as well 
as the place where many other specialized reactions occur (see Figure 6-42): it 
consists of a networlc of RNAs and proteins concentrated around ribosomal RNA 
genes that are being actively transcribed. In eukaryotes, the genome contains 
multiple copies of the ribosomal RNA genes, and although they are typically clus- 
tered together in a single nucleolus, they are often located on several separate 
chromosomes. 

A variety of less obvious organelles are also présent inside the nucléus. For 
example, spherical structures called Cajal bodies and interchromatin granule 
clusters are présent in most plant and animal cells (Figure 4-57). Lilce the nucle¬ 
olus, these organelles are composed of selected protein and RNA molécules that 
bind together to create networlcs that are highly permeable to other protein and 
RNA molécules in the surrounding nucleoplasm. 

Structures such as these can create distinct biochemical environments by 
immobilizing select groups of macromolecules, as can other networlcs of proteins 
and RNA molécules associated with nuclear pores and with the nuclear envelope. 
In principle, this allows other molécules that enter these spaces to be processed 
with great efficiency through complex reaction pathways. Highly permeable, 
fibrous networlcs of this sort can thereby impart many of the lcinetic advantages of 
compartmentalization (see p. 164) to reactions that talce place in subregions of the 
nucléus (Figure 4-58A). However, unlilce the membrane-bound compartments in 
the cytoplasm (discussed in Chapter 12), these nuclear subcompartments—lack- 
ing a lipid bilayer membrane—can neither concentrate nor exclude spécifie small 
molécules. 

The cell has a remarlcable ability to construct distinct environments to per¬ 
forai complex biochemical taslcs efficiently. Those that we hâve mentioned in the 
nucléus facilitate various aspects of gene expression, and will be further discussed 
in Chapter 6. These subcompartments, including the nucleolus, appear to form 
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Figure 4-57 Electron micrograph 
showing two very common fibrous 
nuclear subcompartments. The large 
sphere here is a Cajal body. The smaller 
darker sphere is an interchromatin granule 
cluster, also known as a speckle (see 
also Figure 6-46). These “subnuclear 
organelles” are from the nucléus of a 
Xenopus oocyte. (From K.E. Handwerger 
and J.G. Gall, Trends Cell Biol. 16:19-26, 
2006. With permission from Elsevier.) 
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Figure 4-58 Effective compartmentalization 
without a bilayer membrane. (A) Schematic 
illustration of the organization of a spherical 
subnuclear organelle (left) and of a postulated 
similarly organized subcompartment just 
beneath the nuclear envelope (right). In 
both cases, RNAs and/or proteins (gray) 
associate to form highly porous, gel-like 
structures that contain binding sites for other 
spécifie proteins and RNA molécules (colored 
objects). (B) How the tethering of a selected 
set of proteins and RNA molécules to long 
flexible polymer chains, as in (A), can create 
“staging areas” that greatly speed the rates of 
reactions in subcompartments of the nucléus. 
The reactions catalyzed will dépend on the 
particular macromolecules that are localized 
by the tethering. The same strategy for 
accelerating complex sets of reactions is also 
employed in subcompartments elsewhere in 
the cell (see also Figure 3-78). 


only as needed, and they create a high local concentration of the many different 
enzymes and RNA molécules needed for a particular process. In an analogous 
way, when DNA is damaged by irradiation, the set of enzymes needed to carry out 
DNA repair are observed to congregate in discrète foci inside the nucléus, creating 
"repair factories" (see Figure 5-52). And nuclei often contain hundreds of discrète 
foci representing factories for DNA or RNA synthesis (see Figure 6-47). 

It seems lilcely that ail of these entities malce use of the type of tethering illus- 
trated in Figure 4-58B, where long flexible lengths of polypeptide chain and/or 
long noncoding RNA molécules are interspersed with spécifie binding sites that 
concentrate the multiple proteins and other molécules that are needed to catalyze 
a particular process. Not surprisingly, tethers are similarly used to help to speed 
biological processes in the cytoplasm, increasing spécifie reaction rates there (for 
example, see Figure 16-18). 

Is there also an intranuclear framework, analogous to the cytoskeleton, on 
which chromosomes and other components of the nucléus are organized? The 
nuclear matrix, or scaffold, has been defined as the insoluble material left in the 
nucléus after a sériés of biochemical extraction steps. Many of the proteins and 
RNA molécules that form this insoluble material are lilcely to be derived from the 
fibrous subcompartments of the nucléus just discussed, while others may be pro¬ 
teins that help to form the base of chromosomal loops or to attach chromosomes 
to other structures in the nucléus. 

Mitotic Chromosomes Are Especially Highly Condensed 

Having discussed the dynamic structure of interphase chromosomes, we now 
turn to mitotic chromosomes. The chromosomes from nearly ail eukaryotic cells 
become readily visible by light microscopy during mitosis, when they coil up to 
form highly condensed structures. This condensation reduces the length of a 
typical interphase chromosome only about tenfold, but it produces a dramatic 
change in chromosome appearance. 

Figure 4-59 depicts a typical mitotic chromosome at the metaphase stage 
of mitosis (for the stages of mitosis, see Figure 17-3). The two DNA molécules 
produced by DNA réplication during interphase of the cell-division cycle are 
separately folded to produce two sister chromosomes, or sister chromatids , held 
together at their centromeres, as mentioned earlier. These chromosomes are nor- 
mally covered with a variety of molécules, including large amounts of RNA-protein 
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Figure 4-59 A typical mitotic 
chromosome at metaphase. Each sister 
chromatid contains one of two identical 
sister DNA molécules generated earlier in 
the cell cycle by DNA réplication (see also 
Figure 17-21). 
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Figure 4-60 A scanning électron micrograph of a région near one end 
of a typical mitotic chromosome. Each knoblike projection is believed to 
represent the tip of a separate looped domain. Note that the two identical 
paired chromatids (drawn in Figure 4-59) can be clearly distinguished. 
(From M.P. Marsden and U.K. Laemmli, Cell 17:849-858, 1979. With 
permission from Elsevier.) 


complexes. Once this covering has been stripped away, each chromatid can be 
seen in électron micrographs to be organized into loops of chromatin emanating 
from a central scaffolding (Figure 4-60). Experiments using DNA hybridization 
to detect spécifie DNA sequences demonstrate that the order of visible features 
along a mitotic chromosome at least roughly reflects the order of genes along the 
DNA molécule. Mitotic chromosome condensation can thus be thought of as the 
final level in the hierarchy of chromosome packaging (Figure 4-61). 

The compaction of chromosomes during mitosis is a highly organized and 
dynamic process that serves at least two important purposes. First, when conden¬ 
sation is complété (in metaphase), sister chromatids hâve been disentangled from 
each other and lie side by side. Thus, the sister chromatids can easily separate 
when the mitotic apparatus begins pulling them apart. Second, the compaction 
of chromosomes protects the relatively fragile DNA molécules from being brolcen 
as they are pulled to separate daughter cells. 

The condensation of interphase chromosomes into mitotic chromosomes 
begins in early M phase, and it is intimately connected with the progression of 
the cell cycle. During M phase, gene expression shuts down, and spécifie mod¬ 
ifications are made to histones that help to reorganize the chromatin as it com¬ 
pacts. Two classes of ring-shaped proteins, called cohesins and condensins, aid 
this compaction. How they help to produce the two separately folded chromatids 
of a mitotic chromosome will be discussed in Chapter 17, along with the details 
of the cell cycle. 
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Figure 4-61 Chromatin packing. This 
model shows some of the many levels 
of chromatin packing postulated to give 
rise to the highly condensed mitotic 
chromosome. 
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Summary 

Chromosomes are generally decondensed during interphase , so that the details 
of their structure are difficult to visualize. Notable exceptions are the specialized 
lampbrush chromosomes of vertebrate oocytes and the polytene chromosomes in 
the giant secretory cells ofinsects. Studies ofthese two types of interphase chromo¬ 
somes suggest that each long DNA molécule in a chromosome is divided into a large 
number of discrète domains organized as loops ofchromatin that are compacted by 
furtherfolding. When genes contained in a loop are expressed, the loop unfolds and 
allows the cell's machinery access to the DNA. 

Interphase chromosomes occupy discrète territories in the cell nucléus; that is, 
they are not extensively intertwined. Euchromatin malces up most of interphase 
chromosomes and, when not being transcribed, it probably exists as tightly folded 
fibers of compacted nucléosomes. However, euchromatin is interrupted by stretches 
ofheterochromatin, in which the nucléosomes are subjected to additional packing 
that usually renders the DNA résistant to gene expression. Heterochromatin exists in 
several forms, some of which arefound in large blocks in and around centromeres 
and near telomeres. But heterochromatin is also présent at many other positions on 
chromosomes, where itcan serve to help regulate developmentally important genes. 

The interior of the nucléus is highly dynamic, with heterochromatin often posi- 
tioned near the nuclear envelope and loops ofchromatin moving away from their 
chromosome territory when genes are very highly expressed. This reflects the exis¬ 
tence of nuclear subcompartments, where different sets of biochemical reactions 
are facilitated by an increased concentration of selected proteins and RNAs. The 
components involved informing a subcompartment can self-assemble into discrète 
organelles such as nucleoli or Cajal bodies; they can also be tethered to fixed struc¬ 
tures such as the nuclear envelope. 

During mitosis, gene expression shuts down and ail chromosomes adopt a 
highly condensed conformation in a process that begins early in M phase to pack¬ 
age the two DNA molécules ofeach replicated chromosome as two separately folded 
chromatids. The condensation is accompanied by histone modifications thatfacil- 
itate chromatin packing, but satisfactory completion of this orderly process, which 
reduces the end-to-end distance ofeach DNA molécule from its interphase length by 
an additional factor often, requires additional proteins. 

HOW GENOMES EVOLVE 

In this final section of the chapter, we provide an overview of some of the ways 
that genes and genomes hâve evolved over time to produce the vast diversity of 
modern-day life-forms on our planet. The sequencing of the genomes of thou- 
sands of organisms is revolutionizing our view of the process of évolution, uncov- 
ering an astonishing wealth of information about not only family relationships 
among organisms, but also about the molecular mechanisms by which évolution 
has proceeded. 

It is perhaps not surprising that genes with similar functions can be found in 
a diverse range of living things. But the great révélation of the past 30 years has 
been the extent to which the actual nucléotide sequences of many genes hâve 
been conserved. Homologous genes—that is, genes that are similar in both their 
nucléotide sequence and function because of a common ancestry—can often be 
recognized across vast phylogenetic distances. Unmistakable homologs of many 
human genes are présent in organisms as diverse as nematode worms, fruit Aies, 
yeasts, and even bacteria. In many cases, the resemblance is so close that, for 
example, the protein-coding portion of a yeast gene can be substituted with its 
human homolog—even though humans and yeast are separated by more than a 
billion years of evolutionary history. 

As emphasized in Chapter 3, the récognition of sequence similarity has 
become a major tool for inferring gene and protein function. Although a sequence 
match does not guarantee similarity in function, it has proved to be an excellent 
due. Thus, it is often possible to predict the function of genes in humans for which 
no biochemical or genetic information is available simply by comparing their 
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nucléotide sequences with the sequences of genes that hâve been characterized 
in other more readily studied organisms. 

In general, the sequences of individual genes are much more tightly con- 
served than is overall genome structure. Features of genome organization such 
as genome size, number of chromosomes, order of genes along chromosomes, 
abundance and size of introns, and amount of répétitive DNA are found to differ 
greatly when comparing distant organisms, as does the number of genes that each 
organism contains. 

Genome Comparisons Reveal Functional DNA Sequences by their 
Conservation Throughout Evolution 

A first obstacle in interpreting the sequence of the 3.2 billion nucléotide pairs in 
the human genome is the fact that much of it is probably functionally unimport- 
ant. The régions of the genome that code for the amino acid sequences of proteins 
(the exons) are typically found in short segments (average size about 145 nuclé¬ 
otide pairs), small islands in a sea of DNA whose exact nucléotide sequence is 
thought to be mostly of little conséquence. This arrangement malces it difficult 
to identify ail the exons in a stretch of DNA, and it is often hard too to détermine 
exactly where a gene begins and ends. 

One very important approach to deciphering our genome is to search for DNA 
sequences that are closely similar between different species, on the principle 
that DNA sequences that hâve a function are much more lilcely to be conserved 
than those without a function. For example, humans and mice are thought to 
hâve diverged from a common mammalian ancestor about 80 x 10 6 years ago, 
which is long enough for the majority of nucléotides in their genomes to hâve 
been changed by random mutational events. Consequently, the only régions that 
will hâve remained closely similar in the two genomes are those in which muta¬ 
tions would hâve impaired function and put the animais carrying them at a dis- 
advantage, resulting in their élimination from the population by natural sélection. 
Such closely similar pièces of DNA sequence are lcnown as conserved régions. In 
addition to revealing those DNA sequences that encode functionally important 
exons and RNA molécules, these conserved régions will include regulatory DNA 
sequences as well as DNA sequences with functions that are not yet known. In 
contrast, most nonconserved régions will reflect DNA whose sequence is much 
less lilcely to be critical for function. 

The power of this method can be increased by including in such comparisons 
the genomes of large numbers of species whose genomes hâve been sequenced, 
such as rat, chiclcen, fish, dog, and chimpanzee, as well as mouse and human. 
By revealing in this way the results of a very long natural “experiment,” lasting 
for hundreds of millions of years, such comparative DNA sequencing studies 
hâve highlighted the most interesting régions in our genome. The comparisons 
reveal that roughly 5% of the human genome consists of “multispecies conserved 
sequences." To our great surprise, only about one-third of these sequences code 
for proteins (see Table 4-1, p. 184). Many of the remaining conserved sequences 
consist of DNA containing clusters of protein-binding sites that are involved in 
gene régulation, while others produce RNA molécules that are not translated 
into protein but are important for other known purposes. But, even in the most 
intensively studied species, the function of the majority of these highly conserved 
sequences remains unknown. This remarkable discovery has led scientists to con- 
clude that we understand much less about the cell biology of vertebrates than we 
had thought. Certainly, there are enormous opportunities for new discoveries, 
and we should expect many more surprises ahead. 

Genome Alterations Are Caused by Failures of the Normal 
Mechanisms for Copying and Maintaining DNA, as well as by 
Transposable DNA Eléments 

Evolution dépends on accidents and mistakes followed by nonrandom survival. 
Most of the genetic changes that occur resuit simply from failures in the normal 
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mechanisms by which genomes are copied or repaired when damaged, although 
the movement of transposable DNA éléments (discussed below) also plays an 
important part. As we will explain in Chapter 5, the mechanisms that maintain 
DNA sequences are remarkably précisé—but they are not perfect. DNA sequences 
are inherited with such extraordinary fidelity that typically, along a given line of 
descent, only about one nucléotide pair in a thousand is randomly changed in the 
germ line every million years. Even so, in a population of 10,000 diploid individu - 
als, every possible nucléotide substitution will hâve been “tried out ,; on about 20 
occasions in the course of a million years—a short span of time in relation to the 
évolution of species. 

Errors in DNA réplication, DNA recombination, or DNA repair can lead either 
to simple local changes in DNA sequence—so-called point mutations such as the 
substitution of one base pair for another—or to large-scale genome rearrange¬ 
ments such as délétions, duplications, inversions, and translocations of DNA from 
one chromosome to another. In addition to these failures of the genetic machin- 
ery, genomes contain mobile DNA éléments that are an important source of 
genomic change (see Table 5-3, p. 267). These transposable DNA éléments ( trans¬ 
posons ) are parasitic DNA sequences that can spread within the genomes they 
colonize. In the process, they often disrupt the function or alter the régulation 
of existing genes. On occasion, they hâve created altogether novel genes through 
fusions between transposon sequences and segments of existing genes. Over long 
periods of evolutionary time, DNA transposition events hâve profoundly affected 
genomes, so much so that nearly half of the DNA in the human genome consists 
of recognizable relies of past transposition events (Figure 4-62). Even more of our 
genome is thought to hâve been derived from transpositions that occurred so long 
ago (>10 8 years) that the sequences can no longer be traced to transposons. 


The Genome Sequences of Two Species Differ in Proportion to the 
Length of Time Since They Hâve Separately Evolved 

The différences between the genomes of species alive today hâve accumulated 
over more than 3 billion years. Although we laclc a direct record of changes over 
time, scientists can reconstruct the process of genome évolution from detailed 
comparisons of the genomes of contemporary organisms. 

The basic organizing frameworlc for comparative genomics is the phyloge- 
netic tree. A simple example is the tree describing the divergence of humans from 
the great apes (Figure 4-63). The primary support for this tree cornes from com¬ 
parisons of gene or protein sequences. For example, comparisons between the 
sequences of human genes or proteins and those of the great apes typically reveal 
the fewest différences between human and chimpanzee and the most between 
human and orangutan. 

For closely related organisms such as humans and chimpanzees, it is relatively 
easy to reconstruct the gene sequences of the extinct, last common ancestor of the 
two species (Figure 4-64). The close similarity between human and chimpanzee 
genes is mainly due to the short time that has been available for the accumulation 
of mutations in the two diverging lineages, rather than to functional constraints 
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Figure 4-62 A représentation of the 
nucléotide sequence content of the 
sequenced human genome. The LINEs 
(long interspersed nuclear éléments), SINEs 
(short interspersed nuclear éléments), 
retroviral-like éléments, and DNA-only 
transposons are mobile genetic éléments 
that hâve multiplied in our genome by 
replicating themselves and inserting the 
new copies in different positions. These 
mobile genetic éléments are discussed in 
Chapter 5 (see Table 5-3, p. 267). Simple 
sequence repeats are short nucléotide 
sequences (less than 14 nucléotide pairs) 
that are repeated again and again for long 
stretches. Segmentai duplications are large 
blocks of DNA sequence (1000-200,000 
nucléotide pairs) that are présent at two 
or more locations in the genome. The 
most highly repeated blocks of DNA 
in heterochromatin hâve not yet been 
completely sequenced; therefore about 
10% of human DNA sequences are not 
represented in this diagram. (Data courtesy 
of E. Margulies.) 
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that hâve kept the sequences the same. Evidence for this view cornes from the 
observation that the human and chimpanzee genomes are nearly identical even 
where there is no functional constraint on the nucléotide sequence—such as in 
the third position of “synonymous” codons (codons specifying the same amino 
acid but differing in their third nucléotide). 

For much less closely related organisms, such as humans and chiclcens (which 
hâve evolved separately for about 300 million years), the sequence conservation 
found in genes is almost entirely due to purifying sélection (that is, sélection that 
éliminâtes individuals carrying mutations that interfère with important genetic 
functions), rather than to an inadéquate time for mutations to occur. 

Phylogenetic Trees Constructed from a Comparison of DNA 
Sequences Trace the Relationships of Ail Organisms 

Phylogenetic trees based on molecular sequence data can be compared with 
the fossil record, and we get our best view of évolution by integrating the two 
approaches. The fossil record remains essential as a source of absolute dates, 
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Figure 4-63 A phylogenetic tree 
showing the relationship between 
humans and the great apes based on 
nucléotide sequence data. As indicated, 
the sequences of the genomes of ail four 
species are estimated to differ from the 
sequence of the genome of their last 
common ancestor by a little over 1.5%. 
Because changes occur independently 
on both diverging lineages, pairwise 
comparisons reveal twice the sequence 
divergence from the last common 
ancestor. For example, human-orangutan 
comparisons typically show sequence 
divergences of a little over 3%, while 
human-chimpanzee comparisons show 
divergences of approximately 1.2%. 
(Modified from F.C. Chen and W.H. Li, 

Am. J. Hum. Genet. 68:444-456, 2001.) 


Figure 4-64 Tracing the ancestral 
sequence from a sequence comparison 
of the coding régions of human and 
chimpanzee leptin genes. Reading left 
to right and top to bottom, a continuous 
300-nucleotide segment of a leptin-coding 
gene is illustrated. Leptin is a hormone 
that régulâtes food intake and energy 
utilization in response to the adequacy of 
fat reserves. As indicated by the codons 
boxed in green, only 5 nucléotides (of 
441 total) differ between the two species. 
Moreover, in only one of the five positions 
does the différence in nucléotide lead to 
a différence in the encoded amino acid. 

For each of the five variant nucléotide 
positions, the corresponding sequence in 
the gorilla is also indicated. In two cases, 
the gorilla sequence agréés with the human 
sequence, while in three cases it agréés 
with the chimpanzee sequence. 

What was the sequence of the leptin gene 
in the last common ancestor? The most 
economical assumption is that évolution 
has followed a pathway requiring the 
minimum number of mutations consistent 
with the data. Thus, it seems likely that 
the leptin sequence of the last common 
ancestor was the same as the human and 
chimpanzee sequences when they agréé; 
when they disagree, the gorilla sequence 
would be used as a tiebreaker. For 
convenience, only the first 300 nucléotides 
of the leptin-coding sequences are given. 
The remaining 141 are identical between 
humans and chimpanzees. 
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Figure 4-65 The very different rates of évolution of exons and introns, as illustrated by comparing a portion of the 
mouse and human leptin genes. Positions where the sequences differ by a single nucléotide substitution are boxed in green, 
and positions that differ by the addition or délétion of nucléotides are boxed in yellow. Note that, thanks to purifying sélection, 
the coding sequence of the exon is much more conserved than is the adjacent intron sequence. 

based on radioisotope decay in the rock formations in which fossils are found. 

Because the fossil record has many gaps, however, précisé divergence times 
between species are difficult to establish, even for species that leave good fossils 
with distinctive morphology. 

Phylogenetic trees whose timing has been calibrated according to the fos¬ 
sil record suggest that changes in the sequences of particular genes or proteins 
tend to occur at a nearly constant rate, although rates that differ from the norm 
by as much as twofold are observed in particular lineages. This provides us with a 
molecular clock for évolution—or rather a set of molecular cloclcs corresponding 
to different categories of DNA sequence. As in the example in Figure 4-65, the 
clock runs most rapidly and regularly in sequences that are not subject to purifying 
sélection. These include portions of introns that laclc splicing or regulatory signais, 
the third position in synonymous codons, and genes that hâve been irreversibly 
inactivated by mutation (the so-called pseudogenes). The clock runs most slowly 
for sequences that are subject to strong functional constraints—for example, the 
amino acid sequences of proteins that engage in spécifie interactions with large 
numbers of other proteins and whose structure is therefore highly constrained, 
or the nudeotide sequences that encode the RNA subunits of the ribosome, on 
which ail protein synthesis dépends. 

Occasionally, rapid change is seen in a previously highly conserved sequence. 

As discussed later in this chapter, such épisodes are especially interesting because 
they are thought to reflect periods of strong positive sélection for mutations that 
hâve conferred a sélective advantage in the particular lineage where the rapid 
change occurred. 

The pace at which molecular cloclcs run during évolution is determined not 
only by the degree of purifying sélection, but also by the mutation rate. Most 
notably, in animais, although not in plants, cloclcs based on functionally uncon- 
strained mitochondrial DNA sequences run much faster than cloclcs based on 
functionally unconstrained nuclear sequences, because the mutation rate in ani¬ 
mal mitochondria is exceptionally high. 

Categories of DNA for which the clock runs fast are most informative for recent 
evolutionary events; the mitochondrial DNA clock has been used, for example, to 
chronicle the divergence of the Neanderthal lineage from that of modem Homo 
sapiens. To study more ancient evolutionary events, one must examine DNA for 
which the clock runs more slowly; thus the divergence of the major branches of 
the tree of life—bacteria, archaea, and eulcaryotes—has been deduced from study 
of the sequences specifying ribosomal RNA. 

In general, molecular cloclcs, appropriately chosen, hâve a finer time resolu¬ 
tion than the fossil record, and they are a more reliable guide to the detailed struc¬ 
ture of phylogenetic trees than are classical methods of tree construction, which 
are based on family resemblances in anatomy and embryonic development. For 
example, the précisé family tree of great apes and humans was not settled until 
sufficient molecular sequence data accumulated in the 1980s to produce the ped¬ 
igree shown previously in Figure 4-63. And with huge amounts of DNA sequence 
now determined from a wide variety of mammals, much better estimâtes of our 
relationship to them are being obtained (Figure 4-66). 
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Figure 4-66 A phylogenetic tree showing 
the evolutionary relationships of some 
present-day mammals. The length of 
each line is proportional to the number of 
“neutral substitutions”—that is, nucléotide 
changes at sites where there is assumed 
to be no purifying sélection. (Adapted from 
G.M. Cooper et al., Genome Res. 
15:901-913, 2005. With permission from 
Cold Spring Harbor Laboratory Press.) 


A Comparison of Human and Mouse Chromosomes Shows How 
the Structures of Genomes Diverge 

As would be expected, the human and chimpanzee genomes are much more 
alilce than are the human and mouse genomes, even though ail three genomes 
are roughly the same size and contain nearly identical sets of genes. Mouse and 
human lineages hâve had approximately 80 million years to diverge through accu- 
mulated mutations, versus 6 million years for humans and chimpanzees. In addi¬ 
tion, as indicated in Figure 4-66, rodent lineages (represented by the rat and the 
mouse) hâve unusually fast molecular docks, and hâve diverged from the human 
lineage more rapidly than otherwise expected. 

While the way that the genome is organized into chromosomes is almost iden¬ 
tical between humans and chimpanzees, this organization has diverged greatly 
between humans and mice. According to rough estimâtes, a total of about 180 
breakage-and-rejoining events hâve occurred in the human and mouse lineages 
since these two species last shared a common ancestor. In the process, although 
the number of chromosomes is similar in the two species (23 per haploid genome 
in the human versus 20 in the mouse), their overall structures differ greatly. None- 
theless, even after the extensive genomic shuffling, there are many large bloclcs 
of DNA in which the gene order is the same in the human and the mouse. These 
stretches of conserved gene order in chromosomes are referred to as régions of 
synteny. Figure 4-67 illustrâtes how segments of the different mouse chromo¬ 
somes map onto the human chromosome set. For much more distantly related 
vertebrates, such as chiclcen and human, the number of breakage-and-rejoining 
events has been much greater and the régions of synteny are much shorter; in 
addition, they are often hard to discern because of the divergence of the DNA 
sequences that they contain. 

An unexpected conclusion from a detailed comparison of the complété mouse 
and human genome sequences, confirmed by subséquent comparisons between 
the genomes of other vertebrates, is that small bloclcs of DNA sequence are being 
deleted from and added to genomes at a surprisingly rapid rate. Thus, if we 
assume that our common ancestor had a genome of human size (about 3.2 billion 
nucléotide pairs), mice would hâve lost a total of about 45% of that genome from 
accumulated délétions during the past 80 million years, while humans would 
hâve lost about 25%. However, substantial sequence gains from many small chro¬ 
mosome duplications and from the multiplication of transposons hâve compen- 
sated for these délétions. As a resuit, our genome size is thought to be practically 
unchanged from that of the last common ancestor of humans and mice, while the 
mouse genome is smaller by only about 0.3 billion nucléotides. 
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Figure 4-67 Synteny between human 
and mouse chromosomes. In this 
diagram, the human chromosome set 
is shown above, with each part of each 
chromosome colored according to the 
mouse chromosome with which it is 
syntenic. The color coding used for each 
mouse chromosome is shown below. 
Heterochromatic highly répétitive régions 
(such as centromeres) that are difficult to 
sequence cannot be mapped in this way; 
these are colored black. (Adapted from 
E.E. Eichlerand D. Sankoff, Science 
301:793-797, 2003. With permission 
from AAAS.) 


Good evidence for the loss of DNA sequences in small bloclcs during évolution 
can be obtained from a detailed comparison of régions of synteny in the human 
and mouse genomes. The comparative shrinkage of the mouse genome can be 
clearly seen from such comparisons, with the net loss of sequences scattered 
throughout the long stretches of DNA that are otherwise homologous (Figure 
4-68). 

DNA is added to genomes both by the spontaneous duplication of chromo¬ 
somal segments that are typically tens of thousands of nucléotide pairs long 
(as will be discussed shortly) and by insertion of new copies of active transposons. 
Most transposition events are duplicative, because the original copy of the 
transposon stays where it was when a copy inserts at the new site; see, for exam¬ 
ple, Figure 5-63. Comparison of the DNA sequences derived from transposons 
in the human and the mouse readily reveals some of the sequence additions 
(Figure 4-69). 

It remains a mystery why ail mammals hâve maintained genome sizes of 
roughly 3 billion nucléotide pairs that contain nearly identical sets of genes, 
even though only approximately 150 million nucléotide pairs appear to be under 
sequence-specific functional constraints. 


The Size of a Vertebrate Genome Reflects the Relative Rates of 
DNA Addition and DNA Loss in a Lineage 

In more distantly related vertebrates, genome size can vary considerably, appar- 
ently without a drastic effect on the organism or its number of genes. Thus, the 
chiclcen genome, at one billion nucléotide pairs, is only about one-third the size 
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Figure 4-68 Comparison of a syntenic 
portion of mouse and human genomes. 

About 90% of the two genomes can be 
aligned in this way. Note that while there 
is an identical order of the matched index 
sequences (red marks), there has been a 
net loss of DNA in the mouse lineage that 
is interspersed throughout the entire région. 
This type of net loss is typical for ail such 
régions, and it accounts for the tact that the 
mouse genome contains 14% less DNA than 
does the human genome. (Adapted from 
Mouse Genome Sequencing Consortium, 
Nature 420:520-562, 2002. With permission 
from Macmillan Publishers Ltd.) 
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of the mammalian genome. An extreme example is the puffer fish, Fugu rubripes 
(Figure 4-70), which has a tiny genome for a vertebrate (0.4 billion nucléotide 
pairs compared to 1 billion or more for many other fish). The small size of the Fugu 
genome is largely due to the small size of its introns. Specifically, Fugu introns, as 
well as other noncoding segments of the Fugu genome, laclc the répétitive DNA 
that makes up a large portion of the genomes of most well-studied vertebrates. 
Nevertheless, the positions of the Fugu introns between the exons of each gene 
are almost the same as in mammalian genomes (Figure 4-71). 

While initially a mystery, we now hâve a simple explanation for such large dif¬ 
férences in genome size between similar organisms: because ail vertebrates expé¬ 
rience a continuous process of DNA loss and DNA addition, the size of a genome 
merely dépends on the balance between these opposing processes acting over 
millions of years. Suppose, for example, that in the lineage leading to Fugu , the 
rate of DNA addition happened to slow greatly. Over long periods of time, this 
would resuit in a major “cleansing” from this fish genome of those DNA sequences 
whose loss could be tolerated. The resuit is an unusually compact genome, rela- 
tively free of junlc and clutter, but retaining through purifying sélection the ver¬ 
tebrate DNA sequences that are functionally important. This makes Fugu , with 
its 400 million nucléotide pairs of DNA, a valuable resource for genome research 
aimed at understanding humans. 

We Can Infer the Sequence of Some Ancient Genomes 

The genomes of ancestral organisms can be inferred, but most can never be 
directly observed. DNA is very stable compared with most organic molécules, 
but it is not perfectly stable, and its progressive dégradation, even under the best 
circumstances, means that it is virtually impossible to extract sequence infor¬ 
mation from fossils that are more than a million years old. Although a modem 
organism such as the horseshoe crab looks remarkably similar to fossil ancestors 
that lived 200 million years ago, there is every reason to believe that the horse- 
shoe-crab genome has been changing during ail that time in much the same way 
as in other evolutionary lineages, and at a similar rate. Sélection must hâve main- 
tained lcey functional properties of the horseshoe-crab genome to account for the 
morphological stability of the lineage. However, comparisons between different 
present-day organisms show that the fraction of the genome subject to purifying 
sélection is small; hence, it is fair to assume that the genome of the modem horse¬ 
shoe crab, while preserving features critical for function, must differ greatly from 
that of its extinct ancestors, known to us only through the fossil record. 

It is possible to get direct sequence information by examining DNA samples 
from ancient materials if these are not too old. In recent years, technical advances 
hâve allowed DNA sequencing from exceptionally well-preserved bone fragments 
that date from more than 100,000 years ago. Although any DNA this old will be 
imperfectly preserved, a sequence of the Neanderthal genome has been recon- 
structed from many millions of short DNA sequences, revealing—among other 
things—that our human ancestors interbred with Neanderthals in Europe and 


Figure 4-69 A comparison of the 
p-globin gene cluster in the human 
and mouse genomes, showing the 
locations of transposable éléments. This 
stretch of the human genome contains five 
functional (3-globin-like genes (orange)] 
the comparable région from the mouse 
genome has only four. The positions of 
the human Alu sequences are indicated 
by green circles, and the human L1 
sequences by red circles. The mouse 
genome contains different but related 
transposable éléments: the positions of 
B1 éléments (which are related to the 
human Alu sequences) are indicated by 
blue triangles, and the positions of the 
mouse L1 éléments (which are related to 
the human L1 sequences) are indicated 
by orange triangles. The absence of 
transposable éléments from the globin 
structural genes can be attributed to 
purifying sélection, which would hâve 
eliminated any insertion that compromised 
gene function. (Courtesy of Ross Hardison 
and Webb Miller.) 



Figure 4-70 The puffer fish, Fugu 
rubripes. (Courtesy of Byrappa Venkatesh.) 
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that modem humans hâve inherited spécifie genes from them (Figure 4-72). The 
average différence in DNA sequence between humans and Neanderthals shows 
that our two lineages diverged somewhere between 270,000 and 440,000 years 
ago, well before the time that humans are believed to hâve migrated out of Africa. 

But what about deciphering the genomes of much older ancestors, those for 
which no useful DNA samples can be isolated? For organisms that are as closely 
related as human and chimpanzee, we saw that this may not be difficult : reference 
to the gorilla sequence can be used to sort out which of the few sequence différ¬ 
ences between human and chimpanzee are inherited from our common ancestor 
some 6 million years ago (see Figure 4-64). And for an ancestor that has produced 
a large number of different organisms alive today, the DNA sequences of many 
species can be compared simultaneously to unscramble much of the ancestral 
sequence, allowing scientists to dérivé DNA sequences much farther baclc in time. 
For example, from the genome sequences currently being obtained for dozens of 
modem placental mammals, it should be possible to infer much of the genome 
sequence of their 100 million-year-old common ancestor—the precursor of spe¬ 
cies as diverse as dog, mouse, rabbit, armadillo, and human (see Figure 4-66). 

Multispecies Sequence Comparisons Identify Conserved DNA 
Sequences of Unknown Function 

The mass of DNA sequence now in databases (hundreds of billions of nucléotide 
pairs) provides a rich resource that scientists can mine for many purposes. This 
information can be used not only to unscramble the evolutionary pathways that 
hâve led to modem organisms, but also to provide insights into how cells and 
organisms function. Perhaps the most remarkable discovery in this realm cornes 
from the observation that a striking amount of DNA sequence that does not code 
for protein has been conserved during mammalian évolution (see Table 4-1, 
p. 184). This is most clearly revealed when we align and compare DNA synteny 


Figure 4-71 Comparison of the 
genomic sequences of the human 
and Fugu genes encoding the protein 
huntingtin. Both genes (indicated in red) 
contain 67 short exons that align in 1:1 
correspondence to one another; these 
exons are connected by curved lines. 

The human gene is 7.5 times larger than 
the Fugu gene (180,000 versus 24,000 
nucléotide pairs). The size différence is 
entirely due to larger introns in the human 
gene. The larger size of the human 
introns is due in part to the presence of 
retrotransposons (discussed in Chapter 
5), whose positions are represented by 
green vertical lines; the Fugu introns lack 
retrotransposons. In humans, mutation of 
the huntingtin gene causes Huntington’s 
disease, an inherited neurodegenerative 
disorder. (Adapted from S. Baxendale et 
al., Nat. Genet. 10:67-76, 1995. With 
permission from Macmillan Publishers Ltd.) 


Figure 4-72 The Neanderthals. (A) Map 

of Europe showing the location of the 
cave in Croatia where most of the bones 
used to isolate the DNA used to dérivé 
the Neanderthal genome sequence were 
discovered. (B) Photograph ofthe Vindija 
cave. (C) Photograph of the 38,000-year- 
old bones from Vindija. More recent 
studies hâve succeeded in extracting 
DNA sequence information from hominid 
remains that are considerably older (see 
Movie 8.3). (B, courtesy of Johannes 
Krause; C, from R.E. Green et al., Science 
328: 710-722, 2010. Reprinted with 
permission from AAAS.) 
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bloclcs from many different species, thereby identifying large numbers of so-called 
multispecies conserved sequences : some of these code for protein, but most of 
them do not (Figure 4-73). 

Most of the noncoding conserved sequences discovered in this way turn out 
to be relatively short, containing between 50 and 200 nucléotide pairs. Among the 
most mysterious are the so-called "ultraconserved” noncoding sequences, exem- 
plified by more than 5000 DNA segments over 100 nucléotides long that are exactly 
the same in human, mouse, and rat. Most hâve undergone little or no change 
since mammalian and bird ancestors diverged about 300 million years ago. The 
strict conservation implies that even though the sequences do not encode pro¬ 
teins, each nevertheless has an important function maintained by purifying sélec¬ 
tion. The puzzle is to unravel what those functions are. 

Many of the conserved sequences that do not code for protein are now known 
to produce untranslated RNA molécules, such as the thousands of long noncoding 
RNAs ( IncRNAs ) that are thought to hâve important functions in regulating gene 
transcription. As we shall also see in Chapter 7, others are short régions of DNA 
scattered throughout the genome that directly bind proteins involved in gene rég¬ 
ulation. But it is uncertain how much of the conserved noncoding DNA can be 
accounted for in these ways, and the function of most of it remains a mystery. This 
enigma highlights how much more we need to learn about the fundamental bio- 
logical mechanisms that operate in animais and other complex organisms, and its 
solution is certain to hâve profound conséquences for medicine. 

How can cell biologists taclde the mystery of noncoding conserved DNA? Tra- 
ditionally, attempts to détermine the function of a puzzling DNA sequence begin 
by looking at the conséquences of its experimental disruption. But many DNA 
sequences that are crucial for an organism in the wild can be expected to hâve no 
noticeable effect on its phenotype under laboratory conditions: what is required 
for a mouse to survive in a laboratory cage is very much less than what is required 
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Figure 4-73 The détection of 
multispecies conserved sequences. 

In this example, genome sequences 
for each of the organisms shown hâve 
been compared with the indicated région 
of the human CFTR (cystic fibrosis 
transmembrane conductance regulator) 
gene; this région contains one exon plus 
a large amount of intronic DNA. For each 
organism, the percent identity with human 
for each 25-nucleotide block is plotted 
in green. In addition, a computational 
algorithm has been used to detect the 
sequences within this région that are most 
highly conserved when the sequences from 
ail of the organisms are taken into account. 
Besides the exon (dark blue on the line at 
the top of the figure), the positions of three 
other blocks of multispecies conserved 
sequences are indicated (pale blue). The 
function of most such sequences in the 
human genome is not known. (Courtesy of 
Eric D. Green.) 
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for it to succeed in nature. Moreover, calculations based on population genetics 
reveal that just a tiny sélective advantage—less than a 0.1% différence in sur- 
vival—can be enough to strongly favor retaining a particular DNA sequence over 
evolutionary time spans. One should therefore not be surprised to find that many 
DNA sequences that are ultraconserved can be deleted from the mouse genome 
without any noticeable effect on that mouse in a laboratory. 

A second important approach for discovering the function of a mysterious 
noncoding DNA sequence uses biochemical techniques to identify proteins or 
RNA molécules that bind to it—and/or to any RNA molécules that it produces. 
Most of this taslc still lies before us, but a start has been made (see p. 435). 

Changes in Previously Conserved Sequences Can Help Decipher 
Critical Steps in Evolution 

Given genome sequence information, we can taclde another intriguing question: 
What alterations in our DNA hâve made humans so different from other ani¬ 
mais—or for that matter, what malces any individual species so different from its 
relatives? For example, as soon as both the human and the chimpanzee genome 
sequences became available, scientists began searching for DNA sequence 
changes that might account for the striking différences between us and chim- 
panzees. With 3.2 billion nucléotide pairs to compare in the two species, this 
might seem an impossible taslc. But the job was made much easier by confining 
the search to 35,000 clearly defined multispecies conserved sequences (a total 
of about 5 million nucléotide pairs), representing parts of the genome that are 
most lilcely to be functionally important. Though these sequences are conserved 
strongly, they are not conserved perfectly, and when the version in one species is 
compared with that in another they are generally found to hâve drifted apart by 
a small amount corresponding simply to the time elapsed since the last common 
ancestor. In a small proportion of cases, however, one sees signs of a sudden evo¬ 
lutionary spurt. For example, some DNA sequences that hâve been highly con¬ 
served in other mammalian species are found to hâve accumulated nucléotide 
changes exceptionally rapidly during the 6 million years of human évolution since 
we diverged from the chimpanzees. These human accelerated régions (HARs) are 
thought to reflect functions that hâve been especially important in malcing us dif¬ 
ferent in some useful way. 

About 50 such sites were identified in one study, one-fourth of which were 
located near genes associated with neural development. The sequence exhibiting 
the most rapid change (18 changes between human and chimpanzee, compared 
to only two changes between chimpanzee and chiclcen) was examined further 
and found to encode a 118-nucleotide noncoding RNA molécule, HAR1F (human 
accelerated région 1F), that is produced in the human cérébral cortex at a critical 
time during brain development. The function of this HAR1F RNA is not yet lcnown, 
but findings of this type are stimulating research studies that may shed light on 
crucial features of the human brain. 

A related approach in the search for the important mutations that contributed 
to human évolution likewise begins with DNA sequences that hâve been con¬ 
served during mammalian évolution, but rather than screening for accelerated 
changes in individual nucléotides, it focuses instead on chromosome sites that 
hâve experienced délétions in the 6 million years since our lineage diverged from 
that of chimpanzees. More than 500 such sequences—conserved among other 
species but deleted in humans—hâve been discovered. Each délétion removes an 
average of 95 nucléotides of DNA sequence. Only one of these délétions affects a 
protein-coding région: the rest are thought to alter régions that affect how nearby 
genes are expressed, an expectation that has been experimentally confirmed 
in a few cases. A large proportion of the presumed regulatory régions identified 
in this way lie near genes that affect neural function and/or near genes involved in 
steroid signaling, suggesting that changes in the nervous System and in immune 
or reproductive functions hâve played an especially important rôle in human 
évolution. 
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Mutations in the DNA Sequences That Control Gene Expression 
Hâve Driven Many of the Evolutionary Changes in Vertebrates 

The vast hoard of genomic sequence data now being accumulated can be explored 
in many other ways to reveal events that happened even hundreds of millions of 
years ago. For example, one can attempt to trace the origins of the regulatory élé¬ 
ments in DNA that hâve played critical parts in vertebrate évolution. One such 
study began with the identification of nearly 3 million noncoding sequences, aver- 
aging 28 base pairs in length, that hâve been conserved in recent vertebrate évolu¬ 
tion while being absent in more ancient ancestors. Each of these spécial non-cod- 
ing sequences is lilcely to represent a functional innovation peculiar to a particular 
branch of the vertebrate family tree, and most of them are thought to consist of 
regulatory DNA that governs the expression of a neighboring gene. Given full 
genome sequences, one can identify the genes that lie closest and thus appear 
most lilcely to hâve fallen under the sway of these novel regulatory éléments. By 
comparing many different species, with lcnown divergence times, one can also 
estimate when each such regulatory element came into existence as a conserved 
feature. The findings suggest remarkable evolutionary différences between the 
various functional classes of genes (Figure 4-74). Conserved regulatory éléments 
that originated early in vertebrate évolution—that is, more than about 300 million 
years ago, which is when the mammalian lineage split from the lineage leading to 
birds and reptiles—seem to be mostly associated with genes that code for tran¬ 
scription regulator proteins and for proteins with rôles in organizing embryonic 
development. Then came an era when the regulatory DNA innovations arose next 
to genes coding for receptors for extracellular signais. Finally, over the course of 
the past 100 million years, the regulatory innovations seem to hâve been concen- 
trated in the neighborhood of genes coding for proteins (such as protein kinases) 
that function to modify other proteins post-translationally. 

Many questions remain to be answered about these phenomena and what 
they mean. One possible interprétation is that the logic—the circuit diagram—of 
the gene regulatory networlc in vertebrates was established early, and that more 
recent evolutionary change has mainly occurred through the tuning of quantita¬ 
tive parameters. This could help to explain why, among the mammals, for exam¬ 
ple, the basic body plan—the topology of the tissues and organs—has been largely 
conserved. 

Gene Duplication Also Provides an Important Source of Genetic 
Novelty During Evolution 

Evolution dépends on the création of new genes, as well as on the modification 
of those that already exist. How does this occur? When we compare organisms 
that seem very different—a primate with a rodent, for example, or a mouse with 
a fish—we rarely encounter genes in the one species that hâve no homolog in the 
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Figure 4-74 The types of changes 
in gene régulation inferred to hâve 
predominated during the évolution of 
our vertebrate ancestors. To produce 
the information summarized in this plot, 
wherever possible the type of gene 
regulated by each conserved noncoding 
sequence was inferred from the identity of 
its closest protein-coding gene. The fixation 
time for each conserved sequence was 
then used to dérivé the conclusions shown. 
(Based on C.B. Lowe et al., Science 
333:1019-1024, 2011. With permission 
from AAAS.) 
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other. Genes without homologous counterparts are relatively scarce even when 
we compare such divergent organisms as a mammal and a worm. On the other 
hand, we frequently find gene families that hâve different numbers of members in 
different species. To create such families, genes hâve been repeatedly duplicated, 
and the copies hâve then diverged to talce on new functions that often vary from 
one species to another. 

Gene duplication occurs at high rates in ail evolutionary lineages, contributing 
to the vigorous process of DNA addition discussed previously. In a detailed study 
of spontaneous duplications in yeast, duplications of 50,000 to 250,000 nucléotide 
pairs were commonly observed, most of which were tandemly repeated. These 
appeared to resuit from DNA réplication errors that led to the inexact repair of 
double-strand chromosome breaks. A comparison of the human and chimpanzee 
genomes reveals that, since the time that these two organisms diverged, such seg¬ 
mentai duplications hâve added about 5 million nucléotide pairs to each genome 
every million years, with an average duplication size being about 50,000 nucléo¬ 
tide pairs (although there are some duplications five times larger). In fact, if one 
counts nucléotides, duplication events hâve created more différences between 
our two species than hâve single-nucleotide substitutions. 

Duplicated Genes Diverge 

What is the fate of newly duplicated genes? In most cases, there is presumed to 
be little or no sélection—at least initially—to maintain the duplicated State since 
either copy can provide an équivalent function. Hence, many duplication events 
are lilcely to be followed by loss-of-function mutations in one or the other gene. 
This cycle would functionally restore the one-gene State that preceded the duplica¬ 
tion. Indeed, there are many examples in contemporary genomes where one copy 
of a duplicated gene can be seen to hâve become irreversibly inactivated by mul¬ 
tiple mutations. Over time, the sequence similarity between such a pseudogene 
and the functional gene whose duplication produced it would be expected to be 
eroded by the accumulation of many mutations in the pseudogene—the homolo¬ 
gous relationship eventually becoming undetectable. 

An alternative fate for gene duplications is for both copies to remain func¬ 
tional, while diverging in their sequence and pattern of expression, thus taking 
on different rôles. This process of "duplication and divergence" almost certainly 
explains the presence of large families of genes with related functions in biolog- 
ically complex organisms, and it is thought to play a critical rôle in the évolution 
of increased biological complexity. An examination of many different eukaryotic 
genomes suggests that the probability that any particular gene will undergo a 
duplication event that spreads to most or ail individuals in a species is approxi- 
mately 1 percent every million years. 

Whole-genome duplications offer particularly dramatic examples of the dupli¬ 
cation-divergence cycle. A whole-genome duplication can occur quite simply: ail 
that is required is one round of genome réplication in a germ-line cell lineage 
without a corresponding cell division. Initially, the chromosome number simply 
doubles. Such abrupt increases in the ploidy of an organism are common, par¬ 
ticularly in fungi and plants. After a whole-genome duplication, ail genes exist 
as duplicate copies. However, unless the duplication event occurred so recently 
that there has been little time for subséquent alterations in genome structure, 
the results of a sériés of segmentai duplications—occurring at different times— 
are hard to distinguish from the end product of a whole-genome duplication. In 
mammals, for example, the rôle of whole-genome duplications versus a sériés of 
piecemeal duplications of DNA segments is quite uncertain. Nevertheless, it is 
clear that a great deal of gene duplication has occurred in the distant past. 

Analysis of the genome of the zebrafish, in which at least one whole-genome 
duplication is thought to hâve occurred hundreds of millions of years ago, has cast 
some light on the process of gene duplication and divergence. Although many 
duplicates of zebrafish genes appear to hâve been lost by mutation, a significant 
fraction—perhaps as many as 30-50%—hâve diverged functionally while both 
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Figure 4-75 A comparison of the structure of one-chain and four-chain 
globins. The four-chain globin shown is hemoglobin, which is a complex of 
two a-globin and two (3-globin chains. The one-chain globin présent in some 
primitive vertebrates represents an intermediate in the évolution of the four-chain 
globin. With oxygen bound it exists as a monomer; without oxygen it dimerizes. 


copies hâve remained active. In many cases, the most obvious functional différ¬ 
ence between the duplicated genes is that they are expressed in different tissues 
or at different stages of development. One attractive theory to explain such an end 
resuit imagines that different, mildly deleterious mutations occur quickly in both 
copies of a duplicated gene set. For example, one copy might lose expression in 
a particular tissue as a resuit of a regulatory mutation, while the other copy loses 
expression in a second tissue. Following such an occurrence, both gene copies 
would be required to provide the full range of functions that were once supplied 
by a single gene; hence, both copies would now be protected from loss through 
inactivating mutations. Over a longer period, each copy could then undergo fur- 
ther changes through which it could acquire new, specialized features. 

The Evolution of the Globin Gene Family Shows How DNA 
Duplications Contribute to the Evolution of Organisms 

The globin gene family provides an especially good example of how DNA dupli¬ 
cation generates new proteins, because its evolutionary history has been worlced 
out particularly well. The unmistakable similarities in amino acid sequence and 
structure among the present-day globins indicate that they ail must dérivé from a 
common ancestral gene, even though some are now encoded by widely separated 
genes in the mammalian genome. 

We can reconstruct some of the past events that produced the various types 
of oxygen-carrying hemoglobin molécules by considering the different forms of 
the protein in organisms at different positions on the tree of life. A molécule lilce 
hemoglobin was necessary to allow multicellular animais to grow to a large size, 
since large animais cannot simply rely on the diffusion of oxygen through the 
body surface to oxygenate their tissues adequately. But oxygen plays a vital part in 
the life of nearly ail living organisms, and oxygen-binding proteins homologous to 
hemoglobin can be recognized even in plants, fungi, and bacteria. In animais, the 
most primitive oxygen-carrying molécule is a globin polypeptide chain of about 
150 amino acids that is found in many marine worms, insects, and primitive fish. 
The hemoglobin molécule in more complex vertebrates, however, is composed of 
two lcinds of globin chains. It appears that about 500 million years ago, during the 
continuing évolution of fish, a sériés of gene mutations and duplications occurred. 
These events established two slightly different globin genes in the genome of each 
individual, coding for a- and (3-globin chains that associate to form a hemoglobin 
molécule consisting of two a chains and two P chains (Figure 4-75). The four oxy¬ 
gen-binding sites in the (X 2 P 2 molécule interact, allowing a cooperative allosteric 
change in the molécule as it binds and releases oxygen, which enables hemoglo¬ 
bin to talce up and release oxygen more efficiently than the single-chain version. 

Still later, during the évolution of mammals, the P-chain gene apparently 
underwent duplication and mutation to give rise to a second (3-lilce chain that 
is synthesized specifically in the fétus. The resulting hemoglobin molécule has a 
higher affinity for oxygen than adult hemoglobin and thus helps in the transfer 
of oxygen from the mother to the fétus. The gene for the new P-lilce chain subse- 
quently duplicated and mutated again to produce two new genes, 8 and y, the 8 
chain being produced earlier in development (to form (X 282 ) than the fêtai y chain, 
which forms 0 ^ 2 - A duplication of the adult p-chain gene occurred still later, 
during primate évolution, to give rise to a 8 -globin gene and thus to a minor form 
of hemoglobin ((X 2 S 2 ) that is found only in adult primates (Figure 4-76). 

Each of these duplicated genes has been modified by point mutations that 
affect the properties of the final hemoglobin molécule, as well as by changes in 
regulatory régions that détermine the timing and level of expression of the gene. 
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Figure 4-76 An evolutionary scheme 
for the globin chains that carry oxygen 
in the blood of animais. The scheme 
emphasizes the p-like globin gene family. 
A relatively recent gene duplication of the 
y-chain gene produced y G and y A , which 
are fêtai p-like chains of identical function. 
The location of the globin genes in the 
human genome is shown at the top of 
the figure. 
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As a resuit, each globin is made in different amounts at different times of human 
development. 

The history of these gene duplications is reflected in the arrangement of hemo- 
globin genes in the genome. In the human genome, the genes that arose from the 
original P gene are arranged as a sériés of homologous DNA sequences located 
within 50,000 nucléotide pairs of one another on a single chromosome. A similar 
cluster of human a-globin genes is located on a separate chromosome. Not only 
other mammals, but birds too hâve their a- and (3-globin gene clusters on sepa¬ 
rate chromosomes. In the frog Xenopus, however, they are together, suggesting 
that a chromosome translocation event in the lineage of birds and mammals sep- 
arated the two gene clusters about 300 million years ago, soon after our ancestors 
diverged from amphibians (see Figure 4-76). 

There are several duplicated globin DNA sequences in the a- and P-globin 
gene clusters that are not functional genes but pseudogenes. These hâve a close 
sequence similarity to the functional genes but hâve been disabled by muta¬ 
tions that prevent their expression as functional proteins. The existence of such 
pseudogenes malces it clear that, as expected, not every DNA duplication leads to 
a new functional gene. 


Genes Encoding New Proteins Can Be Created by the 
Recombination of Exons 


The rôle of DNA duplication in évolution is not confined to the expansion of 
gene families. It can also act on a smaller scale to create single genes by string- 
ing together short duplicated segments of DNA. The proteins encoded by genes 
generated in this way can be recognized by the presence of repeating similar pro- 
tein domains, which are covalently linlced to one another in sériés. The immu- 
noglobulins (Figure 4-77), for example, as well as most fibrous proteins (such as 
collagens) are encoded by genes that hâve evolved by repeated duplications of a 
primordial DNA sequence. 

In genes that hâve evolved in this way, as well as in many other genes, each 
separate exon often encodes an individual protein folding unit, or domain. It is 
believed that the organization of DNA coding sequences as a sériés of such exons 
separated by long introns has greatly facilitated the évolution of new proteins. The 
duplications necessary to form a single gene coding for a protein with repeating 
domains, for example, can easily occur by brealcing and rejoining the DNA any- 
where in the long introns on either side of an exon; without introns there would be 
only a few sites in the original gene at which a recombinational exchange between 
DNA molécules could duplicate the domain and not disrupt it. By enabling the 
duplication to occur by recombination at many potential sites rather than just a 
few, introns increase the probability of a favorable duplication event. 

More generally, we know from genome sequences that the various parts of 
genes—both their individual exons and their regulatory éléments—hâve served 
as modular éléments that hâve been duplicated and moved about the genome 
to create the great diversity of living things. Thus, for example, many present-day 
proteins are formed as a patchwork of domains from different origins, reflecting 
their complex evolutionary history (see Figure 3-17). 

Neutral Mutations Often Spread to Become Fixed in a Population, 
with a Probability That Dépends on Population Size 

In comparisons between two species that hâve diverged from one another by mil¬ 
lions of years, it makes little différence which individuals from each species are 

Figure 4-77 Schematic view of an antibody (immunoglobulin) molécule. 

This molécule is a complex of two identical heavy chains and two identical 
light chains. Each heavy chain contains four similar, covalently linked 
domains, while each light chain contains two such domains. Each domain 
is encoded by a separate exon, and ail of the exons are thought to hâve 
evolved by the serial duplication of a single ancestral exon. 
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compared. For example, typical human and chimpanzee DNA sequences differ 
from one another by about 1%. In contrast, when the same région of the genome 
is sampled from two randomly chosen humans, the différences are typically about 
0.1%. For more distantly related organisms, the interspecies différences outshine 
intraspecies variation even more dramatically. However, each “fixed différence" 
between the human and the chimpanzee (in other words, each différence that is 
now characteristic of ail or nearly ail individuals of each species) started out as a 
new mutation in a single individual. If the size of the interbreeding population in 
which the mutation occurred is N, the initial allele frequency for a new mutation 
would be 1/(2 N) for a diploid organism. How does such a rare mutation become 
fixed in the population, and hence become a characteristic of the species rather 
than of a few scattered individuals? 

The answer to this question dépends on the functional conséquences of the 
mutation. If the mutation has a significantly deleterious effect, it will simply be 
eliminated by purifying sélection and will not become fixed. (In the most extreme 
case, the individual carrying the mutation will die without producing progeny.) 
Conversely, the rare mutations that confer a major reproductive advantage on 
individuals who inherit them can spread rapidly in the population. Because 
humans reproduce sexually and genetic recombination occurs each time a gam¬ 
ète is formed (discussed in Chapter 5), the genome of each individual who has 
inherited the mutation will be a unique recombinational mosaic of segments 
inherited from a large number of ancestors. The selected mutation along with a 
modest amount of neighboring sequence—ultimately inherited from the individ¬ 
ual in which the mutation occurred—will simply be one piece of this huge mosaic. 

The great majority of mutations that are not harmful are not bénéficiai either. 
These selectively neutral mutations can also spread and become fixed in a pop¬ 
ulation, and they malce a large contribution to evolutionary change in genomes. 
For example, as we saw earlier, they account for most of the DNA sequence dif¬ 
férences between apes and humans. The spread of neutral mutations is not as 
rapid as the spread of the rare strongly advantageous mutations. It dépends on 
a random variation in the number of mutation-bearing progeny produced by 
each mutation-bearing individual, causing changes in the relative frequency of 
the mutant allele in the population. Through a sort of "random wallc" process, the 
mutant allele may eventually become extinct, or it may become commonplace. 
This can be modeled mathematically for an idealized interbreeding population, 
on the assumption of constant population size and random mating, as well as 
sélective neutrality for the mutations. While neither of the first two assumptions 
is a good description of human population history, study of this idealized case 
reveals the general principles in a clear and simple way. 

When a new neutral mutation occurs in a population of constant size N that 
is undergoing random mating, the probability that it will ultimately become fixed 
is approximately 1/(2 N). This is because there are 2 N copies of the gene in the 
diploid population, and each of them has an equal chance of becoming the pré¬ 
dominant version in the long run. For those mutations that do become fixed, the 
mathematics shows that the average time to fixation is approximately 4 N génér¬ 
ations. Detailed analyses of data on human genetic variation hâve suggested an 
ancestral population size of approximately 10,000 at the time when the current 
pattern of genetic variation was largely established. With a population that has 
reached this size, the probability that a new, selectively neutral mutation would 
become fixed is small (1/20,000), while the average time to fixation would be on 
the order of 800,000 years (assuming a 20-year génération time). Thus, while we 
know that the human population has grown enormously since the development 
of agriculture approximately 15,000 years ago, most of the present-day set of com- 
mon human genetic variants reflects the mixture of variants that was already prés¬ 
ent long before this time, when the human population was still small. 

Similar arguments explain another phenomenon with important practical 
implications for genetic counseling. In an isolated community descended from 
a small group of founders, such as the people of Iceland or the Jews of Eastern 
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Figure 4-78 How founder effects 
détermine the set of genetic variants in 
a population of individuals belonging to 
the same species. This example illustrâtes 
how a rare allele (red) can become 
established in an isolated population, 
even though the mutation that produced 
it has no sélective advantage—or is mildly 
deleterious. 


Europe, genetic variants that are rare in the human population as a whole can 
often be présent at a high frequency, even if those variants are mildly deleterious 
(Figure 4-78). 


A Great Deal Can Be Learned from Analyses of the Variation 
Among Humans 

Even though the common variant gene alleles among modem humans originate 
from variants présent in a comparatively tiny group of ancestors, the total number 
of variants now encountered, including those that are individually rare, is very 
large. New neutral mutations are constantly occurring and accumulating, even 
though no single one of them has had enough time to become fixed in the vast 
modem human population. 

From detailed comparisons of the DNA sequences of a large number of mod¬ 
em humans located around the globe, scientists can estimate how many génér¬ 
ations hâve elapsed since the origin of a particular neutral mutation. From such 
data, it has been possible to map the routes of ancient human migrations. For 
example, by combining this type of genetic analysis with archaeological findings, 
scientists hâve been able to deduce the most probable routes that our ancestors 
toolc when they left Africa 60,000 to 80,000 years ago (Figure 4-79). 

We hâve been focusing on mutations that affect a single gene, but these are not 
the only source of variation. Another source, perhaps even more important but 
missed for many years, lies in the many duplications and délétions of large bloclcs 
of human DNA. When one compares any individual human with the standard 
reference genome in the database, one will generally find roughly 100 différences 
involving gain or loss of long sequence bloclcs, totaling perhaps 3 million nucléo¬ 
tide pairs. Some of these copy number variations (CNVs) will be very common, 
presumably reflecting relatively ancient origins, while others will be présent in 
only a small minority of people (Figure 4-80). On average, nearly half of the CNVs 
contain lcnown genes. CNVs hâve been implicated in many human traits, includ¬ 
ing color blindness, infertility, hypertension, and a wide variety of disease suscep- 
tibilities. In retrospect, this type of variation is not surprising, given the prominent 
rôle of DNA addition and DNA loss in vertebrate évolution. 

The intraspecies variations that hâve been most extensively characterized, 
however, are single-nucléotide polymorphisms (SNPs). These are simply points 
in the genome sequence where one large fraction of the human population has 
one nucléotide, while another substantial fraction has another. To qualify as 


/' \ 



Figure 4-79 Tracing the course of 
human history by analyses of genome 
sequences. The map shows the 
routes of the earliest successful human 
migrations. Dotted Unes indicate two 
alternative routes that our ancestors are 
thought to hâve taken out of Africa. DNA 
sequence comparisons suggest that 
modem Europeans descended from a 
small ancestral population that existed 
about 30,000 to 50,000 years ago. 

In agreement, archaeological findings 
suggest that the ancestors of modem 
native Australians (solid red arrows )—and 
of modem European and Middle Eastern 
populations—reached their destinations 
about 45,000 years ago. Even more recent 
studies, comparing the genome sequences 
of living humans with those of Neanderthals 
and another extinct population from 
Southern Siberia (the Denisovans), suggest 
that our exit from Africa was a bit more 
convoluted, while also revealing that a 
number of our ancestors interbred with 
these hominid neighbors as they made 
their way across the globe. (Modified from 
P. Forster and S. Matsumura, Science 
308:965-966, 2005.) 
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Figure 4-80 Détection of copy number 
variations on human chromosome 17. 

When 100 individuals were tested by a 
DNA microarray analysis that detects 
the copy number of DNA sequences 
throughout the entire length of this 
chromosome, the indicated distributions 
of DNA additions (green bars) and DNA 
losses (red bars) were observed compared 
with an arbitrary human reference 
sequence. The shortest red and green bars 
represent a single occurrence among the 
200 chromosomes examined, whereas the 
longer bars indicate that the addition or 
loss was correspondingly more frequent. 
The results show preferred régions where 
the variations occur, and these tend to 
be in or near régions that already contain 
blocks of segmentai duplications. Many 
of the changes include known genes. 
(Adapted from J.L. Freeman et al., Genome 
Res. 16:949-961,2006. With permission 
from Cold Spring Harbor Laboratory Press.) 


a polymorphism, the variants must be common enough to give a reasonably 
high probability that the genomes of two randomly chosen individuals will dif- 
fer at the given site; a probability of 1% is commonly chosen as the cutoff. Two 
human genomes sampled from the modem world population at random will dif- 
fer at approximately 2.5 x 10 6 such sites (1 per 1300 nucléotide pairs). As will be 
described in the overview of genetics in Chapter 8, SNPs in the human genome 
can be extremely useful for genetic mapping analyses, in which one attempts to 
associate spécifie traits (phenotypes) with spécifie DNA sequences for medical or 
scientific purposes (see p. 493). But while useful as genetic markers, there is good 
evidence that most of these SNPs hâve little or no effect on human fitness. This 
is as expected, since deleterious variants will hâve been selected against during 
human évolution and, unlilce SNPs, should therefore be rare. 

Against the background of ordinary SNPs inherited from our prehistoric 
ancestors, certain sequences with exceptionally high mutation rates stand out. A 
dramatic example is provided by CA repeats, which are ubiquitous in the human 
genome and in the genomes of other eukaryotes. Sequences with the motif (CA) n 
are replicated with relatively lowfidelity because of a slippage that occurs between 
the template and the newly synthesized strands during DNA réplication; hence, 
the précisé value of n can vary over a considérable range from one genome to the 
next. These repeats malce idéal DNA-based genetic markers, since most humans 
are heterozygous, having inherited one repeat length ( n ) from their mother and a 
different repeat length from their father. While the value of n changes sufficiently 
rarely that most parent-child transmissions propagate CA repeats faithfully, the 
changes are sufficiently frequent to maintain high levels of heterozygosity in the 
human population. These and some other simple repeats that display exception¬ 
ally high variability therefore provide the basis for identifying individuals by DNA 
analysis in crime investigations, paternity suits, and other forensic applications 
(see Figure 8-39). 

While most of the SNPs and CNVs in the human genome sequence are thought 
to hâve little or no effect on phenotype, a subset of the genome sequence varia¬ 
tions must be responsible for the heritable aspects of human individuality. We 
know that even a single nucléotide change that alters one amino acid in a protein 
can cause a serious disease, as for example in sickle-cell anémia, which is caused 
by such a mutation in hemoglobin (Movie 4.3). We also know that gene dosage—a 
doubling or halving of the copy number of some genes—can hâve a profound 
effect on development by altering the level of gene product, as can changes in 
regulatory DNA sequences. There is therefore every reason to suppose that some 
of the many différences between any two human beings will hâve substantial 
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effects on human health, physiology, behavior, and physique. A major challenge 
in human genetics is to recognize those relatively few variations that are function- 
ally important against a large background of variation that is neutral and of no 
conséquence. 

Summary 

Comparisons ofthe nucléotide sequences of présent- day genomes hâve revolution- 
ized our understanding ofgene and genome évolution. Because ofthe extremely 
high fidelity ofDNA réplication and DNA repair processes; random errors in main- 
taining the nucléotide sequences in genomes occur so rarely that only about one 
nucléotide in a thousand is altered in every million years in any particular eukary- 
otic line of descent. Not surprisingly, therefore, a comparison of human and chim- 
panzee chromosomes—which are separated by about 6 million years of évolution— 
reveals veryfew changes. Not only are our genes essentially the same, but their order 
on each chromosome is almost identical. Although a substantial number of seg¬ 
mentai duplications and segmentai délétions hâve occurred in the past 6 million 
years, even the positions ofthe transposable éléments thatmake up a major portion 
of our noncodingDNA are mostly unchanged. 

When one compares the genomes oftwo more distantly related organisms—such 
as a human and a mouse, separated by about 80 million years—one finds many 
more changes. Now the effects ofnatural sélection can be clearly seen: through puri- 
fying sélection, essential nucléotide sequences—both in regulatory régions and in 
coding sequences (exons)-have been highly conserved. In contrast, nonessential 
sequences (for example, much ofthe DNA in introns) hâve been altered to such an 
extent that one can no longer see anyfamily resemblance. 

Because of purifying sélection, the comparison of the genome sequences of 
multiple related species is an especially powerful way to find DNA sequences with 
importantfunctions. Although about 5% ofthe human genome has been conserved 
as a resuit of purifying sélection, the function of the majority of this DNA (tens of 
thousands of multispecies conserved sequences) remains mysterious. Future exper- 
iments characterizing its functions should teach us many new lessons about verte- 
brate biology. 

Other sequence comparisons show that a great deal ofthe genetic complexity of 
present-day organisms is due to the expansion of ancientgene families. DNA dupli¬ 
cation followed by sequence divergence has clearly been a major source of genetic 
novelty during évolution. On a more recent time scale, the genomes of any two 
humans will dijfer from each other both because of nucléotide substitutions (sin- 
gle-nucleotide polymorphisms, or SNPs) and because ofinherited DNA gains and 
DNA losses that cause copy number variations (CNVs). Understanding the effects 
ofthese différences will improve both medicine and our understanding of human 
biology. 


WHATWE DONT KNOW 


• How many different types of 
chromatin structure are important for 
cells? How is each of these structures 
established and maintained, and 
which ones tend to be inherited 
following DNA réplication? 

• Why are there so many different 
chromatin remodeling complexes in 
cells? What are their essential rôles, 
and how do they get loaded onto 
chromatin at spécifie places and at 
spécifie times? 

• How do chromosomal loops form 
during interphase, and what happens 
to these loops in condensed mitotic 
chromosomes? 

• What genetic changes made 
us uniquely human? What further 
aspects of our recent evolutionary 
development can be reconstructed 
by sequencing DNA from remains of 
ancient hominids? 

• How much of the enormous 
complexity that we find in cell biology 
is unnecessary, having evolved by 
random drift? 


PROBLEMS 

Which statements are true? Explain why or why not. 

4-1 Human females hâve 23 different chromosomes, 
whereas human males hâve 24. 

4-2 The four core histones are relatively small proteins 
with a very high proportion of positively charged amino 
acids; the positive charge helps the histones bind tightly to 
DNA, regardless of its nucléotide sequence. 

4-3 Nucléosomes bind DNA so tightly that they cannot 

move from the positions where they are first assembled. 

4-4 In a comparison between the DNAs of related 
organisms such as humans and mice, identifying the con¬ 


served DNA sequences facilitâtes the search for function- 
ally important régions. 

4-5 Gene duplication and divergence is thought to 
hâve played a critical rôle in the évolution of increased bio- 
logical complexity. 

Discuss the following problems. 

4-6 DNA isolated from the bacterial virus M13 con- 
tains 25% A, 33% T, 22% C, and 20% G. Do these results 
strilce you as peculiar? Why or why not? How might you 
explain these values? 
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Figure Q4-1 Three nucléotides from the interior 
of a single strand of DNA (Problem 4-7). Arrows 
at the ends of the DNA strand indicate that the 
structure continues in both directions. 

4-7 A segment of DNA from the 
interior of a single strand is shown in 
Figure Q4-1. What is the polarity of this 
DNA from top to bottom? 

4-8 Human DNA contains 20% C 

on a molar basis. What are the mole 
percents of A, G, and T? 

4-9 Chromosome 3 in orangutans 

differs from chromosome 3 in humans 
by two inversion events that occurred 
in the human lineage (Figure Q4-2). 

Draw the intermediate chromosome 
that resulted from the first inversion 
and explicitly indicate the segments 
included in each inversion. 

Figure Q4-2 Chromosome 
3 in orangutans and humans 
(Problem 4-9). Differently colored 
blocks indicate segments of the 
chromosomes that are homologous 
in DNA sequence. 


4-10 Assuming that the 30-nm chromatin fiber con¬ 
tains about 20 nucléosomes (200 bp/nucleosome) per 50 
nm of length, calculate the degree of compaction of DNA 
associated with this level of chromatin structure. What 
fraction of the 10,000-fold condensation that occurs at 
mitosis does this level of DNA packing represent? 

4-11 In contrast to histone acétylation, which always 
correlates with gene activation, histone méthylation can 
lead to either transcriptional activation or repression. How 
do you suppose that the same modification—méthyla¬ 
tion—can médiate different biological outcomes? 

4-12 Why is a chromosome with two centromeres (a 
dicentric chromosome) unstable? Would a baclcup cen- 
tromere not be a good thing for a chromosome, giving it 
two chances to form a kinetochore and attach to microtu- 
bules during mitosis? Would that not help to ensure that 
the chromosome did not get left behind at mitosis? 

4-13 Look at the two yeast colonies in Figure Q4-3. Each 
of these colonies contains about 100,000 cells descended 
from a single yeast cell, originally somewhere in the mid- 
dle of the clump. A white colony arises when the Ade2 gene 
is expressed from its normal chromosomal location. When 
the Ade2 gene is moved to a location near a telomere, it 
is paclced into heterochromatin and inactivated in most 
cells, giving rise to colonies that are mostly red. In these 
largely red colonies, white sectors fan out from the middle 
of the colony. In both the red and white sectors, the Ade2 
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red colony of 
yeast cells 
with white sectors 
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Figure Q4-3 Position effect on expression of the yeast Ade2 gene 
(Problem 4-13). The Ade2 gene codes for one of the enzymes of 
adenosine biosynthesis, and the absence of the Ade2 gene product 
leads to the accumulation of a red pigment. Therefore a colony of cells 
that express Ade2 is white, and one composed of cells in which the 
Ade2 gene is not expressed is red. 


gene is still located near telomeres. Explain why white sec¬ 
tors hâve formed near the rim of the red colony. Based on 
the patterns observed, what can you conclude about the 
propagation of the transcriptional State of the Ade2 gene 
from mother to daughter cells in this experiment? 

4-14 Mobile pièces of DNA—transposable éléments— 
that insert themselves into chromosomes and accumulate 
during évolution malce up more than 40% of the human 
genome. Transposable éléments of four types—long inter- 
spersed nuclear éléments (LINEs), short interspersed 
nuclear éléments (SINEs), long terminal repeat (LTR) 
retrotransposons, and DNA transposons—are inserted 
more-or-less randomly throughout the human genome. 
These éléments are conspicuously rare at the four homeo- 
box gene clusters, HoxA, HoxB, HoxC, and HoxD, as illus- 
trated for HoxD in Figure Q4-4, along with an équivalent 
région of chromosome 22, which laclcs a Hox cluster. Each 
Hox cluster is about 100 lcb in length and contains 9 to 11 
genes, whose differential expression along the anteropos- 
terior axis of the developing embryo establishes the basic 
body plan for humans (and for other animais). Why do you 
suppose that transposable éléments are so rare in the Hox 
clusters? 


chromosome 22 



100 kb HoxD cluster 


Figure Q4-4 Transposable éléments and genes in 1 -Mb régions of 
chromosomes 2 and 22 (Problem 4-14). Blue Unes that project upward 
indicate exons of known genes. Red Unes that project downward 
indicate transposable éléments; they are so numerous (constituting more 
than 40% of the human genome) that they merge into nearly a solid 
block outside the Hox clusters. (Adapted from E. Lânder et al., Nature 
409:860-921,2001. With permission from Macmillan Publishers Ltd.) 
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DNA Réplication, Repair, 
and Recombination 


CHAPTER 

5 


The ability of cells to maintain a high degree of order in a chaotic universe dépends 
upon the accurate duplication of vast quantities of genetic information carried in 
Chemical form as DNA. This process, called DNA réplication, must occur before 
a cell can produce two genetically identical daughter cells. Maintaining order 
also requires the continued surveillance and repair of this genetic information, 
because DNA inside cells is repeatedly damaged by Chemicals and radiation from 
the environment, as well as by thermal accidents and reactive molécules gener- 
ated inside the cell. In this chapter, we describe the protein machines that repli- 
cate and repair the cell's DNA. These machines catalyze some of the most rapid 
and accurate processes that take place within cells, and their mechanisms illus- 
trate the elegance and efficiency of cell chemistry. 

While the short-term survival of a cell can dépend on preventing changes 
in its DNA, the long-term survival of a species requires that DNA sequences be 
changeable over many générations to permit evolutionary adaptation to changing 
circumstances. We shall see that despite the great efforts that cells malce to pro- 
tect their DNA, occasional changes in DNA sequences do occur. Over time, these 
changes provide the genetic variation upon which sélection pressures act during 
the évolution of organisms. 

We begin this chapter with a brief discussion of the changes that occur in 
DNA as it is passed down from génération to génération. Next, we discuss the cell 
mechanisms—DNA réplication and DNA repair—that are responsible for mini- 
mizing these changes. Finally, we consider some of the most intriguing pathways 
that alter DNA sequences—in particular, those of DNA recombination including 
the movement within chromosomes of spécial DNA sequences called transpos¬ 
able éléments. 


IN THIS CHAPTER 

THE MAINTENANCE OF DNA 
SEQUENCES 

DNA REPLICATION MECHANISMS 

THE INITIATION AND 
COMPLETION OF DNA 
REPLICATION IN CHROMOSOMES 

DNA REPAIR 

HOMOLOGOUS RECOMBINATION 

TRANSPOSITION AND 
CONSERVATIVE SITE-SPECIFIC 
RECOMBINATION 


THE MAINTENANCE OF DNA SEQUENCES 

Although, as just pointed out, occasional genetic changes enhance the long-term 
survival of a species through évolution, the survival of the individual demands a 
high degree of genetic stability. Only rarely do the cell's DNA-maintenance pro¬ 
cesses fail, resulting in permanent change in the DNA. Such a change is called a 
mutation, and it can destroy an organism if it occurs in a vital position in the DNA 
sequence. 

Mutation Rates Are Extremely Low 

The mutation rate, the rate at which changes occur in DNA sequences, can be 
determined directly from experiments carried out with a bacterium such as Esch- 
erichia coli—a résident of our intestinal tract and a commonly used laboratory 
organism (see Figure 1-24). Under laboratory conditions, E. coli divides about 
once every 30 minutes, and a single cell can generate a very large population— 
several billion—in less than a day. In such a population, it is possible to detect the 
small fraction of bacteria that hâve suffered a damaging mutation in a particular 
gene, if that gene is not required for the bacterium's survival. For example, the 
mutation rate of a gene specifically required for cells to use the sugar lactose as an 
energy source can be determined by growing the cells in the presence of a different 
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sugar, such as glucose, and testing them subsequently to see how many hâve lost 
the ability to survive on a lactose diet. The fraction of damaged genes underes- 
timates the actual mutation rate because many mutations are silent (for exam¬ 
ple, those that change a codon but not the amino acid it spécifiés, or those that 
change an amino acid without affecting the activity of the protein coded for by the 
gene). After correcting for these silent mutations, one finds that a single gene that 
encodes an average-sized protein (~10 3 coding nucléotide pairs) accumulâtes a 
mutation (not necessarily one that would inactivate the protein) approximately 
once in about 10 6 bacterial cell générations. Stated differently, bacteria display 
a mutation rate of about three nucléotide changes per 10 10 nucléotides per cell 
génération. 

Recently, it has become possible to measure the germ-line mutation rate 
directly in more complex, sexually reproducing organisms such as humans. In 
this case, the complété genomes from a family—parents and offspring—were 
directly sequenced, and a careful comparison revealed that approximately 70 new 
single-nucleotide mutations arose in the germ lines of each offspring. Normal- 
ized to the size of the human genome, the mutation rate is one nucléotide change 
per 10 8 nucléotides per human génération. This is a slight underestimate because 
some mutations will be léthal and will therefore be absent from progeny; however, 
because relatively little of the human genome carries critical information, this 
considération has only a small effect on the true mutation rate. It is estimated that 
approximately 100 cell divisions occur in the germ line from the time of concep¬ 
tion to the time of production of the eggs and sperm that go on to malce the next 
génération. Thus, the human mutation rate, expressed in terms of cell divisions 
(instead of human générations), is approximately 1 mutation/10 10 nucléotides/ 
cell division. 

Although E. coli and humans differ greatly in their modes of reproduction and 
in their génération times, when the mutation rates of each are normalized to a 
single round of DNA réplication, they are both extremely low and within a factor 
of three of each other. We shall see later in the chapter that the basic mechanisms 
that ensure these low rates of mutation hâve been conserved since the very early 
history of cells on Earth. 

Low Mutation Rates Are Necessary for Life as We Know It 

Since many mutations are deleterious, no species can afford to allow them to 
accumulate at a high rate in its germ cells. Although the observed mutation fre- 
quency is low, it is nevertheless thought to limit the number of essential proteins 
that any organism can dépend upon to perhaps 30,000. More than this, and the 
probability that at least one critical component will suffer a damaging mutation 
becomes catastrophically high. By an extension of the same argument, a mutation 
frequency tenfold higher would limit an organism to about 3000 essential genes. 
In this case, évolution would hâve been limited to organisms considerably less 
complex than a fruit fly. 

The cells of a sexually reproducing animal or plant are of two types: germ cells 
and somatic cells. The germ cells transmit genetic information from parent to off¬ 
spring; the somatic cells form the body of the organism (Figure 5-1). We hâve 
seen that germ cells must be protected against high rates of mutation to maintain 
the species. However, the somatic cells of multicellular organisms must also be 
protected from genetic change to properly maintain the organized structure of the 
body. Nucléotide changes in somatic cells can give rise to variant cells, some of 
which, through “local" natural sélection, proliferate rapidly at the expense of the 
rest of the organism. In an extreme case, the resuit is the uncontrolled cell prolif¬ 
ération that we know as cancer, a disease that causes (in Europe and North Amer¬ 
ica) more than 20% of human deaths each year. These deaths are due largely to 
an accumulation of changes in the DNA sequences of somatic cells, as discussed 
in Chapter 20. A significant increase in the mutation frequency would presum- 
ably cause a disastrous increase in the incidence of cancer by accelerating the rate 
at which somatic-cell variants arise. Thus, both for the perpétuation of a species 
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Figure 5-1 Germ-line cells and somatic 
cells carry out fundamentally different 
functions. In sexually reproducing 
organisms, the germ-line cells (red) 
propagate genetic information into the next 
génération. Somatic cells (blue), which form 
the body of the organism, are necessary 
for the survival of germ-line cells but do not 
themselves leave any progeny. 


with a large number of genes (germ-cell stability) and for the prévention of can¬ 
cer resulting from mutations in somatic cells (somatic-cell stability), multicellular 
organisms lilce ourselves dépend on the remarlcably high fidelity with which their 
DNA sequences are replicated and maintained. 

Summary 

In ail cells, DNA sequences are maintained and replicated with high fidelity. The 
mutation rate, approximately one nucléotide change per 10 10 nucléotides each time 
the DNA is replicated, is roughly the samefor organisms as different as bacteria and 
humans. Because ofthis remarlcable accuracy, the sequence ofthe human genome 
(approximately 3.2 x 10 9 nucléotide pairs) is unchanged or changed by only afew 
nucléotides each time a typical human cell divides. This allows most humans to 
pass accurate genetic instructions from one génération to the next, and also to avoid 
the changes in somatic cells that lead to cancer. 

DNA REPLICATION MECHANISMS 

Ail organisms duplicate their DNA with extraordinary accuracy before each cell 
division. In this section, we explore how an elaborate "réplication machine” 
achieves this accuracy, while duplicating DNA at rates as high as 1000 nucléotides 
per second. 

Base-Pairing Underlies DNA Réplication and DNA Repair 

As introduced in Chapter 1, DNA templatingis the mechanism the cell uses to copy 
the nucléotide sequence of one DNA strand into a complementary DNA sequence 
(Figure 5-2). This process requires the séparation of the DNA hélix into two tem- 
plate strands, and entails the récognition of each nucléotide in the DNA template 
strands by a free (unpolymerized) complementary nucléotide. The séparation of 


template S strand 



template S' strand 


Figure 5-2 The DNA double hélix acts 
as a template for its own duplication. 

Because the nucléotide A will pair 
successfully only with T, and G only with 
C, each strand of DNA can serve as 
a template to specify the sequence of 
nucléotides in its complementary strand by 
DNA base-pairing. In this way, a double- 
helical DNA molécule can be copied 
precisely. 
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the DNA hélix exposes the hydrogen-bond donor and acceptor groups on each 
DNA base for base-pairing with the appropriate incoming free nucléotide, align- 
ing it for its enzyme-catalyzed polymerization into a new DNA chain. 

The first nucleotide-polymerizing enzyme, DNA polymerase, was discovered 
in 1957. The free nucléotides that serve as substrates for this enzyme were found 
to be deoxyribonucleoside triphosphates, and their polymerization into DNA 
required a single-strand DNA template. Figure 5-3 and Figure 5-4 illustrate the 
stepwise mechanism of this reaction. 

The DNA Réplication Fork Is Asymmetrical 

During DNA réplication inside a cell, each of the two original DNA strands serves 
as a template for the formation of an entire new strand. Because each of the two 
daughters of a dividing cell inherits a new DNA double hélix containing one origi¬ 
nal and one new strand (Figure 5-5), the DNA double hélix is said to be replicated 
“semiconservatively." How is this feat accomplished? 
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Figure 5-3 The chemistry of DNA synthesis. The addition of a deoxyribonucleotide to the 
3' end of a polynucleotide chain (the primer strand) is the fundamental reaction by which DNA is 
synthesized. As shown, base-pairing between an incoming deoxyribonucleoside triphosphate and 
an existing strand of DNA (the template strand) guides the formation of the new strand of DNA 
and causes it to hâve a complementary nucléotide sequence. The way in which complementary 
nucléotides base-pair is shown in Figure 4-4. 
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Figure 5-4 DNA synthesis catalyzed by DNA polymerase. (A) DNA polymerase catalyzes the stepwise addition of a 
deoxyribonucleotide to the 3'-OH end of a polynucleotide chain, the growing primer strand that is paired to an existing template 
strand. The newly synthesized DNA strand therefore polymerizes in the 5'-to-3' direction as shown also in the previous figure. 
Because each incoming deoxyribonucleoside triphosphate must pair with the template strand to be recognized by the DNA 
polymerase, this strand détermines which of the four possible deoxyribonucleotides (A, C, G, or T) will be added. The reaction 
is driven by a large, favorable free-energy change, caused by the release of pyrophosphate and its subséquent hydrolysis 
to two molécules of inorganic phosphate. (B) Structure of DNA polymerase complexed wth DNA (orange), as determined 
by x-ray crystallography (Movie 5.1). The template DNA strand is the longer strand and the newly synthesized DNA is the 
shorter. (C) Schematic diagram of DNA polymerase, based on the structure in (B). The proper base-pair geometry of a correct 
incoming deoxyribonucleoside triphosphate causes the polymerase to tighten around the base pair, thereby initiating the 
nucléotide addition reaction (middle diagram (C)). Dissociation of pyrophosphate relaxes the polymerase, allowing translocation 
of the DNA by one nucléotide so the active site of the polymerase is ready to receive the next deoxyribonucleoside triphosphate. 


Analyses carried out in the early 1960s on whole replicating chromosomes 
revealed a localized région of réplication that moves progressively along the 
parental DNA double hélix. Because of its Y-shaped structure, this active région 
is called a réplication fork (Figure 5-6). At the réplication forlc, a multienzyme 
complex that contains the DNA polymerase synthesizes the DNA of both new 
daughter strands. 

Initially, the simplest mechanism of DNA réplication seemed to be the con- 
tinuous growth of both new strands, nucléotide by nucléotide, at the réplication 
fork as it moves from one end of a DNA molécule to the other. But because of 
the antiparallel orientation of the two DNA strands in the DNA double hélix (see 
Figure 5-2), this mechanism would require one daughter strand to polymerize 
in the 5'-to-3' direction and the other in the 3'-to-5' direction. Such a réplication 
fork would require two distinct types of DNA polymerase enzymes. However, as 
attractive as this model might be, the DNA polymerases at réplication forlcs can 
synthesize only in the 5'-to-3' direction. 

How, then, can a DNA strand grow in the 3'-to-5' direction? The answer 
was first suggested by the results of an experiment performed in the late 1960s. 
Researchers added highly radioactive 3 H-thymidine to dividing bacteria for a 
few seconds, so that only the most recently replicated DNA—that just behind the 
réplication fork—became radiolabeled. This experiment revealed the transient 
existence of pièces of DNA that were 1000-2000 nucléotides long, now commonly 
known as Olcazaki fragments , at the growing réplication fork. (Similar réplication 
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Figure 5-5 The semiconservative nature of DNA réplication. In a round 
of réplication, each of the two strands of DNA is used as a template for the 
formation of a complementary DNA strand. The original strands therefore 
remain intact through many cell générations. 


intermediates were later found in eukaryotes, where they are only 100-200 nuclé¬ 
otides long.) The Okazaki fragments were shown to be polymerized only in the 
5'-to-3' chain direction and to be joined together after their synthesis to create 
long DNA chains. 

A réplication forlc therefore has an asymmetric structure (Figure 5-7). 
The DNA daughter strand that is synthesized continuously is known as the 
leading strand. Its synthesis slightly précédés the synthesis of the daughter strand 
that is synthesized discontinuously, known as the lagging strand. For the lagging 
strand, the direction of nucléotide polymerization is opposite to the overall direc¬ 
tion of DNA chain growth. The synthesis of this strand by a discontinuous “baclc- 
stitching" mechanism means that DNA réplication requires only the 5'-to-3' type 
of DNA polymerase. 

The High Fidelity of DNA Réplication Requires Several 
Proofreading Mechanisms 

As discussed above, the fidelity of copying DNA during réplication is such that only 
about one mistake occurs for every 10 10 nucléotides copied. This fidelity is much 
higher than one would expect from the accuracy of complementary base-pairing. 
The standard complementary base pairs (see Figure 4-4) are not the only ones 
possible. For example, with small changes in hélix geometry, two hydrogen bonds 
can form between G and T in DNA. In addition, rare tautomeric forms of the four 
DNA bases occur transiently in ratios of 1 part to 10 4 or 10 5 . These forms mispair 
without a change in hélix geometry: the rare tautomeric form of C pairs with A 
instead of G, for example. 

If the DNA polymerase did nothing spécial when a mispairing occurred 
between an incoming deoxyribonucleoside triphosphate and the DNA template, 
the wrong nucléotide would often be incorporated into the new DNA chain, 
producing frequent mutations. The high fidelity of DNA réplication, however, 
dépends not only on the initial base-pairing but also on several "proofreading" 
mechanisms that act sequentially to correct any initial mispairings that might 
hâve occurred. 
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Figure 5-6 Two réplication forks moving 
in opposite directions on a circular 
chromosome. An active zone of DNA 
réplication moves progressively along 
a replicating DNA molécule, creating a 
Y-shaped DNA structure known as a 
réplication fork: the two arms of each Y 
are the two daughter DNA molécules, 
and the stem of the Y is the parental DNA 
hélix. In this diagram, parental strands are 
orange ; newly synthesized strands are red. 
(Micrograph courtesy of Jerome Vinograd.) 
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Figure 5-7 The structure of a DNA réplication fork. Left, réplication fork with newly synthesized 
DNA in red and arrows indicating the 5'-to-3' direction of DNA synthesis. Because both daughter 
DNA strands are polymerized in the 5'-to-3' direction, the DNA synthesized on the lagging strand 
must be made initially as a sériés of short DNA molécules, called Okazaki fragments, named after 
the scientist who discovered them. Right, the same fork a short time later. On the lagging strand, 
the Okazaki fragments are synthesized sequentially, with those nearest the fork being the most 
recently made. 


DNA polymerase performs the first proofreading step just before a new nuclé¬ 
otide is covalently added to the growing chain. Our knowledge of this mechanism 
cornes from studies of several different DNA polymerases, including one pro- 
duced by a bacterial virus, T7, that replicates inside E. coli. The correct nucléotide 
has a higher affinity for the moving polymerase than does the incorrect nucléo¬ 
tide, because the correct pairing is more energetically favorable. Moreover, after 
nucléotide binding, but before the nucléotide is covalently added to the grow¬ 
ing chain, the enzyme must undergo a conformational change in which its “grip” 
tightens around the active site (see Figure 5-4). Because this change occurs more 
readily with correct than incorrect base-pairing, it allows the polymerase to "dou¬ 
ble- checlc” the exact base-pair geometry before it catalyzes the addition of the 
nucléotide. Incorrectly paired nucléotides are harder to add and therefore more 
lilcely to diffuse away before the polymerase can mistakenly add them. 

The next error-correcting reaction, known as exonucleolytic proofreading , 
talces place immediately after those rare instances in which an incorrect nuclé¬ 
otide is covalently added to the growing chain. DNA polymerase enzymes are 
highly discriminating in the types of DNA chains they will elongate: they require 
a previously formed, base-paired 3'-OH end of a primer strand (see Figure 5-4). 
Those DNA molécules with a mismatched (improperly base-paired) nucléotide 
at the 3'-OH end of the primer strand are not effective as templates because the 
polymerase has difficulty extending such a strand. DNA polymerase molécules 
correct such a mismatched primer strand by means of a separate catalytic site 
(either in a separate subunit or in a separate domain of the polymerase molécule, 
depending on the polymerase). This 3'-to-5 1 proofreading exonuclease clips off any 
unpaired or mispaired residues at the primer terminus, continuing until enough 
nucléotides hâve been removed to regenerate a correctly base-paired 3'-OH ter¬ 
minus that can prime DNA synthesis. In this way, DNA polymerase functions as a 
“self-correcting" enzyme that removes its own polymerization errors as it moves 
along the DNA (Figure 5-8 and Figure 5-9). 

The self-correcting properties of the DNA polymerase dépend on its require- 
ment for a perfectly base-paired primer terminus, and it is apparently not pos¬ 
sible for such an enzyme to start synthesis de novo, without an existing primer. 
By contrast, the RNA polymerase enzymes involved in gene transcription do not 
need such an efficient exonucleolytic proofreading mechanism: errors in making 
RNA are not passed on to the next génération, and the occasional defective RNA 
molécule that is produced has no long-term significance. RNA polymerases are 
thus able to start new polynucleotide chains without a primer. 

Figure 5-8 Exonucleolytic proofreading by DNA polymerase during DNA 
réplication. In this example, a C is accidentally incorporated at the growing 
3'-OH end of a DNA chain. The part of DNA polymerase that removes the 
misincorporated nucléotide is a specialized member of a large class of 
enzymes, known as exonucleases, that cleave nucléotides one at a time from 
the ends of polynucleotides. 
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Figure 5-9 Editing by DNA polymerase. DNA polymerase complexed with the DNA template in 
the polymerizing mode (left) and the editing mode (right). The catalytic sites for the exonucleolytic (E) 
and the polymerization (P) reactions are indicated. In the editing mode, the newly synthesized DNA 
transiently unpairs from the template and enters the editing site where the most recently added 
nucléotide is catalytically removed. 


There is an error frequency of about one mistake for every 10 4 polymerization 
events both in RNA synthesis and in the separate process of translating mRNA 
sequences into protein sequences. This error rate is over 100,000 times greater 
than that in DNA réplication, where, as we hâve seen, a sériés of proofreading 
processes malces the process unusually accurate (Table 5-1). 

Only DNA Réplication in the 5'-to-3' Direction Allows Efficient Error 
Correction 

The need for accuracy probably explains why DNA réplication occurs only in the 
5'-to-3' direction. If there were a DNA polymerase that added deoxyribonucleo- 
side triphosphates in the 3'-to-5' direction, the growing 5' end of the chain, rather 
than the incoming mononucleotide, would hâve to provide the activating triphos¬ 
phate needed for the covalent linlcage. In this case, the mistakes in polymeriza¬ 
tion could not be simply hydrolyzed away, because the bare 5' end of the chain 
thus created would immediately terminate DNA synthesis (see Figure 5-3). It is 
therefore possible to correct a mismatched base only if it has been added to the 
3' end of a DNA chain. Although the backstitching mechanism for DNA réplica¬ 
tion seems complex, it préserves the 5'-to-3' direction of polymerization that is 
required for exonucleolytic proofreading. 

Despite these safeguards against DNA réplication errors, DNA polymerases 
occasionally malce mistakes. However, as we shall see later, cells hâve yet another 



The Three Steps That Give Rise to High-Fidelity DNA Synthesis 


Réplication step 

Errors per nucléotide added 

5' —> 3' polymerization 

1 in 10 5 

3' —> 5' exonucleolytic proofreading 

1 in 10 2 

Strand-directed mismatch repair 

1 in 10 3 

Combined 

1 in 10 10 

The third step, strand-directed mismatch repair, is described later in this chapter. For the 
polymerization step, “errors per nucléotide added” describes the probability that an incorrect 
nucléotide will be added to the growing chain. For the other two steps, “errors per nucléotide 
added” describes the probability that an error will not be corrected. Each step therefore reduces 

the chance of a final error by the factor shown. 
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Figure 5-10 RNA primer synthesis. A schematic view of the reaction 
catalyzed by DNA primase, the enzyme that synthesizes the short RNA 
primers made on the lagging strand using DNA as a template. Unlike DNA 
polymerase, this enzyme can start a new polynucleotide chain by joining 
two nucleoside triphosphates together. The primase synthesizes a short 
polynucleotide in the 5'-to-3' direction and then stops, making the 
3' end of this primer available for the DNA polymerase. 
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chance to correct these errors by a process called strand-directed mismatch repair. 
Before discussing this mechanism, however, we describe the other types of pro¬ 
teins that function at the réplication forlc. 
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A Spécial Nucleotide-Polymerizing Enzyme Synthesizes Short 
RNA Primer Molécules on the Lagging Strand 

For the leading strand, a primer is needed only at the start of réplication: once 
a réplication forlc is established, the DNA polymerase is continuously presented 
with a base-paired chain end on which to add new nucléotides. On the lagging 
side of the forlc, however, each time the DNA polymerase complétés a short DNA 
Olcazalci fragment (which talces a few seconds), it must start synthesizing a com- 
pletely new fragment at a site further along the template strand (see Figure 5-7). 
A spécial mechanism produces the base-paired primer strand required by the 
DNA polymerase molécules. The mechanism dépends on an enzyme called 
DNA primase, which uses ribonucleoside triphosphates to synthesize short 
RNA primers on the lagging strand (Figure 5-10). In eulcaryotes, these primers 
are about 10 nucléotides long and are made at intervals of 100-200 nucléotides on 
the lagging strand. 

The Chemical structure of RNA was introduced in Chapter 1 and is described 
in detail in Chapter 6. Here, we note only that RNA is very similar in structure to 
DNA. A strand of RNA can form base pairs with a strand of DNA, generating a 
DNA-RNA hybrid double hélix if the two nucléotide sequences are complemen- 
tary. Thus, the same templating principle used for DNA synthesis guides the syn¬ 
thesis of RNA primers. Because an RNA primer contains a properly base-paired 
nucléotide with a 3'-OH group at one end, it can be elongated by the DNA poly¬ 
merase at this end to begin an Olcazalci fragment. The synthesis of each Olcazaki 
fragment ends when this DNA polymerase runs into the RNA primer attached to 
the 5' end of the previous fragment. To produce a continuous DNA chain from the 
many DNA fragments made on the lagging strand, a spécial DNA repair System 
acts quickly to erase the old RNA primer and replace it with DNA. An enzyme 
called DNA ligase then joins the 3' end of the new DNA fragment to the 5' end of 
the previous one to complété the process (Figure 5-11 and Figure 5-12). 

Why might an erasable RNA primer be preferred to a DNA primer that would 
not need to be erased? The argument that a self-correcting polymerase cannot 
start chains de novo also implies the converse: an enzyme that starts chains anew 
cannot be efficient at self-correction. Thus, any enzyme that primes the synthesis 
of Olcazalci fragments will of necessity malce a relatively inaccurate copy (at least 
one error in 10 5 ). Even if the copies retained in the final product constituted as 
little as 5% of the total genome (for example, 10 nucléotides per 200-nucleotide 
DNA fragment), the resulting increase in the overall mutation rate would be enor- 
mous. It therefore seems lilcely that the use of RNA rather than DNA for priming 
brings a powerful advantage to the cell: the ribonucleotides in the primer auto- 
matically mark these sequences as “suspect copy" to be efficiently removed and 
replaced. 


Figure 5-11 The synthesis of one of many DNA fragments on the 
lagging strand. In eukaryotes, RNA primers are made at intervals spaced 
by about 200 nucléotides on the lagging strand, and each RNA primer is 
approximately 10 nucléotides long. This primer is erased by a spécial DNA 
repair enzyme (an RNAse H) that recognizes an RNA strand in an RNA/DNA 
hélix and fragments it; this leaves gaps that are filled in by DNA polymerase 
and DNA ligase. 
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Spécial Proteins Help to Open Up the DNA Double Hélix in Front 
of the Réplication Fork 

For DNA synthesis to proceed, the DNA double helixmust be opened up (“melted”) 
ahead of the réplication fork so that the incoming deoxyribonucleoside triphos¬ 
phates can form base pairs with the template strands. However, the DNA double 
hélix is very stable under physiological conditions; the base pairs are loclced in 
place so strongly that it requires températures approaching that of boilingwater to 
separate the two strands in a test tube. For this reason, two additional types of rép¬ 
lication proteins—DNA helicases and single-strand DNA-binding proteins—are 
needed to open the double hélix and provide the appropriate single-strand DNA 
templates for the DNA polymerase to copy. 

DNA helicases were first isolated as proteins that hydrolyze ATP when they are 
bound to single strands of DNA. As described in Chapter 3, the hydrolysis of ATP 
can change the shape of a protein molécule in a cyclical manner that allows the 
protein to perform mechanical worlc. DNA helicases use this principle to propel 
themselves rapidly along a DNA single strand. When they encounter a région of 
double hélix, they continue to move along their strand, thereby prying apart the 
hélix at rates of up to 1000 nucléotide pairs per second (Figure 5-13 and Figure 
5-14). 

The two strands of DNA hâve opposite polarities, and, in principle, a helicase 
could unwind the DNA double hélix by moving in the 5'-to-3' direction along one 
strand or in the 3'-to-5' direction along the other. In fact, both types of DNA heli¬ 
case exist. In the best-understood réplication Systems in bacteria, a helicase mov¬ 
ing 5' to 3' along the lagging-strand template appears to hâve the prédominant 
rôle, for reasons that will become clear shortly. 

Single-strand DNA-binding (SSB) proteins, also called hélix-destabilizing 
proteins , bind tightly and cooperatively to exposed single-strand DNA without 
covering the bases, which therefore remain available as templates. These proteins 
are unable to open a long DNA hélix directly, but they aid helicases by stabiliz- 
ing the unwound, single-strand conformation. In addition, through cooperative 
binding, they coat and straighten out the régions of single-strand DNA, which 
occur routinely in the lagging-strand template, thereby preventing the formation 
of the short hairpin helices that readily form in single-strand DNA (Figure 5-15 
and Figure 5-16). If not removed, these hairpin helices can impede the DNA syn¬ 
thesis catalyzed by DNA polymerase. 

A Sliding Ring Holds a Moving DNA Polymerase Onto the DNA 

On their own, most DNA polymerase molécules will synthesize only a short string 
of nucléotides before falling off the DNA template. The tendency to dissociate 
quicldy from a DNA molécule allows a DNA polymerase molécule that has just 


Figure 5-13 An assay for DNA helicase enzymes. A short DNA fragment 
is annealed to a long DNA single strand to form a région of DNA double hélix. 
The double hélix is melted as the helicase runs along the DNA single strand, 
releasing the short DNA fragment in a reaction that requires the presence 
of both the helicase protein and ATR The rapid stepwise movement of the 
helicase is powered by its ATP hydrolysis (shown schematically in Figure 
3-75A). As indicated, many DNA helicases are composed of six su bu n its. 


Figure 5-12 The reaction catalyzed by 
DNA ligase. This enzyme seals a broken 
phosphodiester bond. As shown, DNA 
ligase uses a molécule of ATP to activate 
the 5' end at the nick (step 1) before 
forming the new bond (step 2). In this way, 
the energetically unfavorable nick-sealing 
reaction is driven by being coupled to the 
energetically favorable process of ATP 
hydrolysis. 
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Figure 5-14 The structure of a DNA helicase. (A) Diagram of the protein 
as a hexameric ring drawn to scale with a réplication fork. (B) Detailed 
structure of the bactériophage T7 réplicative helicase, as determined by 
x-ray diffraction. Six identical subunits bind and hydrolyze ATP in an ordered 
fashion to propel this molécule, like a rotary engine, along a DNA single 
strand that passes through the central hole. Red indicates bound ATP 
molécules in the structure (Movie 5.2). (PDB code: 1EOJ.) 



finished synthesizing one Olcazaki fragment on the lagging strand to be recycled 
quicldy, so as to begin the synthesis of the next Olcazaki fragment on the same 
strand. This rapid dissociation, however, would malce it difficult for the poly¬ 
merase to synthesize the long DNA strands produced at a réplication fork were it 
not for an accessory protein (called PCNA in eukaryotes) that functions as a regu- 
lated sliding clamp. This clamp lceeps the polymerase firmly on the DNA when 
it is moving, but releases it as soon as the polymerase runs into a double-strand 
région of DNA. 

How can a sliding clamp prevent the polymerase from dissociating without at 
the same time impeding the polymerase's rapid movement along the DNA molé¬ 
cule? The three-dimensional structure of the clamp protein, determined by x-ray 
diffraction, revealed it to be a large ring around the DNA double hélix. One face 
of the ring binds to the baclc of the DNA polymerase, and the whole ring slides 
freely along the DNA as the polymerase moves. The assembly of the clamp around 
the DNA requires ATP hydrolysis by a spécial protein complex, the clamp loader, 
which hydrolyzes ATP as it loads the clamp on to a primer-template junction 
(Figure 5-17). 

On the leading-strand template, the moving DNA polymerase is tightly bound 
to the clamp, and the two remain associated for a very long time. The DNA poly¬ 
merase on the lagging-strand template also malces use of the clamp, but each 
time the polymerase reaches the 5' end of the preceding Olcazaki fragment, the 
polymerase releases itself from the clamp and dissociâtes from the template. This 
polymerase molécule then associâtes with a new clamp that is assembled on the 
RNA primer of the next Olcazaki fragment. 



single-stranded région 
of DNA template 
with short régions 
of base-paired "hairpins" 





Figure 5-15 The effect of single-strand DNA-binding proteins (SSB proteins) on the structure 
of single-strand DNA. Because each protein molécule prefers to bind next to a previously 
bound molécule, long rows of this protein form on a DNA single strand. This cooperative binding 
straightens out the DNA template and facilitâtes the DNA polymerization process. The “hairpin 
helices” shown in the bare, single-strand DNA resuit from a chance matching of short régions of 
complementary nucléotide sequence; they are similar to the short helices that typically form in RNA 
molécules (see Figure 1-6). 
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(A) single-strand binding protein (B) 


Figure 5-16 Human single-strand binding protein bound to DNA. (A) Front view of the two 

DNA-binding domains of the protein (called RPA) which cover a total of eight nucléotides. Note 
that the DNA bases remain exposed in this protein-DNA complex. (B) Diagram showing the three- 
dimensional structure, with the DNA strand (orange) viewed end-on. (PDB code: 1JMC.) 


Figure 5-17 The regulated sliding clamp that holds DNA polymerase on the DNA. (A) The 

structure of the clamp protein from E. coli, as determined by x-ray crystallography, with a DNA hélix 
added to indicate how the protein fits around DNA (Movie 5.3). (B) Schematic illustration showing 
how the clamp (with red and yellow subunits) is loaded onto DNA to serve as a tether for a moving 
DNA polymerase molécule. The structure of the clamp loader (dark green) resembles a screw nut, 
with its threads matching the grooves of double-stranded DNA. The loader binds to a free clamp 
molécule, forcing a gap in its ring of subunits so that this ring is able to slip around DNA. The clamp 
loader, thanks to its screw-nut structure, recognises the région of DNA that is double-stranded 
and latches onto it, tightening around the complex of a template strand with a freshly synthesized 
elongating (primer) strand. It carries the clamp along this double-stranded région until it encounters 
the 3' end of the primer, at which point the loader hydrolyzes ATP and releases the clamp, allowing 
it to close around the DNA and bind to DNA polymerase. In the simplified reaction shown here, the 
clamp loader dissociâtes into solution once the clamp has been assembled. At a true réplication 
fork, the clamp loader remains close to the polymerase so that, on the lagging strand, it is ready to 
assemble a new clamp at the start of each new Okazaki fragment (see Figure 5-18). (A, from 
X.P. Kong et al., Cell 69:425-437, 1992. With permission from Elsevier; B, adapted from B.A. Kelch 
et al., Science 334:1675-1680, 2011. With permission from AAAS. PDB code: 3BEP.) 




ATP BINDING TO DNA ENGAGED ATP HYDROLYSIS LOCKS DNA POLYMERASE 

CLAMP LOADER IN CLAMP SLIDING CLAMP AROUND BINDS TO SLIDING 

OPENS SLIDING CLAMP DNA AND RELEASES CLAMP 

CLAMP LOADER 


(B) 











DNA REPLICATION MECHANISMS 


249 


The Proteins at a Réplication Fork Cooperate to Form a 
Réplication Machine 

Although we hâve discussed DNA réplication as though it were performed by a 
mix of proteins ail acting independently, in reality most of the proteins are held 
together in a large and orderly multienzyme complex that rapidly synthesizes 
DNA. This complex can be lilcened to a tiny sewing machine composed of protein 
parts and powered by nucleoside triphosphate hydrolysis. Lilce a sewing machine, 
the réplication complex probably remains stationary with respect to its immédi¬ 
ate surroundings; the DNA can be thought of as a long strip of cloth being rapidly 
threaded through it. Although the réplication complex has been most intensively 
studied in E. coli and several of its viruses, a very similar complex also opérâtes in 
eukaryotes, as we see below. 

Figure 5-18 summarizes the functions of the subunits of the réplication 
machine. At the front of the réplication fork, DNA helicase opens the DNA hélix. 
Two DNA polymerase molécules worlc at the fork, one on the leading strand and 
one on the lagging strand. Whereas the DNA polymerase molécule on the leading 
strand can operate in a continuous fashion, the DNA polymerase molécule on the 
lagging strand must restart at short intervals, using a short RNA primer made by 
a DNA primase molécule. The close association of ail these protein components 
increases the efficiency of réplication and is made possible by a folding baclc of 
the lagging strand as shown in Figure 5-18A. This arrangement also facilitâtes the 
loading of the polymerase clamp each time that an Okazaki fragment is synthe- 
sized: the clamp loader and the lagging-strand DNA polymerase molécule are 
lcept in place as a part of the protein machine even when they detach from their 
DNA template. The réplication proteins are thus linlced together into a single large 
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Figure 5-18 A bacterial réplication fork. (A) This schematic diagram shows a current view of the arrangement of réplication proteins at a réplication 
fork when DNA is being synthesized. The lagging-strand DNA is folded to bring the lagging-strand DNA polymerase molécule into a complex with the 
leading-strand DNA polymerase molécule. This folding also brings the 3' end of each completed Okazaki fragment close to the start site for the next 
Okazaki fragment. Because the lagging-strand DNA polymerase molécule remains bound to the rest of the réplication proteins, it can be reused to 
synthesize successive Okazaki fragments. In this diagram, it is about to let go of its completed DNA fragment and move to the RNA primer that is just 
being synthesized. Additional proteins (not shown) help to hold the different protein components of the fork together, enabling them to function as a 
well-coordinated protein machine (Movie 5.4 and Movie 5.5). (B) An électron micrograph showing the réplication machine from the bactériophage 
T4 as it moves along a template synthesizing DNA behind it. (C) An interprétation of the micrograph is given in the sketch: note especially the 
DNA loop on the lagging strand. Apparently, the réplication proteins became partly detached from the very front of the réplication fork during the 
préparation of this sample for électron microscopy. (B, courtesy of Jack Griffith; see P.D. Chastain et al., J. Biol. Chem. 278:21276-21285, 2003.) 
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unit (total molecular mass >10 6 daltons), enabling DNA to be synthesized on both 
sides of the réplication forlc in a coordinated and efficient manner. 

On the lagging strand, the DNA réplication machine leaves behind a sériés of 
unsealed Okazaki fragments, which still contain the RNA that primed their syn- 
thesis at their 5' ends. As discussed earlier, this RNA is removed and the resulting 
gap is filled in by DNA repair enzymes that operate behind the réplication forlc 
(see Figure 5-11). 

A Strand-Directed Mismatch Repair System Removes Réplication 
Errors That Escape from the Réplication Machine 

As stated previously, bacteria such as E. coli are capable of dividing once every 30 
minutes, malcing it relatively easy to screen large populations to find a rare mutant 
cell that is altered in a spécifie process. One interesting class of mutants consists of 
those with alterations in so-called mutator genes, which greatly increase the rate 
of spontaneous mutation. Not surprisingly, one such mutant malces a defective 
form of the 3'-to-5' proofreading exonuclease that is a part of the DNA polymerase 
enzyme (see Figures 5-8 and 5-9). The mutant DNA polymerase no longer proof- 
reads effectively, and many réplication errors that would otherwise hâve been 
removed accumulate in the DNA. 

The study of other E. coli mutants exhibiting abnormally high mutation rates 
has uncovered a proofreading System that removes réplication errors made by the 
polymerase that hâve been missed by the proofreading exonuclease. This strand- 
directed mismatch repair System detects the potential for distortion in the DNA 
hélix from the misfit between noncomplementary base pairs. 

If the proofreading System simply recognized a mismatch in newly replicated 
DNA and randomly corrected one of the two mismatched nucléotides, it would 
mistakenly "correct" the original template strand to match the error exactly half 
the time, thereby failing to lower the overall error rate. To be effective, such a proof¬ 
reading System must be able to distinguish and remove the mismatched nucléo¬ 
tide only on the newly synthesized strand, where the réplication error occurred. 

The strand-distinction mechanism used by the mismatch proofreading System 
in E. coli dépends on the méthylation of selected A residues in the DNA. Methyl 
groups are added to ail A residues in the sequence GATC, but not until some 
time after the A has been incorporated into a newly synthesized DNA chain. As 
a resuit, the only GATC sequences that hâve not yet been methylated are in the 
new strands just behind a réplication forlc. The récognition of these unmethylated 
GATCs allows the new DNA strands to be transiently distinguished from old ones, 
as required if their mismatches are to be selectively removed. The three-step pro¬ 
cess involves récognition of a newly synthesized strand, excision of the portion 
containing the mismatch, and resynthesis of the excised segment using the old 
strand as a template. This strand-directed mismatch repair System reduces the 
number of errors made during DNA réplication by an additional factor of 100 to 
1000 (see Table 5-1, p. 244). 

A similar mismatch proofreading System functions in eulcaryotic cells but uses 
a different strategy to distinguish the new strand from the old (Figure 5-19). Newly 
synthesized lagging-strand DNA transiently contains niclcs (before they are sealed 
by DNA ligase) and such niclcs (also called single-strand breaks) provide the sig¬ 
nal that directs the mismatch proofreading System to the appropriate strand. This 
strategy also requires that the newly synthesized DNA on the leading strand be 
transiently niclced; how this occurs is uncertain. 

The importance of mismatch proofreading in humans is seen in individuals 
who inherit one defective copy of a mismatch repair gene (along with a functional 
gene on the other copy of the chromosome). These people hâve a marlced prédis¬ 
position for certain types of cancers. For example, in a type of colon cancer called 
hereditary nonpolyposis colon cancer (HNPCC), spontaneous mutation of the one 
functional gene produces a clone of somatic cells that, because they are déficient 
in mismatch proofreading, accumulate mutations unusually rapidly. Most can¬ 
cers arise in cells that hâve accumulated multiple mutations (see pp. 1096-1097), 
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and cells déficient in mismatch proofreading therefore hâve a greatly enhanced 
chance of becoming cancerous. Fortunately, most of us inherit two good cop¬ 
ies of each gene that encodes a mismatch proofreading protein; this protects us, 
because it is highly unlikely for both copies to become mutated in the same cell. 

DNA Topoisomerases Prevent DNA Tangling During Réplication 

As a réplication forlc moves along double-strand DNA, it créâtes what has been 
called the “winding problem.” The two parental strands, which are wound around 
each other, must be unwound and separated for réplication to occur. For every 10 
nucléotide pairs replicated at the forlc, one complété turn of the parental double 
hélix must be unwound. In principle, this unwinding can be achieved by rapidly 
rotating the entire chromosome ahead of a moving forlc; however, this is energet- 
ically highly unfavorable (particularly for long chromosomes) and, instead, the 
DNA in front of a réplication forlc becomes overwound (Figure 5-20). The over- 
winding, in turn, is continually relieved by proteins lcnown as DNA topoisomer¬ 
ases. 

A DNA topoisomerase can be viewed as a réversible nuclease that adds itself 
covalently to a DNA baclcbone phosphate, thereby brealcing a phosphodiester 
bond in a DNA strand. This reaction is réversible, and the phosphodiester bond 
re-forms as the protein leaves. 

One type of topoisomerase, called topoisomerase I, produces a transient sin- 
gle-strand break; this break in the phosphodiester baclcbone allows the two sec¬ 
tions of DNA hélix on either side of the niclc to rotate freely relative to each other, 
using the phosphodiester bond in the strand opposite the niclc as a swivel point 
(Figure 5-21). Any tension in the DNA hélix will drive this rotation in the direction 
that relieves the tension. As a resuit, DNA réplication can occur with the rotation 
of only a short length of hélix—the part just ahead of the forlc. Because the cova¬ 
lent linlcage that joins the DNA topoisomerase protein to a DNA phosphate retains 
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Figure 5-19 Strand-directed mismatch 
repair. (A) The two proteins shown are 
présent in both bacteria and eukaryotic 
cells: MutS binds specifically to a 
mismatched base pair, while MutL scans 
the nearby DNA for a nick. Once MutL 
finds a nick, it triggers the dégradation of 
the nicked strand ail the way back through 
the mismatch. Because nicks are largely 
confined to newly replicated strands in 
eukaryotes, réplication errors are selectively 
removed. In bacteria, an additional protein 
in the complex (MutH) nicks unmethylated 
(and therefore newly replicated) GATC 
sequences, thereby beginning the process 
illustrated here. In eukaryotes, MutL 
contains a DNA nicking activity that aids in 
the removal of the damaged strand. 

(B) The structure of the MutS protein bound 
to a DNA mismatch. This protein is a 
dimer, which grips the DNA double hélix as 
shown, kinking the DNA at the mismatched 
base pair. It seems that the MutS protein 
scans the DNA for mismatches by testing 
for sites that can be readily kinked, which 
are those with an abnormal base pair. 

(PDB code: 1EWQ.) 


Figure 5-20 The “winding problem” 
that arises during DNA réplication. 

(A) For a bacterial réplication fork moving at 
500 nucléotides per second, the parental 
DNA hélix ahead of the fork must rotate 
at 50 révolutions per second. (B) If the 
ends of the DNA double hélix remain fixed 
(or difficult to rotate), tension builds up in 
front of the réplication fork as it becomes 
overwound. Some of this tension can be 
taken up by supercoiling, whereby the DNA 
double hélix twists around itself (see Figure 
6-19). However, if the tension continues to 
build up, the réplication fork will eventually 
stop because further unwinding requires 
more energy than the helicase can provide. 
Note that in (A), the dotted line represents 
about 20 turns of DNA. 
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Figure 5-21 The réversible DNA nicking 
reaction catalyzed by a eukaryotic DNA 
topoisomerase I enzyme. As indicated, 
these enzymes transiently form a single 
covalent bond with DNA; this allows free 
rotation of the DNA around the covalent 
backbone bonds linked to the blue 
phosphate. 




DNA topoisomerase covalently 
attaches to a DNA phosphate, 
thereby breaking a phosphodiester 
linkage in one DNA strand 




the original phosphodiester bond 
energy is stored in the phosphotyrosine 
linkage, making the reaction réversible 




spontaneous re-formation 
of the phosphodiester bond 
regenerates both the DNA 
hélix and the DNA 
topoisomerase 



the energy of the cleaved phosphodiester bond, resealing is rapid and does not 
require additional energy input. In this respect, the rejoining mechanism differs 
from that catalyzed by the enzyme DNA ligase, discussed previously (see Figure 
5-12). 

A second type of DNA topoisomerase, topoisomerase II, forms a covalent 
linkage to both strands of the DNA hélix at the same time, making a transient 
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Figure 5-22 The DNA-helix-passing reaction catalyzed by DNA 
topoisomerase II. Unlike type I topoisomerases, type II enzymes hydrolyze 
ATP (red), which is needed to release and reset the enzyme after each cycle. 
Type II topoisomerases are largely confined to proliferating cells in eukaryotes; 
partly for that reason, they hâve been effective targets for anticancer drugs. 
Some of these drugs inhibit topoisomerase II at the third step in the figure 
and thereby produce high levels of double-strand breaks that kill rapidly 
dividing cells. The small yellow circles represent the phosphates in the DNA 
backbone that become covalently bonded to the topoisomerase (see 
Figure 5-21). 


double-strand break in the hélix. These enzymes are activated by sites on chromo¬ 
somes where two double helices cross over each other such as those generated by 
supercoiling in front of a réplication forlc (see Figure 5-20). Once a topoisomerase 
II molécule binds to such a Crossing site, the protein uses ATP hydrolysis to per- 
form the following set of reactions efïïciently: (1) it breaks one double hélix revers- 
ibly to create a DNA "gâte"; (2) it causes the second, nearby double hélix to pass 
through this opening; and (3) it then reseals the break and dissociâtes from the 
DNA. At crossover points generated by supercoiling, passage of the double hélix 
through the gâte occurs in the direction that will reduce supercoiling. In this way, 
type II topoisomerases can relieve the overwinding tension generated in front of 
a réplication forlc. Their reaction mechanism also allows type II DNA topoisomer¬ 
ases to efficiently separate two interlocked DNA circles (Figure 5-22). 

Topoisomerase II also prevents the severe DNA tangling problems that would 
otherwise arise during DNA réplication. This rôle is nicely illustrated by mutant 
yeast cells that produce, in place of the normal topoisomerase II, a version that is 
inactive above 37°C. When the mutant cells are warmed to this température, their 
daughter chromosomes remain intertwined after DNA réplication and are unable 
to separate. The enormous usefulness of topoisomerase II for untangling chromo¬ 
somes can readily be appreciated by anyone who has struggled to remove a tangle 
from a fishing line without the aid of scissors. 

DNA Réplication Is Fundamentally Similar in Eukaryotes and 
Bacteria 

Much of what we lcnow about DNA réplication was first derived from studies 
of purified bacterial and bactériophage multienzyme Systems capable of DNA 
réplication in vitro. The development of these Systems in the 1970s was greatly 
facilitated by the prior isolation of mutants in a variety of réplication genes; these 
mutants were exploited to identify and purify the corresponding réplication pro¬ 
teins. The first mammalian réplication System that accurately replicated DNA in 
vitro was described in the mid-1980s, and mutations in genes encoding nearly ail 
of the réplication components hâve now been isolated and analyzed in the yeast 
Saccharomyces cerevisiae. As a resuit, much is lcnown about the detailed enzymol- 
ogy of DNA réplication in eukaryotes, and it is clear that the fundamental features 
of DNA réplication—including replication-forlc geometry and the use of a multi- 
protein réplication machine—hâve been conserved during the long evolutionary 
process that separated bacteria from eukaryotes. 

There are more protein components in eukaryotic réplication machines than 
there are in the bacterial analogs, even though the basic functions are the same. 
Thus, for example, the eukaryotic single-strand binding (SSB) protein is formed 
from three subunits, whereas only a single subunit is found in bacteria. Similarly, 
the eukaryotic DNA primase is incorporated into a multisubunit enzyme that also 
contains a polymerase called DNA polymerase a-primase. This protein complex 
begins each Okazaki fragment on the lagging strand with RNA and then extends 
the RNA primer with a short length of DNA. At this point, the two main eukary¬ 
otic réplicative DNA polymerases, PolS and Pois, corne into play: PolS complétés 
each Okazaki fragment on the lagging strand and Pois extends the leading strand. 
The increased complexity of eukaryotic réplication machinery probably reflects 
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more elaborate Controls. For example, the orderly maintenance of different cell 
types and tissues in animais and plants requires that DNA réplication be tightly 
regulated. Moreover, eukaryotic DNA réplication must be coordinated with the 
elaborate process of mitosis, as we discuss in Chapter 17. 

As we see in the next section, the eukaryotic réplication machinery has the 
added complication of having to replicate through nucléosomes, the repeating 
structural unit of chromosomes discussed in Chapter 4. Nucléosomes are spaced 
at intervals of about 200 nucléotide pairs along the DNA, which, as we will see, 
explains why new Olcazaki fragments are synthesized on the lagging strand at 
intervals of 100-200 nucléotides in eulcaryotes, instead of 1000-2000 nucléotides 
as in bacteria. Nucléosomes may also act as barriers that slow down the move- 
ment of DNA polymerase molécules, which may be why eukaryotic réplication 
forlcs move only about one-tenth as fast as bacterial réplication forlcs. 

Summary 

DNA réplication takes place at a Y-shaped structure called a réplication fork. A 
self-correcting DNA polymerase enzyme catalyzes nucléotide polymerization in a 
5'-to-3' direction, copying a DNA template strand with remarlcable fidelity. Since 
the two strands ofa DNA double hélix are antiparallel, this 5'-to-3' DNA synthesis 
can take place continuously on only one of the strands at a réplication fork (the 
leading strand). On the lagging strand , short DNA fragments must be made by a 
“backstitching” process. Because the self-correcting DNA polymerase cannot start 
a new chain, these lagging-strand DNA fragments are primed by short RNA primer 
molécules that are subsequently erased and replaced with DNA. 

DNA réplication requires the coopération ofmany proteins. These include (1) 
DNA polymerase and DNA primase to catalyze nucleoside triphosphate polymer¬ 
ization; (2) DNA helicases and single-strand DNA-binding (SSB) proteins to help in 
opening up the DNA hélix so that it can be copied; (3) DNA ligase and an enzyme 
that dégradés RNA primer s to seal together the discontinuously synthesized lagging- 
strand DNA fragments; and (4) DNA topoisomerases to help to relieve helical wind- 
ing and DNA tangling problems. Many of these proteins associate with each other 
at a réplication fork toform a highly efficient "réplication machinethrough which 
the activities and spatial movements ofthe individual components are coordinated. 

THE INITIATION AND COMPLETION OF DNA 
REPLICATION IN CHROMOSOMES 

We hâve seen how a set of réplication proteins rapidly and accurately generates 
two daughter DNA double helices behind a réplication fork. But how is this rép¬ 
lication machinery assembled in the first place, and how are réplication forlcs 
created on an intact, double-strand DNA molécule? In this section, we discuss 
how cells initiate DNA réplication and how they carefully regulate this process to 
ensure that it takes place not only at the proper positions on the chromosome but 
also at the appropriate time in the life of the cell. We also discuss a few of the spé¬ 
cial problems that the réplication machinery in eukaryotic cells must overcome. 
These include the need to replicate the enormously long DNA molécules found in 
eukaryotic chromosomes, as well as the difficulty of copying DNA molécules that 
are tightly complexed with histones in nucléosomes. 

DNA Synthesis Begins at Réplication Origins 

As discussed previously, the DNA double hélix is normally very stable: the two 
DNA strands are loclced together firmly by many hydrogen bonds formed between 
the bases on each strand. To begin DNA réplication, the double hélix must first 
be opened up and the two strands separated to expose unpaired bases. As we 
shall see, the process of DNA réplication is begun by spécial initiator proteins that 
bind to double-strand DNA and pry the two strands apart, breaking the hydrogen 
bonds between the bases. 
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Figure 5-23 A réplication bubble formed 
by replication-fork initiation. This diagram 
outlines the major steps in the initiation of 
réplication forks at réplication origins. The 
structure formed at the last step, in which 
both strands of the parental DNA hélix hâve 
been separated from each other and serve 
as templates for DNA synthesis, is called a 
réplication bubble. 
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Figure 5-24 DNA réplication of a bacterial genome. It takes E. coli 
about 30 minutes to duplicate its genome of 4.6 x 10 6 nucléotide pairs. For 
simplicity, no Okazaki fragments are shown on the lagging strand. What 
happens as the two réplication forks approach each other and collide at the 
end of the réplication cycle is not well understood, although the réplication 
machines are disassembled as part of the process. 


The positions at which the DNA hélix is first opened are called réplication ori- 
gins (Figure 5-23). In simple cells lilce those of bacteria or yeast, origins are spec- 
ified by DNA sequences several hundred nucléotide pairs in length. This DNA 
contains both short sequences that attract initiator proteins and stretches of DNA 
that are especially easy to open. We saw in Figure 4-4 that an A-T base pair is held 
together by fewer hydrogen bonds than a G-C base pair. Therefore, DNA rich in 
A-T base pairs is relatively easy to pull apart, and régions of DNA enriched in A-T 
base pairs are typically found at réplication origins. 

Although the basic process of replication-fork initiation depicted in Figure 
5-23 is fundamentally the same for bacteria and eukaryotes, the detailed way in 
which this process is performed and regulated differs between these two groups 
of organisms. We first consider the simpler and better-understood case in bacte¬ 
ria and then turn to the more complex situation found in yeasts, mammals, and 
other eukaryotes. 

Bacterial Chromosomes Typically Hâve a Single Origin of DNA 
Réplication 

The genome of E. coli is contained in a single circular DNA molécule of 4.6 x 10 6 
nucléotide pairs. DNA réplication begins at a single origin of réplication, and the 
two réplication forks assembled there proceed (at approximately 1000 nucléotides 
per second) in opposite directions until they meet up roughly halfway around the 
chromosome (Figure 5-24). The only point at which E. coli can control DNA rép¬ 
lication is initiation: once the forks hâve been assembled at the origin, they syn- 
thesize DNA at relatively constant speed until réplication is finished. Therefore, 
it is not surprising that the initiation of DNA réplication is highly regulated. The 
process begins when initiator proteins (in their ATP-bound State) bind in multiple 
copies to spécifie DNA sites located at the réplication origin, wrapping the DNA 
around the proteins to form a large protein-DNA complex that destabilizes the 
adjacent double hélix. This complex then attracts two DNA helicases, each bound 
to a helicase loader, and these are placed around adjacent DNA single strands 
whose bases hâve been exposed by the assembly of the initiator protein-DNA 
complex. The helicase loader is analogous to the clamp loader we encountered 
above; it has the additional job of lceeping the helicase in an inactive form until it 
is properly loaded onto a nascent réplication forlc. Once the helicases are loaded, 
the loaders dissociate and the helicases begin to unwind DNA, exposing enough 
single-strand DNA for DNA primase to synthesize the first RNA primers (Figure 
5-25). This quicldy leads to the assembly of remaining pro teins to create two rép¬ 
lication forks, with réplication machines that move, with respect to the réplication 
origin, in opposite directions. They continue to synthesize DNA until ail of the 
DNA template downstream of each forlc has been replicated. 

In E. coli , the interaction of the initiator protein with the réplication origin is 
carefully regulated, with initiation occurring only when sufficient nutrients are 
available for the bacterium to complété an entire round of réplication. Initiation is 
also controlled to ensure that only one round of DNA réplication occurs for each 
cell division. After réplication is initiated, the initiator protein is inactivated by 
hydrolysis of its bound ATP molécule, and the origin of réplication expériences a 
“refractory period." The refractory period is caused by a delay in the méthylation 
of newly incorporated A nucléotides in the origin (Figure 5-26). Initiation cannot 
occur again until the As are methylated and the initiator protein is restored to its 
ATP-bound State. 
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Figure 5-25 The proteins that initiate 
DNA réplication in bacteria. The 

mechanism shown was established 
by studies in vitro with mixtures of 
highly purified proteins. For E. coli DNA 
réplication, the major initiator protein, the 
helicase, and the primase are the dnaA, 
dnaB, and dnaG proteins, respectively. 

In the first step, several molécules of 
the initiator protein bind to spécifie DNA 
sequences at the réplication origin and 
destabilize the double hélix by forming a 
compact structure in which the DNA is 
tightly wrapped around the protein. Next, 
two helicases are brought in by helicase- 
loading proteins (the dnaC proteins), 
which inhibit the helicases until they are 
properly loaded at the réplication origin. 
Helicase-loading proteins prevent the 
réplicative DNA helices from inappropriately 
entering other single-strand stretches of 
DNA in the bacterial genome. Aided by 
single-strand binding protein (not shown), 
the loaded helicases open up the DNA, 
thereby enabling primases to enter and 
synthesize initial primers. In subséquent 
steps, two complété réplication forks are 
assembled at the origin and move off in 
opposite directions. The initiator proteins 
are displaced as the left-hand fork moves 
through them (not shown). 


TWO REPLICATION FORKS 
MOVING IN OPPOSITE DIRECTIONS 


Eukaryotic Chromosomes Contain Multiple Origins of Réplication 

We hâve seen how two réplication forks begin at a single réplication origin in bac¬ 
teria and proceed in opposite directions, moving away from the origin until ail of 
the DNA in the single circular chromosome is replicated. The bacterial genome is 
sufficiently small for these two réplication forks to duplicate the genome in about 
30 minutes. Because of the much greater size of most eukaryotic chromosomes, a 
different strategy is required to allow their réplication in a timely manner. 

A method for determining the general pattern of eukaryotic chromosome 
réplication was developed in the early 1960s. Human cells growing in culture are 
labeled for a short time with 3 H-thymidine so that the DNA synthesized during 
this period becomes highly radioactive. The cells are then gently lysed, and the 
DNA is stretched on the surface of a glass slide coated with a photographie émul¬ 
sion. Development of the émulsion reveals the pattern of labeled DNA through 
a technique lcnown as autoradiography. The time allotted for radioactive label- 
ing is chosen to allow each réplication fork to move several micrometers along 
the DNA, so that the replicated DNA can be detected in the light microscope as 
lines of silver grains, even though the DNA molécule itself is too thin to be visible. 
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In this way, both the rate and the direction of replication-fork movement can 
be determined (Figure 5-27). From the rate at which traclcs of replicated DNA 
increase in length with increasing labeling time, the eukaryotic réplication forlcs 
are estimated to travel at about 50 nucléotides per second. This is approximately 
twentyfold slower than the rate at which bacterial réplication forlcs move, possibly 
refiecting the increased difficulty of replicating DNA that is paclcaged tightly in 
chromatin. 

An average-size human chromosome contains a single linear DNA molécule 
of about 150 million nucléotide pairs. It would talce 0.02 seconds/nucleotide x 150 
x 10 6 nucléotides = 3.0 x 10 6 seconds (about 35 days) to replicate such a DNA mol¬ 
écule from end to end with a single réplication forlc moving at a rate of 50 nucléo¬ 
tides per second. As expected, therefore, the autoradiographic experiments just 
described reveal that many forlcs, belonging to separate réplication bubbles, are 
moving simultaneously on each eukaryotic chromosome. 

Much faster and more sophisticated methods now exist for monitoring DNA 
réplication initiation and traclcing the movement of DNA réplication forlcs across 
whole genomes. One approach uses DNA microarrays—grids the size of a post¬ 
age stamp studded with hundreds of thousands of fragments of lcnown DNA 
sequence. As we will see in detail in Chapter 8, each different DNA fragment is 
placed at a unique position on the microarray, and whole genomes can thereby 
be represented in an orderly manner. If a DNA sample from a group of replicating 
cells is brolcen up and hybridized to a microarray representing that organisnTs 
genome, the amount of each DNA sequence can be determined. Because a seg¬ 
ment of a genome that has been replicated will contain twice as much DNA as 
an unreplicated segment, replication-fork initiation and forlc movement can be 
accurately monitored across an entire genome (Figure 5-28). 

Experiments of this type hâve shown the following: (1) Approximately 30,000- 
50,000 origins of réplication are used each time a human cell divides. (2) The 
human genome has many more (perhaps tenfold more) potential origins than 
this, and different cell types use different sets of origins. This may allow a cell to 
coordinate its active origins with other features of its chromosomes such as which 
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Figure 5-26 Méthylation of the E. coli 
réplication origin créâtes a refractory 
period for DNA initiation. DNA 

méthylation occurs at GATC sequences, 

11 of which are found in the origin of 
réplication (spanning approximately 250 
nucléotide pairs). In its hemimethylated 
State, the origin of réplication is bound 
by an inhibitor protein (Seq A, not 
shown), which blocks the ability of the 
initiator proteins to unwind the origin 
DNA. Eventually (about 15 minutes after 
réplication is initiated), the hemimethylated 
origins become fully methylated by a DNA 
methylase enzyme; Seq A then dissociâtes. 

A single enzyme, the Dam methylase, 
is responsible for methylating ail E. coli 
GATC sequences. A lag in méthylation after 
the réplication of GATC sequences is also 
used by the E. coli mismatch proofreading 
System to distinguish the newly synthesized 
DNA strand from the parental DNA strand; 
in that case, the relevant GATC sequences 
are scattered throughout the chromosome, 
and they are not bound by Seq A. 


Figure 5-27 The experiments that 
demonstrated the pattern in which 
réplication forks are formed and move 
on eukaryotic chromosomes. The new 

DNA made in human cells in culture 
was labeled briefly with a puise of highly 
radioactive thymidine ( 3 H-thymidine). 

(A) In this experiment, the cells were 
lysed, and the DNA was stretched out 
on a glass slide that was subsequently 
covered with a photographie émulsion. 
After several months, the émulsion was 
developed, revealing a line of silver grains 
over the radioactive DNA. The brown DNA 
in this figure is shown only to help with 
the interprétation of the autoradiograph; 
the unlabeled DNA is invisible in such 
experiments. (B) This experiment was the 
same except that a further incubation in 
unlabeled medium allowed additional DNA, 
with a lower level of radioactivity, to be 
replicated. The pairs of dark tracks in (B) 
were found to hâve silver grains tapering 
off in opposite directions, demonstrating 
bidirectional fork movement from a central 
réplication origin where a réplication bubble 
forms (see Figure 5-23). A réplication fork 
is thought to stop only when it encounters 
a réplication fork moving in the opposite 
direction or when it reaches the end of the 
chromosome; in this way, ail the DNA is 
eventually replicated. 
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genes are being expressed. The excess origins also provide “baclcups" in case a pri- 
mary origin fails. (3) As in bacteria, réplication forks are formed in pairs and create 
a réplication bubble as they move in opposite directions away from a common 
point of origin, stopping only when they collide head-on with a réplication forlc 
moving in the opposite direction or when they reach a chromosome end. In this 
way, many réplication forks operate independently on each chromosome and yet 
form two complété daughter DNA helices. 

In Eukaryotes, DNA Réplication Takes Place During Only One Part 
of the Cell Cycle 

When growing rapidly, bacteria replicate their DNA nearly continuously. In con- 
trast, DNA réplication in most eukaryotic cells occurs only during a spécifie part of 
the cell-division cycle, called the DNA synthesis phase or S phase (Figure 5-29). In 
a mammalian cell, the S phase typically lasts for about 8 hours; in simpler eukary¬ 
otic cells such as yeasts, the S phase can be as short as 40 minutes. By its end, each 
chromosome has been replicated to produce two complété copies, which remain 
joined together at their centromeres until the Mphase (M for mitosis), which soon 
follows. In Chapter 17, we describe the control System that runs the cell cycle, and 
we explain why entry into each phase of the cycle requires the cell to hâve suc- 
cessfully completed the previous phase. 

In the following sections, we explore how chromosome réplication is coordi- 
nated within the S phase of the cell cycle. 

Different Régions on the Same Chromosome Replicate at Distinct 
Times in S Phase 

In mammalian cells, the réplication of DNA in the région between one réplica¬ 
tion origin and the next should normally require only about an hour to complété, 
given the rate at which a réplication forlc moves and the largest distances mea- 
sured between réplication origins. Yet S phase usually lasts for about 8 hours in 
a mammalian cell. This implies that the réplication origins are not ail activated 
simultaneously; indeed, réplication origins are activated in clusters of about 50 
adjacent réplication origins, each of which is replicated during only a small part of 
the total S-phase interval. 


Figure 5-28 Use of DNA microarrays 
to monitor the formation and progress 
of réplication forks. For this experiment, 
a population of cells is synchronized so 
that they ail begin réplication at the same 
time. DNA is collected and hybridized 
to the microarray; DNA that has been 
replicated once gives a hybridization 
signal (dark green squares) twice as high 
as that of unreplicated DNA (light green 
squares). The spots on these microarrays 
represent consecutive sequences along a 
segment of a chromosome arranged left 
to right, top to bottom. Only 81 spots are 
shown here, but the actual arrays contain 
hundreds of thousands of sequences 
that span an entire genome. As can be 
seen, réplication begins at an origin and 
proceeds bidirectionally. For simplicity, only 
one origin is shown here. In human cells, 
réplication begins at 30,000-50,000 origins 
located throughout the genome. Using 
this approach it is possible to observe the 
formation and progress of every réplication 
fork across a genome. 



Figure 5-29 The four successive phases 
of a standard eukaryotic cell cycle. 

During the Gi, S, and G 2 phases, the 
cell grows continuously. During M phase 
growth stops, the nucléus divides, and 
the cell divides in two. DNA réplication is 
confined to the part of the cell cycle known 
as S phase. Gi is the gap between 
M phase and S phase; G 2 is the gap 
between S phase and M phase. 
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It seems that the order in which réplication origins are activated dépends, in 
part, on the chromatin structure in which the origins résidé. We saw in Chap- 
ter 4 that heterochromatin is a particularly condensed State of chromatin, while 
euchromatin, where most transcription occurs, has a less condensed conforma¬ 
tion. Heterochromatin tends to be replicated very late in S phase, suggesting that 
the timing of réplication is related to the packing of the DNA in chromatin. 

Once initiated, however, réplication forlcs seem to move at comparable rates 
throughout S phase, so the extent of chromosome condensation seems to influ¬ 
ence the time at which réplication forlcs are initiated, rather than their speed once 
formed. 


A Large Multisubunit Complex Binds to Eukaryotic Origins of 
Réplication 

Having seen that a eukaryotic chromosome is replicated using many origins of 
réplication, each of which “fires” at a characteristic time in S phase of the cell 
cycle, we turn to the nature of these origins of réplication. We saw earlier in this 
chapter that réplication origins hâve been precisely defined in bacteria as spécifie 
DNA sequences that attract initiator proteins, which then assemble the DNA rép¬ 
lication machinery. We shall see that this is the case for the single-cell budding 
yeast S. cerevisiae, but it appears not to be strictly true for most other eulcaryotes. 

For budding yeast, the location of every origin of réplication on each chromo¬ 
some has been determined. The particular chromosome shown in Figure 5-30 — 
chromosome III from S. cerevisiae —is one of the smallest chromosomes lcnown, 
with a length less than 1/100 that of a typical human chromosome. Its major ori¬ 
gins are spaced an average of 30,000 nucléotide pairs apart, but only a subset of 
these origins is used by a given cell. Nonetheless, this chromosome can be repli¬ 
cated in about 15 minutes. 

The minimal DNA sequence required for directing DNA réplication initiation 
in S. cerevisiae has been determined by taking a segment of DNA that spans an 
origin of réplication and testing smaller and smaller DNA fragments for their abil- 
ity to function as origins. Most DNA sequences that can serve as an origin of rép¬ 
lication are found to contain (1) a binding site for a large, multisubunit initiator 
protein called ORC, for origin récognition complex; (2) a stretch of DNA that is 
rich in As and Ts and therefore easy to melt; and (3) at least one binding site for 
proteins that facilitate ORC binding, probably by adjusting chromatin structure. 

In bacteria, once the initiator protein is properly bound to the single origin 
of réplication, the assembly of the réplication forlcs seems to follow more or less 
automatically. In eulcaryotes, the situation is significantly different because of a 
profound problem eulcaryotes hâve in replicating chromosomes: with so many 
places to begin réplication, howis the process regulated to ensure that ail the DNA 
is copied once and only once? 

The answer lies in the sequential manner in which the réplicative helicase is 
first loaded onto origins and is then activated to initiate DNA réplication. This 
matter is discussed in detail in Chapter 17, where we consider the machinery that 
underlies the cell-division cycle. In brief, during Gi phase, the réplicative heli- 
cases are loaded onto DNA next to ORC to create a prereplicative complex. Then, 
upon passage from Gi phase to S phase, specialized protein kinases corne into 
play to activate the helicases. The resulting opening of the double hélix allows the 
loading of the remaining réplication proteins, including the DNA polymerases. 
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Figure 5-30 The origins of DNA 
réplication on chromosome III of the 
yeast S. cerevisiae. This chromosome, 
one of the smallest eukaryotic 
chromosomes known, carries a total of 
180 genes. As indicated, it contains 18 
réplication origins, although they are used 
with different frequencies. Those in red 
are typically used in less than 10% of cell 
divisions, while those in green are used 
about 90% of the time. 
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Figure 5-31 DNA réplication initiation in 
eukaryotes. This mechanism ensures that 
each origin of réplication is activated only 
once per cell cycle. An origin of réplication 
can be used only if a prereplicative 
complex forms there in Gi phase. At the 
beginning of S phase, specialized kinases 
phosphorylate Mcm and ORC, activating 
the former and inactivating the latter. A new 
prereplicative complex cannot form at the 
origin until the cell progresses to the next 
Gi phase, when the bound ORC has been 
dephosphorylated. Note that the eukaryotic 
Mcm helicase moves along the leading- 
strand template, whereas the bacterial 
helicase moves along the lagging-strand 
template (see Figure 5-25). As the forks 
begin to move, ORC is displaced, and new 
ORCs rapidly bind to the newly replicated 
origins. 


The protein kinases that trigger DNA réplication simultaneously prevent 
assembly of new prereplicative complexes until the next M phase resets the entire 
cycle (for details, see pp. 974-975). They do this, in part, by phosphorylating ORC, 
rendering it unable to accept new helicases. This strategy provides a single win- 
dow of opportunity for prereplicative complexes to form (Gi phase, when kinase 
activity is low) and a second window for them to be activated and subsequently 
disassembled (S phase, when kinase activity is high). Because these two phases of 
the cell cycle are mutually exclusive and occur in a prescribed order, each origin 
of réplication can fire once and only once during each cell cycle. 

Features of the Human Genome That Specify Origins of 
Réplication Remain to Be Discovered 

Compared with the situation in budding yeast, the déterminants of réplication 
origins in other eukaryotes hâve been difficult to discover. It has been possible to 
identify spécifie human DNA sequences, each several thousand nucléotide pairs 
in length, that are sufficient to serve as réplication origins. These origins continue 
to function when moved to a different chromosomal région by recombinant DNA 
methods, as long as they are placed in a région where the chromatin is relatively 
uncondensed. However, comparisons of such DNA sequences hâve not revealed 
spécifie DNA sequences that mark origins of réplication. 
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Despite this, a human ORC that is very similar to the yeast ORC binds to ori- 
gins of réplication and initiâtes DNA réplication in humans. Many of the other 
proteins that function in the initiation process in yeast likewise hâve central rôles 
in humans. It therefore seems likely that the yeast and human initiation mecha- 
nisms are similar in outline, but chromatin structure, transcriptional activity, or 
some property of the genome other than a spécifie DNA sequence has the cen¬ 
tral rôle in attracting ORC and specifying mammalian origins of réplication. These 
ideas could also help to explain how a given mammalian cell chooses which of the 
many possible origins to use when it replicates its genome and how this choice 
could differ from cell to cell. Clearly, we hâve a great deal to discover about the 
fundamental process of DNA réplication initiation. 

New Nucléosomes Are Assembled Behind the Réplication Fork 

Several additional aspects of DNA réplication are spécifie to eukaryotes. As dis- 
cussed in Chapter 4, eukaryotic chromosomes are composed of roughly equal 
mixtures of DNA and protein. Chromosome duplication therefore requires not 
only the réplication of DNA, but also the synthesis and assembly of new chro¬ 
mosomal proteins onto the DNA behind each réplication fork. Although we are 
far from understanding this process in detail, we are beginning to learn how the 
fundamental unit of chromatin packaging, the nucléosome, is duplicated. The cell 
requires a large amount of new histone protein, approximately equal in mass to 
the newly synthesized DNA, to malce the new nucléosomes in each cell cycle. For 
this reason, most eukaryotic organisms possess multiple copies of the gene for 
each histone. Vertebrate cells, for example, hâve about 20 repeated gene sets, most 
containing the genes that encode ail five histones (Hl, H2A, H2B, H3, and H4). 

Unlilce most proteins, which are made continuously, histones are synthesized 
mainly in S phase, when the level of histone mRNA increases about fiftyfold as 
a resuit of both increased transcription and decreased mRNA dégradation. The 
major histone mRNAs are degraded within minutes when DNA synthesis stops at 
the end of S phase. The mechanism dépends on spécial properties of the 3' ends 
of these mRNAs, as discussed in Chapter 7. In contrast, the histone proteins them- 
selves are remarkably stable and may survive for the entire life of a cell. The tight 
linkage between DNA synthesis and histone synthesis appears to reflect a feed¬ 
back mechanism that monitors the level of free histone to ensure that the amount 
of histone made exactly matches the amount of new DNA synthesized. 

As a réplication fork advances, it must pass through the parental nucléosomes. 
In the cell, efficient réplication requires chromatin remodeling complexes (dis¬ 
cussed in Chapter 4) to destabilize the DNA-histone interfaces. Aided by such 
complexes, réplication forlcs can transit even highly condensed chromatin effi- 
ciently. 

As a réplication fork passes through chromatin, the histones are transiently 
displaced leaving about 600 nucléotide pairs of non-nucleosomal DNA in its 
walce. The reestablishment of nucléosomes behind a moving fork occurs in an 
intriguing way. When a nucléosome is traversed by a réplication fork, the histone 
octamer appears to be brolcen into an H3-H4 tetramer and two H2A-H2B dimers 
(discussed in Chapter 4). The H3-H4 tetramer remains loosely associated with 
DNA and is distributed at random to one or the other daughter duplex, but the 
H2A-H2B dimers are released completely from DNA. Freshly made H3-H4 tetram- 
ers are added to the newly synthesized DNA to fill in the “spaces,” and H2A-H2B 
dimers—half of which are old and half new—are then added at random to com¬ 
plété the nucléosomes (Figure 5-32). The formation of new nucléosomes behind 
a réplication fork has an important conséquence for the process of DNA réplica¬ 
tion itself. As DNA polymerase S discontinuously synthesizes the lagging strand 
(see pp. 253-254), the length of each Okazaki fragment is determined by the point 
at which DNA polymerase 8 is blocked by a newly formed nucléosome. This tight 
coupling between nucléosome duplication and DNA réplication explains why the 
length of Okazaki fragments in eukaryotes (-200 nucléotides) is approximately 
the same as the nucléosome repeat length. 
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The orderly and rapid addition of new H3-H4 tetramers and H2A-H2B dimers 
behind a réplication fork requires histone chaperones (also called chromatin 
assembly factors). These multisubunit complexes bind the highly basic histones 
and release them for assembly only in the appropriate context. The histone chap¬ 
erones, along with their cargoes, are directed to newly replicated DNA through 
a spécifie interaction with the eukaryotic sliding clamp called PCNA (see Figure 
5-32B). These clamps are left behind moving réplication forlcs and remain on the 
DNA long enough for the histone chaperones to complété their taslcs. 


Figure 5-32 Formation of nucléosomes 
behind a réplication fork. Parental 
H3-H4 tetramers are distributed at random 
to the daughter DNA molécules, with 
roughly equal numbers inherited by each 
daughter. In contrast, H2A-H2B dimers 
are released from the DNA as the 
réplication fork passes. This release 
begins just in front of the réplication fork 
and is facilitated by chromatin remodeling 
complexes that move with the fork. 

Histone chaperones (NAP1 and 
CAF1) restore the full complément of 
histones to daughter molécules using both 
parental and newly synthesized histones. 
Although some daughter nucléosomes 
contain only parental histones or only newly 
synthesized histones, most are hybrids of 
old and new. For simplicity, the DNA double 
hélix shown as a single red line. (Adapted 
from J.D. Watson et al., Molecular Biology 
of the Gene, 5th ed. Cold Spring Harbor: 
Cold Spring Harbor Laboratory Press, 
2004.) 


Telomerase Replicates the Ends of Chromosomes 

We saw earlier that synthesis of the lagging strand at a réplication fork must occur 
discontinuously through a baclcstitching mechanism that produces short DNA 
fragments. This mechanism encounters a spécial problem when the réplication 
fork reaches an end of a linear chromosome. The final RNA primer synthesized 
on the lagging-strand template cannot be replaced by DNA because there is no 
3'-OH end available for the repair polymerase. Without a mechanism to deal with 
this problem, DNA would be lost from the ends of ail chromosomes each time a 
cell divides. 

Bacteria solve this "end-replication" problem by having circular DNA molé¬ 
cules as chromosomes (see Figure 5-24). Eukaryotes solve it in a different way: 
they hâve specialized nucléotide sequences at the ends of their chromosomes 
that are incorporated into structures called telomeres (discussed in Chapter 4). 
Telomeres contain many tandem repeats of a short sequence that is similar in 
organisms as diverse as protozoa, fungi, plants, and mammals. In humans, the 
sequence of the repeat unit is GGGTTA, and it is repeated roughly a thousand 
times at each telomere. 

Telomere DNA sequences are recognized by sequence-specific DNA-bind- 
ing proteins that attract an enzyme, called telomerase, that replenishes these 
sequences each time a cell divides. Telomerase recognizes the tip of an existing 
telomere DNA repeat sequence and elongates it in the 5'-to-3' direction, using 
an RNA template that is a component of the enzyme itself to synthesize new cop¬ 
ies of the repeat (Figure 5-33). The enzymatic portion of telomerase resembles 
other reverse transcriptases , proteins that synthesize DNA using an RNA template, 
although, in this case, the telomerase RNA also contributes functional groups to 
malce the catalysis more efficient. After extension of the parental DNA strand by 
telomerase, réplication of the lagging strand at the chromosome end can be com- 
pleted by the conventional DNA polymerases, using these extensions as a tem¬ 
plate to synthesize the complementary strand (Figure 5-34). 
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Figure 5-33 Structure of a portion of 
telomerase. Telomerase is a large protein- 
RNA complex. The RNA (blue) contains 
a templating sequence for synthesizing 
new DNA telomere repeats. The synthesis 
reaction itself is carried out by the reverse 
transcriptase domain of the protein, shown 
in green. A reverse transcriptase is a 
spécial form of polymerase enzyme that 
uses an RNA template to make a DNA 
strand; telomerase is unique in carrying 
its own RNA template with it. Telomerase 
also has several additional protein domains 
(not shown) that are needed to assemble 
the enzyme at the ends of chromosomes. 
(Modified from J. Lingner and T.R. Cech, 
Curr. Opin. Genet. Dev. 8:226-232, 1998. 
With permission from Elsevier.) 


Telomeres Are Packaged Into Specialized Structures That Protect 
the Ends of Chromosomes 

The ends of chromosomes présent cells with an additional problem. As we will 
see in the next part of this chapter, when a chromosome is accidently brolcen, 
the break is rapidly repaired (see Figure 5-45). Telomeres must clearly be distin- 
guished from these accidentai breaks; otherwise the cell will attempt to "repair" 
telomeres, causing chromosome fusions and other genetic abnormalities. Telo¬ 
meres hâve several features to prevent this from happening. 

A specialized nuclease chews baclc the 5' end of a telomere leaving a protrud- 
ing single-strand end. This protruding end—in combination with the GGGTTA 
repeats in telomeres—attracts a group of proteins that form a protective chromo¬ 
some cap known as shelterin. In particular, shelterin “hides" telomeres from the 
cell's damage detectors that continually monitor DNA. When human telomeres 
are artificially cross-linked and viewed by électron microscopy, structures known 
as “t-loops" are observed in which the protruding end of the telomere loops baclc 
and tuclcs itself into the duplex DNA of the telomere repeat sequence (Figure 
5-35). It is believed that t-loops are regulated by shelterin and provide additional 
protection for the ends of chromosomes. 
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Figure 5-34 Telomere réplication. Shown 
here are the reactions that synthesize 
the repeating sequences that form the 
ends of the chromosomes (telomeres) of 
diverse eukaryotic organisms. The 3' end 
of the parental DNA strand is extended 
by RNA-templated DNA synthesis; this 
allows the incomplète daughter DNA 
strand that is paired with it to be extended 
in its 5' direction. This incomplète, lagging 
strand is presumed to be completed by 
DNA polymerase a, which carries a DNA 
primase as one of its subunits (Movie 5.6). 
The telomere sequence illustrated is that 
of the ciliate Tetrahymena, in which these 
reactions were first discovered. 
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Telomere Length Is Regulated by Cells and Organisms 

Because the processes that grow and shrinlc each telomere sequence are only 
approximately balanced, a chromosome end contains a variable number of telo- 
meric repeats. Not surprisingly, many cells hâve homeostatic mechanisms that 
maintain the number of these repeats within a limited range (Figure 5-36). 

In most of the dividing somatic cells of humans, however, telomeres gradually 
shorten, and it has been proposed that this provides a counting mechanism that 
helps prevent the unlimited prolifération of wayward cells in adult tissues. In its 
simplest form, this idea holds that our somatic cells start off in the embryo with a 
full complément of telomeric repeats. These are then eroded to different extents in 
different cell types. Some stem cells, notably those in tissues that must be replen- 
ished at a high rate throughout life—bone marrow or gut lining, for example— 
retain full telomerase activity. However, in many other types of cells, the level of 
telomerase is turned down so that the enzyme cannot quite lceep up with chromo¬ 
some duplication. Such cells lose 100-200 nucléotides from each telomere every 
time they divide. After many cell générations, the descendant cells will inherit 
chromosomes that lack telomere function, and, as a resuit of this defect, activate 
a DNA-damage response causing them to withdraw permanently from the cell 
cycle and cease dividing—a process called réplicative cell senescence (discussed 
in Chapter 17). In theory, such a mechanism could provide a safeguard against 
the uncontrolled cell prolifération of abnormal cells in somatic tissues, thereby 
helping to protect us from cancer. 
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Figure 5-35 A t-loop at the end of a 
mammalian chromosome. (A) Electron 
micrograph of the DNA at the end of an 
interphase human chromosome. The 
chromosome was fixed, deproteinated, and 
artificially thickened before viewing. The 
loop seen here is approximately 15,000 
nucléotide pairs in length. (B) Structure 
of a t-loop. The insertion of the single- 
strand 3' end into the duplex repeats is 
carried out, and the structure maintained, 
by specialized proteins. (From J.D. Griffith 
et al., Cell 97:503-514, 1999. With 
permission from Elsevier.) 


Figure 5-36 A démonstration that 
yeast cells control the length of their 
telomeres. In this experiment, the telomere 
at one end of a particular chromosome 
is artificially made either longer (left) or 
shorter (right) than average. After many 
cell divisions, the chromosome recovers, 
showing an average telomere length and 
a length distribution that is typical of the 
other chromosomes in the yeast cell. A 
similar feedback mechanism for controlling 
telomere length has been proposed for the 
germ-line cells of animais. 
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The idea that telomere length acts as a “measuring stick" to count cell divisions 
and thereby regulate the lifetime of the cell lineage has been tested in several 
ways. For certain types of human cells grown in tissue culture, the experimental 
results support such a theory. Human fibroblasts normally proliferate for about 60 
cell divisions in culture before undergoing réplicative cell senescence. Lilce most 
other somatic cells in humans, fibroblasts produce only low levels of telomerase, 
and their telomeres gradually shorten each time they divide. When telomerase is 
provided to the fibroblasts by inserting an active telomerase gene, telomere length 
is maintained and many of the cells now continue to proliferate indefinitely. 

It has been proposed that this type of control on cell prolifération may con- 
tribute to the aging of animais lilce ourselves. These ideas hâve been tested by 
producing transgenic mice that laclc telomerase entirely. The telomeres in mouse 
chromosomes are about five times longer than human telomeres, and the mice 
must therefore be bred through three or more générations before their telomeres 
hâve shrunlc to the normal human length. It is therefore perhaps not surprising 
that the first générations of mice develop normally. However, the mice in later 
générations develop progressively more defects in some of their highly proliféra¬ 
tive tissues. In addition, these mice show signs of prématuré aging and hâve a 
pronounced tendency to develop tumors. In these and other respects these mice 
resemble humans with the genetic disease dyskeratosis congenita. Individuals 
afflicted with this disease carry one functional and one nonfunctional copy of the 
telomerase RNA gene; they hâve prematurely shortened telomeres and typically 
die of progressive bone marrow failure. They also develop lung scarring and liver 
cirrhosis and show abnormalities in various epidermal structures including slcin, 
hair follicles, and nails. 

The above observations demonstrate that controlling cell prolifération by telo¬ 
mere shortening poses a rislc to an organism, because not ail of the cells that begin 
losing the ends of their chromosomes will stop dividing. Some apparently become 
genetically unstable, but continue to divide, giving rise to variant cells that can 
lead to cancer. Clearly, the use of telomere shortening as a regulating mechanism 
is not foolproof and, lilce many mechanisms in the cell, seems to strilce a balance 
between benefit and rislc. 


Summary 

The proteins that initiate DNA réplication bind to DNA sequences at a réplication 
origin to catalyze the formation ofa réplication bubble with two outward-moving 
réplication forlcs. The process begins when an initiator protein-DNA complex is 
formed that subsequently loads a DNA helicase onto the DNA template. Other pro¬ 
teins are then added toform the multienzyme “réplication machine” that catalyzes 
DNA synthesis at each réplication forlc. 

In bacteria and some simple eulcaryotes, réplication origins are specified by spé¬ 
cifie DNA sequences that are only several hundred nucléotide pairs long. In other 
eulcaryotes, such as humans, the sequences needed to specify an origin of DNA 
réplication seem to be less well defined, and the origin can span several thousand 
nucléotide pairs. 

Bacteria typically hâve a single origin of réplication in a circular chromosome. 
With forlc speeds of up to 1000 nucléotides per second, they can replicate their 
genome in less than an hour. Eulcaryotic DNA réplication takes place in only one 
part ofthe cell cycle , the S phase. The réplication forlc in eukaryotes moves about 10 
times more slowly than the bacterial réplication fork, and the much longer eulcary- 
otic chromosomes each require many réplication origins to complété their réplica¬ 
tion in an S phase, which typically lasts for 8 hours in human cells. The different 
réplication origins in these eulcaryotic chromosomes are activated in a sequence, 
determined in part by the structure of the chromatin, with the most condensed 
régions of chromatin typically beginning their réplication last. After the réplication 
fork has passed, chromatin structure is re-formed by the addition ofnew histones to 
the old histones that are directly inherited by each daughter DNA molécule. 

Eukaryotes solve the problem ofreplicating the ends of their linear chromosomes 
with a specialized end structure, the telomere, maintained by a spécial nucléotide 
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polymerizing enzyme called telomerase. Telomerase extends one ofthe DNA strands 
at the end ofa chromosome by using an RNA template that is an intégral part ofthe 
enzyme itself producing a highly repeated DNA sequence that typically extends for 
thousands of nucléotide pairs at each chromosome end. Telomeres hâve specialized 
structures that distinguish them from brolcen ends of chromosomes, ensuring that 
they are not mistakenly repaired. 

DNA REPAIR 

Maintaining the genetic stability that an organism needs for its survival requires 
not only an extremely accurate mechanism for replicating DNA, but also mecha- 
nisms for repairing the many accidentai lésions that DNA continually suffers. Most 
such spontaneous changes in DNA are temporary because they are immediately 
corrected by a set of processes that are collectively called DNA repair. Of the tens 
of thousands of random changes created every day in the DNA of a human cell by 
heat, metabolic accidents, radiation of various sorts, and exposure to substances 
in the environment, only a few (less than 0.02%) accumulate as permanent muta¬ 
tions in the DNA sequence. The rest are eliminated with remarkable efficiency by 
DNA repair. 

The importance of DNA repair is évident from the large investment that cells 
malce in the enzymes that carry it out: several percent of the coding capacity of 
most genomes is devoted solely to DNA repair functions. The importance of DNA 
repair is also demonstrated by the increased rate of mutation that follows the 
inactivation of a DNA repair gene. Many DNA repair proteins and the genes that 
encode them—which we now lcnow operate in a wide range of organisms, includ- 
ing humans—were originally identified in bacteria by the isolation and charac- 
terization of mutants that displayed an increased mutation rate or an increased 
sensitivity to DNA-damaging agents. 

Recent studies of the conséquences of a diminished capacity for DNA repair 
in humans hâve linlced many human diseases with decreased repair (Table 5-2). 
Thus, we saw previously that defects in a human gene whose product normally 
functions to repair the mismatched base pairs resulting from DNA réplication 
errors can lead to an inherited prédisposition to cancers of the colon and some 
other organs, reflecting an increased mutation rate. In another human disease, 


Some Inherited Human Syndromes with Defects in DNA Repair 

Name 

Phenotype 

Enzyme or process affected 

MSH2, 3, 6, MLH1, PMS2 

Colon cancer 

Mismatch repair 

Xeroderma pigmentosum (XP) 
groups A-G 

Skin cancer, UV sensitivity, neurological 
abnormalities 

Nucléotide excision repair 

Cockayne syndrome 

UV sensitivity; developmental abnormalities 

Coupling of nucléotide excision repair to 
transcription 

XP variant 

UV sensitivity, skin cancer 

Translesion synthesis by DNA polymerase v 

Ataxia telangiectasia (AT) 

Leukemia, lymphoma, y-ray sensitivity, genome 
instability 

ATM protein, a protein kinase activated by 
double-strand breaks 

BRCA1 

Breast and ovarian cancer 

Repair by homologous recombination 

BRCA2 

Breast, ovarian, and prostate cancer 

Repair by homologous recombination 

Werner syndrome 

Prématuré aging, cancer at several sites, 
genome instability 

Accessory 3'-exonuclease and DNA 
helicase used in repair 

Bloom syndrome 

Cancer at several sites, stunted growth, 
genome instability 

DNA helicase needed for recombination 

Fanconi anémia groups A-G 

Congénital abnormalities, leukemia, genome 
instability 

DNA interstrand cross-link repair 

46 BR patient 

Hypersensitivity to DNA-damaging agents, 
genome instability 

DNA ligase 1 
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xeroderma pigmentosum (XP), the afflicted individuals hâve an extreme sensitiv- 
ity to ultraviolet radiation because they are unable to repair certain DNA photo- 
products. This repair defect results in an increased mutation rate that leads to seri- 
ous slcin lésions and an increased susceptibility to skin cancers. Finally, mutations 
in the Brcal and Brca2 genes compromise a type of DNA repair known as homolo- 
gous recombination and are a cause of hereditary breast and ovarian cancer. 

Without DNA Repair, Spontaneous DNA Damage Would Rapidly 
Change DNA Sequences 

Although DNA is a highly stable material—as required for the storage of genetic 
information—it is a complex organic molécule that is susceptible; even under 
normal cell conditions, to spontaneous changes that would lead to mutations if 
left unrepaired (Figure 5-37 and see Table 5-3). For example, the DNA of each 


Figure 5-37 A summary of spontaneous 
alterations that require DNA repair. 

The sites on each nucléotide modified 
by spontaneous oxidative damage (red 
arrows), hydrolytic attack (blue arrows), 
and méthylation (green arrows) are shown, 
with the width of each arrow indicating the 
relative frequency of each event (see 
Table 5-3). (After T. Lindahl, Nature 
362:709-715, 1993. With permission from 
Macmillan Publishers Ltd.) 


TABLE 5-3 


Endogenous DNA Lésions Arising and Repaired in a Diploid 
Mammalian Cell in 24 Hours 


DNA lésion 

Number repaired in 24 h 

Hydrolysis 

Depurination 

18,000 

Depyrimidination 

600 

Cytosine deamination 

100 

5-Methylcytosine deamination 

10 

Oxidation 

8-oxo G 

1500 

Ring-saturated pyrimidines (thymine glycol, cytosine 
hydrates) 

2000 

Lipid peroxidation products (Ml G, etheno-A, 
etheno-C) 

1000 

Nonenzymatic méthylation by S-adenosylmethionine 

7-Methylguanine 

6000 

3-Methyladenine 

1200 

Nonenzymatic méthylation by nitrosated polyamines and peptides 

0 6 -Methylguanine 

20-100 

The DNA lésions listed in the table are the resuit of the normal Chemical reactions that take 

place in cells. Cells that are exposed to external Chemicals and radiation suffer greater and more 
diverse forms of DNA damage. (From T. Lindahl and D.E. Barnes, Cold Spring Harb. Symp. 

Quant. Biol. 65:127-133, 2000.) 
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human cell loses about 18,000 purine bases (adenine and guanine) every day 
because their TV-glycosyl linkages to deoxyribose hydrolyze, a spontaneous reac¬ 
tion called depurination. Similarly, a spontaneous deamination of cytosine to 
uracil in DNA occurs at a rate of about 100 bases per cell per day (Figure 5-38). 
DNA bases are also occasionally damaged by an encounter with reactive métab¬ 
olites produced in the cell, including reactive forms of oxygen and the high-en- 
ergy methyl donor S-adenosylmethionine, or by exposure to Chemicals in the 
environment. Likewise, ultraviolet radiation from the sun can produce a covalent 
linkage between two adjacent pyrimidine bases in DNA to form, for example, 
thymine dimers (Figure 5-39). If left uncorrected when the DNA is replicated, 
most of these changes would be expected to lead either to the délétion of one or 
more base pairs or to a base-pair substitution in the daughter DNA chain (Figure 
5-40). The mutations would then be propagated throughout subséquent cell gén¬ 
érations. Such a high rate of random changes in the DNA sequence would hâve 
disastrous conséquences. 

The DNA Double Hélix Is Readily Repaired 

The double-helical structure of DNA is ideally suited for repair because it carries 
two separate copies of ail the genetic information—one in each of its two strands. 
Thus, when one strand is damaged, the complementary strand retains an intact 
copy of the same information, and this copy is generally used to restore the correct 
nucléotide sequences to the damaged strand. 

An indication of the importance of a double-strand hélix to the safe storage of 
genetic information is that ail cells use it; only a few small viruses use single-strand 
DNA or RNA as their genetic material. The types of repair processes described in 
this section cannot operate on such nucleic acids, and once damaged, the chance 
of a permanent nucléotide change occurring in these single-strand genomes of 
viruses is thus very high. It seems that only organisms with tiny genomes (and 
therefore tiny targets for DNA damage) can afford to encode their genetic infor¬ 
mation in any molécule other than a DNA double hélix. 



! strand \ \ strand ; 

Figure 5-38 Depurination and deamination. These reactions are two of the most frequent spontaneous Chemical reactions that create serious 
DNA damage in cells. Depurination can release guanine (shown here), as well as adenine, from DNA. The major type of deamination reaction 
converts cytosine to an altered DNA base, uracil (shown here), but deamination occurs on other bases as well. These reactions normally take place 
in double-helical DNA; for convenience, only one strand is shown. 
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Figure 5-39 The most common type of thymine dimer. This type of 
damage occurs in the DNA of cells exposed to ultraviolet irradiation (as in 
sunlight). A similar dimer will form between any two neighboring pyrimidine 
bases (C or T residues) in DNA. 


DNA Damage Can Be Removed by More Than One Pathway 

Cells hâve multiple pathways to repair their DNA using different enzymes that act 
upon different kinds of lésions. Figure 5-41 shows two of the most common path¬ 
ways. In both, the damage is excised, the original DNA sequence is restored by a 
DNA polymerase that uses the undamaged strand as its template, and a remain- 
ing break in the double hélix is sealed by DNA ligase (see Figure 5-12). 

The two pathways differ in the way in which they remove the damage from 
DNA. The first pathway, called base excision repair, involves a battery of enzymes 
called DNA glycosylases, each of which can recognize a spécifie type of altered 
base in DNA and catalyze its hydrolytic removal. There are at least six types of 
these enzymes, including those that remove deaminated Cs, deaminated As, dif¬ 
ferent types of allcylated or oxidized bases, bases with opened rings, and bases in 
which a carbon-carbon double bond has been accidentally converted to a car¬ 
bon-carbon single bond. How is an altered base detected within the context of 
the double hélix? A key step is an enzyme-mediated “flipping-out” of the altered 
nucléotide from the hélix, which allows the DNA glycosylase to probe ail faces of 
the base for damage (Figure 5-42). It is thought that these enzymes travel along 
DNA using base-flipping to evaluate the status of each base. Once an enzyme 
finds the damaged base that it recognizes, it removes that base from its sugar. 

The “missing tooth” created by DNA glycosylase action is recognized by an 
enzyme called AP endonuclease (AP for apurinic or apyrimidinic, endo to signify 
that the nuclease cleaves within the polynucleotide chain), which cuts the phos- 
phodiester backbone, after which the resulting gap is repaired (see Figure 5-41 A). 
Depurination, which is by far the most frequent type of damage suffered by DNA, 
also leaves a deoxyribose sugar with a missing base. Depurinations are directly 
repaired beginning with AP endonuclease, following the bottom half of the path¬ 
way in Figure 5-41 A. 
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Figure 5-40 How Chemical modifications of nucléotides produce mutations. (A) Deamination of cytosine, if uncorrected, results in the 
substitution of one base for another when the DNA is replicated. As shown in Figure 5-38, deamination of cytosine produces uracil. Uracil differs 
from cytosine in its base-pairing properties and preferentially base-pairs with adenine. The DNA réplication machinery therefore adds an adenine 
when it encounters a uracil on the template strand. (B) Depurination can lead to the loss of a nucléotide pair. When the réplication machinery 
encounters a missing purine on the template strand, it may skip to the next complété nucléotide as illustrated here, thus producing a nucléotide 
délétion in the newly synthesized strand. Many other types of DNA damage (see Figure 5-37), if left uncorrected, also produce mutations when the 
DNA is replicated. 
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Figure 5- 41 A comparison of two major DNA repair pathways. (A) Base excision repair. This pathway starts with a DNA 
glycosylase. Here, the enzyme uracil DNA glycosylase removes an accidentally deaminated cytosine in DNA. After the action 
of this glycosylase (or another DNA glycosylase that recognizes a different kind of damage), the sugar phosphate with the 
missing base is eut out by the sequential action of AP endonuclease and a phosphodiesterase. (These same enzymes begin 
the repair of depurinated sites directly.) The gap of a single nucléotide is then filled by DNA polymerase and DNA ligase. The 
net resuit is that the U that was created by accidentai deamination is restored to a C. AP endonuclease is so-named because 
it recognizes any site in the DNA hélix that contains a deoxyribose sugar with a missing base; such sites can arise either by 
the loss of a purine (apurinic sites) or by the loss of a pyrimidine (apyrimidinic sites). (B) Nucléotide excision repair. In bacteria, 
after a multienzyme complex has recognized a lésion such as a pyrimidine dimer (see Figure 5-39), one eut is made on each 
side of the lésion, and an associated DNA helicase then removes the entire portion of the damaged strand. The excision repair 
machinery in bacteria leaves the gap of 12 nucléotides shown. In humans, once the damaged DNA is recognized, a helicase is 
recruited to unwind the DNA duplex locally. Next, the excision nuclease enters and cleaves on either side of the damage, leaving 
a gap of about 30 nucléotides. The nucléotide excision repair machinery in both bacteria and humans can recognize and repair 
many different types of DNA damage. 


The second major repair pathway is called nucléotide excision repair. This 
mechanism can repair the damage caused by almost any large change in the 
structure of the DNA double hélix. Such “bulky lésions" include those created by 
the covalent reaction of DNA bases with large hydrocarbons (such as the carcino- 
gen benzopyrene, found in tobacco smoke, coal tar, and diesel exhaust), as well as 
the various pyrimidine dimers (T-T, T-C, and C-C) caused by sunlight. In this path¬ 
way, a large multienzyme complex scans the DNA for a distortion in the double 
hélix, rather than for a spécifie base change. Once it finds a lésion, it cleaves the 
phosphodiester backbone of the abnormal strand on both sides of the distortion, 
and a DNA helicase peels away the single-strand oligonucleotide containing the 
lésion. The large gap produced in the DNA hélix is then repaired by DNA poly¬ 
merase and DNA ligase (see Figure 5-41B). 

An alternative to base and nucléotide excision repair processes is direct Chemi¬ 
cal reversai of DNA damage, and this strategy is selectively employed for the rapid 
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removal of certain highly mutagenic or cytotoxic lésions. For example, the alkyla¬ 
tion lésion 0 6 -methylguanine has its methyl group removed by direct transfer to 
a cysteine residue in the repair protein itself, which is destroyed in the reaction. 
In another example, methyl groups in the alkylation lésions 1-methyladenine and 
3-methylcytosine are “burnt off ,; by an iron-dependent demethylase, with release 
of formaldéhyde from the methylated DNA and régénération of the native base. 

Coupling Nucléotide Excision Repair to Transcription Ensures That 
the Cell’s Most Important DNA Is Efficiently Repaired 

Ail of a cell's DNA is under constant surveillance for damage, and the repair mech- 
anisms we hâve described act on ail parts of the genome. However, cells hâve a 
way of directing DNA repair to the DNA sequences that are most urgently needed. 
They do this by linking RNA polymerase, the enzyme that transcribes DNA into 
RNA as the first step in gene expression, to the nucléotide excision repair pathway. 
As discussed above, this repair System can correct many different types of DNA 
damage. RNA polymerase stalls at DNA lésions and, through the use of coupling 
proteins, directs the excision repair machinery to these sites. In bacteria, where 
genes are relatively short, the stalled RNA polymerase can be dissociated from the 
DNA; the DNA is repaired, and the gene is transcribed again from the beginning. 
In eukaryotes, where genes can be enormously long, a more complex reaction is 
used to “baclc up" the RNA polymerase, repair the damage, and then restart the 
polymerase. 

The importance of transcription-coupled excision repair is seen in people 
with Cockayne syndrome, which is caused by a defect in this coupling. These indi- 
viduals suffer from growth retardation, skeletal abnormalities, progressive neural 
retardation, and severe sensitivity to sunlight. Most of these problems are thought 
to arise from RNA polymerase molécules that become permanently stalled at sites 
of DNA damage that lie in important genes. 

The Chemistry of the DNA Bases Facilitâtes Damage Détection 

The DNA double hélix seems optimal for repair. As noted above, it contains a 
baclcup copy of ail genetic information. Equally importantly, the nature of the 
four bases in DNA malces the distinction between undamaged and damaged 
bases very clear. For example, every possible deamination event in DNA yields 
an “unnatural” base, which can be directly recognized and removed by a spécifie 
DNA glycosylase. Hypoxanthine, for example, is the simplest purine base capable 
of pairing specifically with C, but hypoxanthine is the direct deamination prod- 
uct of A (Figure 5-43A). The addition of a second amino group to hypoxanthine 


Figure 5-42 The récognition of an 
unusual nucléotide in DNA by base- 
flipping. The DNA glycosylase family of 
enzymes recognizes spécifie inappropriate 
bases in the conformation shown. Each of 
these enzymes cleaves the glycosyl bond 
that connects a particular recognized base 
(yellow) to the backbone sugar, removing it 
from the DNA. (A) Stick model; (B) space- 
filling model. 
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Figure 5-43 The deamination of DNA nucléotides. In each case, the oxygen atom that is 
added in this reaction with water is colored red. (A) The spontaneous deamination products of 
A and G are recognizable as unnatural when they occur in DNA and thus are readily found and 
repaired. The deamination of C to U was also illustrated in Figure 5-38; T has no amino group to 
remove. (B) About 3% of the C nucléotides in vertebrate DNAs are methylated to help in controlling 
gene expression (discussed in Chapter 7). When these 5-methyl C nucléotides are accidentally 
deaminated, they form the natural nucléotide T. However, this T will be paired with a G on the 
opposite strand, forming a mismatched base pair. 
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produces G, which cannot be formed from A by spontaneous deamination, and 
whose deamination product (xanthine) is likewise unique. 

As discussed in Chapter 6, RNA is thought, on an evolutionary time scale, 
to hâve served as the genetic material before DNA, and it seems lilcely that the 
genetic code was initially carried in the four nucléotides A, C, G, and U. This raises 
the question of why the U in RNA was replaced in DNA by T (which is 5-methyl U). 
We hâve seen that the spontaneous deamination of C converts it to U, but that this 
event is rendered relatively harmless by uracil DNA glycosylase. However, if DNA 
contained U as a natural base, the repair System would not be able to distinguish 
a deaminated C from a naturally occurring U. 

A spécial situation occurs in vertebrate DNA, in which selected C nucléo¬ 
tides are methylated at spécifie CG sequences that are associated with inactive 
genes (discussed in Chapter 7). The accidentai deamination of these methylated 
C nucléotides produces the natural nucléotide T (Figure 5-43B) in a mismatched 
base pair with a G on the opposite DNA strand. To help in repairing deaminated 
methylated C nucléotides, a spécial DNA glycosylase recognizes a mismatched 
base pair involving T in the sequence T-G and removes the T. This DNA repair 
mechanism must be relatively ineffective, however, because methylated C nuclé¬ 
otides are exceptionally common sites for mutations in vertebrate DNA. It is strilc- 
ing that, even though only about 3% of the C nucléotides in human DNA are meth¬ 
ylated, mutations in these methylated nucléotides account for about one-third of 
the single-base mutations that hâve been observed in inherited human diseases. 

Spécial Translesion DNA Polymerases Are Used in Emergencies 

If a celFs DNA suffers heavy damage, the repair mechanisms that we hâve dis¬ 
cussed are often insufficient to cope with it. In these cases, a different strategy is 
called into play, one that entails some rislc to the cell. The highly accurate réplica¬ 
tive DNA polymerases stall when they encounter damaged DNA, and in emer¬ 
gencies cells employ versatile, but less accurate, baclcup polymerases, known as 
translesion polymerases, to replicate through the DNA damage. 

Human cells hâve seven translesion polymerases, some of which can recog- 
nize a spécifie type of DNA damage and correctly add the nucléotide required to 
restore the initial sequence. Others malce only “good guesses,” especially when the 
template base has been extensively damaged. These enzymes are not as accurate 
as the normal réplicative polymerases when they copy a normal DNA sequence. 
For one thing, the translesion polymerases lack exonucleolytic proofreading 
activity; in addition, many are much less discriminating than the réplicative poly¬ 
merase in choosing which nucléotide to incorporate initially. Presumably for this 
reason, each such translesion polymerase is given a chance to add only one or a 
few nucléotides before the highly accurate réplicative polymerase résumés DNA 
synthesis. 

Despite their usefulness in allowing heavily damaged DNA to be replicated, 
these translesion polymerases do, as noted above, pose rislcs to the cell. They are 
probably responsible for most of the base-substitution and single-nucleotide 
délétion mutations that accumulate in genomes; although they generally produce 
mutations when copying damaged DNA (see Figure 5-40), they probably also cre- 
ate mutations—at a low level—on undamaged DNA. Clearly, it is important for 
the cell to tightly regulate these polymerases, releasing them only at sites of DNA 
damage. Exactly how this happens for each translesion polymerase remains to 
be discovered, but a conceptual model is given in Figure 5-44. The principle of 
this model applies to many of the DNA repair processes discussed in this chapter: 
because the enzymes that carry out these reactions are potentially dangerous to 
the genome, they must be brought into play only at sites of damage. 

Double-Strand Breaks Are Efficiently Repaired 

An especially dangerous type of DNA damage occurs when both strands of the 
double hélix are broken, leaving no intact template strand to enable accurate 
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Figure 5-44 Translesion DNA 
polymerases can use damaged 
templates. According to this model, a 
réplicative polymerase stalled at a site of 
DNA damage is recognized by the cell 
as needing rescue. Specialized enzymes 
covalently modify the sliding clamp 
(typically, it is ubiquitylated—see Figure 
3-69) which releases the réplicative DNA 
polymerase and, together with damaged 
DNA, attracts a translesion polymerase 
spécifie to that type of damage. Once the 
damaged DNA is bypassed, the covalent 
modification of the clamp is removed, the 
translesion polymerase dissociâtes, and 
the réplicative polymerase is brought back 
into play. 


repair. Ionizing radiation, réplication errors, oxidizing agents, and other métabo¬ 
lites produced in the cell cause breaks of this type. If these lésions were left unre- 
paired, they would quicldy lead to the breakdown of chromosomes into smaller 
fragments and to loss of genes when the cell divides. However, two distinct mecha- 
nisms hâve evolved to deal with this type of damage (Figure 5-45). The simplest to 
understand is nonhomologous end joining, in which the broken ends are simply 
brought together and rejoined by DNA ligation, generally with the loss of nucléo¬ 
tides at the site of joining (Figure 5-46). This end-joining mechanism, which can 
be seen as a “quiclc and dirty" solution to the repair of double-strand breaks, is 
common in mammalian somatic cells. Although a change in the DNA sequence 
(a mutation) results at the site of breakage, so little of the mammalian genome is 
essential for life that this mechanism is apparently an acceptable solution to the 
problem of rejoining broken chromosomes. By the time a human reaches the âge 
of 70, the typical somatic cell contains over 2000 such "scars," distributed through- 
out its genome, representing places where DNA has been inaccurately repaired by 
nonhomologous end joining. But nonhomologous end joining présents another 
danger: because there seems to be no mechanism to ensure that two ends being 
joined were originally next to each other in the genome, nonhomologous end 
joining can occasionally generate rearrangements in which one broken chromo¬ 
some becomes covalently attached to another. This can resuit in chromosomes 
with two centromeres and chromosomes lacking centromeres altogether; both 
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Figure 5-45 Two ways to repair double- 
strand breaks. (A) Nonhomologous end 
joining alters the original DNA sequence 
when repairing a broken chromosome. The 
initial dégradation of the broken DNA ends 
chromatids is im P ortant because the nucléotides at the 
site of the initial break are often damaged 
and cannot be ligated. Nonhomologous 
end joining usually takes place when 
cells hâve not yet duplicated their DNA. 

(B) Repairing double-strand breaks by 
homologous recombination is more difficult 
to accomplish but restores the original DNA 
sequence. It typically takes place after the 
DNA has been duplicated (when a duplex 
template is available) but before the cell 
has divided. Details of the homologous 
recombination pathway are presented in 
the following section (see Figure 5-48). 


délétion of DNA sequence 


damage repaired accurately using 
sister chromatid as the template 


types of aberrant chromosomes are missegregated during cell division. As previ- 
ously discussed, the specialized structure of telomeres prevents the natural ends 
of chromosomes from being mistaken for broken DNA and “repaired" in this way. 

A much more accurate type of double-strand break repair occurs in newly rep- 
licated DNA (Figure 5-45B). Here, the DNA is repaired using the sister chromatid 
as a template. This reaction is an example of homologous recombination , and we 
consider its mechanism later in this chapter. Most organisms employ both non¬ 
homologous end joining and homologous recombination to repair double-strand 
breaks in DNA. Nonhomologous end joining prédominâtes in humans; homol¬ 
ogous recombination is used only during and shortly after DNA réplication (in 
S and G 2 phases), when sister chromatids are available to serve as templates. 
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Figure 5-46 Nonhomologous end 
joining. (A) A central rôle is played by the 
Ku protein, a heterodimer that grasps 
the broken chromosome ends. The 
additional proteins shown are needed 
to hold the broken ends together while 
they are processed and eventually joined 
covalently. (B) Three-dimensional structure 
of a Ku heterodimer bound to the end of 
a duplex DNA fragment. The Ku protein is 
also essential for V(D)J joining, a spécifie 
recombination process through which 
antibody and T cell receptor diversity is 
generated in developing B and T cells 
(discussed in Chapter 24). V(D)J joining and 
nonhomologous end joining show many 
similarities in mechanism but the former 
relies on spécifie double-strand breaks 
produced deliberately by the cell. 

(B, from J.R. Walker, R.A. Corpina, and 
J. Goldberg, Nature 412:607-614, 2001. 
With permission from Macmillan 
Publishers Ltd.) 
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DNA Damage Delays Progression of the Cell Cycle 

We hâve just seen that cells contain multiple enzyme Systems that can recognize 
and repair many types of DNA damage (Movie 5.7). Because of the importance 
of maintaining intact, undamaged DNA from génération to génération, eulcary- 
otic cells hâve an additional mechanism that maximizes the effectiveness of their 
DNA repair enzymes: they delay progression of the cell cycle until DNA repair is 
complété. As discussed in detail in Chapter 17, the orderly progression of the cell 
cycle is stopped if damaged DNA is detected, and it restarts when the damage has 
been repaired. Thus, in mammalian cells, the presence of DNA damage can bloclc 
entry from Gi into S phase, it can slow S phase once it has begun, and it can bloclc 
the transition from G 2 phase to M phase. These delays facilitate DNA repair by 
providing the time needed for the repair to reach completion. 

DNA damage also results in an increased synthesis of some DNA repair 
enzymes. This response dépends on spécial signaling proteins that sense DNA 
damage and up-regulate the appropriate DNA repair enzymes. The importance 
of this mechanism is revealed by the phenotype of humans who are born with 
defects in the gene that encodes the ATM protein. These individuals hâve the 
disease ataxia telangiectasia {AT), the symptoms of which include neurodegen- 
eration, a prédisposition to cancer, and genome instability. The ATM protein is 
a large kinase needed to generate the intracellular signais that sound the alarm 
in response to many types of spontaneous DNA damage (see Figure 17-62), and 
individuals with defects in this protein therefore suffer from the effects of unre- 
paired DNA lésions. 

Summary 

Genetic information can be stored stably in DNA sequences only because a large 
set ofDNA repair enzymes continuously scan the DNA and replace any damaged 
nucléotides. Most types ofDNA repair dépend on the presence of a separate copy 
ofthe genetic information in each ofthe two strands ofthe DNA double hélix. An 
accidentai lésion on one strand can therefore be eut out by a repair enzyme and a 
corrected strand resynthesized by reference to the information in the undamaged 
strand. 

Most of the damage to DNA bases is excised by one of two major DNA repair 
pathways. In base excision repair ; the altered base is removed by a DNA glycosylase 
enzyme, followed by excision ofthe resulting sugar phosphate. In nucléotide exci¬ 
sion repair, a small section ofthe DNA strand surrounding the damage is removed 
from the DNA double hélix as an oligonucleotide. In both cases, the gap left in the 
DNA hélix is filled in by the sequential action ofDNA polymerase and DNA ligase, 
using the undamaged DNA strand as the template. Some types ofDNA damage can 
be repaired by a different strategy—the direct Chemical reversai of the damage— 
which is carried out by specialized repair proteins. When DNA damage is excessive, 
a spécial class ofinaccurate DNA polymerases, called translesion polymerases, is 
used to bypass the damage, allowing the cell to survive but sometimes creating per¬ 
manent mutations at the sites of damage. 

Other critical repair Systems—based on either nonhomologous end joining or 
homologous recombination—reseal the accidentai double-strand breaks thatoccur 
in the DNA hélix. In most cells, an elevated level ofDNA damage causes a delay in 
the cell cycle, which ensures that DNA damage is repaired before a cell divides. 

HOMOLOGOUS RECOMBINATION 

In the two preceding sections, we discussed the mechanisms that allow the DNA 
sequences in cells to be maintained from génération to génération with very little 
change. In this section, we further explore one of the DNA repair mechanisms, 
a diverse set of reactions lcnown collectively as homologous recombination. The 
lcey feature of homologous recombination (also lcnown as general recombina¬ 
tion) is an exchange of DNA strands between a pair of homologous duplex DNA 
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sequences, that is, segments of double hélix that are very similar or identical in 
nucléotide sequence. This exchange allows one stretch of duplex DNA to act as 
a template to restore lost or damaged information on a second stretch of duplex 
DNA. Because the template for repair is not limited to the strand complementary 
to that containing the damage, homologous recombination can repair many types 
of DNA damage. It is, for example, the main way to accurately repair double-strand 
breaks, as introduced in the previous section (see Figure 5-45B). Double-strand 
breaks can resuit from radiation and reactive Chemicals, but most of the time they 
arise from DNA réplication forlcs that become stalled or brolcen independently 
of any such external cause. Homologous recombination accurately corrects these 
accidents and, because they occur during nearly every round of DNA réplica¬ 
tion, this repair mechanism is essential for every proliferating cell. Homologous 
recombination is perhaps the most versatile DNA repair mechanism available to 
the cell; the “all-purpose” nature of recombinational repair probably explains why 
its mechanism and the proteins that carry it out hâve been conserved in virtually 
ail cells on Earth. 

Additionally, we shall see that homologous recombination plays a spécial rôle 
in sexually reproducing organisms. During meiosis, a key step in gamete (sperm 
and egg) production, it catalyzes the orderly exchange of bits of genetic informa¬ 
tion between corresponding (homologous) maternai and paternal chromosomes 
to create new combinations of DNA sequences in the chromosomes passed to the 
offspring. 

Homologous Recombination Has Common Features in Ail Cells 

The current view of homologous recombination as a critical DNA repair mecha¬ 
nism in ail cells evolved slowly from its original discovery as a key component in 
the specialized process of meiosis in plants and animais. The subséquent récogni¬ 
tion that homologous recombination also occurs in unicellular organisms made 
it much more amenable to molecular analyses. Thus, most of what we lcnow about 
the biochemistry of genetic recombination was originally derived from studies of 
bacteria, especially of E. coli and its viruses, as well as from experiments with sim¬ 
ple eukaryotes such as yeasts. For these organisms with short génération times 
and relatively small genomes, it was possible to isolate a large set of mutants with 
defects in their recombination processes. The protein altered in each mutant was 
then identified and, ultimately, studied biochemically. Close relatives of these 
proteins hâve been found in more complex eukaryotes including Aies, mice, and 
humans, and more recently, it has been possible to directly analyze homologous 
recombination in these species as well. These studies reveal that the fundamental 
processes that catalyze homologous recombination are common to ail cells. 

DNA Base-Pairing Guides Homologous Recombination 

The hallmark of homologous recombination is that it talces place only between 
DNA duplexes that hâve extensive régions of sequence similarity (homology). Not 
surprisingly, base-pairing underlies this requirement, and two DNA duplexes that 
are undergoing homologous recombination “sample” each other's DNA sequence 
by engaging in extensive base-pairing between a single strand from one DNA 
duplex and the complementary single strand from the other. The match need not 
be perfect, but it must be very close for homologous recombination to succeed. 

In its simplest form, this type of base-pairing interaction can be mimiclced in 
a test tube by allowing a DNA double hélix to re-form from its separated single 
strands. This process, called DNA renaturation or hybridization, occurs when a 
rare random collision juxtaposes complementary nucléotide sequences on two 
matching DNA single strands, allowing the formation of a short stretch of double 
hélix between them. This relatively slow hélix-nucléation step is followed by a very 
rapid “zippering" step, as the région of double hélix is extended to maximize the 
number of base-pairing interactions (Figure 5-47). 
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DNA hybridization can create a région of DNA double hélix consisting of 
strands that originate from two different duplex DNA molécules as long as they 
are complementary, or nearly so. As we will see shortly, the formation of such a 
hybrid molécule, lcnown as a heteroduplex, is an essential feature of homologous 
recombination. DNA hybridization and heteroduplex formation is also the basis 
for many of the methods used to study cells, and we will discuss these uses in 
Chapter 8. 

The DNA in a living cell is almost ail in the stable double-helical form, so the 
reaction depicted in Figure 5-47 rarely occurs in vivo. Instead, as we shall see, 
homologous recombination is brought about through a carefully controlled set of 
reactions that allow two DNA duplexes to sample each other's sequences without 
fully dissociating into single strands. 

Homologous Recombination Can Flawlessly Repair Double-Strand 
Breaks in DNA 

We saw in the previous section that nonhomologous end-joining occurs without a 
template and usually leaves a mutation at the site at which a double-strand break 
is repaired. In contrast, homologous recombination can repair double-strand 
breaks accurately, without any loss or alteration of nucléotides at the site of repair. 
For homologous recombination to do this repair job, the brolcen DNA has to be 
brought into proximity with homologous but unbroken DNA, which can serve as a 
template for repair. For this reason, homologous recombination often occurs just 
after DNA réplication, when the two daughter DNA molécules lie close together 
and one can serve as a template for repair of the other. As we shall see, the process 
of DNA réplication itself créâtes a spécial rislc of accidents requiring this sort of 
repair. 

The simplest pathway through which homologous recombination can repair 
double-strand breaks is shown in Figure 5-48. In essence, the brolcen DNA duplex 
and the template duplex carry out a “strand dance" so that one of the damaged 
strands can use the complementary strand of the intact DNA duplex as a tem¬ 
plate for repair. First, the ends of the brolcen DNA are chewed baclc, or “resected," 
by specialized nucleases to produce overhanging, single-strand 3' ends. The next 
step is strand exchange (also called strand invasion ), during which one of the 
single-strand 3' ends from the damaged DNA molécule worms its way into the 
template duplex and searches it for homologous sequences through base-pair- 
ing. We describe this remarlcable reaction in detail in the next section. Once sta¬ 
ble base-pairing is established (which complétés the strand exchange step), an 
accurate DNA polymerase extends the invading strand by using the information 
provided by the undamaged template molécule, thus restoring the damaged 
DNA. The last steps—strand displacement, further repair synthesis, and liga- 
tion—restore the two original DNA double helices and complété the repair pro¬ 
cess. Homologous recombination resembles other DNA repair reactions in that a 


Figure 5-47 DNA hybridization. DNA 

double helices can re-form from their 
separated strands in a reaction that 
dépends on the random collision of 
two complementary DNA strands. The 
vast majority of such collisions are not 
productive, as shown on the left, but 
a few resuit in a short région where 
complementary base pairs hâve formed 
(hélix nucléation). A rapid zippering then 
leads to the formation of a complété double 
hélix. Through this trial-and-error process, 
a DNA strand will find its complementary 
partner even in the midst of millions of 
nonmatching DNA strands. 
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Figure 5-48 Mechanism of double- 
strand break repair by homologous 
recombination. This is the preferred 
method for repairing DNA double-strand 
breaks that arise shortly after the DNA has 
been replicated, while the daughter DNA 
molécules are still held close together. In 
general, homologous recombination can be 
regarded as a flexible sériés of reactions, 
with the exact pathway differing from one 
case to the next. For example, the length 
of the repair “patch” can vary considerably 
depending on the extent of 5' Processing 
and new DNA synthesis, indicated in green. 


DNA polymerase utilizes a pristine template to restore damaged DNA. However, 
instead of using the partner complementary strand as a template, as occurs in 
most DNA repair pathways, homologous recombination exploits a complemen¬ 
tary strand from a separate DNA duplex. 

Strand Exchange Is Carried Out by the RecA/Rad51 Protein 

Of ail the steps of homologous recombination, strand exchange is the most diffi- 
cult to imagine. How does the invading single strand rapidly sample a DNA duplex 
for homology? Once the homology is found, how does the exchange occur? How 
is the inhérent stability of the template double hélix overcome? 

The answers to these questions came from biochemical and structural stud- 
ies of the protein that carries out these feats, called RecA in E. coli and Rad51 in 
virtually ail eukaryotic organisms. To catalyze strand exchange, RecA first binds 
cooperatively to the invading single strand, forming a protein-DNA filament 
that forces the DNA into an unusual configuration: groups of three consecutive 
nucléotides are held as though they were in a conventional DNA double hélix 
but, between adjacent triplets, the DNA backbone is untwisted and stretched out 
(Figure 5-49). This unusual protein-DNA filament then binds to duplex DNA 
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single-strand DNA in 
RecA-bound form 



Figure 5-49 Strand invasion catalyzed 
by the RecA protein. Our understanding 
of this reaction is based in part on 
structures determined by x-ray diffraction 
studies of RecA bound to single- and 
double-strand DNA. These DNA structures 
(shown without the RecA protein) are 
on the left side of the diagram. Starting 
at the top, ATP-bound RecA associâtes 
with single-strand DNA, holding it in an 
elongated form where groups of three 
bases are separated from each other by 
a stretched and twisted backbone. In the 
next step, the RecA-bound single strand 
then binds to duplex DNA, destabilizing it 
and allowing the single strand to sample 
its sequence through base-pairing, three 
bases at a time. If no match is found, the 
RecA-bound single strand of DNA rapidly 
dissociâtes and begins a new search. If 
an extensive match is found, the structure 
is disassembled through ATP hydrolysis, 
resulting in the dissociation of RecA 
and the exchange of one single strand 
of DNA for another, thereby forming a 
heteroduplex. (PDB code: 3CMX.) 


in a way that stretches the duplex, destabilizing it and malcing it easy to pull the 
strands apart. The invading single strand then can sample the sequence of the 
duplex by conventional base-pairing. This sampling occurs in triplet nucléotide 
bloclcs: if a triplet match is found, the adjacent triplet is sampled, and so on. In 
this way, mismatches quicldy lead to dissociation and only an extended stretch of 
base-pairing (at least 15 nucléotides) stabilizes the invading strand and leads to 
strand exchange. 

RecA hydrolyzes ATP, and the steps described above require that each RecA 
monomer along the filament be in the ATP-bound State. However, the search- 
ing itself does not require ATP hydrolysis; instead, the process occurs by simple 
molecular collision, allowing many potential sequences to be rapidly sampled. 
Once the strand-exchange reaction is completed, however, ATP hydrolysis is nec- 
essary to disassemble RecA from the complex of DNA molécules. At this point, 
repair DNA polymerases and DNA ligase can complété the repair process, as 
shown in Figure 5-48. 

Homologous Recombination Can Rescue Broken DNA Réplication 
Forks 

Although accurately repairing double-strand breaks, which can arise from radi¬ 
ation or Chemical reactions, is a crucial function of homologous recombination, 
perhaps its most important rôle is in rescuing stalled or broken DNA réplication 
forks. Many types of events can cause a réplication forlc to break, and here we con- 
sider just one example: a single-strand niclc or gap in the parental DNA hélix just 
ahead of a réplication forlc. When the forlc reaches this lésion, it falls apart—result¬ 
ing in one broken and one intact daughter chromosome. The broken forlc can be 
flawlessly repaired (Figure 5-50) using the same basic homologous recombina¬ 
tion reactions we discussed above for the repair of double-strand breaks. With 
slight modifications, the set of reactions depicted in Figures 5-48 and 5-50— 
lcnown collectively as homologous recombination—can accurately repair many 
different types of DNA damage. 

Cells Carefully Regulate the Use of Homologous Recombination in 
DNA Repair 

Although homologous recombination neatly solves the problem of accurately 
repairing double-strand breaks and other types of DNA damage, it does présent 





HOMOLOGOUS RECOMBINATION 


281 


Figure 5-50 Repair of a broken réplication fork by homologous 
recombination. When a moving réplication fork encounters a single-strand 
break, it will collapse, but can be repaired by homologous recombination. 
The process uses many of the same reactions shown in Figure 5-48 and 
proceeds through the same basic steps. Green strands represent the new 
DNA synthesis that takes place after the réplication fork has broken. This 
pathway allows the fork to move past the site that was nicked on the original 
template by using the undamaged duplex as a template to synthesize DNA. 
(Adapted from M.M. Cox, Proc. Natl Acad. Sci. USA 98:8173-8180, 2001. 
With permission from National Academy of Sciences.) 


some dangers to the cell as it sometimes “repairs” damage using the wrong bit 
of the genome as the template. For example, sometimes a broken human chro¬ 
mosome is “repaired” using the homolog from the other parent instead of the 
sister chromatid as the template. Because maternai and paternal chromosomes 
differ in DNA sequence at many positions along their lengths, this type of repair 
can convert the sequence of the repaired DNA from the maternai to the paternal 
sequence or vice versa. The resuit of this type of errant recombination is known as 
loss of heterozygosity. It can hâve severe conséquences if the homolog used for 
repair contains a deleterious mutation, because the recombination event destroys 
the “good” copy. Loss of heterozygosity, although rare, is a critical step in the for¬ 
mation of many cancers (discussed in Chapter 20). 

Cells go to great lengths to minimize the rislc of mishaps of these types; indeed, 
nearly every step of homologous recombination is carefully regulated. For exam¬ 
ple, the first step, processing of the broken ends, is coordinated with the cell cycle: 
the nuclease enzymes that carry out this process are activated (in part, by phos¬ 
phorylation) only in the S and G 2 phases of the cell cycle, when a daughter duplex 
(either as a partially replicated chromosome or a fully replicated sister chromatid) 
can serve as a template for repair (see Figure 5-50). The close proximity of the two 
daughter chromosomes disfavors the use of other genome sequences in the repair 
process. 

The loading of RecA or Rad52 onto the processed DNA ends and the subsé¬ 
quent strand-exchange reaction are also tightly controlled. Although these pro- 
teins alone can carry out these steps in vitro, a sériés of accessory proteins, includ- 
ing Rad52, is needed in eukaryotic cells to ensure that homologous recombination 
is efficient and accurate (Figure 5-51). There are many such accessory proteins, 
and exactly how they coordinate and control homologous recombination remains 
a mystery. We do lcnow that the enzymes that catalyze recombinational repair are 
made at relatively high levels in eukaryotes and are dispersed throughout the 
nucléus in an inactive form. In response to DNA damage, they rapidly converge 
on the sites of DNA damage, become activated, and form “repair factories" where 
many lésions are apparently brought together and repaired (Figure 5-52). 

In Chapter 20, we shall see that both too much and too little homologous 
recombination can lead to cancer in humans, the former through repair using the 
“wrong” template (as described above) and the latter through an increased muta¬ 
tion rate caused by inefficient DNA repair. Clearly, a délicate balance has evolved 
that lceeps this process in checlc on undamaged DNA, while still allowing it to act 
efficiently and rapidly on DNA lésions as soon as they arise. 

Not surprisingly, mutations in the components that carry out and regulate 
homologous recombination are responsible for several inherited forms of can¬ 
cer. Two of these, the Brcal and Brca2 proteins, were first discovered because 


Figure 5-51 Structure of a portion of the Rad52 protein. This doughnut- 
shaped structure is composed of 11 subunits. Single-strand DNA has been 
modeled into the deep groove running along the protein surface. Rad52 
helps load Rad51 onto single-strand DNA to form the nucleoprotein filament 
that carries out strand exchange. Rad52 also acts later to re-form the double 
hélix and complété the homologous recombination reaction. (From 
M.R. Singleton et al., Proc. Natl Acad. Sci. USA 99:13492-13497, 2002. 
With permission from National Academy of Sciences.) 
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mutations in their genes lead to a greatly increased frequency of breast cancer. 
Because these mutations cause inefficient repair by homologous recombination, 
accumulation of DNA damage can, in a small proportion of cells, give rise to a 
cancer. Brcal régulâtes an early step in broken-end processing; without it, such 
ends are not processed correctly for homologous recombination and instead are 
repaired inaccurately by the nonhomologous end-joining pathway (see Figure 
5-45). Brca2 binds to the Rad51 protein, preventing its polymerization on DNA, 
and thereby maintaining it in an inactive form until it is needed. Normally, upon 
DNA damage, Brca2 helps to bring Rad51 protein rapidly to sites of damage and, 
once in place, to release it in its active form onto single-strand DNA. 

Homologous Recombination Is Crucial for Meiosis 

We hâve seen that homologous recombination comprises a group of reactions— 
including broken-end processing, strand exchange, limited DNA synthesis, and 
ligation—to exchange DNA sequences between two double helices of similar 
nucléotide sequence. Having discussed its rôle in accurately repairing damaged 
DNA, we now turn to homologous recombination as a means to generate DNA 
molécules that carry novel combinations of genes as a resuit of the deliberate 
exchange of material between different chromosomes. Although this occasionally 
occurs by accident in mitotic cells (and is often detrimental), it is a frequent and 
necessary part of meiosis, which occurs in sexually reproducing organisms such 
as fungi, plants, and animais. 

Here, homologous recombination occurs as an intégral part of the process 
whereby chromosomes are parceled out to germ cells (sperm and eggs in ani¬ 
mais). We discuss the process of meiosis in detail in Chapter 17; in the following 
sections, we discuss how homologous recombination during meiosis produces 
chromosome crossing-over and gene conversion, resulting in hybrid chromosomes 
that contain genetic information from both the maternai and paternal homologs 
(Figure 5-53). Crossing-over and gene conversion are both generated by homolo¬ 
gous recombination mechanisms that, at their core, resemble those used to repair 
double-strand breaks. 

Meiotic Recombination Begins with a Programmed Double-Strand 
Break 

Homologous recombination in meiosis starts with a bold stroke: a specialized 
protein (called Spoll in budding yeast) breaks both strands of the DNA double 
hélix in one of the recombining chromosomes (Figure 5-54). Lilce a topoisomer- 
ase, Spoll, after catalyzing this reaction, remains covalently bound to the brolcen 
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Figure 5-52 Experiment demonstrating 
the rapid localization of repair proteins 
to DNA double-strand breaks. Human 
fibroblasts were x-irradiated to produce DNA 
double-strand breaks. Before the x-rays 
struck the cells, they were passed through 
a microscopie grid with x-ray-absorbing 
“bars” spaced 1 jim apart. This produced a 
striped pattern of DNA damage, allowing a 
comparison of damaged and undamaged 
DNA in the same nucléus. (A) Total DNA in 
a fibroblast nucléus stained with the dye 
DAPI. (B) Sites of new DNA synthesis due 
to repair of DNA damage, indicated by 
incorporation of BudR (a thymidine analog) 
and subséquent staining with fluorescently 
labeled antibodies to BudR (green). 

(C) Localization of the Mrel 1 complex to 
damaged DNA as visualized by antibodies 
against the Mrel 1 subunit (red). Mrel 1 is a 
nuclease that processes damaged DNA in 
préparation for homologous recombination 
(see Figure 5-48). (A), (B), and (C) were 
processed 30 minutes after x-irradiation. 
(From B.E. Nelms et al., Science 280:590- 
592, 1998. With permission from AAAS.) 


Figure 5-53 Chromosome crossing-over 
occurs in meiosis. Meiosis is the process 
by which a diploid cell gives rise to four 
haploid germ cells, as described in detail 
in Chapter 17. Meiosis produces germ 
cells in which the paternal and maternai 
genetic information (red and blue) has 
been reassorted through chromosome 
crossovers. In addition, many short régions 
of gene conversion occur, as indicated. 
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Figure 5-54 Homologous recombination 
during meiosis can generate 
chromosome crossovers. Once the 
meiosis-specific protein Spol 1 and the 
Mrel 1 complex break the duplex DNA 
and process the ends, homologous 
recombination can proceed along 
alternative pathways. One (right side of 
figure) closely resembles the double-strand 
break repair reaction shown in Figure 5-48 
and results in chromosomes that hâve been 
“repaired” but hâve not crossed over. The 
other (left side with strand breaks as shown 
by the blue arrows) proceeds through a 
double Holliday junction and produces 
two chromosomes that hâve crossed over. 
During meiosis, homologous recombination 
takes place between maternai and paternal 
chromosome homologs when they are held 
tightly together (see Figure 17-54). 
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DNA (see Figure 5-21). A specialized nuclease then rapidly dégradés the ends 
bound by Spoll, removing the protein along with the DNA and leaving protrud- 
ing 3' single-strand ends. 

At this point, many of the recombination reactions resemble those described 
above for the repair of double-strand breaks; indeed, some of the same pro teins are 
used for both processes. However, several meiosis-specific proteins direct them 
to perform their taslcs somewhat differently, resulting in the distinctive outcomes 
observed for meiosis. Another important différence is that, in meiosis, recombi¬ 
nation occurs preferentially between maternai and paternal chromosomal homo- 
logs rather than between the newly replicated, identical DNA duplexes that pair 
in double-strand break repair. In the sections that follow, we describe in more 
detail those aspects of homologous recombination that are especially important 
for meiosis. 


Figure 5-55 A Holliday junction. The initially 
formed structure (A) is usually drawn with 
two strands Crossing, as in Figure 5-54. 

An isomerization of the Holliday junction 

(B) produces an open, symmetrical structure 
that is bound by specialized proteins. 

(C) These proteins “move” the Holliday 
junctions by a coordinated set of branch- 
migration reactions (see Figure 5-57 and 
Movie 5.8). (D) Structure of the Holliday 
junction in the open form depicted in (B). 

The Holliday junction is named for the 
scientist who first proposed its formation. 
(PDB code: 1DCW.) 


Holliday Junctions Are Formed During Meiosis 

Of spécial importance in meiosis is an intermediate known as a Holliday junc¬ 
tion or cross-strand exchange (Figure 5-55). Each Holliday junction can adopt 
multiple conformations and a spécial set of recombination proteins binds to, and 
thereby stabilizes, the open, symmetric isomer. 

Specialized proteins that bind to Holliday junctions can catalyze a reaction 
known as branch migration (Figure 5-56), whereby DNA is spooled through 
the Holliday junction by continually breaking and re-forming base pairs (Figure 
5-57). In this way, the Holliday junction pro teins use ATP hydrolysis to expand the 
région of heteroduplex DNA initially created by the strand-exchange reaction. In 
meiosis, heteroduplex régions often “migrate” thousands of nucléotides from the 
original site of the double-strand break. As shown in Figure 5-54, Holliday junc¬ 
tions usually occur in pairs, known as double Holliday junctions. 

Homologous Recombination Produces Both Crossovers and 
Non-Crossovers During Meiosis 

As shown in Figure 5-54, there are two basic outcomes of homologous recom¬ 
bination during meiosis. In humans, approximately 90% of the double-strand 
breaks produced during meiosis are resolved as non-crossovers (see right side of 
Figure 5-54). Here, the two original DNA duplexes separate from each other in a 
form unaltered except for a région of heteroduplex that formed near the site of the 
original double-strand break. This set of reactions resembles that described above 
for the repair of double-strand breaks (see Figure 5-48). 

The other outcome is more profound: a double Holliday junction is formed 
and is cleaved by specialized enzymes to create a crossover (see left side of Figure 
5-54). The two original portions of each chromosome upstream and downstream 




Figure 5-56 Simplifiée! view of branch 
migration. In branch migration, base pairs 
are continually broken and formed as the 
branch point moves. Although branch 
migration can happen spontaneously on 
naked DNA molécules, the process is 
inefficient and the branch moves back and 
forth at random. In the cell, branch migration 
is carried out using specialized proteins and 
ATP hydrolysis to ensure that, as shown, the 
branch moves rapidly and in one direction. 
As shown in Figure 5-57, branch migrations 
often occur at Holliday junctions, where two 
branch-migration reactions are coupled. 
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Figure 5-57 Enzyme-catalyzed branch 
movement at a Holliday junction by 
branch migration. In E. coli, a tetramer of 
the RuvA protein (green) and two hexamers 
of the RuvB protein (yellow) bind to the 
open form of the junction. The RuvB 
protein, which resembles the hexameric 
helicases used in DNA réplication (Figure 
5-14), uses the energy of ATP hydrolysis 
to spool DNA rapidly through the Holliday 
junction, extending the heteroduplex région 
as shown. The RuvA protein coordinates 
this movement, threading the DNA strands 
to avoid tangling. (PDB codes: 11XR, 1C7Y.) 


DNA MOVES IN 

from the two Holliday junctions are thereby swapped, creating two chromosomes 
that hâve crossed over. 

How does the cell décidé which Spoll-induced double-strand breaks to 
résolve as crossovers? The answer is not yet known, but we lcnow the decision 
is an important one. The relatively few crossovers that do form are distributed 
along chromosomes in such a way that a crossover in one position inhibits cross¬ 
ing-over in neighboring régions. Termed crossover control, this fascinating but 
poorly understood regulatory mechanism ensures the roughly even distribution 
of crossover points along chromosomes. It also ensures that each chromosome— 
no matter how small—undergoes at least one crossover every meiosis. For many 
organisms, roughly two crossovers per chromosome occur during each meiosis, 
one on each arm. As discussed in detail in Chapter 17, these crossovers play an 
important mechanical rôle in the proper ségrégation of chromosomes during 
meiosis. 

Whether a meiotic recombination event is resolved as a crossover or a 
non-crossover, the recombination machinery leaves behind a heteroduplex région 
where a strand with the DNA sequence of the paternal homolog is base-paired 
with a strand from the maternai homolog (Figure 5-58). These heteroduplex 
régions can tolerate a small percentage of mismatched base pairs, and because of 
branch migration, they often extend for thousands of nucléotide pairs. The many 
non-crossover events that occur in meiosis thereby produce scattered sites in the 
germ cells where short DNA sequences from one homolog hâve been pasted into 
the other homolog. Heteroduplex régions mark sites of potential gene conver¬ 
sion —where the four haploid chromosomes produced by meiosis contain three 
copies of a DNA sequence from one homolog and only one copy of this sequence 
from the other homolog (see Figure 5-53), as explained next. 
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Figure 5-58 Heteroduplexes formed during meiosis. Heteroduplex DNA is présent at sites 
of recombination that are resolved either as crossovers or non-crossovers. Because the DNA 
sequences of maternai and paternal chromosomes differ at many positions along their lengths, 
heteroduplexes often contain a small number of base-pair mismatches. 
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Figure 5-59 Gene conversion caused by mismatch correction. In 

this process, heteroduplex DNA is formed at the sites of homologous 
recombination between maternai and paternal chromosomes. If the maternai 
and paternal DNA sequences are slightly different, the heteroduplex région 
will include some mismatched base pairs, which may then be corrected 
by the DNA mismatch repair machinery (see Figure 5-19). Such repair can 
“erase” nucléotide sequences on eitherthe paternal or the maternai strand. 
The conséquence of this mismatch repair is gene conversion, detected as 
a déviation from the ségrégation of equal copies of maternai and paternal 
alleles that normally occurs in meiosis. 


Homologous Recombination Often Results in Gene Conversion 

In sexually reproducing organisms, it is a fundamental law of genetics that—aside 
from mitochondrial DNA, which is inherited only through the mother—each 
parent malces an equal genetic contribution to an offspring. One complété set of 
nuclear genes is inherited from the father and one complété set is inherited from 
the mother. Underlying this law is the accurate parceling out of chromosomes to 
the germ cells (eggs and sperm) that talces place during meiosis. Thus, when a dip- 
loid cell in a parent undergoes meiosis to produce four haploid germ cells, exactly 
half of the genes distributed among these four cells should be maternai (genes 
inherited from the mother of this parent) and the other half paternal (genes inher¬ 
ited from the father of this parent). In some organisms (fungi, for example), it is 
possible to recover and analyze ail four of the haploid gametes produced from a 
single cell by meiosis. Studies in such organisms hâve revealed rare cases in which 
the parceling out of genes violâtes the standard genetic rules. Occasionally, for 
example, meiosis yields three copies of the maternai version of a gene and only 
one copy of the paternal allele. Alternative versions of the same gene are called 
alleles, and it is the divergence from their expected distribution during meiosis 
that is known as gene conversion. Genetic studies show that only small sections 
of DNA typically undergo gene conversion, and in many cases only a part of a 
gene is changed. 

Several pathways in the cell can lead to gene conversion, but one of the most 
important arises from a particular conséquence of recombination during meio¬ 
sis. We hâve seen that both crossovers and non-crossovers produce heteroduplex 
régions of DNA. If the two strands that malce up a heteroduplex région do not hâve 
identical nucléotide sequences, mismatched base pairs are formed, and these are 
often repaired by the cell's mismatch repair System (see Figure 5-19). However, the 
mismatch repair System cannot distinguish between the paternal and maternai 
strands and will randomly choose the strand to be used as a template. As a con¬ 
séquence, one allele will be lost and the other duplicated (Figure 5-59), resulting 
in net “conversion" of one allele to the other. Thus, gene conversion, originally 
regarded as a mysterious déviation from the rules of genetics, can be seen as a 
straightforward conséquence of the mechanisms of homologous recombination. 
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Summary 

Homologous recombination describes a flexible set of reactions resulting in the 
exchange ofDNA sequences between a pair of identical or nearly identical duplex 
DNA molécules. In ail cells, this process is essentialfor the error-free repair of chro¬ 
mosome damage , particularly double-strand breaks and broken or stalled réplica¬ 
tion forks. Homologous recombination is also responsible for the crossing-over of 
chromosomes that occurs during meiosis. Homologous recombination takes place 
through a variety of pathways, but they hâve in common a strand-exchange step 
whereby a single strand from one DNA duplex invades a second duplex and base- 
pairs with one strand while displacing the other. This reaction, catalyzed by the 
RecA/Rad51 family of proteins, can only occur if the invading strand canform a 
short stretch of consecutive nucléotide pairs with one ofthe strands ofthe duplex. 
This requirement ensures that homologous recombination occurs only between 
identical or very similar DNA sequences. 
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When used as a repair mechanism, homologous recombination occurs between 
a damaged DNA molécule and its recently duplicated sister molécule , with the 
undamaged duplex acting as a template to repair the damaged copy flawlessly. 

In meiosis, homologous recombination is initiated by deliberate, carefully regu- 
lated double-strand breaks and occurs preferentially between the homologous chro¬ 
mosomes rather than the newly replicated sister chromatids. The outcome can be 
either two chromosomes that hâve crossed over (that is, chromosomes in which the 
DNA on either side ofthe site of DNA pairing originates from two different homo- 
logs) or two non-crossover chromosomes. In the latter case, the two chromosomes 
that resuit are identical to the original two homologs, exceptfor relatively minor 
DNA sequence changes at the site of recombination. 

TRANSPOSITION AND CONSERVATIVE SITE-SPECIFIC 
RECOMBINATION 

We hâve seen that homologous recombination can resuit in the exchange of DNA 
sequences between chromosomes. However, the order of genes on the interact- 
ing chromosomes typically remains the same following homologous recom¬ 
bination, inasmuch as the recombining sequences must be very similar for the 
process to occur. In this section, we describe two very different types of recombi¬ 
nation—transposition (also called transpositional recombination ) and conserva¬ 
tive site-specific recombination—that do not require substantial régions of DNA 
homology. These two types of recombination reactions can alter gene order along 
a chromosome and can cause unusual types of mutations that introduce whole 
bloclcs of DNA sequence into the genome. 

Transposition and conservative site-specific recombination are largely dedi- 
cated to moving a wide variety of specialized segments of DNA—collectively 
termed mobile genetic éléments —from one position in a genome to another. We 
will see that mobile genetic éléments can range in size from a few hundred to tens 
of thousands of nucléotide pairs, and each typically carries a unique set of genes. 
Often, one of these genes encodes a specialized enzyme that catalyzes the move- 
ment of only that element, thereby making this type of recombination possible. 

Virtually ail cells contain mobile genetic éléments (lcnown informally as 
"jumping genes"). As explained in Chapter 4, over evolutionary time scales, they 
hâve had a profound effect on the shaping of modem genomes. For example, 
nearly half of the human genome can be traced to these éléments (see Figure 
4-62). Over time, random mutation has altered their nucléotide sequences, and, 
as a resuit, only a few of the many copies of these éléments in our DNA are still 
active and capable of movement. The remainder are molecular fossils whose exis¬ 
tence provides striking dues to our evolutionary history. 

Mobile genetic éléments are often considered to be molecular parasites (they 
are also termed “selfish DNA") that persist because cells cannot get rid of them; 
they certainly hâve corne close to overrunning our own genome. However, mobile 
DNA éléments can provide benefits to the cell. For example, the genes they carry 
are sometimes advantageous, as in the case of antibiotic résistance in bacterial 
cells, discussed below. The movement of mobile genetic éléments also produces 
many of the genetic variants upon which évolution dépends, because, in addition 
to moving themselves, mobile genetic éléments occasionally rearrange neighbor- 
ing sequences of the host genome. Thus, spontaneous mutations observed in Dro- 
sophila , humans, and other organisms are often due to the movement of mobile 
genetic éléments. While many of these mutations will be deleterious to the organ- 
ism, some will be advantageous and may spread throughout the population. It is 
almost certain that much of the variety of life we see around us originally arose 
from the movement of mobile genetic éléments. 

In this section, we introduce mobile genetic éléments and describe the mech- 
anisms that enable them to move around a genome. We shall see that some of 
these éléments move through transposition mechanisms and others through con¬ 
servative site-specific recombination. We begin with transposition, as there are 
many more known examples of this type of movement. 
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Through Transposition, Mobile Genetic Eléments Can Insert Into 
Any DNA Sequence 

Mobile éléments that move by way of transposition are called transposons, or 
transposable éléments. In transposition, a spécifie enzyme, usually encoded 
by the transposon itself and typically called a transposase, acts on spécifie DNA 
sequences at each end of the transposon, causing it to insert into a new target 
DNA site. Most transposons are only modestly sélective in choosing their target 
site, and they can therefore insert themselves into many different locations in a 
genome. In particular, there is no general requirement for sequence similarity 
between the ends of the element and the target sequence. Most transposons move 
only rarely. In bacteria, where it is possible to measure the frequency accurately, 
transposons typically move once every 10 5 cell divisions. More frequent move- 
ment would probably destroy the host cell's genome. 

On the basis of their structure and transposition mechanism, transposons 
can be grouped into three large classes: DNA-only transposons, retroviral lilce ret- 
rotransposons, and nonretroviral retrotransposons. The différences among them 
are briefly outlined in Table 5-4, and each class will be discussed in turn. 

DNA-Only Transposons Can Move by a Cut-and-Paste 
Mechanism 

DNA-only transposons, so named because they exist only as DNA during their 
movement, predominate in bacteria, and they are largely responsible for the 
spread of antibiotic résistance in bacterial strains. When antibiotics lilce penicillin 
and streptomycin first became widely available in the 1950s, most bacteria that 
caused human disease were susceptible to them. Now, the situation is different— 
antibiotics such as penicillin (and its modem dérivatives) are no longer effective 
against many modem bacterial strains, including those causing gonorrhea and 
bacterial pneumonia. The spread of antibiotic résistance is due largely to genes 
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These éléments range in length from 1000 to about 12,000 nucléotide pairs. Each family contains many members, only a few of which are listed 
here. Some viruses can also move in and out of host-cell chromosomes by transpositional mechanisms. These viruses are related to the first two 
classes of transposons. 































TRANSPOSITION AND CONSERVATIVE SITE-SPECIFIC RECOMBINATION 


289 



2 kb 


that encode antibiotic-inactivating enzymes that are carried on transposons (Fig¬ 
ure 5-60). Although these mobile éléments can transpose only within cells that 
already carry them, they can be moved from one cell to another through other 
mechanisms lcnown collectively as horizontal gene transfer (see Figure 1-19). 
Once introduced into a new cell, a transposon can insert itself into the genome 
and be faithfully passed on to ail progeny cells through the normal processes of 
DNA réplication and cell division. 

DNA-only transposons can relocate from a donor site to a target site by cut- 
and-paste transposition (Figure 5-61). Here, the transposon is literally excised 
from one spot on a genome and inserted into another. This reaction produces a 
short duplication of the target DNA sequence at the insertion site; these direct 
repeat sequences that flanlc the transposon serve as convenient records of prior 
transposition events. Such "signatures” often provide valuable dues in identifying 
transposons in genome sequences. 

When a cut-and-paste DNA-only transposon is excised from its original loca¬ 
tion, it leaves behind a “hole” in the chromosome. This lésion can be perfectly 
healed by recombinational double-strand break repair (see Figure 5-48), pro- 
vided that the chromosome has just been replicated and an identical copy of the 
damaged host sequence is available. Alternatively, a nonhomologous end-join- 
ing reaction can reseal the break; in this case, the DNA sequence that originally 
flanlced the transposon is altered, producing a mutation at the chromosomal site 
from which the transposon was excised (see Figure 5-45). 

Remarlcably, the same mechanism used to excise cut-and-paste trans¬ 
posons from DNA has been found to operate in developing immune Systems of 

transposon in donor 
chromosome A 


transposase 
monomers ,, 

short inverted 
repeat sequences 


3' 5' 

5' 3' 

target chromosome B 

I 

integrated 

transposon 

3' i i 5' 

5 ' “ ? 3 ' 

short direct repeats of target DNA 
sequences in chromosome B 



transpososome 



Figure 5-60 Three of the many DNA-only 
transposons found in bacteria. Each 
of these mobile DNA éléments contains 
a gene that encodes a transposase, an 
enzyme that carries out the DNA breakage 
and joining reactions needed for the 
element to move. Each transposon also 
carries short DNA sequences (indicated 
in red) that are recognized only by the 
transposase encoded by that element 
and are necessary for movement of the 
element. In addition, two of the three 
mobile éléments shown carry genes 
that encode enzymes that inactivate the 
antibiotics ampicillin (AmpR)—a penicillin 
dérivative—and tétracycline (TetR). The 
transposable element TnIO, shown in the 
bottom diagram, is thought to hâve evolved 
from the chance landing of two much 
shorter mobile éléments on either side of a 
tetracycline-resistance gene. 


Figure 5-61 Cut-and-paste 
transposition. DNA-only transposons can 
be recognized in chromosomes by the 
“inverted repeat DNA sequences” (red) 
présent at their ends. These sequences, 
which can be as short as 20 nucléotides, 
are ail that is necessary for the DNA 
between them to be transposed by the 
particular transposase enzyme associated 
with the element. The cut-and-paste 
movement of a DNA-only transposable 
element from one chromosomal site to 
another begins when the transposase 
brings the two inverted DNA sequences 
together, forming a DNA loop. Insertion 
into the target chromosome, also catalyzed 
by the transposase, occurs at a random 
site through the création of staggered 
breaks in the target chromosome (purple 
arrowheads). Following the transposition 
reaction, the single-strand gaps created by 
the staggered breaks are repaired by DNA 
polymerase and ligase (black). As a resuit, 
the insertion site is marked by a short direct 
repeat of the target DNA sequence, as 
shown. Although the break in the donor 
chromosome (green) is repaired, this 
process often alters the DNA sequence, 
causing a mutation at the original site of the 
excised transposable element (not shown). 
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vertebrates, catalyzing the DNA rearrangements that produce antibody and T cell 
receptor diversity. Known as V(D)J recombination , this process will be discussed 
in Chapter 24. Found only in vertebrates, V(D)J recombination is a relatively 
recent evolutionary novelty, but it is believed to be derived from the much more 
ancient cut-and-paste transposons. 

Some Viruses Use a Transposition Mechanism to Move 
Themselves Into Host-Cell Chromosomes 

Certain viruses are considered mobile genetic éléments because they use trans¬ 
position mechanisms to integrate their genomes into that of their host cell. How- 
ever, unlilce transposons, these viruses encode proteins that package their genetic 
information into virus particles that can infect other cells. Many of the viruses that 
insert themselves into a host chromosome do so by employing one of the first two 
mechanisms listed in Table 5-4; namely, by behaving lilce DNA-only transposons 
or lilce retroviral-like retrotransposons. Indeed, much of our knowledge of these 
mechanisms has corne from studies of particular viruses that employ them. 

Transposition has a lcey rôle in the life cycle of many viruses. Most notable are 
the retroviruses, which include the human AIDS virus, HIV. Outside the cell, a ret- 
rovirus exists as a single-strand RNA genome paclced into a protein shell or capsid 
along with a virus-encoded reverse transcriptase enzyme. During the infection 
process, the viral RNA enters a cell and is converted to a double-strand DNA mol¬ 
écule by the action of this crucial enzyme, which is able to polymerize DNA on 
either an RNA or a DNA template (Figure 5-62). The term retrovirus refers to the 
virus's ability to reverse the usual flow of genetic information, which normally is 
from DNA to RNA (see Figure 1-4). 

Once the reverse transcriptase has produced a double-strand DNA molé¬ 
cule, spécifie sequences near its two ends are recognized by a virus-encoded 
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Figure 5-62 The life cycle of a retrovirus. The retrovirus genome consists of an RNA molécule (blue) that is typically between 7000 and 12,000 
nucléotides in length. It is packaged inside a protein capsid, which is surrounded by a lipid-based envelope that contains virus-encoded envelope 
proteins (green). Inside an infected cell, the enzyme reverse transcriptase (red circle) first makes a DNA copy of the viral RNA molécule and then a 
second DNA strand, generating a double-strand DNA copy of the RNA genome. The intégration of this DNA double hélix into the host chromosome 
is then catalyzed by a virus-encoded integrase enzyme. This intégration is required for the synthesis of new viral RNA molécules by the host-cell RNA 
polymerase, the enzyme that transcribes DNA into RNA (discussed in Chapter 6). 
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transposase called integrase. Integrase then inserts the viral DNA into the chro¬ 
mosome by a mechanism similar to that used by the cut-and-paste DNA-only 
transposons (see Figure 5-61). 


Retroviral-like Retrotransposons Resemble Retroviruses, but Lack 
a Protein Coat 

A large family of transposons called retroviral-like retrotransposons (see Table 
5-4) move themselves in and out of chromosomes by a mechanism that is similar 
to that used by retroviruses. Ihese éléments are présent in organisms as diverse as 
yeasts, Aies, and mammals; unlilce viruses, they hâve no intrinsic ability to leave 
their résident cell but are passed along to ail descendants of that cell through the 
normal processes of DNA réplication and cell division. The first step in their trans¬ 
position is the transcription of the entire transposon, producing an RNA copy of 
the element that is typically several thousand nucléotides long. This transcript, 
which is translated as a messenger RNA by the host cell, encodes a reverse tran¬ 
scriptase enzyme. This enzyme malces a double-strand DNA copy of the RNA mol¬ 
écule via an RNA-DNA hybrid intermediate, precisely mirroring the early stages 
of infection by a retrovirus (see Figure 5-62). Lilce a retrovirus, the linear, dou¬ 
ble-strand DNA molécule then intégrâtes into a site on the chromosome using an 
integrase enzyme that is also encoded by the element. The structure and mecha- 
nisms of these integrases closely resemble those of the transposases of DNA-only 
transposons. 


A Large Fraction of the Human Genome Is Composed of 
Nonretroviral Retrotransposons 

A significant fraction of many vertebrate chromosomes is made up of repeated 
DNA sequences. In human chromosomes, these repeats are mostly mutated and 
truncated versions of nonretroviral retrotransposons, the third major type of 
transposon (see Table 5-4). Although most of these transposons in the human 
genome are immobile, a few retain the ability to move. Relatively recent move- 
ments of the Lî element (sometimes referred to as a LINE or long interspersed 
nuclear element) hâve been identified, some of which resuit in human disease; 
for example, a particular type of hemophilia results from an L1 insertion into the 
gene encoding the blood-clotting protein Factor VIII (see Figure 6-24). 

Nonretroviral retrotransposons are found in many organisms and move via a 
distinct mechanism that requires a complex of an endonuclease and a reverse 
transcriptase. As illustrated in Figure 5-63, the RNA and reverse transcriptase 
hâve a much more direct rôle in the recombination event than they do in the ret¬ 
roviral-like retrotransposons described above. 

Inspection of the human genome sequence reveals that the bulle of nonretro¬ 
viral retrotransposons—for example, the many copies of the Alu element, a mem- 
ber of the SINE (short interspersed nuclear element) family—do not carry their 
own endonuclease or reverse transcriptase genes. Nonetheless, they hâve suc- 
cessfully amplified themselves to become major constituents of our genome, pre- 
sumably by pirating enzymes encoded by other transposons. Together the LINEs 
and SINEs malce up over 30% of the human genome (see Figure 4-62); there are 
500,000 copies of the former and over a million of the latter. 


Figure 5-63 Transposition by a nonretroviral retrotransposon. 

Transposition of the L1 element (red) begins when an endonuclease attached 
to the L1 reverse transcriptase (green) and the L1 RNA (blue) nick the target 
DNA at the point at which insertion will occur. This cleavage releases a 3'-OH 
DNA end in the target DNA, which is then used as a primer for the reverse 
transcription step shown. This generates a single-strand DNA copy of the 
element that is directly linked to the target DNA. In subséquent reactions, 
further processing of the single-strand DNA copy results in the génération of 
a new double-strand DNA copy of the L1 element that is inserted at the site 
of the initial nick. 
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Different Transposable Eléments Predominate in Different 
Organisms 

We hâve described several types of transposable éléments: (1) DNA-only transpo¬ 
sons, the movement of which is based on DNA breaking and joining reactions; (2) 
retroviral-like rétro transposons, which also move via DNA breakage and joining, 
but where RNA has a lcey rôle as a template to generate the DNA recombination 
substrate; and (3) nonretroviral retrotransposons, in which an RNA copy of the 
element is central to the incorporation of the element into the target DNA, acting 
as a direct template for a DNA target-primed reverse transcription event. 

Intriguingly, different types of transposons predominate in different organ¬ 
isms. For example, the vast majority of bacterial transposons are DNA-only types, 
with a few related to the nonretroviral retrotransposons also présent. In yeasts, the 
main mobile éléments are retroviral-like retrotransposons. In Drosophila, DNA- 
based, retroviral, and nonretroviral transposons are ail found. Finally, the human 
genome contains ail three types of transposon, but as discussed below, their evo- 
lutionary historiés are strikingly different. 

Genome Sequences Reveal the Approximate Times at Which 
Transposable Eléments Hâve Moved 

The nucléotide sequence of the human genome provides a rich fossil record of 
the activity of transposons over evolutionary time spans. By carefully comparing 
the nucléotide sequences of the approximately 3 million transposable element 
remnants in the human genome, it has been possible to broadly reconstruct the 
movements of transposons in our ancestors' genomes over the past several hun- 
dred million years. For example, the DNA-only transposons appear to hâve been 
very active well before the divergence of humans and Old World monkeys (25-35 
million years ago), but because they gradually accumulated inactivating muta¬ 
tions, they hâve been dormant in the human lineage since that time. Likewise, 
although our genome is littered with relies of retroviral-like retrotransposons, 
none appear to be active today. Only a single family of retroviral-like retrotrans¬ 
posons is believed to hâve transposed in the human genome since the divergence 
of human and chimpanzee approximately 6 million years ago. The nonretroviral 
retrotransposons are also ancient, but in contrast to other types, some are still 
moving in our genome, as mentioned previously. For example, it is estimated 
that de novo movement of an Alu element is seen once in every 100-200 human 
births. The movement of nonretroviral retrotransposons is responsible for a small 
but significant fraction of new human mutations—perhaps two mutations out of 
every thousand. 

The situation in mice is significantly different. Although the mouse and human 
genomes contain roughly the same density of the three types of transposons, both 
types of retrotransposons are still actively transposing in the mouse genome, 
being responsible for approximately 10% of new mutations. 

Although we are only beginning to understand how the movements of trans¬ 
posons hâve shaped the genomes of present-day mammals, it has been proposed 
that bursts in transposition activity could hâve been responsible for critical spé¬ 
ciation events during the radiation of the mammalian lineages from a common 
ancestor, a process that began approximately 170 million years ago. At présent, we 
can only wonder how many of our uniquely human qualities arose from the past 
activity of the many mobile genetic éléments whose remnants are found today 
scattered throughout our chromosomes. 

Conservative Site-Specific Recombination Can Reversibly 
Rearrange DNA 

A different kind of recombination mechanism, lcnown as conservative site-specific 
recombination , rearranges other types of mobile DNA éléments. In this pathway, 
breakage and joining occur at two spécial sites, one on each participating DNA 
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Figure 5-64 Two types of DNA rearrangement produced by conservative site-specific 
recombination. The only différence between the reactions in (A) and (B) is the relative orientation of 
the two short DNA sites (indicated by arrows) at which a site-specific recombination event occurs. 

(A) Through an intégration reaction, a circular DNA molécule can become incorporated into a 
second DNA molécule; by the reverse reaction (excision), it can exit to re-form the original DNA 
circle. Many bacterial viruses move in and out of their host chromosomes in this way. 

(B) Conservative site-specific recombination can also invert a spécifie segment of DNA in a 
chromosome. A well-studied example of DNA inversion through site-specific recombination occurs 
in the bacterium Salmonella typhimurium, an organism that is a major cause of food poisoning in 
humans; as described in the following section, the inversion of a DNA segment changes the type of 
flagellum that is produced by the bacterium. 


molécule. Depending on the positions and relative orientations of the two recom¬ 
bination sites, DNA intégration, DNA excision, or DNA inversion can occur (Fig¬ 
ure 5-64). Conservative site-specific recombination is carried out by specialized 
enzymes that break and rejoin two DNA double helices at spécifie sequences on 
each DNA molécule. The same enzyme System that joins two DNA molécules can 
often talce them apart again, precisely restoring the sequence of the two original 
DNA molécules (see Figure 5-64A). 

Conservative site-specific recombination is often used by DNA viruses to move 
their genomes in and out of the genomes of their host cells. When integrated into 
its host genome, the viral DNA is replicated along with the host DNA and is faith- 
fully passed on to ail descendent cells. If the host cell suffers damage (for example, 
by UV irradiation), the virus can reverse the site-specific recombination reaction, 
excise its genome, and package it into a virus particle. In this way, many viruses 
can replicate themselves passively as a component of the host genome, but can 
also "leave the sinking ship" by excising their genomes and packaging them in a 
protective coat until a new, healthy host cell is encountered. 

Several features distinguish conservative site-specific recombination from 
transposition. First, conservative site-specific recombination requires specialized 
DNA sequences on both the donor and récipient DNA (hence the term site-spe¬ 
cific). These sequences contain récognition sites for the particular recombinase 
that will catalyze the rearrangement. In contrast, transposition requires only that 
the transposon hâve a specialized sequence; for most transposons, the récipient 
DNA can be of any sequence. Second, the reaction mechanisms are fundamen- 
tally different. The recombinases that catalyze conservative site-specific recombi¬ 
nation resemble topoisomerases in the sense that they form transient high-energy 
covalent bonds with the DNA and use this energy to complété the DNA rearrange¬ 
ments (see Figure 5-21). Thus, ail the phosphate bonds that are broken during a 
recombination event are restored upon its completion (hence the term conser¬ 
vative). Transposition, in contrast, does not proceed through a covalently joined 
protein-DNA intermediate, and this process leaves gaps in the DNA that must be 
repaired by DNA polymerases. 
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Conservative Site-Specific Recombination Can Be Used to Turn 
Genes On or Off 


Figure 5-65 Switching gene expression 
by DNA inversion in bacteria. 

Alternating transcription of two flagellin 
genes in a Salmonella bacterium is 
caused by a conservative site-specific 
recombination event that inverts a small 
DNA segment containing a promoter. 

(A) In one orientation, the promoter 
activâtes transcription of the H2 flagellin 
gene as well as that of a repressor protein 
that blocks the expression of the H1 
flagellin gene. Promoters and repressors 
are described in detail in Chapter 7; here 
we note simply that a promoter is needed 
to express a gene into protein and that 

a repressor blocks this from happening. 

(B) When the promoter is inverted, it no 
longer turns on H2 or the repressor, and 
the H1 gene, which is thereby released 
from repression, is expressed instead. 

The inversion reaction requires spécifie 
DNA sequences (red) and a recombinase 
enzyme that is encoded in the invertible 
DNA segment. This site-specific 
recombination mechanism is activated 
only rarely (about once in every 10 5 cell 
divisions). Therefore, the production of one 
or the other flagellin tends to be faithfully 
inherited in each clone of cells. 


Many bacteria use conservative site-specific recombination to control the expres¬ 
sion of particular genes. A well-studied example occurs in Salmonella bacteria 
and is known as phase variation. The switch in gene expression results from the 
occasional inversion of a spécifie 1000-nucleotide-pair piece of DNA, brought 
about by a conservative site-specific recombinase encoded in the Salmonella 
genome. This change alters the expression of the cell-surface protein flagellin, for 
which the bacterium has two different genes (Figure 5-65). The DNA inversion 
changes the orientation of a promoter (a DNA sequence that directs transcription 
of a gene) that is located within the inverted DNA segment. With the promoter in 
one orientation, the bacteria synthesize one type of flagellin; with the promoter 
in the other orientation, they synthesize the other type. The recombination reac¬ 
tion is réversible, allowing bacterial populations to switch baclc and forth between 
the two types of flagellin. Inversions occur only rarely, and because such changes 
in the genome will be copied faithfully during ail subséquent réplication cycles, 
entire clones of bacteria will hâve one type of flagellin or the other. 

Phase variation helps protect the bacterial population against the immune 
response of its vertebrate host. If the host malces antibodies against one type of 
flagellin, a few bacteria whose flagellin has been altered by gene inversion will still 
be able to survive and multiply. 


Bacterial Conservative Site-Specific Recombinases Hâve Become 
Powerful Tools for Cell and Developmental Biologists 

Like many of the mechanisms used by cells and viruses, site-specific recombina¬ 
tion has been put to worlc by scientists to study a wide variety of problems. To 
decipher the rôles of spécifie genes and proteins in complex multicellular organ- 
isms, genetic engineering techniques are used to produce worms, Aies, and mice 
carrying a gene encoding a site-specific recombination enzyme plus a carefully 
designed target DNA with the DNA sites that this enzyme recognizes. At an appro- 
priate time, the gene encoding the enzyme can be activated to rearrange the target 
DNA sequence. Such a rearrangement is widely used to delete a spécifie gene in a 
particular tissue of a multicellular organism (Figure 5-66). It is particularly useful 
when the gene of interest plays a key rôle in the early development of many tis- 
sues, and a complété délétion of the gene from the germ line would cause death 
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Figure 5-66 How a conservative site-specific recombination enzyme from bacteria is used 
to delete spécifie genes from particular mouse tissues. This approach requires the insertion 
of two specially engineered DNA molécules into the animal’s germ line. The first contains the gene 
for a recombinase (in this case, the Cre recombinase from the bactériophage PI) under the control 
of a tissue-specific promoter, which ensures that the recombinase is expressed only in that tissue. 
The second DNA molécule contains the gene of interest flanked by récognition sites (in this case, 
LoxP sites) for the recombinase. The mouse is engineered so that this is the only copy of this gene. 
Therefore, if the recombinase is expressed only in the liver, the gene of interest will be deleted there, 
and only there. The reaction that excises the gene is the same as that shown in Figure 5-64A. 

As described in Chapter 7, many tissue-specific promoters are known; moreover, many of these 
promoters are active only at spécifie times in development. Thus, it is possible to study the effect 
of deleting spécifie genes at different times during the development of each tissue. 


very early in development. The same strategy can also be used to inappropriately 
express any spécifie gene in a tissue of interest; here, the triggered délétion joins 
a strong transcriptional promoter to the gene of interest. With this tool one can in 
principle détermine the influence of any protein in any desired tissue of an intact 
animal. 

Summary 

The genomes ofnearly ail organisms contain mobile genetic éléments that can move 
from one position in the genome to another by either transpositional or conserva¬ 
tive site-specific recombination processes. In most cases, this movement is random 
and happens ata very low frequency. Mobile genetic éléments include transposons, 
which move within a single cell (and its descendants), plus those viruses whose 
genomes can integrate into the genomes oftheir host cells. 

There are three classes of transposons: the DNA-only transposons, the retrovi- 
ral-like retrotransposons, and the nonretroviral retrotransposons. Ail but the last 
hâve close relatives among the viruses. Although viruses and transposable éléments 
can be viewed as parasites, many ofthe new arrangements ofDNA sequences that 
their site-specific recombination events produce hâve played an important part in 
creating the genetic variation crucial for the évolution of cells and organisms. 


WHATWE DONT KNOW 


• How does DNA réplication contend 
with ail the other processes that occur 
simultaneously on chromosomes, 
including DNA repair and gene 
transcription? 

• What is the basis for the low 
frequency of errors in DNA réplication 
observed in ail cells? Is this the best 
that cells can do given the speed of 
réplication and the limits of molecular 
diffusion? Was this mutation rate 
selected in évolution to provide genetic 
variation? 

• Cells hâve only one fundamental way 
of replicating DNA but many different 
ways of repairing it. Are there still 
other, undiscovered ways that cells 
hâve for repairing DNA? 

• Do the many “dead” transposons 
in the human genome provide any 
benefits to humans? 
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PROBLEMS 

Which statements are true? Explain why or why not. 

5-1 The different cells in your body rarely hâve 
genomes with the identical nucléotide sequence. 

5-2 In E. coli , where the réplication forlc travels at 500 
nucléotide pairs per second, the DNA ahead of the forlc— 
in the absence of topoisomerase—would hâve to rotate at 
nearly 3000 révolutions per minute. 

5-3 In a réplication bubble, the same parental DNA 
strand serves as the template strand for leading-strand 
synthesis in one réplication forlc and as the template for 
lagging-strand synthesis in the other forlc. 

5-4 When bidirectional réplication forlcs from adja¬ 
cent origins meet, a leading strand always runs into a lag¬ 
ging strand. 

5-5 DNA repair mechanisms ail dépend on the exis¬ 
tence of two copies of the genetic information, one in each 
of the two homologous chromosomes. 

Discuss the following problems. 

5-6 To détermine the reproducibility of mutation fre- 
quency measurements, you do the following experiment. 
You inoculate each of 10 cultures with a single E. coli bac- 
terium, allow the cultures to grow until each contains 10 6 
cells, and then measure the number of cells in each culture 
that carry a mutation in your gene of interest. You were so 
surprised by the initial results that you repeated the experi¬ 
ment to confirm them. Both sets of results display the same 
extreme variability, as shown in Table Q5-1. Assuming that 
the rate of mutation is constant, why do you suppose there 
is so much variation in the frequencies of mutant cells in 
different cultures? 


H Frequencies of mutant cells in multiple 
cultures (Problem 5-6) 


Culture (mutant cells/10 6 cells) 


Experiment 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1 

4 

0 

257 

1 

2 

32 

0 

0 

2 

1 

2 

128 

0 

1 

4 

0 

0 

66 

5 

0 
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5-7 DNA repair enzymes preferentially repair mis- 
matched bases on the newly synthesized DNA strand, 
using the old DNA strand as a template. If mismatches 
were instead repaired without regard for which strand 
served as template, would mismatch repair reduce repli- 
cation errors? Would such a mismatch repair System resuit 
in fewer mutations, more mutations, or the same number 
of mutations as there would hâve been without any repair 
at ail? Explain your answers. 

5-8 Discuss the following statement: “Primase is a 
sloppy enzyme that malces many mistalces. Eventually, the 


RNA primers it malces are replaced with DNA made by a 
polymerase with higher fidelity. This is wasteful. It would 
be more energy-efficient if a DNA polymerase made an 
accurate copy in the first place.” 

5-9 If DNA polymerase requires a perfectly paired 
primer in order to add the next nucléotide, how is it that 
any mismatched nucléotides “escape” this requirement 
and become substrates for mismatch repair enzymes? 

5-10 The laboratory you joined is studying the life 
cycle of an animal virus that uses circular, double-strand 
DNA as its genome. Your project is to define the location 
of the origin(s) of réplication and to détermine whether 
réplication proceeds in one or both directions away from 
an origin (unidirectional or bidirectional réplication). To 
accomplish your goal, you brolce open cells infected with 
the virus, isolated replicating viral genomes, cleaved them 
with a restriction nuclease that cuts the genome at only 
one site to produce a linear molécule from the circle, and 
examined the resulting molécules in the électron micro¬ 
scope. Some of the molécules you observed are illustrated 
schematically in Figure Q5-1. (Note that it is impossible to 
distinguish the orientation of one DNA molécule relative 
to another in the électron microscope.) 

You must présent your conclusions to the rest of 
the lab tomorrow. How will you answer the two questions 
your advisor posed for you? Is there a single, unique origin 
of réplication or several origins? Is réplication unidirec¬ 
tional or bidirectional? 
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Figure Q5-1 Parental 
and replicating forms of 
an animal virus (Problem 
5-10). 


5-11 You are investigating DNA synthesis in tissue-cul- 
ture cells, using 3 H-thymidine to radioactively label the 
réplication forlcs. By breaking open the cells in a way that 
allows some of the DNA strands to be stretched out, very 
long DNA strands can be isolated intact and examined. 
You overlay the DNA with a photographie émulsion, and 
expose it for 3 to 6 months, a procedure lcnown as auto- 
radiography. Because the émulsion is sensitive to radioac¬ 
tive émissions, the 3 H-labeled DNA shows up as traclcs of 
silver grains. Because the stretching collapses réplication 
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(A) 



Figure Q5-2 Autoradiographic investigation of DNA réplication in cultured 
cells (Problem 5-11). (A) Addition of 3 H-labeled thymidine immediately 
after release from the synchronizing block. (B) Addition of 3 H-labeled 
thymidine 30 minutes after release from the synchronizing block. 


bubbles, the daughter duplexes lie side by side and cannot 
be distinguished from each other. 

You pretreat the cells to synchronize them at the 
beginning of S phase. In the first experiment, you release 
the synchronizing block and add 3 H-thymidine immedi¬ 
ately. After 30 minutes, you wash the cells and change the 
medium so that the total concentration of thymidine is the 
same as it was, but only one-third of it is radioactive. After 
an additional 15 minutes, you préparé DNA for autoradi- 
ography. The results of this experiment are shown in Fig¬ 
ure Q5-2A. In the second experiment, you release the syn¬ 
chronizing block and then wait 30 minutes before adding 
3 H-thymidine. After 30 minutes in the presence of 3 H-thy¬ 
midine, you once again change the medium to reduce the 
concentration of radioactive thymidine and incubate the 
cells for an additional 15 minutes. The results of the second 
experiment are shown in Figure Q5-2B. 

A. Explain why, in both experiments, s orne régions of 
the traclcs are dense with silver grains (dark), whereas oth- 
ers are less dense (light). 

B. In the first experiment, each traclc has a central 
dark section with light sections at each end. In the second 
experiment, the dark section of each traclc has a light sec¬ 
tion at only one end. Explain the reason for this différence. 

C. Estimate the rate of forlc movement (pm/min) in 
these experiments. Do the estimâtes from the two exper¬ 
iments agréé? Can you use this information to gauge how 
long it would talce to replicate the entire genome? 

5-12 If you compare the frequency of the sixteen pos¬ 
sible dinucleotide sequences in the E. coli and human 
genomes, there are no striking différences except for one 
dinucleotide, 5'-CG-3'. The frequency of CG dinucleotides 
in the human genome is significantly lower than in E. coli 
and significantly lower than expected by chance. Why do 
you suppose that CG dinucleotides are underrepresented 
in the human genome? 

5-13 With âge, somatic cells are thought to accumulate 
genomic "scars" as a resuit of the inaccurate repair of dou- 
ble-strand breaks by nonhomologous end joining (NHEI). 


Estimâtes based on the frequency of breaks in primary 
human fibroblasts suggest that by âge 70, each human 
somatic cell may carry some 2000 NHEI-induced muta¬ 
tions due to inaccurate repair. If these mutations were 
distributed randomly around the genome, how many pro- 
tein-coding genes would you expect to be affected? Would 
you expect cell function to be compromised? Why or why 
not? (Assume that 2% of the genome—1.5% protein-cod- 
ing and 0.5% regulatory—is crucial information.) 

5-14 Draw the structure of the double Holliday junction 
that would resuit from strand invasion by both ends of the 
brolcen duplex into the intact homologous duplex shown 
in Figure Q5-3. Label the left end of each strand in the Hol¬ 
liday junction 5' or 3' so that the relationship to the paren¬ 
tal and recombinant duplexes is clear. Indicate how DNA 
synthesis would be used to fill in any single-strand gaps in 
your double Holliday junction. 

5 ' 3 ' 

Figure Q5-3 A broken duplex with 

single-strand tails ready to invade 
5' 3' an intact homologous duplex 

~~ (Problem 5-14). 


5-15 In addition to correcting DNA mismatches, the 
mismatch repair System functions to prevent homologous 
recombination from taking place between similar but not 
identical sequences. Why would recombination between 
similar, but nonidentical sequences pose a problem for 
human cells? 

5-16 Cre recombinase is a site-specific enzyme that 
catalyzes recombination between two LoxP DNA sites. 
Cre recombinase pairs two LoxP sites in the same orienta¬ 
tion, breaks both duplexes at the same point in each LoxP 
site, and joins the ends with new partners so that each 
LoxP site is regenerated, as shown schematically in Figure 
Q5-4A. Based on this mechanism, predict the arrange¬ 
ment of sequences that will be generated by Cre-medi- 
ated site-specific recombination for each of the two DNAs 
shown in Figure Q5-4B. 

(A) 


BREAK REJOIN W 



(B) 

Figure Q5-4 Cre recombinase-mediated site-specific recombination 
(Problem 5-16). (A) Schematic représentation of Cre/LoxP site-specific 
recombination. The LoxP sequences in the DNA are represented by 
triangles that are colored so that the site-specific recombination event 
can be followed more readily. In reality their DNA sequences 
are identical. (B) DNA substrates containing two arrangements of 
LoxP sites. 
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How Cells Read the Genome: 
From DNA to Protein 


CHAPTER 

6 


Since the structure of DNA was discovered in the early 1950s, progress in cell and 
molecular biology has been astounding. We now know the complété genome 
sequences for thousands of different organisms, revealing fascinating details of 
their biochemistry as well as important dues as to how these organisms evolved. 
Complété genome sequences hâve also been obtained for thousands of individ- 
ual humans, as well as for a few of our now-extinct relatives, such as the Neander- 
thals. Knowing the maximum amount of information that is required to produce 
a complex organism like ourselves puts constraints on the biochemical and struc¬ 
tural features of cells and malces it clear that biology is not infinitely complex. 

As discussed in Chapter 1, the DNA in genomes does not direct protein synthe- 
sis itself, but instead uses RNA as an intermediary. When the cell needs a particu- 
lar protein, the nucléotide sequence of the appropriate portion of the immensely 
long DNA molécule in a chromosome is first copied into RNA (a process called 
transcription). It is these RNA copies of segments of the DNA that are used directly 
as templates to direct the synthesis of the protein (a process called translation). 
The flow of genetic information in cells is therefore from DNA to RNA to protein 
(Figure 6-1). Ail cells, from bacteria to humans, express their genetic informa¬ 
tion in this way—a principle so fundamental that it is termed the central dogma 
of molecular biology. Despite the universality of the central dogma of molecular 
biology, there are important variations between organisms in the way in which 
information flows from DNA to protein. Principal among these is that RNA tran- 
scripts in eulcaryotic cells are subject to a sériés of processing steps in the nucléus, 
including RNA splicing, before they are permitted to exit from the nucléus and be 
translated into protein. As we discuss in this chapter, these processing steps can 
critically change the “meaning" of an RNA molécule and are therefore crucial for 
understanding how eulcaryotic cells read their genome. 

Although we focus on the production of the proteins encoded by the genome 
in this chapter, we see that for many genes, RNA is the final product. Like pro¬ 
teins, some of these RNAs fold into précisé three-dimensional structures that hâve 
structural and catalytic rôles in the cell. Other RNAs, as we discuss in the next 
chapter, act primarily as regulators of gene expression. But the rôles of many non- 
coding RNAs are not yet lcnown. 

One might hâve predicted that the information présent in genomes would 
be arranged in an orderly fashion, resembling a dictionary or a téléphoné direc- 
tory. But it turns out that the genomes of most multicellular organisms are sur- 
prisingly disorderly, reflecting their chaotic evolutionary historiés. The genes in 
these organisms largely consist of a long string of alternating short exons and long 
introns, as discussed in Chapter 4 (see Figure 4-15D). Moreover, small bits of DNA 
sequence that code for protein are interspersed with large bloclcs of seemingly 
meaningless DNA. Some sections of the genome contain many genes and oth- 
ers laclc genes altogether. Proteins that worlc closely with one another in the cell 
often hâve their genes located on different chromosomes, and adjacent genes typ- 
ically encode proteins that hâve little to do with each other in the cell. Decoding 
genomes is therefore no simple matter. Even with the aid of powerful computers, 
it is difficult for researchers to locate definitively the beginning and end of genes, 
much less to decipher when and where each gene is expressed in the life of the 
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human X chromosome: 155 million nucléotide base pairs (~5% of genome) 
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Figure 6-2 Schematic depiction of a small portion of the human X chromosome. As summarized in the key, the known 
protein-coding genes (starting with Abcdl and ending with F8) are shown in darkgray, with coding régions (exons) indicated by 
bars that extend above and below the central line. Noncoding RNAs with known functions are indicated by purple diamonds. 
Yellow triangles indicate positions within protein-coding régions where the Neanderthal genome sequences codes for a different 
amino acid than the human genome. The stretch of yellow triangles in the Txtll gene appear to hâve been positively selected 
for since the divergence of Homo sapiens from Neanderthals some 200,000 years ago. Note that most of the proteins are 
identical between us and our extinct relative. The blue histogram indicates the extent to which portions of the human genome 
are conserved with other vertebrate species. It is likely that additional genes, currently unrecognized, also lie within this portion 
of the human genome. 

Genes whose mutation causes an inherited human condition are indicated by red brackets. Th e Abcdl gene codes for a 
protein that imports fatty acids into the peroxisome; mutations in the gene cause demylination of nerves which can resuit in 
cognition and movement disorders. Incontinentia pigmenti is a disease of the skin, hair, nails, teeth, and eyes. Hemophilia A is 
a bleeding disorder caused by mutations in the Factor VIII gene, which codes for a blood-clotting protein. Because males hâve 
only a single copy of the X chromosome, most of the conditions shown here affect only males; females that inherit one of these 
defective genes are often asymptomatic because a functional protein is made from their other X chromosome. (Courtesy of Alex 
Williams, obtained from the University of California, Genome Browser, http://genome.ucsc.edu) 


organism. Yet the cells in our body do this automatically, thousands of times a 
second. 

The problems that cells face in decoding genomes can be appreciated by con- 
sidering a tiny portion of the human genome (Figure 6-2). The région illustrated 
represents less than l/2000th of our genome and includes at least 48 genes that 
encode proteins and 6 genes for noncoding RNAs. When we consider the entire 
human genome, we can only marvel at the capacity of our cells to rapidly and 
accurately handle such large amounts of information. 

In this chapter, we explain how cells décodé and use the information in their 
genomes. Much has been learned about how the genetic instructions written in 
an alphabet of just four “letters”—the four different nucléotides in DNA—direct 
the formation of a bacterium, a fruit fly, or a human. Nevertheless, we still hâve a 
great deal to discover about how the information stored in an organisnTs genome 
produces even the simplest unicellular bacterium with 500 genes, let alone how 
it directs the development of a human with approximately 30,000 genes. An enor- 
mous amount of ignorance remains; many fascinating challenges therefore await 
the next génération of cell biologists. 


FROM DNA TO RNA 


Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molécule can direct the synthesis of 
many identical protein molécules, cells can synthesize a large amount of protein 
from a gene when necessary. But genes can be transcribed and translated with 
different efficiencies, allowing the cell to malce vast quantities of some proteins 
and tiny amounts of others (Figure 6-3). Moreover, as we see in the next chapter, 
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Figure 6-3 Genes can be expressed with 
different efficiencies. In this example, 
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gene A is transcribed much more efficiently 
than gene B and each RNA molécule that it 
produces is also translated more frequently. 
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This causes the amount of protein A in the cell 
to be much greater than that of protein B. 
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a cell can change (or regulate) the expression of each of its genes according to its 5' end 

needs—most commonly by controlling the production of its RNA. I 


RNA Molécules Are Single-Stranded 

The first step a cell talces in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucléotide sequence—a gene—into an 
RNA nucléotide sequence (Figure 6-4). The information in RNA, although copied 
into another Chemical form, is still written in essentially the same language as it 
is in DNA—the language of a nucléotide sequence. Hence the name given to pro- 
ducing RNA molécules on DNA is transcription. 

Lilce DNA, RNA is a linear polymer made of four different types of nucléotide 
subunits linlced together by phosphodiester bonds (see Figure 6-4). It differs from 
DNA chemically in two respects: (1) the nucléotides in RNA are ribonucleotides— 
that is, they contain the sugar ribose (hence the name n'bonucleic acid) rather 
than deoxyribose; (2) although, lilce DNA, RNA contains the bases adenine (A), 
guanine (G), and cytosine (C), it contains the base uracil (U) instead of the thy¬ 
mine (T) in DNA (Figure 6-5). Since U, lilce T, can base-pair by hydrogen-bonding 
with A (Figure 6-6), the complementary base-pairing properties described for 
DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with C, and A pairs 
with U). We also find other types of base pairs in RNA: for example, G occasionally 
pairs with U. 

Although these Chemical différences are slight, DNA and RNA differ quite 
dramatically in overall structure. Whereas DNA always occurs in cells as a dou- 
ble-stranded hélix, RNA is single-stranded. An RNA chain can therefore fold up 
into a particular shape, just as a polypeptide chain folds up to form the final shape 
of a protein (Figure 6-7). As we see later in this chapter, the ability to fold into 
complex three-dimensional shapes allows some RNA molécules to hâve précisé 
structural and catalytic functions. 

Transcription Produces RNA Complementary to One Strand 
of DNA 

The RNA in a cell is made by DNA transcription, a process that has certain sim- 
ilarities to the process of DNA réplication discussed in Chapter 5. Transcription 
begins with the opening and unwinding of a small portion of the DNA double 
hélix to expose the bases on each DNA strand. One of the two strands of the DNA 
double hélix then acts as a template for the synthesis of an RNA molécule. As in 
DNA réplication, the nucléotide sequence of the RNA chain is determined by 
the complementary base-pairing between incoming nucléotides and the DNA 
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Figure 6-4 A short length of RNA. The 

phosphodiester Chemical linkage between 
nucléotides in RNA is the same as that in DNA. 
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Figure 6-5 The Chemical structure of 

RNA. (A) RNA contains the sugar ribose, 
which differs from deoxyribose, the sugar 
used in DNA, by the presence of an 
additional -OH group. (B) RNA contains the 
base uracil, which differs from thymine, the 
équivalent base in DNA, by the absence of 
a -CH 3 group. 
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Figure 6-6 Uracil forms base pairs with adenine. The absence of a 
methyl group in U has no effect on base-pairing; thus, U-A base pairs closely 
resemble T-A base pairs (see Figure 4-4). 


template. When a good match is made (A with T, U with A, G with C, and C with 
G), the incoming ribonucleotide is covalently linlced to the growing RNA chain in 
an enzymatically catalyzed reaction. The RNA chain produced by transcription— 
the transcript— is therefore elongated one nucléotide at a time, and it has a nuclé¬ 
otide sequence that is exactly complementary to the strand of DNA used as the 
template (Figure 6-8). 

Transcription, however, differs from DNA réplication in several crucial ways. 
Unlilce a newly formed DNA strand, the RNA strand does not remain hydro- 
gen-bonded to the DNA template strand. Instead, just behind the région where 
the ribonucleotides are being added, the RNA chain is displaced and the DNA 
hélix re-forms. Thus, the RNA molécules produced by transcription are released 
from the DNA template as single strands. In addition, because they are copied 
from only a limited région of the DNA, RNA molécules are much shorter than 
DNA molécules. A DNA molécule in a human chromosome can be up to 250 mil¬ 
lion nucleotide-pairs long; in contrast, most RNAs are no more than a few thou- 
sand nucléotides long, and many are considerably shorter. 
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RNA Polymerases Carry Out Transcription 

The enzymes that perform transcription are called RNA polymerases. Lilce the 
DNA polymerase that catalyzes DNA réplication (discussed in Chapter 5), RNA 
polymerases catalyze the formation of the phosphodiester bonds that linlc the 
nucléotides together to form a linear chain. The RNA polymerase moves step- 
wise along the DNA, unwinding the DNA hélix just ahead of the active site for 
polymerization to expose a new région of the template strand for complementary 




Figure 6-7 RNA can fold into spécifie 
structures. RNA is largely single-stranded, 
but it often contains short stretches of 
nucléotides that can form conventional 
base pairs with complementary sequences 
found elsewhere on the same molécule. 
These interactions, along with additional 
“nonconventional” base-pair interactions, 
allow an RNA molécule to fold into a three- 
dimensional structure that is determined by 
its sequence of nucléotides (Movie 6.1). 

(A) Diagram of a folded RNA structure 
showing only conventional base-pair 
interactions. (B) Structure with both 
conventional (red) and nonconventional 
(green) base-pair interactions. 

(C) Structure of an actual RNA, one that 
catalyzes its own splicing (see p. 324). 

Each conventional base-pair interaction is 
indicated by a “rung” in the double hélix. 
Bases in other configurations are indicated 
by broken rungs. 
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base-pairing. In this way, the growing RNA chain is extended by one nucléotide 
at a time in the 5'-to-3' direction (Figure 6-9). The substrates are ribonucleoside 
triphosphates (ATP, CTP, UTP, and GTP); as in DNA réplication, the hydrolysis of 
high-energy bonds provides the energy needed to drive the reaction forward (see 
Figure 5-4 and Movie 6.2). 

The almost immédiate release of the RNA strand from the DNA as it is synthe- 
sized means that many RNA copies can be made from the same gene in a relatively 
short time, with the synthesis of additional RNA molécules being started before 
the previous RNA molécules are completed (Figure 6-10). When RNA polymerase 
molécules follow hard on each other's heels in this way, each moving at about 
50 nucléotides per second, over a thousand transcripts can be synthesized in an 
hour from a single gene. 

Although RNA polymerase catalyzes essentially the same Chemical reaction as 
DNA polymerase, there are some important différences between the activities of 
the two enzymes. First, and most obviously, RNA polymerase catalyzes the linlc- 
age of ribonucleotides, not deoxyribonucleotides. Second, unlilce the DNA poly¬ 
merases involved in DNA réplication, RNA polymerases can start an RNA chain 
without a primer. This différence is thought possible because transcription need 
not be as accurate as DNA réplication (see Table 5-1, p. 244). RNA polymerases 
malce about one mistake for every 10 4 nucléotides copied into RNA (compared 
with an error rate for direct copying by DNA polymerase of about one in 10 7 nuclé¬ 
otides); and the conséquences of an error in RNA transcription are much less sig¬ 
nifiant as RNA does not permanently store genetic information in cells. Finally, 
unlilce DNA polymerases, which malce their products in segments that are later 
stitched together, RNA polymerases are absolutely processive; that is, the same 
RNA polymerase that begins an RNA molécule must finish it without dissociating 
from the DNA template. 

Although not nearly as accurate as the DNA polymerases that replicate DNA, 
RNA polymerases nonetheless hâve a modest proofreading mechanism. If an 
incorrect ribonucleotide is added to the growing RNA chain, the polymerase can 
baclc up, and the active site of the enzyme can perform an excision reaction that 
resembles the reverse of the polymerization reaction, except that a water molé¬ 
cule replaces the pyrophosphate and a nucleoside monophosphate is released. 

Given that DNA and RNA polymerases both carry out template-dependent 
nucléotide polymerization, it might be expected that the two types of enzymes 
would be structurally related. However, x-ray crystallographie studies reveal that, 
other than containing a critical Mg 2+ ion at the catalytic site, the two enzymes are 
quite different. Template-dependent nucleotide-polymerizing enzymes seem to 
hâve arisen at least twice during the early évolution of cells. One lineage led to the 
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Figure 6-8 DNA transcription produces 
a single-stranded RNA molécule that 
is complementary to one strand of the 
DNA double hélix. Note that the sequence 
of bases in the RNA molécule produced is 
the same as the sequence of bases in the 
non-template DNA strand, except that a U 
replaces every T base in the DNA. 


Figure 6-9 DNA is transcribed by the 
enzyme RNA polymerase. The RNA 

polymerase (pale blue) moves stepwise 
along the DNA, unwinding the DNA hélix 
at its active site indicated by the Mg 2+ 

(red), which is required for catalysis. 

As it progresses, the polymerase adds 
nucléotides one by one to the RNA chain at 
the polymerization site, using an exposed 
DNA strand as a template. The RNA 
transcript is thus a complementary copy of 
one of the two DNA strands. A short région 
of DNA/RNA hélix (approximately nine 
nucléotide pairs in length) is formed only 
transiently, and a “window” of DNA/RNA 
hélix therefore moves along the DNA with 
the polymerase as the DNA double hélix 
reforms behind it. The incoming nucléotides 
are in the form of ribonucleoside 
triphosphates (ATP, UTP, CTP, and GTP), 
and the energy stored in their phosphate- 
phosphate bonds provides the driving 
force for the polymerization reaction (see 
Figure 5-4). The figure, based on an x-ray 
crystallographic structure, shows a cut- 
away view of the polymerase: the part 
facing the viewer has been sliced away 
to reveal the interior (Movie 6.3). 

(Adapted from P. Cramer et al., Science 
288:640-649, 2000; PDB code: 1HQM.) 
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modem DNA polymerases and reverse transcriptases discussed in Chapter 5, as 
well as to a few RNA polymerases from viruses. The other lineage formed ail of the 
modem RNA polymerases that we discuss in this chapter. 

Cells Produce Different Categories of RNA Molécules 

The majority of genes carried in a cell's DNA specify the amino acid sequence of 
proteins; the RNA molécules that are copied from these genes (which ultimately 
direct the synthesis of pro teins) are called messenger RNA (mRNA) molécules. The 
final product of other genes, however, is the RNA molécule itself. These RNAs are 
known as noncoding RNAs because they do not code for protein. In a well-stud- 
ied, single-celled eulcaryote, the yeast Saccharomyces cerevisiae, over 1200 genes 
(more than 15% of the total) produce RNA as their final product. Humans may 
produce on the order of ten thousand noncoding RNAs. These RNAs, lilce pro teins, 
serve as enzymatic, structural, and regulatory components for a wide variety of 
processes in the cell. In Chapter 5, we encountered one of them as the template 
carried by the enzyme telomerase. Although many of the noncoding RNAs are still 
mysterious, we shall see in this chapter that small nuclear RNA (snRNA ) molé¬ 
cules direct the splicing of pre-mRNA to form mRNA, that ribosomal RNA ( rRNA ) 
molécules form the core of ribosomes, and that transfer RNA (tRNA ) molécules 
form the adaptors that select amino acids and hold them in place on a ribosome 
for incorporation into protein. In Chapter 7, we shall see that microRNA ( miRNA ) 
molécules and small interfering RNA (siRNA ) molécules serve as lcey regulators 
of eukaryotic gene expression, and that piwi-interacting RNAs (piRNAs ) pro- 
tect animal germ Unes from transposons; we also discuss the long noncoding 
RNAs (IncRNAs ), a diverse set of RNAs whose functions are just being discovered 
(Table 6-1). 


Figure 6-10 Transcription of two 
genes as observed under the électron 
microscope. The micrograph shows 
many molécules of RNA polymerase 
simultaneously transcribing each of 
two adjacent genes. Molécules of RNA 
polymerase are visible as a sériés of dots 
along the DNA with the newly synthesized 
transcripts (fine threads) attached to them. 
The RNA molécules (ribosomal RNAs) 
shown in this example are not translated 
into protein but are instead used directly as 
components of ribosomes, the machines 
on which translation takes place. The 
particles at the 5' end (the free end) of each 
rRNA transcript are believed to reflect the 
beginnings of ribosome assembly. From the 
relative lengths of the newly synthesized 
transcripts, it can be deduced that the RNA 
polymerase molécules are transcribing from 
left to right. (Courtesy of Ulrich Scheer.) 


TABLE 6-1 


Principal Types of RNAs Produced in Cells 


Type of RNA Function 

mRNAs 

Messenger RNAs, code for proteins 

rRNAs 

Ribosomal RNAs, form the basic structure of the ribosome and catalyze protein synthesis 

tRNAs 

Transfer RNAs, central to protein synthesis as adaptors between mRNA and amino acids 

snRNAs 

Small nuclear RNAs, function in a variety of nuclear processes, including the splicing of pre-mRNA 

snoRNAs 

Small nucleolar RNAs, help to process and chemically modify rRNAs 

miRNAs 

MicroRNAs, regulate gene expression by blocking translation of spécifie mRNAs and cause their dégradation 

siRNAs 

Small interfering RNAs, turn off gene expression by directing the dégradation of sélective mRNAs and the 
establishment of compact chromatin structures 

piRNAs 

Piwi-interacting RNAs, bind to piwi proteins and protect the germ line from transposable éléments 

IncRNAs 

Long noncoding RNAs, many of which serve as scaffolds; they regulate diverse cell processes, including 
X-chromosome inactivation 
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Each transcribed segment of DNA is called a transcription unit. In eukaryotes, 
a transcription unit typically carries the information of just one gene, and there- 
fore codes for either a single RNA molécule or a single protein (or group of related 
proteins if the initial RNA transcript is spliced in more than one way to produce 
different mRNAs). In bacteria, a set of adjacent genes is often transcribed as a 
unit; the resulting mRNA molécule therefore carries the information for several 
distinct proteins. 

Overall, RNA malces up a few percent of a cell's dry weight, whereas pro teins 
comprise about 50%. Most of the RNA in cells is rRNA; mRNA comprises only 
3-5% of the total RNA in a typical mammalian cell. The mRNA population is made 
up of tens of thousands of different species, and there are on average only 10-15 
molécules of each species of mRNA présent in each cell. 

Signais Encoded in DNA Tell RNA Polymerase Where to Start and 
Stop 

To transcribe a gene accurately, RNA polymerase must recognize where on the 
genome to start and where to finish. The way in which RNA polymerases perform 
these taslcs differs somewhat between bacteria and eulcaryotes. Because the pro¬ 
cesses in bacteria are simpler, we discuss them first. 

The initiation of transcription is an especially important step in gene expres¬ 
sion because it is the main point at which the cell régulâtes which proteins are to 
be produced and at what rate. The bacterial RNA polymerase core enzyme is a 
multisubunit complex that synthesizes RNA using the DNA template as a guide. 
An additional subunit called sigma (a) factor associâtes with the core enzyme and 
assists it in reading the signais in the DNA that tell it where to begin transcribing 
(Figure 6-11). Together, a factor and core enzyme are known as the RNA poly¬ 
merase holoenzyme; this complex adhères only wealdy to bacterial DNA when the 
two collide, and a holoenzyme typically slides rapidly along the long DNA molé¬ 
cule and then dissociâtes. However, when the polymerase holoenzyme slides into 
a spécial sequence of nucléotides indicating the starting point for RNA synthesis 
called a promoter, the polymerase binds tightly, because its a factor malces spé¬ 
cifie contacts with the edges of bases exposed on the outside of the DNA double 
hélix (step 1 in Figure 6-11A). 

The tightly bound RNA polymerase holoenzyme at a promoter opens up the 
double hélix to expose a short stretch of nucléotides on each strand (step 2 in 
Figure 6-1 IA). The région of unpaired DNA (about 10 nucléotides) is called the 
transcription bubble and it is stabilized by the binding of o factor to the unpaired 
bases on one of the exposed strands. The other exposed DNA strand then acts as 
a template for complementary base-pairing with incoming ribonucleotides, two 
of which are joined together by the polymerase to begin an RNA chain (step 3 
in Figure 6-1 IA). The first ten or so nucléotides of RNA are synthesized using a 
“scrunching" mechanism, in which RNA polymerase remains bound to the pro¬ 
moter and pulls the upstream DNA into its active site, thereby expanding the 
transcription bubble. This process créâtes considérable stress and the short RNAs 
are often released, thereby relieving the stress and forcing the polymerase, which 
remains in place, to begin synthesis over again. Eventually this process of abor¬ 
tive initiation is overcome and the stress generated by scrunching helps the core 
enzyme to break free of its interactions with the promoter DNA (step 4 in Figure 
6-1 IA) and discard the a factor (step 5 in Figure 6-1 IA). At this point, the poly¬ 
merase begins to move down the DNA, synthesizing RNA, in a stepwise fashion: 
the polymerase moves forward one base pair for every nucléotide added. During 
this process, the transcription bubble continually expands at the front of the poly¬ 
merase and contracts at its rear. Chain élongation continues (at a speed of approx- 
imately 50 nucleotides/sec for bacterial RNA polymerases) until the enzyme 
encounters a second signal, the terminator (step 6 in Figure 6-11 A), where the 
polymerase halts and releases both the newly made RNA molécule and the DNA 
template (step 7 in Figure 6-1 IA). The free polymerase core enzyme then reas- 
sociates with a free o factor to form a holoenzyme that can begin the process of 
transcription again (step 8 in Figure 6-11A). 
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termination hairpin 
formation 


The process of transcription initiation is complicated and requires that the 
RNA polymerase holoenzyme and the DNA undergo a sériés of conformational 
changes. We can view these changes as opening up and positioning the DNA in 
the active site followed by a successive tightening of the enzyme around the DNA 
and RNA to ensure that it does not dissociate before it has finished transcribing a 
gene. If an RNA polymerase does dissociate prematurely, it must start over again 
at the promoter. 

How do the termination signais in the DNA stop the elongating polymerase? 
For most bacterial genes, a termination signal consists of a string of A-T nucléotide 
pairs preceded by a twofold symmetric DNA sequence, which, when transcribed 
into RNA, folds into a "hairpin" structure through Watson-Criclcbase-pairing (see 
Figure 6-1 IA). As the polymerase transcribes across a terminator, the formation 
of the hairpin helps to disengage the RNA transcript from the active site (step 7 
in Figure 6-1 IA). The process of termination provides an example of a common 
theme in this chapter: the folding of RNA into spécifie structures affects many 
steps in decoding the genome. 

Transcription Start and Stop Signais Are Heterogeneous in 
Nucléotide Sequence 

As we hâve just seen, the processes of transcription initiation and termination 
involve a complicated sériés of structural transitions in protein, DNA, and RNA 
molécules. The signais encoded in DNA that specify these transitions are often dif- 
ficult for researchers to recognize. Indeed, a comparison of many different bacte¬ 
rial promoters reveals a surprising degree of variation. Nevertheless, they ail con- 
tain related sequences, reflecting aspects of the DNA that are recognized directly 



Figure 6-11 The transcription cycle 
of bacterial RNA polymerase. (A) In 

step 1, the RNA polymerase holoenzyme 
(polymerase core enzyme plus o factor) 
assembles and then locates a promoter 
DNA sequence (see Figure 6-12). The 
polymerase opens (unwinds) the DNA at 
the position at which transcription is to 
begin (step 2) and begins transcribing 
(step 3). This initial RNA synthesis 
(abortive initiation) is relatively inefficient 
as short, unproductive transcripts are 
often released. However, once RNA 
polymerase has managed to synthesize 
about 10 nucléotides of RNA, it breaks its 
interactions with the promoter DNA (step 
4) and eventually releases a factor—as 
the polymerase tightens around the DNA 
and shifts to the élongation mode of RNA 
synthesis, moving along the DNA (step 5). 
During the élongation mode, transcription 
is highly processive, with the polymerase 
leaving the DNA template and releasing 
the newly transcribed RNA only when it 
encounters a termination signal (steps 6 
and 7). Termination signais are typically 
encoded in DNA, and many function by 
forming an RNA hairpin-like structure that 
destabilizes the polymerase’s hold on the 
RNA. 

In bacteria, ail RNA molécules are 
synthesized by a single type of RNA 
polymerase, and the cycle depicted in the 
figure therefore applies to the production of 
mRNAs as well as structural and catalytic 
RNAs. (B) Two-dimensional image of an 
elongating bacterial RNA polymerase, as 
determined by atomic force microscopy 
(see Figure 9-33). (C) Interprétation of the 
image in (B). (Adapted from K.M. Herbert 
et al .,Annu. Rev. Biochem. 77:149-176, 
2008.) 
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by the a factor. These common features are often summarized in the form of a 
consensus sequence (Figure 6-12). A consensus nucléotide sequence is derived 
by comparing many sequences with the same basic function and tallying up the 
most common nucléotides found at each position. It therefore serves as a sum- 
mary or "average" of a large number of individual nucléotide sequences. A more 
accurate way of displaying the range of DNA sequences recognized by a protein is 
through the use of a sequence logo, which reveals the relative frequencies of each 
nucléotide at each position (Figure 6-12C). 

The DNA sequences of individual bacterial promoters differ in ways that déter¬ 
mine their strength (the number of initiation events per unit time of the promoter). 
Evolutionary processes hâve fine-tuned each to initiate as often as necessary and 
hâve thereby created a wide spectrum of promoter strengths. Promoters for genes 
that code for abundant proteins are much stronger than those associated with 
genes that encode rare proteins, and the nucléotide sequences of their promoters 
are responsible for these différences. 

Lilce bacterial promoters, transcription terminators also hâve a wide range of 
sequences, with the potential to form a simple hairpin RNA structure being the 
most important common feature. Since an almost unlimited number of nucléo¬ 
tide sequences hâve this potential, terminator sequences are even more heteroge- 
neous than promoter sequences. 

We hâve discussed bacterial promoters and terminators in some detail to illus- 
trate an important point regarding the analysis of genome sequences. Although 
we know a great deal about bacterial promoters and terminators and can con- 
struct consensus sequences that summarize their most salient features, their vari¬ 
ation in nucléotide sequence malces it difficult to definitively locate them simply 
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Figure 6-12 Consensus nucléotide 
sequence and sequence logo for the 
major class of E. coli promoters. 

(A) On the basis of a comparison of 300 
promoters, the frequencies of each of 
the four nucléotides at each position in 
the promoter are given. The consensus 
sequence, shown below the graph, reflects 
the most common nucléotide found at 
each position in the collection of promoters. 
These promoters are characterized by 
two hexameric DNA sequences—the -35 
sequence and the -10 sequence, named 
for their approximate location relative to 
the start point of transcription (designated 
+1). The sequence of nucléotides between 
the -35 and -10 hexamers shows no 
significant similarities among promoters. 

For convenience, the nucléotide sequence 
of a single strand of DNA is shown; in 
reality, promoters are double-stranded 
DNA. The nucléotides shown in the figure 
are recognized by o factor, a subunit of 
the RNA polymerase holoenzyme. 

(B) The distribution of spacing between 
the -35 and -10 hexamers found in E. coli 
promoters. (C) A sequence logo displaying 
the same information as in panel (A). Here, 
the height of each letter is proportional to 
the frequency at which that base occurs 
at that position across a wide variety of 
promoter sequences. The total height of ail 
the letters at each position is proportional 
to the information content (expressed in 
bits) at that position. For example, the total 
information content of a position that can 
tolerate several different bases is small (see 
the last three bases of the -35 sequences), 
but statistically greater than random. 
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by analysis of the nucléotide sequence of a genome. It is even more difficult to 
locate analogous sequences in eukaryotic genomes, due in part to the excess DNA 
carried in these genomes. Often we need additional information, some of it from 
direct expérimentation, to locate and accurately interpret the short DNA signais 
in genomes. 

As shown in Figure 6-11, promoter sequences are asymmetric, ensuring that 
RNA polymerase can bind in only one orientation. Because the polymerase can 
synthesize RNA only in the 5'-to-3' direction, the promoter orientation spécifiés 
the strand to be used as a template. Genome sequences reveal that the DNA strand 
that is used as the template for RNA synthesis varies from gene to gene, depending 
on the orientation of the promoter (Figure 6-13). 

Having considered transcription in bacteria, we now turn to the situation in 
eulcaryotes, where the synthesis of RNA molécules is a much more elaborate affair. 

Transcription Initiation in Eukaryotes Requires Many Proteins 

In contrast to bacteria, which contain a single type of RNA polymerase, eukary¬ 
otic nuclei hâve three: RNA polymerase I, RNA polymerase II, and RNA polymerase 
III. The three polymerases are structurally similar to one another and share some 
common subunits, but they transcribe different categories of genes (Table 6-2). 
RNA polymerases I and III transcribe the genes encoding transfer RNA, ribo- 
somal RNA, and various small RNAs. RNA polymerase II transcribes most genes, 
including ail those that encode proteins, and our subséquent discussion therefore 
focuses on this enzyme. 

Eukaryotic RNA polymerase II has many structural similarities to bacterial 
RNA polymerase (Figure 6-14). But there are several important différences in the 
way in which the bacterial and eukaryotic enzymes function, two of which con- 
cern us immediately. 

1. While bacterial RNA polymerase requires only a single transcription- initi¬ 
ation factor (a) to begin transcription, eukaryotic RNA polymerases require 
many such factors, collectively called the general transcription factors. 

2. Eukaryotic transcription initiation must talce place on DNA that is pack- 
aged into nucléosomes and higher-order forms of chromatin structure 
(described in Chapter 4), features that are absent from bacterial chromo¬ 
somes. 


The Three RNA Polymerases in Eukaryotic Cells 

Type of polymerase 

Genes transcribed 

RNA polymerase 1 

5.8S, 18S, and 28S rRNA genes 

RNA polymerase II 

Ail protein-coding genes, plus snoRNA genes, miRNA 
genes, siRNA genes, IncRNA genes, and most snRNA genes 

RNA polymerase III 

tRNA genes, 5S rRNA genes, some snRNA genes, and 
genes for other small RNAs 

The rRNAs were named according to their “S” values, which refer to their rate of sédimentation 
in an ultracentrifuge. The larger the S value, the larger the rRNA. 


Figure 6-13 Directions of transcription 
along a short portion of a bacterial 
chromosome. Some genes are 
transcribed using one DNA strand as a 
template, while others are transcribed 
using the other DNA strand. The direction 
of transcription is determined by the 
promoter at the beginning of each gene 
(green arrowheads). This diagram shows 
approximately 0.2% (9000 base pairs) 
of the E coli chromosome. The genes 
transcribed from left to right use the 
bottom DNA strand as the template; those 
transcribed from right to left use the top 
strand as the template. 
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Figure 6-14 Structural similarity between 
a bacterial RNA polymerase and a 
eukaryotic RNA polymerase II. Régions 
of the two RNA polymerases that hâve 
similar structures are indicated in green. 

The eukaryotic polymerase is larger than 
the bacterial enzyme (12 subunits instead 
of 5), and some of the additional régions 
are shown in gray. The blue spheres 
represent Zn atoms that serve as structural 
components of the polymerases, and the 
red sphere represents the Mg atom présent 
at the active site, where polymerization 
takes place. The RNA polymerases in ail 
modern-day cells (bacteria, archaea, and 
eukaryotes) are closely related, indicating 
that the basic features of the enzyme were 
in place before the divergence of the three 
major branches of life. (Courtesy of 
P. Cramer and R. Kornberg.) 


RNA Polymerase II Requires a Set of General Transcription Factors 

The general transcription factors help to position eukaryotic RNA polymerase 
correctly at the promoter, aid in pulling apart the two strands of DNA to allow 
transcription to begin, and release RNA polymerase from the promoter to start its 
élongation mode. The proteins are "general" because they are needed at nearly ail 
promoters used by RNA polymerase IL They consist of a set of interacting proteins 
denoted arbitrarily as TFILA, TFIIB, TFIIC, TFIID, and so on (TFII standing for 
"transcription factor for polymerase II)." In a broad sense, the eukaryotic general 
transcription factors carry out functions équivalent to those of the a factor in bac¬ 
teria; indeed, portions of TFIIF hâve the same three-dimensional structure as the 
équivalent portions of a. 

Figure 6-15 illustrâtes how the general transcription factors assemble at 
promoters used by RNA polymerase II, and Table 6-3 summarizes their activ- 
ities. The assembly process begins when TFIID binds to a short double-helical 
DNA sequence primarily composed of T and A nucléotides. For this reason, this 
sequence is known as the TATA sequence, or TATA box, and the subunit of TFIID 
that recognizes it is called TBP (for TATA-binding protein). The TATA box is typ- 
ically located 25 nucléotides upstream from the transcription start site. It is not 
the only DNA sequence that signais the start of transcription (Figure 6-16), but 
for most polymerase II promoters it is the most important. The binding of TFIID 


Figure 6-15 Initiation of transcription of a eukaryotic gene by RNA 
polymerase II. To begin transcription, RNA polymerase requires several 
general transcription factors. (A) The promoter contains a DNA sequence 
called the TATA box, which is located 25 nucléotides away from the site at 
which transcription is initiated. (B) Through its subunit TBP, TFIID recognizes 
and binds the TATA box, which then enables the adjacent binding of TFIIB 
(C). For simplicity the DNA distortion produced by the binding of TFIID (see 
Figure 6-17) is not shown. (D) The rest of the general transcription factors, 
as well as the RNA polymerase itself, assemble at the promoter. (E) TFII H 
then uses energy from ATP hydrolysis to pry apart the DNA double hélix at 
the transcription start point, locally exposing the template strand. TFIIH also 
phosphorylâtes RNA polymerase II, changing its conformation so that the 
polymerase is released from the general factors and can begin the élongation 
phase of transcription. As shown, the site of phosphorylation is a long 
C-terminal polypeptide tail, also called the C-terminal domain (CTD), that 
extends from the polymerase molécule. The assembly scheme shown in the 
figure was deduced from experiments performed in vitro, and the exact order 
in which the general transcription factors assemble on promoters probably 
varies from gene to gene in vivo. The general transcription factors are highly 
conserved; some of those from human cells can be replaced in biochemical 
experiments by the corresponding factors from simple yeasts. 
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The General Transcription Factors Needed for Transcription Initiation 
by Eukaryotic RNA Polymerase II 


Name Number Rôles in transition initiation 

of 

subunits 

TFIID 

TBP subunit 
TAF subunits 

1 

-11 

Recognizes TATA box 

Recognizes other DNA sequences near the 
transcription start point; régulâtes DNA-binding by TBP 

TFIIB 

1 

Recognizes BRE element in promoters; accurately 
positions RNA polymerase at the start site of 
transcription 

TFIIF 

3 

Stabilizes RNA polymerase interaction with TBP and 
TFIIB; helps attract TFIIE and TFIIH 

TFIIE 

2 

Attracts and régulâtes TFIIH 

TFIIH 

9 

Unwinds DNA at the transcription start point, 
phosphorylâtes Ser5 of the RNA polymerase CTD; 
releases RNA polymerase from the promoter 

TFIID is composed of TBP and ~11 additional subunits called TAFs (TBP-associated factors); 

CTD, C-terminal domain. 


causes a large distortion in the DNA of the TATA box (Figure 6-17). This distortion 
is thought to serve as a physical landmark for the location of an active promoter 
in the midst of a very large genome, and it brings DNA sequences on both sides 
of the distortion doser together to allow for subséquent protein assembly steps. 
Other factors then assemble, along with RNA polymerase II, to form a complété 
transcription initiation complex (see Figure 6-15). The most complicated of the 
general transcription factors is TFIIH. Consisting of nine subunits, it is nearly as 
large as RNA polymerase II itself and, as we shall see shortly, performs several 
enzymatic steps needed for the initiation of transcription. 

After forming a transcription initiation complex on the promoter DNA, RNA 
polymerase II must gain access to the template strand at the transcription start 
point. TFIIH, which contains a DNA helicase as one of its subunits, malces this 
step possible by hydrolyzing ATP and unwinding the DNA, thereby exposing the 
template strand. Next, RNA polymerase II, lilce the bacterial polymerase, remains 
at the promoter synthesizing short lengths of RNA until it undergoes a sériés of 
conformational changes that allow it to move away from the promoter and enter 
the élongation phase of transcription. A lcey step in this transition is the addi¬ 
tion of phosphate groups to the “tail" of the RNA polymerase (known as the CTD 
or C-terminal domain). In humans, the CTD consists of 52 tandem repeats of a 
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Figure 6-16 Consensus sequences 
found in the vicinity of eukaryotic RNA 
polymerase II start points. The name given 
to each consensus sequence (first column) 
and the general transcription factor that 
recognizes it (last column) are indicated. 

N indicates any nucléotide, and two 
nucléotides separated by a slash indicate an 
equal probability of either nucléotide at the 
indicated position. In reality, each consensus 
sequence is a shorthand représentation of a 
histogram similar to that of Figure 6-12. 

For most RNA polymerase II transcription 
start points, only two or three of the four 
sequences are présent. For example, many 
polymerase II promoters hâve a TATA box 
sequence, but those that do not typically 
hâve a “strong” INR sequence. Although 
most of the DNA sequences that influence 
transcription initiation are located upstream 
of the transcription start point, a few, such 
as the DPE shown in the figure, are located 
in the transcribed région. 
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Figure 6-17 Three-dimensional structure 
of TBP (TATA-binding protein) bound to 

DNA. The TBP is the subunit of the general 
transcription factor TFIID that is responsible 
for recognizing and binding to the TATA 
box sequence in the DNA (red). The unique 
DNA bending caused by TBP—kinks in the 
double hélix separated by partly unwound 
DNA—is thought to serve as a landmark 
that helps to attract the other general 
transcription factors (Movie 6.4). TBP is a 
single polypeptide Chain that is folded into 
two very similar domains (blue and green). 
(Adapted from J.L. Kim et al., Nature 
365:520-527, 1993. With permission from 
Macmillan Publishers Ltd.) 


seven-amino-acid sequence, which extend from the RNA polymerase core struc¬ 
ture. During transcription initiation, the serine located at the fifth position in the 
repeat sequence (Ser5) is phosphorylated by TFIIH, which contains a protein 
kinase in one of its subunits (see Figure 6-15D and E). The polymerase can then 
disengage from the cluster of general transcription factors. During this process, 
it undergoes a sériés of conformational changes that tighten its interaction with 
DNA, and it acquires new proteins that allow it to transcribe for long distances, in 
some cases for many hours, without dissociating from DNA. 

Once the polymerase II has begun elongating the RNA transcript, most of the 
general transcription factors are released from the DNA so that they are available 
to initiate another round of transcription with a new RNA polymerase molécule. 
As we see shortly, the phosphorylation of the tail of RNA polymerase II has an 
additional function: it causes components of the RNA-processing machinery to 
load onto the polymerase and thus be positioned to modify the newly transcribed 
RNA as it emerges from the polymerase. 

Polymerase II Also Requires Activator, Mediator, and Chromatin- 
Modifying Proteins 

Studies of RNA polymerase II and its general transcription factors acting on DNA 
templates in purified in vitro Systems established the model for transcription 
initiation just described. However, as discussed in Chapter 4, DNA in eukaryotic 
cells is paclcaged into nucléosomes, which are further arranged in higher-order 
chromatin structures. As a resuit, transcription initiation in a eukaryotic cell is 
more complex and requires more proteins than it does on purified DNA. First, 
gene regulatory proteins known as transcriptional activators must bind to spé¬ 
cifie sequences in DNA (called enhancers) and help to attract RNA polymerase 
II to the start point of transcription (Figure 6-18). We discuss the rôle of these 
activators in Chapter 7, because they are one of the main ways in which cells reg- 
ulate expression of their genes. Here we simply note that their presence on DNA 
is required for transcription initiation in a eukaryotic cell. Second, eukaryotic 
transcription initiation in vivo requires the presence of a large protein complex 
known as Mediator , which allows the activator proteins to communicate prop- 
erly with the polymerase II and with the general transcription factors. Finally, 
transcription initiation in a eukaryotic cell typically requires the recruitment of 
chromatin-modifying enzymes, including chromatin remodeling complexes and 
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histone-modifying enzymes. As discussed in Chapter 4, both types of enzymes 
can increase access to the DNA in chromatin, and by doing so they facilitate the 
assembly of the transcription initiation machinery onto DNA. 

As illustrated in Figure 6-18, many proteins (well over 100 individual sub- 
units) must assemble at the start point of transcription to initiate transcription in 
a eukaryotic cell. The order of assembly of these proteins does not seem to follow 
a prescribed pathway; rather, the order differs from gene to gene. Indeed, some of 
these different protein complexes may be brought to DNA as preformed subas- 
semblies. 

To begin transcribing, RNA polymerase II must be released from this large 
complex of proteins. In addition to the steps described in Figure 6-14, this release 
often requires the in situ proteolysis of the activator protein. We shall return to 
some of these issues, including the rôle of chromatin remodeling complexes and 
histone-modifying enzymes, in Chapter 7, where we discuss how eukaryotic cells 
regulate the process of transcription initiation. 

Transcription Elongation in Eukaryotes Requires Accessory 
Proteins 

Once RNA polymerase has initiated transcription, it moves jerkily, pausing at 
some DNA sequences and rapidly transcribing through others. Elongating RNA 
polymerases, both bacterial and eukaryotic, are associated with a sériés of élonga¬ 
tion factors, proteins that decrease the lilcelihood that RNA polymerase will disso- 
ciate before it reaches the end of a gene. These factors typically associate with RNA 
polymerase shortly after initiation and help the polymerase move through the 
wide variety of different DNA sequences that are found in genes. Eukaryotic RNA 
polymerases must also contend with chromatin structure as they move along a 
DNA template, and they are typically aided by ATP-dépendent chromatin remod¬ 
eling complexes that either move with the polymerase or may simply seelc out and 
rescue the occasional stalled polymerase. In addition, histone chaperones help 
by partially disassembling nucléosomes in front of a moving RNA polymerase and 
assembling them behind. 

As RNA polymerase moves along a gene, some of the enzymes bound to it 
modify the histones, leaving behind a record of where the polymerase has been. 
Although it is not clear exactly how the cell uses this information, it may aid in 


Figure 6-18 Transcription initiation by 
RNA polymerase II in a eukaryotic cell. 

Transcription initiation in vivo requires 
the presence of transcription activator 
proteins. As described in Chapter 7, these 
proteins bind to spécifie short sequences 
in DNA. Although only one is shown 
here, a typical eukaryotic gene utilizes 
many transcription activator proteins, 
which in combination détermine its rate 
and pattern of transcription. Sometimes 
acting from a distance of several thousand 
nucléotide pairs (indicated by the dashed 
DNA molécule), these proteins help RNA 
polymerase, the general transcription 
factors, and Mediator ail to assemble at 
the promoter. In addition, activators attract 
ATP-dependent chromatin remodeling 
complexes and histone-modifying 
enzymes. One of the main rôles of 
Mediator is to coordinate the assembly of 
ail these proteins at the promoter so that 
transcription can begin. As discussed in 
Chapter 4, the “default” State of chromatin 
is a condensed fiber (see Figure 4-28), and 
this is likely to be the form of DNA upon 
which most transcription is initiated. For 
simplicity, the chromatin is not shown in 
this figure. 
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transcribing a gene over and over again once it has become active for the first 
time. It may also be used to coordinate transcription élongation with the Process¬ 
ing of RNA as it emerges from RNA polymerase, a topic we discuss later in this 
chapter. 

Transcription Créâtes Superhelical Tension 

There is yet another barrier to elongating RNA polymerases, both bacterial and 
eukaryotic, one that also applies to DNA polymerases, as discussed in Chapter 5 
(see Figure 5-20). To describe this issue in more detail, we need first to consider a 
subtle property inhérent in the DNA double hélix called DNA supercoiling. DNA 
supercoiling is the name given to a conformation that DNA adopts in response 
to superhelical tension; alternatively, creating loops or coils in a double-helical 
DNA molécule can create such tension. Figure 6-19 illustrâtes why. There are 
approximately 10 nucléotide pairs for every helical turn in a DNA double hélix. If 
we imagine a hélix whose two ends are fixed with respect to each other (as they 
are in a DNA circle, such as a bacterial chromosome, or in a tightly clamped loop, 
as is thought to exist in eukaryotic chromosomes), one large DNA supercoil will 
form to compensate for each 10 nucléotide pairs that are opened (unwound). The 
formation of this supercoil is energetically favorable because it restores a normal 
helical twist to the base-paired régions that remain, which would otherwise need 
to be overwound because of the fixed ends. 

RNA polymerase créâtes superhelical tension as it moves along a stretch of 
DNA that is anchored at its ends (see Figure 6-19C). As long as the polymerase 
is not free to rotate rapidly (and such rotation is unlikely given the size of RNA 
polymerases and their attached transcripts), a moving polymerase generates pos¬ 
itive superhelical tension in the DNA in front of it and négative helical tension 
behind it. For eulcaryotes, this situation is thought to provide a bonus: although 
the positive superhelical tension ahead of the polymerase malces the DNA hélix 


(A) 

DNA with free end 


unwind 10 DNA base pairs 
(one helical turn) 


\ 

DNA hélix must 
rotate one turn 


(B) 

DNA with fixed ends 

unwind 10 DNA base pairs 
(one helical turn) 



DNA hélix forms 
one supercoil 


(C) 

DNA 




protein molécule 



NEGATIVE SUPERCOILING POSITIVE SUPERCOILING 

hélix opening facilitated hélix opening hindered 


Figure 6-19 Superhelical tension in 
DNA causes DNA supercoiling. (A) For 

a DNA molécule with one free end (or a 
nick in one strand that serves as a swivel), 
the DNA double hélix rotâtes by one turn 
for every 10 nucléotide pairs opened. 

(B) If rotation is prevented, superhelical 
tension is introduced into the DNA by hélix 
opening. In the example shown, the DNA 
hélix contains 10 helical turns, one of which 
is opened. One way of accommodating 
the tension created would be to increase 
the helical twist from 10 to 11 nucléotide 
pairs per turn in the double hélix that 
remains. The DNA hélix, however, resists 
such a deformation in a springlike fashion, 
preferring to relieve the superhelical tension 
by bending into supercoiled loops. As a 
resuit, one DNA supercoil forms in the DNA 
double hélix for every 10 nucléotide pairs 
opened. The supercoil formed in this case 
is a positive supercoil. (C) Supercoiling 
of DNA is induced by a protein tracking 
through the DNA double hélix. The two 
ends of the DNA shown here are unable 
to rotate freely relative to each other, and 
the protein molécule is assumed also to be 
prevented from rotating freely as it moves. 
Under these conditions, the movement of 
the protein causes an excess of helical turns 
to accumulate in the DNA hélix ahead of the 
protein and a déficit of helical turns to arise 
in the DNA behind the protein, as shown. 
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more diffïcult to open, the tension should facilitate the partial unwrapping of the 
DNA in nucléosomes, inasmuch as the release of DNA from the histone core helps 
to relax this tension. 

Any protein that propels itself alone along a DNA strand of a double hélix, such 
as a DNA helicase or an RNA polymerase, tends to generate superhelical tension. 
In eulcaryotes, DNA topoisomerase enzymes rapidly remove this superhelical ten¬ 
sion (see pp. 251-253). But in bacteria a specialized topoisomerase called DNA 
gyrase uses the energy of ATP hydrolysis to pump supercoils continuously into the 
DNA, thereby maintaining the DNA under constant tension. These are négative 
supercoils , having the opposite handedness from the positive supercoils that form 
when a région of DNA hélix opens (see Figure 6-19B). Whenever a région of hélix 
opens, it removes these négative supercoils from bacterial DNA, reducing the 
superhelical tension. DNA gyrase therefore malces the opening of the DNA hélix 
in bacteria energetically favorable compared with hélix opening in DNA that is 
not supercoiled. For this reason, it facilitâtes those genetic processes in bacteria, 
such as the initiation of transcription by bacterial RNA polymerase, that require 
hélix opening (see Figure 6-11). 


Figure 6-20 Comparison of the 
steps leading from gene to protein in 
eukaryotes and bacteria. The final level 
of a protein in the cell dépends on the 
efficiency of each step and on the rates 
of dégradation of the RNA and protein 
molécules. (A) In eukaryotic cells, the 
mRNA molécule resulting from transcription 
contains both coding (exon) and noncoding 
(intron) sequences. Before it can be 
translated into protein, the two ends of the 
RNA are modified, the introns are removed 
by an enzymatically catalyzed RNA 
splicing reaction, and the resulting mRNA 
is transported from the nucléus to the 
cytoplasm. For convenience, the steps in 
this figure are depicted as occurring one at 
a time; in reality, many occur concurrently. 
For example, the RNA cap is added and 
splicing begins before transcription has 
been completed. Because of the coupling 
between transcription and RNA processing, 
intact primary transcripts—the full-length 
RNAs that would, in theory, be produced 
if no processing had occurred—are 
found only rarely. (B) In prokaryotes, the 
production of mRNA is much simpler. The 
5' end of an mRNA molécule is produced 
by the initiation of transcription, and the 
3' end is produced by the termination of 
transcription. Since prokaryotic cells lack 
a nucléus, transcription and translation 
take place in a common compartiment, 
and the translation of a bacterial mRNA 
often begins before its synthesis has been 
completed. 


Transcription Elongation in Eukaryotes Is Tightly Coupled to RNA 
Processing 

We hâve seen that bacterial mRNAs are synthesized by the RNA polymerase start- 
ing and stopping at spécifie spots on the genome. The situation in eukaryotes is 
substantially different. In particular, transcription is only the first of several steps 
needed to produce a mature mRNA molécule. Other critical steps are the covalent 
modification of the ends of the RNA and the removal of intron sequences that are 
discarded from the middle of the RNA transcript by the process of RNA splicing 
(Figure 6-20). 

Both ends of eukaryotic mRNAs are modified: by capping on the 5' end and by 
polyadenylation of the 3' end (Figure 6-21). These spécial ends allow the cell to 
assess whether both ends of an mRNA molécule are présent (and if the message 
is therefore intact) before it exports the RNA from the nucléus and translates it 
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Figure 6-21 A comparison of the structures of prokaryotic and eukaryotic mRNA molécules. (A) The 5' and 3' 

ends of a bacterial mRNA are the unmodified ends of the chain synthesized by the RNA polymerase, which initiâtes 
and terminâtes transcription at those points, respectively. The corresponding ends of a eukaryotic mRNA are formed by 
adding a 5' cap and by cleavage of the pre-mRNA transcript near the 3' end and the addition of a poly-A tail, respectively. 
The figure also illustrâtes another différence between the prokaryotic and eukaryotic mRNAs: bacterial mRNAs can 
contain the instructions for several different proteins, whereas eukaryotic mRNAs nearly always contain the information 
for only a single protein. (B) The structure of the cap at the 5' end of eukaryotic mRNA molécules. Note the unusual 
5'-to-5' linkage of the 7-methyl G to the remainder of the RNA. Many eukaryotic mRNAs carry an additional modification: 
méthylation of the 2'-hydroxyl group of the ribose sugar at the 5' end of the primary transcript (see Figure 6-23). 



into protein. RNA splicing joins together the different portions of a protein-coding 
sequence, and it provides eukaryotes with the ability to synthesize several differ¬ 
ent proteins from the same gene. 

A simple strategy has evolved to couple ail of the above RNA processing steps 
to transcription élongation. As discussed previously, a lcey step in transcription 
initiation by RNA polymerase II is the phosphorylation of the RNA polymerase II 
tail, also called the CTD (C-terminal domain). This phosphorylation, which pro- 
ceeds gradually as the RNA polymerase initiâtes transcription and moves along 
the DNA, not only helps dissociate the RNA polymerase II from other proteins 
présent at the start point of transcription, but also allows a new set of proteins 
to associate with the RNA polymerase tail that function in transcription élonga¬ 
tion and RNA processing. As discussed next, some of these processing proteins 
are thought to “hop” from the polymerase tail onto the nascent RNA molécule 
to begin processing it as it emerges from the RNA polymerase. Thus, we can view 
RNA polymerase II in its élongation mode as an RNA factory that not only moves 
along the DNA synthesizing an RNA molécule, but also processes the RNA that it 
produces (Figure 6-22). Fully extended, the CTD is nearly 10 times longer than 
the remainder of RNA polymerase. As a flexible protein domain, it serves as a scaf- 
fold or tether, holding a variety of proteins close by so that they can rapidly act 
when needed. This strategy, which greatly speeds up the overall rate of a sériés 
of consecutive reactions, is one that is commonly utilized in the cell (see Figures 
4-58 and 16-18). 

RNA Capping Is the First Modification of Eukaryotic Pre-mRNAs 

As soon as RNA polymerase II has produced about 25 nucléotides of RNA, the 5' 
end of the new RNA molécule is modified by addition of a cap that consists of a 
modified guanine nucléotide (see Figure 6-2IB). Three enzymes, acting in suc¬ 
cession, perform the capping reaction: one (a phosphatase) removes a phosphate 
from the 5' end of the nascent RNA, another (a guanyl transferase) adds a GMP in 
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Figure 6-22 Eukaryotic RNA polymerase II as an “RNA factory.” As the 

polymerase transcribes DNA into RNA, it carries RNA-processing proteins on 
its tail that are transferred to the nascent RNA at the appropriate time. The 
tail contains 52 tandem repeats of a seven-amino-acid sequence, and there 
are two serines in each repeat. The capping proteins first bind to the RNA 
polymerase tail when it is phosphorylated on Ser5 of the heptad repeat late in 
the process of transcription initiation (see Figure 6-15). This strategy ensures 
that the RNA molécule is efficiently capped as soon as its 5' end emerges 
from the RNA polymerase. As the polymerase continues transcribing, its tail 
is extensively phosphorylated on the Ser2 positions by a kinase associated 
with the elongating polymerase and is eventually dephosphorylated at Ser5 
positions. These further modifications attract splicing and 3'-end Processing 
proteins to the moving polymerase, positioning them to act on the newly 
synthesized RNA as it emerges from the RNA polymerase. There are many 
RNA-processing enzymes, and not ail travel with the polymerase. For RNA 
splicing, for example, the tail carries only a few critical components; once 
transferred to an RNA molécule, they serve as a nucléation site for the 
remaining components. 

When RNA polymerase II finishes transcribing a gene, it is released 
from DNA, soluble phosphatases remove the phosphates on its tail, and 
it can reinitiate transcription. Only the fully dephosphorylated form of RNA 
polymerase II is competent to begin RNA synthesis at a promoter. 


a reverse linlcage (5' to 5' instead of 5' to 3'), and a third (a methyl transferase) adds 
a methyl group to the guanosine (Figure 6-23). Because ail three enzymes bind to 
the RNA polymerase tail phosphorylated at the Ser5 position—the modification 
added by TFIIH during transcription initiation— they are poised to modify the 5' 
end of the nascent transcript as soon as it emerges from the polymerase. 

The 5'-methyl cap signifies the 5' end of eukaryotic mRNAs, and this land- 
mark helps the cell to distinguish mRNAs from the other types of RNA molécules 
présent in the cell. For example, RNA polymerases I and III produce uncapped 
RNAs during transcription, in part because these polymerases laclc a CTD. In the 
nucléus, the cap binds a protein complex called CBC (cap-binding complex), 
which, as we discuss in subséquent sections, helps a future mRNA be further pro- 
cessed and exported. The 5'-methyl cap also has an important rôle in the transla¬ 
tion of mRNAs in the cytosol, as we discuss later in the chapter. 

RNA Splicing Removes Intron Sequences from Newly Transcribed 
Pre-mRNAs 

As discussed in Chapter 4, the protein-coding sequences of eukaryotic genes are 
typically interrupted by noncoding intervening sequences (introns). Discovered 
in 1977, this feature of eukaryotic genes came as a surprise to scientists, who had 
been, until that time, familiar only with bacterial genes, which typically consist 
of a continuous stretch of coding DNA that is directly transcribed into mRNA. In 
marlced contrast, eukaryotic genes were found to be brolcen up into small pièces 
of coding sequence (expressed sequences or exons) interspersed with much longer 
intervening sequences or introns; thus, the coding portion of a eukaryotic gene is 
often only a small fraction of the length of the gene (Figure 6-24). 

Both intron and exon sequences are transcribed into RNA. The intron 
sequences are removed from the newly synthesized RNA through the process 
of RNA splicing. The vast majority of RNA splicing that talces place in cells func- 
tions in the production of mRNA, and our discussion of splicing focuses on this 
so-called precursor-mRNA (or pre-mRNA) splicing. Only after 5'- and 3'-end Pro¬ 
cessing and splicing hâve talcen place is such RNA termed mRNA. 


Figure 6-23 The reactions that cap the 5' end of each RNA molécule 
synthesized by RNA polymerase II. The final cap contains a novel 5'-to-5' 
linkage between the positively charged 7-methyl G residue and the 5' end 
of the RNA transcript (see Figure 6-21 B). The letter N represents any one of 
the four ribonucleotides, although the nucléotide that starts an RNA chain is 
usually a purine (an A or a G). (After A.J. Shatkin, BioEssays 7:275-277, 1987. 
With permission from Wiley-Liss, Inc., a subsidiary of John Wiley & Sons, Inc.) 
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Each splicing event removes one intron, proceeding through two sequential 
phosphoryl-transfer reactions lcnown as transestérifications; these join two exons 
together while removing the intron between them as a “lariat” (Figure 6-25). The 
machinery that catalyzes pre-mRNA splicing is complex, consisting of five addi- 
tional RNA molécules and several hundred proteins, and it hydrolyzes many ATP 
molécules per splicing event. This complexity ensures that splicing is accurate, 
while at the same time being flexible enough to deal with the enormous variety of 
introns found in a typical eukaryotic cell. 

It may seem wasteful to remove large numbers of introns by RNA splicing. In 
attempting to explain why it occurs, scientists hâve pointed out that the exon- 
intron arrangement would seem to facilitate the emergence of new and useful 
proteins over evolutionary time scales. Thus, the presence of numerous introns 
in DNA allows genetic recombination to readily combine the exons of different 
genes, enabling genes for new proteins to evolve more easily by the combination 
of parts of preexisting genes. The observation, described in Chapter 3, that many 
proteins in présent-day cells resemble patchworks composed from a common set 
of protein domains , supports this idea (see pp. 121-122). 

RNA splicing also has a present-day advantage. The transcripts of many 
eukaryotic genes (estimated at 95% of genes in humans) are spliced in more than 
one way, thereby allowing the same gene to produce a corresponding set of dif¬ 
ferent pro teins (Figure 6-26). Rather than being the wasteful process it may hâve 
seemed at first sight, RNA splicing enables eukaryotes to increase the coding 
potential of their genomes. We shall return to this idea again in this chapter and 
the next, but we first need to describe the cellular machinery that performs this 
remarkable taslc. 


Figure 6-24 Structure of two human 
genes showing the arrangement of 
exons and introns. (A) The relatively small 
P-globin gene, which encodes a subunit of 
the oxygen-carrying protein hemoglobin, 
contai ns 3 exons (see also Figure 4-7). 

(B) The much larger Factor VIII gene 
contains 26 exons; it codes for a protein 
(Factor VIII) that functions in the blood- 
clotting pathway. The most prévalent form 
of hemophilia results from mutations in 
this gene. 
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Figure 6-25 The pre-mRNA splicing 
reaction. (A) In the first step, a spécifie 
adenine nucléotide in the intron sequence 
(indicated in red) attacks the 5' splice site 
and cuts the sugar-phosphate backbone 
of the RNA at this point. The eut 5' end 
of the intron becomes covalently linked to 
the adenine nucléotide, as shown in detail 
in (B), thereby creating a loop in the RNA 
molécule. The released free 3'-OH end of 
the exon sequence then reacts with the 
start of the next exon sequence, joining 
the two exons together and releasing the 
intron sequence in the shape of a lariat. 

The two exon sequences thereby become 
joined into a continuous coding sequence. 
The released intron sequence is eventually 
broken down into single nucléotides, which 
are recycled. 
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Figure 6-26 Alternative splicing of 
the a-tropomyosin gene from rat. 

a-Tropomyosin is a coiled-coil protein (see 
Figure 3-9) that carries out several tasks, 
most notably the régulation of contraction 
in muscle cells. The primary transcript can 
be spliced in different ways, as indicated 
in the figure, to produce distinct mRNAs, 
which then give rise to variant proteins. 
Some of the splicing patterns are spécifie 
for certain types of cells. For example, the 
a-tropomyosin made in striated muscle 
is different from that made from the same 
gene in smooth muscle. The arrowheads 
in the top part of the figure mark the sites 
where cleavage and poly-A addition form 
the 3' ends of the mature mRNAs. 


Nucléotide Sequences Signal Where Splicing Occurs 

The mechanism of pre-mRNA splicing shown in Figure 6-24 requires that the 
splicing machinery recognize three portions of the precursor RNA molécule: the 
5' splice site, the 3' splice site, and the branch point in the intron sequence that 
forms the base of the excised lariat. Not surprisingly, each site has a consensus 
nucléotide sequence that is similar from intron to intron and provides the cell 
with eues for where splicing is to talce place (Figure 6-27). However, these con¬ 
sensus sequences are relatively short and can accommodate extensive sequence 
variability; as we shall see shortly, the cell incorporâtes additional types of infor¬ 
mation to ultimately choose exactly where, on each RNA molécule, splicing is to 
talce place. 

The high variability of the splicing consensus sequences présents a spécial 
challenge for scientists attempting to decipher genome sequences. Introns range 
in size from about 10 nucléotides to over 100,000 nucléotides, and choosing the 
précisé borders of each intron is a difficult taslc even with the aid of powerful com¬ 
puters. The possibility of alternative splicing compounds the problem of predict- 
ing protein sequences solely from a genome sequence. This difficulty is one of the 
main barriers to identifying ail of the genes in a complété genome sequence, and 
it is one of the primary reasons why we lcnow only the approximate number of 
different proteins produced by the human genome. 


RNA Splicing Is Performed by the Spliceosome 

Unlilce the other steps of mRNA production we hâve discussed, lcey steps in RNA 
splicing are performed by RNA molécules rather than proteins. Specialized RNA 
molécules recognize the nucléotide sequences that specify where splicing is to 
occur and also catalyze the chemistry of splicing. These RNA molécules are rela¬ 
tively short (less than 200 nucléotides each), and there are five of them, Ul, U2, U4, 
U5, and U6. Known as snRNAs (small nuclear RNAs), each is complexed with at 
least seven protein subunits to form an snRNP (small nuclear ribonucleoprotein). 
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Figure 6-27 The consensus nucléotide 
sequences in an RNA molécule that 
signal the beginning and the end of 
most introns in humans. The three 
blocks of nucléotide sequences shown are 
required to remove an intron sequence. 
Flere A, G, U, and C are the standard RNA 
nucléotides; R stands for purines (A or 
G); and Y stands for pyrimidines (C or U). 
The A highlighted in red forms the branch 
point of the lariat produced by splicing (see 
Figure 6-25). Only the GU at the start of the 
intron and the AG at its end are invariant 
nucléotides in the splicing consensus 
sequences. Several different nucléotides 
can occupy the remaining positions, 
although the indicated nucléotides are 
preferred. The distances along the RNA 
between the three splicing consensus 
sequences are highly variable; however, 
the distance between the branch point and 
3' splice junction is typically much shorter 
than that between the 5' splice junction 
and the branch point. 
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These snRNPs form the core of the spliceosome, the large assembly of RNA and 
protein molécules that performs pre-mRNA splicing in the cell. During the splic- 
ing reaction, récognition of the 5' splice junction, the branch-point site, and the 
3' splice junction is performed largely through base-pairing between the snRNAs 
and the consensus RNA sequences in the pre-mRNA substrate. 

The spliceosome is a complex and dynamic machine. When studied in vitro , a 
few components of the spliceosome assemble on pre-mRNA and, as the splicing 
reaction proceeds, new components enter and those that hâve already performed 
their taslcs are jettisoned (Figure 6-28). However, many scientists believe that, 
inside the cell, the spliceosome is a preexisting, loose assembly of ail the compo¬ 
nents—capturing, splicing, and releasing RNA as a coordinated unit, and under- 
going extensive rearrangements each time a splice is made. 
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The U2 snRNP displaces BBP and U2AF and 
forms base pairs with the branch-point site 
consensus sequence. 


The U4/U6»U5 "triple" snRNP enters the 
reaction. In this triple snRNP, the U4 and U6 
snRNAs are held firmly together by base-pair 
interactions. Subséquent rearrangements 
break apart the U4/U6 base pairs, allowing 
U6 to displace U1 at the 5' splice junction 
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that catalyzes the first phosphoryl- 
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Additional RNA-RNA rearrangements create 
the active site for the second phosphoryl- 
transferase reaction, which then complétés 
the splice (see Figure 6-25A). 


Figure 6-28 The pre-mRNA splicing 
mechanism. RNA splicing is catalyzed by 
an assembly of snRNPs (shown as colored 
circles) plus other proteins (most of which are 
not shown), which together constitute the 
spliceosome. The spliceosome recognizes the 
splicing signais on a pre-mRNA molécule, brings 
the two ends of the intron together, and provides 
the enzymatic activity for the two reaction steps 
required (see Figure 6-25A and Movie 6.5). 

As indicated, a set of proteins called the exon 
junction complex (EJC) remains on the spliced 
mRNA molécule; its subséquent rôle will be 
discussed shortly. 
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The Spliceosome Uses ATP Hydrolysis to Produce a Complex 
Sériés of RNA-RNA Rearrangements 

ATP hydrolysis is not required for the chemistry of RNA splicing per se since the 
two transestérification reactions preserve the high-energy phosphate bonds. 
However, extensive ATP hydrolysis is required for the assembly and rearrange¬ 
ments of the spliceosome. Some of the additional proteins that malce up the spli¬ 
ceosome use the energy of ATP hydrolysis to break existing RNA-RNA interactions 
to allow the formation of new ones. Each successful splice requires approximately 
200 proteins, if we include those that form the snRNPs. 

What is the purpose of these rearrangements? First, they allow the splicing sig¬ 
nais on the pre-RNA to be examined by snRNPs several times during the course of 
splicing. For example, the U1 snRNP initially recognizes the 5' splice site through 
conventional base-pairing; as splicing proceeds, these base pairs are brolcen 
(using the energy of ATP hydrolysis) and U1 is replaced by U6 (Figure 6-29). This 
type of RNA-RNA rearrangement (in which the formation of one RNA-RNA inter¬ 
action requires the disruption of another) occurs several times during splicing 
and allows the spliceosomes to checlc and recheck the splicing signais, thereby 
increasing the overall accuracy of splicing. Second, the rearrangements that talce 
place in the spliceosome create the active sites for the two transestérification 
reactions. These two active sites are created, one after the other, and only after the 
splicing signais on the pre-mRNA hâve been checked several times. This orderly 
progression ensures that splicing accidents occur only rarely. 

One of the most surprising features of the spliceosome is the nature of the cat- 
alytic sites: they are formed by both protein and RNA molécules, although the 
RNA molécules catalyze the actual chemistry of splicing. In the last section of this 
chapter, we discuss in general terms the structural and Chemical properties of 
RNA molécules that allow them to act as catalysts. 

Once the splicing chemistry is completed, the snRNPs remain bound to the 
lariat. The disassembly of these snRNPs from the lariat (and from each other) 
requires another sériés of RNA-RNA rearrangements that require ATP hydroly¬ 
sis, thereby returning the snRNAs to their original configuration so that they can 
be used again in a new reaction. At the completion of a splice, the spliceosome 
directs a set of proteins to bind to the mRNA near the position formerly occupied 
by the intron. Called the exon junction complex (F/C), these proteins mark the site 
of a successful splicing event and, as we shall see later in this chapter, influence 
the subséquent fate of the mRNA. 


Other Properties of Pre-mRNA and Its Synthesis Help to Explain 
the Choice of Proper Splice Sites 

As we hâve seen, intron sequences vary enormously in size, with some being in 
excess of 100,000 nucléotides. If splice-site sélection were determined solely by 
the snRNPs acting on a preformed, protein-free RNA molécule, we would expect 
frequent splicing mistakes—such as exon skipping and the use of “cryptic” splice 
sites (Figure 6-30). The fidelity mechanisms built into the spliceosome to sup- 
press errors, however, are supplemented by two additional strategies that further 
increase the accuracy of splicing. The first is a simple conséquence of splicing 
being coupled to transcription. As transcription proceeds, the phosphorylated 
tail of RNA polymerase carries several components of the spliceosome (see Figure 
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Figure 6-29 One of the many 
rearrangements that take place in the 
spliceosome during pre-mRNA splicing. 

This example cornes from the yeast 
Saccharomyces cerevisiae, in which the 
nucléotide sequences involved are slightly 
different from those in human cells. The 
exchange of U1 snRNP for U6 snRNP 
occurs just before the first phosphoryl- 
transfer reaction (see Figure 6-28). This 
exchange requires the 5' splice site to be 
read by two different snRNPs, thereby 
increasing the accuracy of 5' splice-site 
sélection by the spliceosome. 
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Figure 6-30 Two types of splicing errors. 

(A) Exon skipping. (B) Cryptic splice- 
site sélection. Cryptic splicing signais 
are nucléotide sequences of RNA that 
closely resemble true splicing signais and 
are sometimes mistakenly used by the 
spliceosome. 


6-22), and these components are transferred directiy from the polymerase to the 
RNA as the RNA emerges from the polymerase. This strategy helps the cell lceep 
traclc of introns and exons: for example, the snRNPs that assemble at a 5' splice 
site are initially presented only with the single 3' splice site that emerges next from 
the polymerase; the potential sites further downstream hâve not yet been synthe- 
sized. The coordination of transcription with splicing is especially important in 
preventing inappropriate exon skipping. 

A strategy called "exon définition" also helps cells choose the appropriate 
splice sites. Exon size tends to be much more uniform than intron size, averaging 
about 150 nucléotide pairs across a wide variety of eulcaryotic organisms (Figure 
6-31). Through exon définition, the splicing machinery can seelc out the relatively 
homogeneously sized exon sequences. As RNA synthesis proceeds, a group of 
additional components (most notably SR proteins, so-named because they con- 
tain a domain rich in serines and arginines) assemble on exon sequences and 
help to mark off each 3' and 5' splice site, starting at the 5' end of the RNA (Figure 
6-32). These proteins, in turn, recruit U1 snRNA, which marks the downstream 
exon boundary, and U2 snRNA, which spécifiés the upstream one. By specifically 
marking the exons in this way and thereby talcing advantage of the relatively uni¬ 
form size of exons, the cell increases the accuracy with which it deposits the initial 
splicing components on the nascent RNA and thereby avoids “near miss" splice 
sites. How the SR proteins discriminate exon sequences from intron sequences is 
not understood in detail; however, it is lcnown that some of the SR proteins bind 
preferentially to spécifie RNA sequences in exons, termed splicing enhancers. 
In principle, since any one of several different codons can be used to code for a 
given amino acid, there is freedom to evolve the exon nucléotide sequence so as 
to form a binding site for an SR protein, without necessarily affecting the amino 
acid sequence that the exon spécifiés. 

Both the marking of exon and intron boundaries and the assembly of the spli¬ 
ceosome begin on an RNA molécule while it is still being elongated by RNA poly¬ 
merase at its 3' end. However, the actual chemistry of splicing can talce place later. 
This delay means that intron sequences are not necessarily removed from a pre- 
mRNA molécule in the order in which they occur along the RNA chain. 


Figure 6-31 Variation in intron and exon 
lengths in the human, worm, and fly 
genomes. (A) Size distribution of exons. 

(B) Size distribution of introns. Note that 
exon length is much more uniform than 
intron length. (Adapted from International 
Human Genome Sequencing Consortium, 
Nature 409:860-921,2001. With 
permission from Macmillan Publishers Ltd.) 
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Chromatin Structure Affects RNA Splicing 

Although it may seem at first counterintuitive, the way a gene is packaged into 
chromatin can affect how the RNA transcript of that gene is ultimately spliced. 
Nucléosomes tend to be positioned over exons (which are, on average, close to the 
length of DNA in a nucléosome), and it has been proposed that these act as “speed 
bumps," allowing the proteins responsible for exon définition to assemble on the 
RNA as it emerges from the polymerase. In addition, changes in chromatin struc¬ 
ture are used to alter splicing patterns. There are two ways this can happen. First, 
because splicing and transcription are coupled, the rate at which RNA polymerase 
moves along DNA can affect RNA splicing. For example, if polymerase is mov- 
ing slowly, exon skipping (see Figure 6-30A) is minimized: assembly of the ini¬ 
tial spliceosome may be complété before an alternative choice of splice site even 
emerges from the RNA polymerase. The nucléosomes in condensed chromatin 
can cause polymerase to pause; the pattern of pauses in turn affects the extent of 
RNA exposed at any given time to the splicing machinery. 

There is a second and more direct way that chromatin structure can affect RNA 
splicing. Although the details are not yet understood, spécifie histone modifica¬ 
tions attract components of the spliceosome, and, because the chromatin being 
transcribed is in close association with the nascent RNA, these splicing compo¬ 
nents can easily be transferred to the emerging RNA. In this way, certain types of 
histone modifications can affect the final pattern of splicing. 

RNA Splicing Shows Remarkable Plasticity 

We hâve seen that the choice of splice sites dépends on such features of the pre- 
mRNA transcript as the strength of the three signais on the RNA (the 5' and 3' splice 
junctions and the branch point) for the splicing machinery, the co-transcriptional 
assembly of the spliceosome, chromatin structure, and the “bookkeeping” that 
underlies exon définition. We do not lcnow exactly how accurate splicing normally 
is because, as we see later, there are several quality control Systems that rapidly 
destroy mRNAs whose splicing goes awry. However, we do know that, compared 
with other steps in gene expression, splicing is unusually flexible. 

Thus, for example, a mutation in a nucléotide sequence critical for splicing of 
a particular intron does not necessarily prevent splicing of that intron altogether. 
Instead, the mutation typically créâtes a new pattern of splicing (Figure 6-33). 
Most commonly, an exon is simply skipped (Figure 6-33B). In other cases, the 
mutation causes a cryptic splice junction to be efficiently used (Figure 6-33C). 
Apparently, the splicing machinery has evolved to piclc out the best possible pat¬ 
tern of splice junctions, and if the optimal one is damaged by mutation, it will 
seelc out the next best pattern, and so on. This inhérent plasticity in the process of 
RNA splicing suggests that changes in splicing patterns caused by random muta¬ 
tions hâve been important in the évolution of genes and organisms. It also means 
that mutations that affect splicing can be severely detrimental to the organism: 
in addition to the P thalassemia, example presented in Figure 6-33, aberrant 


Figure 6-32 The exon définition 
hypothesis. According to this idea, 

SR proteins bind to each exon sequence 
in the pre-mRNA and thereby help to 
guide the snRNPs to the proper intron/ 
exon boundaries. This démarcation of 
exons by the SR proteins occurs co- 
transcriptionally, beginning at the CBC 
(cap-binding complex) at the 5' end. 

It has been proposed that a group of 
proteins known as the heterogeneous 
nuclear ribonucleoproteins (hnRNPs) 
may preferentially associate with intron 
sequences, further helping the spliceosome 
distinguish introns from exons. (Adapted 
from R. Reed, Curr. Opin. Cell Biol. 
12:340-345, 2000. With permission from 
El sévi er.) 
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splicing plays important rôles in the development of cystic fibrosis, frontotem- 
poral dementia, Parkinson's disease, retinitis pigmentosa, spinal muscular atro- 
phy, myotonie dystrophy, prématuré aging, and cancer. It has been estimated that 
of the many point mutations that cause inherited human diseases, 10% produce 
aberrant splicing of the gene containing the mutation. 

The plasticity of RNA splicing also means that the cell can easily regulate the 
pattern of RNA splicing. Earlier in this section we saw that alternative splicing can 
give rise to different proteins from the same gene and that this is a common strat- 
egy to enhance the coding potential of genomes. Some examples of alternative 
splicing are constitutive; that is, the alternatively spliced mRNAs are produced 
continuously by cells of an organism. However, in many cases, the cell régulâtes 
the splicing patterns so that different forms of the protein are produced at differ¬ 
ent times and in different tissues (see Figure 6-26). In Chapter 7, we return to this 
issue to discuss some spécifie examples of regulated RNA splicing. 

Spliceosome-Catalyzed RNA Splicing Probably Evolved from 
Self-splicing Mechanisms 

When the spliceosome was first discovered, it puzzled molecular biologists. Why 
do RNA molécules instead of proteins perform important rôles in splice-site 
récognition and in the chemistry of splicing? Why is a lariat intermediate used 
rather than the apparently simpler alternative of bringing the 5' and 3' splice sites 
together in a single step, followed by their direct cleavage and rejoining? The 
answers to these questions reflect the way in which the spliceosome has evolved. 

As discussed briefly in Chapter 1 (and in more detail in the final section of this 
chapter), it is lilcely that early cells used RNA molécules rather than proteins as 
their major catalysts and that they stored their genetic information in RNA rather 
than in DNA sequences. RNA-catalyzed splicing reactions presumably had criti- 
cal rôles in these early cells. As evidence, some self-splicing RNA introns (that is, 
intron sequences in RNA whose splicing out can occur in the absence of proteins 
or any other RNA molécules) remain today—for example, in the nuclear rRNA 
genes of the ciliate Tetrahymena, in a few bactériophage T4 genes, and in some 
mitochondrial and chloroplast genes. In these cases, the RNA molécule folds 
into a spécifie three-dimensional structure that brings the intron/exon junctions 
together and catalyzes the two transestérification reactions. A self-splicing intron 
sequence can be identified in a test tube by incubating a pure RNA molécule that 
contains the intron sequence and observing the splicing reaction. Because the 
basic chemistry of some self-splicing reactions is so similar to pre-mRNA splicing, 
it has been proposed that the much more involved process of pre-mRNA splicing 
evolved from a simpler, ancestral form of RNA self-splicing. 

RNA-Processing Enzymes Generate the 3' End of Eukaryotic 
mRNAs 

We hâve seen that the 5' end of the pre-mRNA produced by RNA polymerase II 
is capped almost as soon as it emerges from the RNA polymerase. Then, as the 
polymerase continues its movement along a gene, the spliceosome assembles 
on the RNA and delineates the intron and exon boundaries. The long C-terminal 
tail of the RNA polymerase coordinates these processes by transferring capping 
and splicing components directly to the RNA as it emerges from the enzyme. In 
this section, we shall see that, as RNA polymerase II reaches the end of a gene, 
a similar mechanism ensures that the 3' end of the pre-mRNA is appropriately 
processed. 

The position of the 3' end of each mRNA molécule is specified by signais 
encoded in the genome (Figure 6-34). These signais are transcribed into RNA as 
the RNA polymerase II moves through them, and they are then recognized (as 
RNA) by a sériés of RNA-binding proteins and RNA-processing enzymes (Figure 
6-35). Two multisubunit proteins, called CstF (cleavage stimulation factor) and 
CPSF (cleavage and polyadenylation specificity factor), are of spécial importance. 
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Figure 6-33 Abnormal Processing of 
the p-globin primary RNA transcript in 
humans with the disease p thalassemia. 

In the examples shown, the disease (a 
severe anémia due to aberrant hemoglobin 
synthesis) is caused by splice-site 
mutations found in the genomes of affected 
patients. The dark blue boxes represent 
the three normal exon sequences; the red 
Unes connect the 5' and 3' splice sites that 
are used. In (B), (C), and (D), the light blue 
boxes depict new nucléotide sequences 
included in the final mRNA molécule as a 
resuit of the mutation denoted by the 
black arrowhead. Note that when a 
mutation leaves a normal splice site 
without a partner, an exon is skipped (B) 
or one or more abnormal cryptic splice 
sites nearby is used as the partner site (C). 
[Adapted in part from S.H. Orkin, in The 
Molecular Basis of Blood Diseases 
(G. Stamatoyannopoulos et al., eds.), 
pp. 106-126. Philadelphia: Saunders, 
1987.] 
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Both of these proteins travel with the RNA polymerase tail and are transferred to 
the 3'-end processing sequence on an RNA molécule as it emerges from the RNA 
polymerase. 

Once CstF and CPSF bind to their récognition sequences on the emerging 
RNA molécule, additional proteins assemble with them to create the 3' end of the 
mRNA. First, the RNA is cleaved from the polymerase (see Figure 6-35). Next an 
enzyme called poly-A polymerase (PAP) adds, one at a time, approximately 200 A 
nucléotides to the 3' end produced by the cleavage. The nucléotide precursor for 
these additions is ATP, and the same type of 5'-to-3' bonds are formed as in con- 
ventional RNA synthesis. But unlilce other RNA polymerases, poly-A polymerase 
does not require a template; hence the poly-A tail of eukaryotic mRNAs is not 
directly encoded in the genome. As the poly-A tail is synthesized, proteins called 
poly-A-binding proteins assemble onto it and, by a poorly understood mecha- 
nism, help détermine the final length of the tail. 

After the 3'-end of a eukaryotic pre-mRNA molécule has been cleaved, the RNA 
polymerase II continues to transcribe, in some cases for hundreds of nucléotides. 
Once 3'-end cleavage has occurred, the newly synthesized RNA that emerges from 
the polymerases laclcs a 5' cap; this unprotected RNA is rapidly degraded by a 5' 
—> 3' exonuclease carried along on the polymerase tail. Apparently, it is this con- 
tinued RNA dégradation that eventually causes the RNA polymerase to release its 
grip on the template and terminate transcription. 

Mature Eukaryotic mRNAs Are Selectively Exported from the 
Nucléus 

Eukaryotic pre-mRNA synthesis and processing talce place in an orderly fashion 
within the cell nucléus. But of the pre-mRNA that is synthesized, only a small frac¬ 
tion—the mature mRNA—is of further use to the cell. Most of the rest—excised 
introns, brolcen RNAs, and aberrantly processed pre-mRNAs—is not only useless 
but potentially dangerous. How does the cell distinguish between the relatively 
rare mature mRNA molécules it wishes to lceep and the overwhelming amount of 
débris created by RNA processing? 

The answer is that, as an RNA molécule is processed, it loses certain proteins 
and acquires others. For example, we hâve seen that acquisition of cap-binding 
complexes, exon junction complexes, and poly-A-binding proteins mark the com- 
pletion of capping, splicing, and poly-A addition, respectively. A properly com- 
pleted mRNA molécule is also distinguished by the proteins it laclcs. For example, 
the presence of an snRNP protein would signify incomplète or aberrant splicing. 
Only when the pro teins présent on an mRNA molécule collectively signify that pro¬ 
cessing was successfully completed is the mRNA exported from the nucléus into 
the cytosol, where it can be translated into protein. Improperly processed mRNAs 

Figure 6-35 Some of the major steps in generating the 3' end of a 
eukaryotic mRNA. This process is much more complicated than the 
analogous process in bacteria, where the RNA polymerase simply stops at a 
termination signal and releases both the 3' end of its transcript and the DNA 
template (see Figure 6-11). 


Figure 6-34 Consensus nucléotide 
sequences that direct cleavage and 
polyadenylation to form the 3' end of 
a eukaryotic mRNA. These sequences 
are encoded in the genome, and spécifie 
proteins recognize them—as RNA—after 
they are transcribed. As shown in Figure 
6-35, the hexamer AAUAAA is bound by 
CPSF and the GU-rich element beyond 
the cleavage site is bound by CstF; the 
CA sequence is bound by a third protein 
factor required for the cleavage step. Like 
other consensus nucléotide sequences 
discussed in this chapter (see Figure 
6-12), the sequences shown in the figure 
represent a variety of individual cleavage 
and polyadenylation signais. 
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and other RNA débris (excised intron sequences, for example) are retained in the 
nucléus, where they are eventually degraded by the nuclear exosome, a large pro¬ 
tein complex whose interior is rich in 3'-to-5' RNA exonucleases (Figure 6-36). 
Eukaryotic cells thus export only useful RNA molécules to the cytoplasm, while 
débris is disposed of in the nucléus. 

Of ail the proteins that assemble on pre-mRNA molécules as they emerge from 
transcribing RNA polymerases, the most abundant are the hnRNPs (heteroge- 
neous nuclear ribonuclear proteins). Some of these proteins (there are approx- 
imately 30 different ones in humans) unwind the hairpin helices in the RNA so 
that splicing and other signais on the RNA can be read more easily. Others pref- 
erentially package the RNA contained in the very long intron sequences typical 
in complex organisms (see Figure 6-31) and these may play an important rôle in 
distinguishing mature mRNA from the débris left over from RNA processing. 

Successfully processed mRNAs are guided through the nuclear pore 
complexes (NPCs)—aqueous channels in the nuclear membrane that directly 
connect the nucleoplasm and cytosol (Figure 6-37). Small molécules (less than 
60,000 daltons) can diffuse freely through these channels. However, most of the 
macromolecules in cells, including mRNAs complexed with proteins, are far too 
large to pass through the channels without a spécial process. The cell uses energy 
to actively transport such macromolecules in both directions through the nuclear 
pore complexes. 

As explained in detail in Chapter 12, macromolecules are moved through 
nuclear pore complexes by nuclear transport receptors, which, depending on the 
identity of the macromolecule, escort it from the nucléus to the cytoplasm or vice 
versa. For mRNA export to occur, a spécifie nuclear transport receptor must be 
loaded onto the mRNA, a step that, in many organisms, talces place in concert 
with 3' cleavage and polyadenylation. Once it helps to move an RNA molécule 
through the nuclear pore complex, the transport receptor dissociâtes from the 
mRNA, re-enters the nucléus, and is then used to export a new mRNA molécule. 

The export of mRNA-protein complexes from the nucléus can be readily 
observed with the électron microscope for the unusually abundant mRNA of the 
insect Balbiani Ring genes. As these genes are transcribed, the newly formed RNA 
is seen to be paclcaged by proteins, including hnRNPs, SR proteins, and compo- 
nents of the spliceosome. This protein-RNA complex undergoes a sériés of struc¬ 
tural transitions, probably reflecting RNA processing events, culminating in a 
curved fiber (see Figure 6-37). This curved fiber moves through the nucleoplasm 
and enters the nuclear pore complex (with its 5' cap proceeding first), and it then 
undergoes another sériés of structural transitions as it moves through the pore. 
These and other observations reveal that the pre-mRNA-protein and mRNA-pro¬ 
tein complexes are dynamic structures that gain and lose numerous spécifie pro¬ 
teins during RNA synthesis, processing, and export (Figure 6-38). 

The analysis just described has been complemented by new methods that 
allow researchers to traclc the fate of more typical mRNA molécules, which can 
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Figure 6-36 Structure of the core of 
human RNA exosome. RNA is fed 

into one end of the central pore and is 
degraded by RNAses that associate 
with the other end. Nine different protein 
su bu n its (each represented by a different 
color) make up this large ring structure. 
Eukaryotic cells hâve both a nuclear 
exosome and a cytoplasmic exosome; both 
forms include the core exosome shown 
here and additional subunits (including 
specialized RNAses) that differentiate the 
two forms. The nuclear exosome dégradés 
aberrant R N As before they are exported to 
the cytosol. It also processes certain types 
of RNA (for example, the ribosomal RNAs) 
to produce their final form. The cytoplasmic 
form of the exosome is responsible for 
degrading mRNAs in the cytosol, and is 
thus crucial in determining the lifetime of 
each mRNA molécule. (PDB code: 2NN6.) 



Figure 6-37 Transport of a large mRNA molécule through the nuclear pore complex. (A) The maturation of an mRNA molécule as it is 
synthesized by RNA polymerase and packaged by a variety of nuclear proteins. This drawing of an unusually large and abundant insect RNA, called 
the Balbiani Ring mRNA, is based on électron microscope micrographs such as that shown in (B). (A, adapted from B. Daneholt, Cell 88:585-588, 
1997. With permission from Elsevier; B, from B.J. Stevens and H. Swift, J. Cell Biol. 31:55-77, 1966. With permission from The Rockefeller 
University Press.) 
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be fluorescently labeled and observed individually. A typical RNA molécule is 
released from its site of transcription and spends several minutes diffusing to a 
nuclear pore complex. During this time it is lilcely that RNA processing events con¬ 
tinue and that the RNA sheds previously bound proteins and acquires new ones. 
Once it arrives at the entrance to the pore, the “export-ready” mRNA hovers for 
several seconds, during which time the completion of processing may occur, and 
then is transported through the pore very rapidly, in tens of milliseconds. Some 
mRNA-protein complexes are very large, and how they move through the nuclear 
pore complexes so rapidly remains a mystery. 

Some of the proteins deposited on the mRNA while it is still in the nucléus can 
affect the fate of the RNA after it is transported to the cytosol. Thus, the stability of 
an mRNA in the cytosol, the efficiency with which it is translated into protein, and 
its ultimate destination in the cytosol can ail be determined by proteins acquired 
in the nucléus that remain bound to the RNA after it leaves the nucléus. 

But before discussing what happens to mRNAs in the cytosol, we briefly 
consider how the synthesis and processing of some noncoding RNA molécules 
occurs. There are many types of noncoding RNAs produced by cells (see Table 
6-1, p. 305), but here we focus on the rRNAs, which are critically important for the 
translation of mRNAs into protein. 


Noncoding RNAs Are Also Synthesized and Processed in the 
Nucléus 

Only a few percent of the dry weight of a mammalian cell is RNA; of that, only 
about 3-5% is mRNA. The bulle of the RNA in cells performs structural and cata- 
lytic functions (see Table 6-1). The most abundant RNAs in cells are the ribosomal 
RNAs (rRNAs), constituting approximately 80% of the RNA in rapidly dividing 
cells. As discussed later in this chapter, these RNAs form the core of the ribosome. 
Unlike bacteria—in which a single RNA polymerase synthesizes ail RNAs in the 
cell—eukaryotes hâve a separate, specialized polymerase, RNA polymerase I, that 
is dedicated to producing rRNAs. RNA polymerase I is similar structurally to the 
RNA polymerase II discussed previously; however, the absence of a C-terminal 
tail in polymerase I helps to explain why its transcripts are neither capped nor 
polyadenylated. 

Because multiple rounds of translation of each mRNA molécule can provide 
an enormous amplification in the production of protein molécules, many of the 
proteins that are very abundant in a cell can be synthesized from genes that are 
présent in a single copy per haploid genome (see Figure 6-3). In contrast, the RNA 
components of the ribosome are final gene products, and a growing mammalian 
cell must synthesize approximately 10 million copies of each type of ribosomal 
RNA in each cell génération to construct its 10 million ribosomes. The cell can 
produce adéquate quantities of ribosomal RNAs only because it contains mul¬ 
tiple copies of the rRNA genes that code for ribosomal RNAs (rRNAs). Even 
E. coli needs seven copies of its rRNA genes to meet the cell's need for ribosomes. 
Human cells contain about 200 rRNA gene copies per haploid genome, spread 



Figure 6-38 Schematic illustration of 
an export-ready mRNA molécule and 
its transport through the nuclear pore. 

As indicated, some proteins travel with 
the mRNA as it moves through the pore, 
whereas others remain in the nucléus. The 
nuclear export receptor for mRNAs is a 
complex of proteins that binds to an mRNA 
molécule once it has been correctly spliced 
and polyadenylated. After the mRNA has 
been exported to the cytosol, this export 
receptor dissociâtes from the mRNA and is 
re-imported into the nucléus, where it can 
be used again. The final check indicated 
here, called nonsense-mediated decay, will 
be described later in the chapter. 
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Figure 6-39 Transcription from tandemly 
arrangée! rRNA genes, as seen in 
the électron microscope. The pattern 
of alternating transcribed gene and 
nontranscribed spacer is readily seen. 

A higher-magnification view of rRNA genes 
is shown in Figure 6-10. (From V.E. Foe, 
Cold Spring Harb. Symp. Quant. Biol. 
42:723-740, 1978. With permission from 
Cold Spring Flarbor Laboratory Press.) 


out in small clusters on five different chromosomes (see Figure 4-11), while cells 
of the frog Xenopus contain about 600 rRNA gene copies per haploid genome in a 
single cluster on one chromosome (Figure 6-39). 

There are four types of eulcaryotic rRNAs, each présent in one copy per ribo¬ 
some. Three of the four rRNAs (18S, 5.8S, and 28S) are made by chemically mod- 
ifying and cleaving a single large precursor rRNA (Figure 6-40); the fourth (5S 
RNA) is synthesized from a separate cluster of genes by a different polymerase, 
RNA polymerase III, and does not require Chemical modification. 

Extensive Chemical modifications occur in the 13,000-nucleotide-long precur¬ 
sor rRNA before the rRNAs are cleaved out of it and assembled into ribosomes. 
These include about 100 méthylations of the 2'-OH positions on nucléotide sugars 
and 100 isomerizations of uridine nucléotides to pseudouridine (Figure 6-41A). 
The functions of these modifications are not understood in detail, but they proba- 
bly aid in the folding and assembly of the final rRNAs, or subtly alter the function 
of ribosomes. Each modification is made at a spécifie position in the precursor 
rRNA, specified by "guide RNAs," which position themselves on the precursor 
rRNA through base-pairing and thereby bring an RNA-modifying enzyme to the 
appropriate position (Figure 6-41B). Other guide RNAs promote cleavage of the 
precursor rRNAs into the mature rRNAs, probably by causing conformational 
changes in the precursor rRNA that expose these sites to nucleases. Ail of these 
guide RNAs are members of a large class of RNAs called small nucleolar RNAs 
(or snoRNAs), so named because these RNAs perform their functions in a sub- 
compartment of the nucléus called the nucleolus. Many snoRNAs are encoded in 
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Figure 6-40 The Chemical modification 
and nucleolytic Processing of a 
eukaryotic 45S precursor rRNA molécule 
into three separate ribosomal RNAs. 

Two types of Chemical modifications 
(color-coded as indicated in Figure 6-41) 
are made to the precursor rRNA before 
it is cleaved. Nearly hait of the nucléotide 
sequences in this precursor rRNA are 
discarded and degraded in the nucléus 
by the exosome. The rRNAs are named 
according to their “S” values, which 
refer to their rate of sédimentation in an 
ultracentrifuge. The larger the S value, the 
larger the rRNA. 
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Figure 6-41 Modifications of the 
precursor rRNA by guide RNAs. (A) Two 

prominent covalent modifications made 
to rRNA; the différences from the initially 
incorporated nucléotide are indicated by 
red atoms. Pseudouridine is an isomer of 
uridine; the base has been “rotated,” and is 
attached to the red C rather than to the red 
N of the sugar (compare to Figure 6-5B). 

(B) As indicated, snoRNAs détermine the 
sites of modification by base-pairing to 
complementary sequences on the precursor 
rRNA. The snoRNAs are bound to proteins, 
and the complexes are called snoRNPs 
(small nucleolar ribonucleoproteins). 
snoRNPs contain both the guide sequences 
and the enzymes that modify the rRNA. 


the introns of other genes, especially those encoding ribosomal proteins. Ihey are 
synthesized by RNA polymerase II and processed from excised intron sequences. 

The Nucleolus Is a Ribosome-Producing Factory 

The nucleolus is the most obvious structure seen in the nucléus of a eukaryotic 
cell when viewed in the light microscope. It was so closely scrutinized by early 
cytologists that an 1898 review could list some 700 references. We now lcnow 
that the nucleolus is the site for the processing of rRNAs and their assembly into 
ribosome subunits. Unlilce many of the major organelles in the cell, the nucle¬ 
olus is not bound by a membrane (Figure 6-42); instead, it is a huge aggregate 


Figure 6-42 Electron micrograph of a 
thin section of a nucleolus in a human 
fibroblast, showing its three distinct 
zones. (A) View of entire nucléus. 

(B) Higher-power view of the nucleolus. It 
is believed that transcription of the rRNA 
genes takes place between the fibrillar center 
and the dense fibrillar component and that 
Processing of the rRNAs and their assembly 
into the two subunits of the ribosome 
proceeds outward from the dense fibrillar 
component to the surrounding granular 
components. (Courtesy of E.G. Jordan and 
J. McGovern.) 
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Figure 6-43 Changes in the appearance of the nucleolus in a human cell 
during the cell cycle. Only the cell nucléus is represented in this diagram. In 
most eukaryotic cells, the nuclear envelope breaks down during mitosis, as 
indicated by the dashed circles. 

of macromolecules, including the rRNA genes themselves, precursor rRNAs, 
mature rRNAs, rRNA-processing enzymes, snoRNPs, a large set of assembly fac¬ 
tors (including ATPases, GTPases, protein kinases, and RNA helicases), ribosomal 
proteins, and partly assembled ribosomes. The close association of ail these com- 
ponents allows the assembly of ribosomes to occur rapidly and smoothly. 

Various types of RNA molécules play a central part in the chemistry and struc¬ 
ture of the nucleolus, suggesting that it may hâve evolved from an ancient struc¬ 
ture présent in cells dominated by RNA catalysis. In présent-day cells, the rRNA 
genes hâve an important rôle in forming the nucleolus. In a diploid human cell, 
the rRNA genes are distributed into 10 clusters, located near the tips of five dif¬ 
ferent chromosome pairs (see Figure 4-11). During interphase, these 10 chromo¬ 
somes contribute DNA loops (containing the rRNA genes) to the nucleolus; in 
M phase, when the chromosomes condense, the nucleolus fragments and then 
disappears. Then, in the telophase part of mitosis, as chromosomes return to 
their semi-dispersed State, the tips of the 10 chromosomes reform small nucle- 
oli, which progressively coalesce into a single nucleolus (Figure 6-43 and Figure 
6-44). As might be expected, the size of the nucleolus reflects the number of ribo¬ 
somes that the cell is producing. Its size therefore varies greatly in different cells 
and can change in a single cell, occupying 25% of the total nuclear volume in cells 
that are making unusually large amounts of protein. 

Ribosome assembly is a complexprocess, the most important features of which 
are outlined in Figure 6-45. In addition to its central rôle in ribosome biogenesis, 
the nucleolus is the site where other noncoding RNAs are produced and other 
RNA-protein complexes are assembled. For example, the U6 snRNP, which func- 
tions in pre-mRNA splicing (see Figure 6-28), is composed of one RNA molécule 
and at least seven proteins. The U6 snRNA is chemically modified by snoRNAs in 
the nucleolus before its final assembly there into the U6 snRNP. Other important 
RNA-protein complexes, including telomerase (encountered in Chapter 5) and the 
signal-récognition particle (which we discuss in Chapter 12), are assembled at the 
nucleolus. Finally, the tRNAs (transfer RNAs) that carry the amino acids for pro¬ 
tein synthesis are processed there as well; lilce the rRNA genes, the genes encoding 
tRNAs are clustered in the nucleolus. Thus, the nucleolus can be thought of as a 
large factory at which different noncoding RNAs are transcribed, processed, and 
assembled with proteins to form a large variety of ribonucleoprotein complexes. 



10 pm 


Figure 6-44 Nucleolar fusion. These light micrographs of human fibroblasts grown in culture show 
various stages of nucleolar fusion. After mitosis, each of the 10 human chromosomes that carry a 
cluster of rRNA genes begins to form a tiny nucleolus, but these rapidly coalesce as they grow to 
form the single large nucleolus typical of many interphase cells. (Courtesy of E.G. Jordan and 
J. McGovern.) 


nuclear ^ 
envelope 

nucleolus 


& 


y 

nucleolar 



nucleolar 

association 






Q 


G 2 


préparation 
for mitosis 


prophase 

metaphase 

anaphase 

telophase 



préparation 
for DNA 
réplication 


DNA 

réplication 


















FROM DNA TO RNA 


331 



Figure 6-45 The function of the 
nucleolus in ribosome and other 
ribonucleoprotein synthesis. The 45S 

precursor rRNA is packaged in a large 
ribonucleoprotein particle containing 
many ribosomal proteins imported from 
the cytoplasm. While this particle remains 
at the nucleolus, selected components 
are added and others discarded as it is 
processed into immature large and small 
ribosomal subunits. The two ribosomal 
subunits attain their final functional form 
only after each is individually transported 
through the nuclear pores into the 
cytoplasm. Other ribonucleoprotein 
complexes, including telomerase shown 
here, are also assembled in the nucleolus. 


The Nucléus Contains a Variety of Subnuclear Aggregates 

Although the nucleolus is the most prominent structure in the nucléus, sev- 
eral other nuclear bodies hâve been observed and studied (Figure 6-46). These 
include Cajal bodies (named for the scientist who first described them in 1906) 
and interchromatin granule clusters (also called “speckles”). Like the nucleolus, 
these other nuclear structures laclc membranes and are highly dynamic depend- 
ing on the needs of the cell. Their assembly is likely mediated by the association of 
lowcomplexity protein domains, as described in Chapter 3 (see Figure 3-36). Their 
appearance is the resuit of the tight association of protein and RNA components 
involved in the synthesis, assembly, and storage of macromolecules involved in 
gene expression. Cajal bodies are sites where the snRNPs and snoRNPs undergo 
their final maturation steps, and where the snRNPs are recycled and their RNAs 
are “reset” after the rearrangements that occur during splicing (see p. 321). In 
contrast, the interchromatin granule clusters hâve been proposed to be stoclcpiles 
of fully mature snRNPs and other RNA processing components that are ready to 
be used in the production of mRNA. 

Scientists hâve had difficulties in working out the function of these small sub¬ 
nuclear structures, in part because their appearances can change dramatically as 
cells traverse the cell cycle or respond to changes in their environment. Moreover, 
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Figure 6-46 Visualization of some prominent nuclear bodies. The 

protein fibrillarin (red), a component of several snoRNPs, is présent at both 
nucleoli and Cajal bodies; the latter are indicated by the arrows. The Cajal 
bodies (but not the nucleoli) are also highlighted by staining one of their 
main components, the protein coilin; the superposition of the snoRNP and 
coilin stains appears pink. Interchromatin granule clusters (green) hâve been 
revealed by using antibodies against a protein involved in pre-mRNA splicing. 
DNA is stained blue by the dye DAPI. (From J.R. Swedlow and A.l. Lamond, 
Gen. Biol. 2:1-7, 2001. With permission from BioMed Central. Micrograph 
courtesy of Judith Sleeman.) 

disrupting a particular type of nuclear body often has little effect on cell viabil- 
ity. It seems that the main function of these aggregates is to bring components 
together at high concentration in order to speed up their assembly. For example, 
it is estimated that assembly of the U4/U6 snRNP (see Figure 6-28) occurs ten 
times more rapidly in Cajal bodies than would be the case if the same number of 
components were dispersed throughout the nucléus. Consequently, Cajal bodies 
appear dispensible in many types of cells but are absolutely required in situations 
where cells must proliferate rapidly, such as in early vertebrate development. 
Here, protein synthesis (which dépends on RNA splicing) must be especially 
rapid, and delays can be léthal. 

Given the prominence of nuclear bodies in RNA processing, it might be 
expected that pre-mRNA splicing would occur in a particular location in the 
nucléus, as it requires numerous RNA and protein components. However, as we 
hâve seen, the assembly of splicing components on pre-mRNA is co-transcrip- 
tional; thus, splicing must occur at many locations along chromosomes. Although 
a typical mammalian cell may be expressing on the order of 15,000 genes, tran¬ 
scription and RNA splicing talces place in only several thousand sites in the 
nucléus. These sites are highly dynamic and probably resuit from the association 
of transcription and splicing components to create small factories, the name given 
to spécifie aggregates containing a high local concentration of selected compo¬ 
nents that create biochemical assembly lines (Figure 6-47). Interchromatin 





Figure 6-47 A model for an mRNA production factory. mRNA production is made more efficient in the nucléus by an aggregation of the many 
components needed for transcription and pre-mRNA processing, thereby producing a specialized biochemical factory. In (A), a postulated scaffold 
protein holds various components in the proximity of a transcribing RNA polymerase. Other key components are bound directly to the RNA 
polymerase tail, which likewise serves as a scaffold (see Figure 6-22), but for simplicity these are not shown here. In (B), a large number of such 
scaffolds hâve been brought together to form an aggregate that is highly enriched in the many components needed for the synthesis and processing 
of pre-mRNAs. Such a scaffold model can account for the several thousand sites of active RNA transcription and processing typically observed 
in the nucléus of a mammalian cell, each of which has a diameter of roughly lOOnm and is estimated to contain, on average, about 10 RNA 
polymerase II molécules in addition to many other proteins. (C) Here, mRNA production factories and DNA réplication factories hâve been visualized 
in the same mammalian cell by briefly incorporating differently modified nucléotides into each nucleic acid and detecting the RNA and DNA produced 
using antibodies, one (green) detecting the newly synthesized DNA and the other (red) detecting the newly synthesized RNA. (C, from D.G. Wansink 
et al., J. Cell Soi. 107:1449-1456, 1994. With permission from The Company of Biologists.) 
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granule clusters—which contain stockpiles of RNA processing components—are 
often observed next to these sites of transcription, as though poised to replen- 
ish supplies. We can thus view the nucléus as organized into subdomains, with 
snRNPs, snoRNPs, and other nuclear components moving among them in an 
orderly fashion according to the needs of the cell. 

Summary 

Before the synthesis of a particular protein can begin, the corresponding mRNA 
molécule must be produced by transcription. Bacteria contain a single type ofRNA 
polymerase (the enzyme that carries out the transcription ofDNA into RNA). An 
mRNA molécule is produced after this enzyme initiâtes transcription atapromoter, 
synthesizes the RNA by chain élongation, stops transcription at a terminator, and 
releases both the DNA template and the completed mRNA molécule. In eukary- 
otic cells , the process of transcription is much more complex, and there are three 
RNA polymerases—polymerase I, II, and III—that are related evolutionarily to one 
another and to the bacterial polymerase. 

RNA polymerase II synthesizes eukaryotic mRNA. This enzyme requires a set of 
additional proteins, both the general transcription factors, and spécifie transcrip- 
tional activator proteins, to initiate transcription on a DNA template. It requires 
still more proteins (including chromatin remodeling complexes and histone-mod- 
ifying enzymes) to initiate transcription on its chromatin templates inside the cell. 

During the élongation phase of transcription, the nascent RNA undergoes three 
types of processing events: a spécial nucléotide is added to its 5' end (capping), 
intron sequences are removed from the middle ofthe RNA molécule (splicing), and 
the 3' end ofthe RNA is generated (cleavage and poly adénylation). Each of these 
processes is initiated by proteins that travel along with RNA polymerase II by bind- 
ing to sites on its long, extended C-terminal tail. Splicing is unusual in that many 
ofits key steps are carried out by specialized RNA molécules rather than proteins. 
Only properly processed mRNAs are passed through nuclear pore complexes into 
the cytosol, where they are translated into protein. 

For many genes, RNA, rather than protein, is the final product. In eukaryotes; 
these genes are usually transcribed by either RNA polymerase I or RNA polymerase 
III. RNA polymerase I malces the ribosomal RNAs. After their synthesis as a large 
precursor, the rRNAs are chemically modified, cleaved, and assembled into the two 
ribosomal subunits in the nucleolus—a distinctsubnuclear structure thatalso helps 
to process some smaller RNA-protein complexes in the cell. Additional subnuclear 
structures (including Cajal bodies and interchromatin granule clusters) are sites 
where components involved in RNA processing are assembled, stored, and recycled. 
The high concentration of components in such “factories " ensures that the processes 
being catalyzed are rapid and efficient. 

FROM RNATO PROTEIN 

In the preceding section, we hâve seen that the final product of some genes is an 
RNA molécule itself, such as the RNAs présent in the snRNPs and in ribosomes. 
However, most genes in a cell produce mRNA molécules that serve as intermedi- 
aries on the pathway to proteins. In this section, we examine how the cell converts 
the information carried in an mRNA molécule into a protein molécule. This feat 
of translation was a strong focus of attention for biologists in the late 1950s, when 
it was posed as the "coding problem": how is the information in a linear sequence 
of nucléotides in RNA translated into the linear sequence of a chemically quite 
different set of units—the amino acids in proteins? This fascinating question stim- 
ulated great excitement. Here was a cryptogram set up by nature that, after more 
than 3 billion years of évolution, could finally be solved by one of the products 
of évolution—human beings. And indeed, not only was the code craclced step by 
step, but in the year 2000 the structure of the elaborate machinery by which cells 
read this code—the ribosome—was finally revealed in atomic detail. 
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An mRNA Sequence Is Decoded in Sets of Three Nucléotides 

Once an mRNA has been produced by transcription and processing, the informa¬ 
tion présent in its nucléotide sequence is used to synthesize a protein. Transcrip¬ 
tion is simple to understand as a means of information transfer: since DNA and 
RNA are chemically and structurally similar, the DNA can act as a direct template 
for the synthesis of RNA by complementary base-pairing. As the term transcrip¬ 
tion signifies, it is as if a message written out by hand is being converted, say, into 
a typewritten text. The language itself and the form of the message do not change, 
and the symbols used are closely related. 

In contrast, the conversion of the information in RNA into protein represents 
a translation of the information into another language that uses quite different 
symbols. Moreover, since there are only 4 different nucléotides in mRNA and 20 
different types of amino acids in a protein, this translation cannot be accounted 
for by a direct one-to-one correspondence between a nucléotide in RNA and an 
amino acid in protein. The nucléotide sequence of a gene, through the interme- 
diary of mRNA, is instead translated into the amino acid sequence of a protein by 
rules that are lcnown as the genetic code. Ihis code was deciphered in the early 
1960s. 

The sequence of nucléotides in the mRNA molécule is read in consecutive 
groups of three. RNA is a linear polymer of four different nucléotides, so there are 
4 x 4 x 4 = 64 possible combinations of three nucléotides: the triplets AAA, AUA, 
AUG, and so on. However, only 20 different amino acids are commonly found in 
proteins. Either some nucléotide triplets are never used, or the code is redundant 
and some amino acids are specified by more than one triplet. The second possi- 
bility is, in fact, the correct one, as shown by the completely deciphered genetic 
code in Figure 6-48. Each group of three consecutive nucléotides in RNA is called 
a codon, and each codon spécifiés either one amino acid or a stop to the transla¬ 
tion process. 

This genetic code is used universally in ail présent-day organisms. Although a 
few slight différences in the code hâve been found, these are chiefly in the DNA of 
mitochondria. Mitochondria hâve their own transcription and protein-synthesis 
Systems that operate quite independently from those of the rest of the cell, and it 
is understandable that their tiny genomes hâve been able to accommodate minor 
changes to the code (discussed in Chapter 14). 

In principle, an RNA sequence can be translated in any one of three different 
reading frames, depending on where the decoding process begins (Figure 6-49). 
However, only one of the three possible reading frames in an mRNA encodes the 
required protein. We see later how a spécial punctuation signal at the beginning of 
each RNA message sets the correct reading frame at the start of protein synthesis. 

tRNA Molécules Match Amino Acids to Codons in mRNA 

The codons in an mRNA molécule do not directly recognize the amino acids they 
specify: the group of three nucléotides does not, for example, bind directly to the 
amino acid. Rather, the translation of mRNA into protein dépends on adaptor 
molécules that can recognize and bind both to the codon and, at another site on 
their surface, to the amino acid. These adaptors consist of a set of small RNA mol¬ 
écules lcnown as transfer RNAs (tRNAs), each about 80 nucléotides in length. 


Figure 6-48 The genetic code. The 

standard one-letter abbreviation for each 
amino acid is presented below its three- 
letter abbreviation (see Panel 3-1, pp. 112- 
113, for the full name of each amino acid 
and its structure). By convention, codons 
are always written with the 5'-terminal 
nucléotide to the left. Note that most amino 
acids are represented by more than one 
codon, and that there are some regularities 
in the set of codons that spécifiés each 
amino acid: codons for the same amino 
acid tend to contain the same nucléotides 
at the first and second positions, and 
vary at the third position. Three codons 
do not specify any amino acid but act as 
termination sites (stop codons), signaling 
the end of the protein-coding sequence. 
One codon—AUG—acts both as an 
initiation codon, signaling the start of a 
protein-coding message, and also as the 
codon that spécifiés méthionine. 


5 ' 


3 ' 


AU 


— Leu-Ser-Val-Thr — 


Cj 

— Ser-Ala-Leu-Pro — 



— Gin-Arg-Tyr-His — 


Figure 6-49 The three possible 
reading frames in protein synthesis. 

In the process of translating a nucléotide 
sequence (blue) into an amino acid 
sequence (red), the sequence of 
nucléotides in an mRNA molécule is read 
from the 5' end to the 3' end in consecutive 
sets of three nucléotides. In principle, 
therefore, the same RNA sequence can 
specify three completely different amino 
acid sequences, depending on the reading 
frame. In reality, however, only one of 
these reading frames contains the actual 
message. 
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Figure 6-50 A tRNA molécule. A tRNA spécifie for the amino acid phenylalanine (Phe) is depicted in various ways. (A) The 
cloverleaf structure showing the complementary base-pairing (red Unes) that créâtes the double-helical régions of the molécule. 
The anticodon is the sequence of three nucléotides that base-pairs with a codon in mRNA. The amino acid matching the 
codon/anticodon pair is attached at the 3' end of the tRNA. tRNAs contain some unusual bases, which are produced by 
Chemical modification after the tRNA has been synthesized. For example, the bases denoted \j/ (pseudouridine—see Figure 
6-41) and D (dihydrouridine—see Figure 6-53) are derived from uracil. (B and C) Views of the L-shaped molécule, based on 
x-ray diffraction analysis. Although this diagram shows the tRNA for the amino acid phenylalanine, ail other tRNAs hâve similar 
structures. (D) The tRNA icon we use in this book. (E) The linear nucléotide sequence of the molécule, color-coded to match (A), 
(B), and (C). 


We saw earlier in this chapter that RNA molécules can fold into précisé 
three-dimensional structures, and the tRNA molécules provide a striking exam¬ 
ple. Four short segments of the folded tRNA are double-helical, producing a mol¬ 
écule that looks lilce a cloverleaf when drawn schematically (Figure 6-50). For 
example, a 5'-GCUC-3' sequence in one part of a polynucleotide chain can form 
a relatively strong association with a 5'-GAGC-3' sequence in another région of 
the same molécule. The cloverleaf undergoes further folding to form a compact 
L-shaped structure that is held together by additional hydrogen bonds between 
different régions of the molécule (see Figure 6-50B and C). 

Two régions of unpaired nucléotides situated at either end of the L-shaped 
molécule are crucial to the function of tRNA in protein synthesis. One of these 
régions forms the anticodon, a set of three consecutive nucléotides that pairs 
with the complementary codon in an mRNA molécule. The other is a short sin- 
gle-stranded région at the 3' end of the molécule; this is the site where the amino 
acid that matches the codon is attached to the tRNA. 

We saw above that the genetic code is redundant; that is, several different 
codons can specify a single amino acid. This redundancy implies either that there 
is more than one tRNA for many of the amino acids or that some tRNA molécules 
can base-pair with more than one codon. In fact, both situations occur. Some 
amino acids hâve more than one tRNA and some tRNAs are constructed so that 
they require accurate base-pairing only at the first two positions of the codon 
and can tolerate a mismatch (or wobble) at the third position (Figure 6-51). This 
wobble base-pairing explains why so many of the alternative codons for an amino 
acid differ only in their third nucléotide (see Figure 6-48). In bacteria, wobble 
base-pairings malce it possible to fit the 20 amino acids to their 61 codons with as 
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Figure 6-51 Wobble base-pairing between codons and anticodons. If the 

nucléotide listed in the first column is présent at the third, or wobble, position 
of the codon, it can base-pair with any of the nucléotides listed in the second 
column. Thus, for example, when inosine (I) is présent in the wobble position 
of the tRNA anticodon, the tRNA can recognize any one of three different 
codons in bacteria and either of two codons in eukaryotes. The inosine in 
tRNAs is formed from the deamination of adenosine (see Figure 6-53), a 
Chemical modification that takes place after the tRNA has been synthesized. 
The nonstandard base pairs, including those made with inosine, are generally 
weaker than conventional base pairs. Codon-anticodon base-pairing is 
more stringent at positions 1 and 2 of the codon, where only conventional 
base pairs are permitted. The différences in wobble base-pairing interactions 
between bacteria and eukaryotes presumably resuit from subtle structural 
différences between bacterial and eukaryotic ribosomes, the molecular 
machines that perform protein synthesis. (Adapted from C. Guthrie and 
J. Abelson, in The Molecular Biology of the Yeast Saccharomyces: 

Metabolism and Gene Expression, pp. 487-528. Cold Spring Harbor, New 
York: Cold Spring Harbor Laboratory Press, 1982.) 


few as 31 lcinds of tRNA molécules. The exact number of different kinds of tRNAs, 
however, differs from one species to the next. For example, humans hâve nearly 
500 tRNA genes, and among them 48 different anticodons are represented. 

tRNAs Are Covalently Modified Before They Exit from the Nucléus 

Like most other eukaryotic RNAs, tRNAs are covalently modified before they are 
allowed to exit from the nucléus. Eukaryotic tRNAs are synthesized by RNA poly¬ 
merase III. Both bacterial and eukaryotic tRNAs are typically synthesized as larger 
precursor tRNAs, which are then trimmed to produce the mature tRNA. In addi¬ 
tion, some tRNA precursors (from both bacteria and eukaryotes) contain introns 
that must be spliced out. This splicing reaction differs chemically from pre-mRNA 
splicing; rather than generating a lariat intermediate, tRNA splicing uses a cut- 
and-paste mechanism that is catalyzed by proteins (Figure 6-52). Trimming and 
splicing both require the precursor tRNA to be correctly folded in its cloverleaf 
configuration. Because misfolded tRNA precursors will not be processed prop- 
erly, the trimming and splicing reactions serve as quality-control steps in the gén¬ 
ération of tRNAs. 

Ail tRNAs are modified chemically—nearly 1 in 10 nucléotides in each mature 
tRNA molécule is an altered version of a standard G, U, C, or A ribonucleotide. 
Over 50 different types of tRNA modifications are lcnown; a few are shown in Fig¬ 
ure 6-53. Some of the modified nucléotides—most notably inosine, produced by 
the deamination of adenosine—affect the conformation and base-pairing of the 
anticodon and thereby facilitate the récognition of the appropriate mRNA codon 
by the tRNA molécule (see Figure 6-51). Others affect the accuracy with which the 
tRNA is attached to the correct amino acid. 

Spécifie Enzymes Couple Each Amino Acid to Its Appropriate 
tRNA Molécule 

We hâve seen that, to read the genetic code in DNA, cells make a sériés of differ¬ 
ent tRNAs. We now consider how each tRNA molécule becomes linlced to the one 
amino acid in 20 that is its appropriate partner. Récognition and attachment of the 
correct amino acid dépends on enzymes called aminoacyl-tRNA synthetases, 
which covalently couple each amino acid to its appropriate set of tRNA molécules 
(Figure 6-54 and Figure 6-55). Most cells hâve a different synthetase enzyme for 
each amino acid (that is, 20 synthetases in ail); one attaches glycine to ail tRNAs 
that recognize codons for glycine, another attaches alanine to ail tRNAs that rec¬ 
ognize codons for alanine, and so on. Many bacteria, however, hâve fewer than 20 
synthetases, and the same synthetase enzyme is responsible for coupling more 
than one amino acid to the appropriate tRNAs. In these cases, a single synthetase 
places the identical amino acid on two different types of tRNAs, only one of which 
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Figure 6-52 Structure of a tRNA-splicing 
endonuclease docked to a precursor 

tRNA. The endonuclease (a four-subunit 
enzyme) removes the tRNA intron (dark 
blue, bottom). A second enzyme, a 
multifunctional tRNA ligase (not shown), 
then joins the two tRNA halves together. 
(Courtesy of Hong Li, Christopher Trotta, 
and John Abelson; PDB code: 2A9L.) 
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Figure 6-53 A few of the unusual 
nucléotides found in tRNA molécules. 

These nucléotides are produced by 
covalent modification of a normal 
nucléotide after it has been incorporated 
into an RNA chain. Two other types of 
modified nucléotides are shown in Figure 
6-41. In most tRNA molécules, about 
10% of the nucléotides are modified (see 
Figure 6-50). As shown in Figure 6-51, 
inosine is sometimes présent at the wobble 
position in the tRNA anticodon. 


has an anticodon that matches the amino acid. A second enzyme then chemically 
modifies each "incorrectly" attached amino acid so that it now corresponds to the 
anticodon displayed by its covalently linlced tRNA. 

The synthetase-catalyzed reaction that attaches the amino acid to the 3' end of 
the tRNA is one of many reactions coupled to the energy-releasing hydrolysis of 
ATP (see pp. 64-65), and it produces a high-energy bond between the tRNA and 
the amino acid. The energy of this bond is used at a later stage in protein synthesis 
to linlc the amino acid covalently to the growing polypeptide chain. 

The aminoacyl-tRNA synthetase enzymes and the tRNAs are equally important 
in the decoding process (Figure 6-56). This was established by an experiment in 
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Figure 6-54 Amino acid activation by synthetase enzymes. An amino acid is activated for 
protein synthesis by an aminoacyl-tRNA synthetase enzyme in two steps. As indicated, the energy 
of ATP hydrolysis is used to attach each amino acid to its tRNA molécule in a high-energy linkage. 
The amino acid is first activated through the linkage of its carboxyl group directly to AMP, forming 
an adenylated amino acid ; the linkage of the AMP, normally an unfavorable reaction, is driven by the 
hydrolysis of the ATP molécule that donates the AMP. Without leaving the synthetase enzyme, the 
AMP-linked carboxyl group on the amino acid is then transferred to a hydroxyl group on the sugar 
at the 3' end of the tRNA molécule. This transfer joins the amino acid by an activated ester linkage 
to the tRNA and forms the final aminoacyl-tRNA molécule. The synthetase enzyme is not shown in 
this diagram. 
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which one amino acid (cysteine) was chemically converted into a different amino 
acid (alanine) after it already had been attached to its spécifie tRNA. When such 
“hybrid" aminoacyl-tRNA molécules were used for protein synthesis in a cell-free 
System, the wrong amino acid was inserted at every point in the protein chain 
where that tRNA was used. Although, as we shall see, cells hâve several quality 
control mechanisms to avoid this type of mishap, the experiment did establish 
that the genetic code is translated by two sets of adaptors that act sequentially. 
Each matches one molecular surface to another with great specificity, and it is 
their combined action that associâtes each sequence of three nucléotides in the 
mRNA molécule—that is, each codon—with its particular amino acid. 


Figure 6-55 The structure of the 
aminoacyl-tRNA linkage. The carboxyl 
end of the amino acid forms an ester 
bond to ribose. Because the hydrolysis of 
this ester bond is associated with a large 
favorable change in free energy, an amino 
acid held in this way is said to be activated. 

(A) Schematic drawing of the structure. The 
amino acid is linked to the nucléotide at 
the 3' end of the tRNA (see Figure 6-50). 

(B) Actual structure corresponding to the 
boxed région in (A). There are two major 
classes of synthetase enzymes: one links 
the amino acid directly to the 3'-OH group 
of the ribose, and the other links it initially 
to the 2'-OH group. In the latter case, a 
subséquent transestérification reaction 
shifts the amino acid to the 3' position. 

As in Figure 6-54, the “R group” indicates 
the side chain of the amino acid. 


Editing by tRNA Synthetases Ensures Accuracy 

Several mechanisms working together ensure that an aminoacyl-tRNA synthe¬ 
tase links the correct amino acid to each tRNA. Most synthetase enzymes select 
the correct amino acid by a two-step mechanism. The correct amino acid has 
the highest affinity for the active-site pocket of its synthetase and is therefore 
favored over the other 19; in particular, amino acids larger than the correct one 
are excluded from the active site. However, accurate discrimination between two 
similar amino acids, such as isoleucine and valine (which differ by only a methyl 
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Figure 6-56 The genetic code is translated by means of two adaptors that act one after another. The first adaptor is the aminoacyl-tRNA 
synthetase, which couples a particular amino acid to its corresponding tRNA; the second adaptor is the tRNA molécule itself, whose anticodon 
forms base pairs with the appropriate codon on the mRNA. An error in either step would cause the wrong amino acid to be incorporated into a 
protein chain (Movie 6.6). In the sequence of events shown, the amino acid tryptophan (Trp) is selected by the codon UGG on the mRNA. 
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Figure 6-57 Hydrolytic editing. 

(A) Aminoacyl tRNA synthetases correct 
their own coupling errors through hydrolytic 
editing of incorrectly attached amino acids. 
As described in the text, the correct amino 
acid is rejected by the editing site. 

(B) The error-correction process performed 
by DNA polymerase has similarities; 
however, it differs because the removal 
process dépends strongly on a mispairing 
with the template (see Figure 5-8). 

(P, polymerization site; E, editing site.) 
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synthesized 
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EDITING 


group), is very difficult to achieve in a single step. A second discrimination step 
occurs after the amino acid has been covalently linlced to AMP (see Figure 6-54): 
when tRNA binds, the synthetase tries to force the adenylated amino acid into 
a second editing pocket in the enzyme. The précisé dimensions of this pocket 
exclude the correct amino acid, while allowing access by closely related amino 
acids. In the editing pocket, an amino acid is removed from the AMP (or from the 
tRNA itself if the aminoacyl-tRNA bond has already formed) by hydrolysis. This 
hydrolytic editing, which is analogous to the exonucleolytic proofreading by DNA 
polymerases, increases the overall accuracy of tRNA charging to approximately 
one mistalce in 40,000 couplings (Figure 6-57). 

The tRNA synthetase must also recognize the correct set of tRNAs, and exten¬ 
sive structural and Chemical complementarity between the synthetase and the 
tRNA allows the synthetase to probe various features of the tRNA (Figure 6-58). 
Most tRNA synthetases directly recognize the matching tRNA anticodon; these 
synthetases contain three adjacent nucleotide-binding poclcets, each of which is 
complementary in shape and charge to a nucléotide in the anticodon. For other 
synthetases, the nucléotide sequence of the amino acid-accepting arm (acceptor 
stem) is the lcey récognition déterminant. In most cases, however, the synthetase 
“reads" the nucléotides at several different positions on the tRNA. 

Amino Acids Are Added to the C-terminal End of a Growing 
Polypeptide Chain 

Having seen that each amino acid is first coupled to spécifie tRNA molécules, we 
now turn to the mechanism that joins these amino acids together to form proteins. 
The fundamental reaction of protein synthesis is the formation of a peptide bond 
between the carboxyl group at the end of a growing polypeptide chain and a free 
amino group on an incoming amino acid. Consequently, a protein is synthesized 
stepwise from its N-terminal end to its C-terminal end. Throughout the entire 
process, the growing carboxyl end of the polypeptide chain remains activated 
by its covalent attachment to a tRNA molécule (forming a peptidyl-tRNÂ). Each 
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Figure 6-58 The récognition of a 
tRNA molécule by its aminoacyl-tRNA 
synthetase. For this tRNA (tRNA Gln ), 
spécifie nucléotides in both the anticodon 
(dark blue) and the amino acid-accepting 
arm (green) allow the correct tRNA to be 
recognized by the synthetase enzyme 
(yellow-green). A bound ATP molécule is 
yellow. (Courtesy of Tom Steitz; 

PDB code: 1QRS.) 
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Figure 6-59 The incorporation of an amino acid into a protein. A polypeptide Chain grows by the stepwise addition of amino 
acids to its C-terminal end. The formation of each peptide bond is energetically favorable because the growing C-terminus 
has been activated by the covalent attachment of a tRNA molécule. The peptidyl-tRNA linkage that activâtes the growing end 
is regenerated during each addition. The amino acid side chains hâve been abbreviated as Ri, R 2 , R3, and R4; as a reference 
point, ail of the atoms in the second amino acid in the polypeptide chain are shaded gray. The figure shows the addition of the 
fourth amino acid (red) to the growing chain. 


addition disrupts this high-energy covalent linkage, but immediately replaces it 
with an identical linkage on the most recently added amino acid (Figure 6-59). 
In this way, each amino acid added carries with it the activation energy for the 
addition of the next amino acid rather than the energy for its own addition—an 
example of the “head growth” type of polymerization described in Figure 2-44. 

The RNA Message Is Decoded in Ribosomes 

The synthesis of proteins is guided by information carried by mRNA molécules. To 
maintain the correct reading frame and to ensure accuracy (about 1 mistalce every 
10,000 amino acids), protein synthesis is performed in the ribosome, a complex 
catalytic machine made from more than 50 different proteins (the ribosomal 
proteins ) and several RNA molécules, the ribosomal RNAs (rRNAs). A typical 
eulcaryotic cell contains millions of ribosomes in its cytoplasm (Figure 6-60). The 
large and small ribosome subunits are assembled at the nucleolus, where newly 
transcribed and modified rRNAs associate with the ribosomal proteins that hâve 
been transported into the nucléus after their synthesis in the cytoplasm. These two 
ribosomal subunits are then exported to the cytoplasm, where they join together 
to synthesize proteins. 



400 nm 


Figure 6-60 Ribosomes in the cytoplasm 
of a eukaryotic cell. This électron 
micrograph shows a thin section of a small 
région of cytoplasm. The ribosomes appear 
as black dots (red arrows). Some are 
free in the cytosol; others are attached to 
membranes of the endoplasmic réticulum. 
(Courtesy of Daniel S. Friend.) 
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Figure 6-61 A comparison of bacterial and eukaryotic ribosomes. Despite différences in the number and size of their 
rRNA and protein components, both bacterial and eukaryotic ribosomes hâve nearly the same structure and they function 
similarly. Although the 18S and 28S rRNAs of the eukaryotic ribosome contain many nucléotides not présent in their bacterial 
counterparts, these nucléotides are présent as multiple insertions that form extra domains and leave the basic structure of the 
rRNA largely unchanged. 

Eukaryotic and bacterial ribosomes hâve similar structures and fonctions, 
being composed of one large and one small subunit that fit together to form a 
complété ribosome with a mass of several million daltons (Figure 6-61). The small 
subunit provides the framework on which the tRNAs are accurately matched to 
the codons of the mRNA, while the large subunit catalyzes the formation of the 
peptide bonds that linlc the amino acids together into a polypeptide chain (see 
Figure 6-58). 

When not actively synthesizing proteins, the two subunits of the ribosome 
are separate. They join together on an mRNA molécule, usually near its 5' end, 
to initiate the synthesis of a protein. The mRNA is then pulled through the ribo¬ 
some, three nucléotides at a time. As its codons enter the core of the ribosome, the 
mRNA nucléotide sequence is translated into an amino acid sequence using the 
tRNAs as adaptors to add each amino acid in the correct sequence to the growing 
end of the polypeptide chain. When a stop codon is encountered, the ribosome 
releases the finished protein, and its two subunits separate again. These subunits 
can then be used to start the synthesis of another protein on another mRNA mol¬ 
écule. Ribosomes operate with remarlcable efficiency: in one second, a eukaryotic 
ribosome adds 2 amino acids to a polypeptide chain; the ribosomes of bacterial 
cells operate even faster, at a rate of about 20 amino acids per second. 

To choreograph the many coordinated movements required for efficient trans¬ 
lation, a ribosome contains four binding sites for RNA molécules: one is for the 
mRNA and three (called the A site, the P site, and the E site) are for tRNAs (Figure 
6-62). A tRNA molécule is held tightly at the A and P sites only if its anticodon 
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Figure 6-62 The RNA-binding sites in the ribosome. Each ribosome has one binding site for mRNA and three binding sites 
for tRNA: the A, P, and E sites (short for aminoacyl-tRNA, peptidyl-tRNA, and exit, respectively). (A) A bacterial ribosome viewed 
with the small subunit in the front (dark green) and the large subunit in the back (light green). Both the rRNAs and the ribosomal 
proteins are illustrated. tRNAs are shown bound in the E site (red), the P site (orange), and the A site (yellow). Although ail three 
tRNA sites are shown occupied here, during the process of protein synthesis not more than two of these sites are thought to 
contain tRNA molécules at any one time (see Figure 6-64). (B) Large and small ribosomal subunits arranged as though the 
ribosome in (A) were opened like a book. (C) The ribosome in (A) rotated through 90° and viewed with the large subunit on top 
and small subunit on the bottom. (D) Schematic représentation of a ribosome [in the same orientation as (C)], which will be used 
in subséquent figures. (A, B, and C, adapted from M.M. Yusupov et al., Science 292:883-896, 2001. With permission from 
AAAS; courtesy of Albion Baucom and Harry Noller.) 


forms base pairs with a complementary codon (allowing for wobble) on the 
mRNA molécule that is threaded through the ribosome (Figure 6-63). The A and 
P sites are close enough together for their two tRNA molécules to be forced to 
form base pairs with adjacent codons on the mRNA molécule. This feature of the 
ribosome maintains the correct reading frame on the mRNA. 

Once protein synthesis has been initiated, each new amino acid is added to the 
elongating chain in a cycle of reactions containing four major steps: tRNA binding 
(step 1), peptide bond formation (step 2), large subunit translocation (step 3), and 
small subunit translocation (step 4). As a resuit of the two translocation steps, the 
entire ribosome moves three nucléotides along the mRNA and is positioned to 
start the next cycle. Figure 6-64 illustrâtes this four-step process, beginning at a 
point at which three amino acids hâve already been linked together and there is a 
tRNA molécule in the P site on the ribosome, covalently joined to the C-terminal 
end of the short polypeptide. In step 1, a tRNA carrying the next amino acid in the 
chain binds to the ribosomal A site by forming base pairs with the mRNA codon 
positioned there, so that the P site and the A site contain adjacent bound tRNAs. In 
step 2, the carboxyl end of the polypeptide chain is released from the tRNA at the 
P site (by breakage of the high-energy bond between the tRNA and its amino acid) 
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and joined to the free amino group of the amino acid linlced to the tRNA at the A 
site, forming a new peptide bond. This central reaction of protein synthesis is cat- 
alyzed by a peptidyl transferase contained in the large ribosomal subunit. In step 
3, the large subunit moves relative to the mRNA held by the small subunit, thereby 
shifting the acceptor stems of the two tRNAs to the E and P sites of the large sub¬ 
unit. In step 4, another sériés of conformational changes moves the small subunit 
and its bound mRNA exactly three nucléotides, ejecting the spent tRNA from the E 
site and resetting the ribosome so it is ready to receive the next aminoacyl-tRNA. 
Step 1 is then repeated with a new incoming aminoacyl-tRNA, and so on. 

This four-step cycle is repeated each time an amino acid is added to the poly¬ 
peptide chain, as the chain grows from its amino to its carboxyl end. 


Figure 6-63 The path of mRNA (blue) 
through the small ribosomal subunit. The 

orientation is the same as that in the right- 
hand panel of Figure 6-62B. (Courtesy 
of Harry F. Noller, based on data in G.Z. 
Yusupova et al., Ce//106:233-241,2001. 
With permission from Elsevier.) 
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Elongation Factors Drive Translation Forward and Improve Its 
Accuracy 

The basic cycle of polypeptide élongation shown in outline in Figure 6-64 has an 
additional feature that malces translation especially efficient and accurate. Two 
élongation factors enter and leave the ribosome during each cycle, each hydro- 
lyzing GTP to GDP and undergoing conformational changes in the process. These 
factors are called EF-Tu and EF-G in bacteria, and EF1 and EF2 in eulcaryotes. 
Under some conditions in vitro, ribosomes can be forced to synthesize proteins 


Figure 6-64 Translating an mRNA molécule. Each amino acid added to 
the growing end of a polypeptide chain is selected by complementary base- 
pairing between the anticodon on its attached tRNA molécule and the next 
codon on the mRNA chain. Because only one of the many types of tRNA 
molécules in a cell can base-pair with each codon, the codon détermines 
the spécifie amino acid to be added to the growing polypeptide chain. The 
four-step cycle shown is repeated over and over during the synthesis of a 
protein. In step 1, an aminoacyl-tRNA molécule binds to a vacant A site on 
the ribosome. In step 2, a new peptide bond is formed. In step 3, the large 
subunit translocates relative to the small subunit, leaving the two tRNAs in 
hybrid sites: P on the large subunit and A on the small, for one; E on the 
large subunit and P on the small, for the other. In step 4, the small subunit 
translocates carrying its mRNA a distance of three nucléotides through the 
ribosome. This “resets” the ribosome with a fully empty A site, ready for the 
next aminoacyl-tRNA molécule to bind. As indicated, the mRNA is translated 
in the 5'-to-3' direction, and the N-terminal end of a protein is made first, with 
each cycle adding one amino acid to the C-terminus of the polypeptide chain 
(Movie 6.7 and Movie 6.8). 
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Figure 6-65 Detailed view of the translation cycle. The outline of 
translation presented in Figure 6-64 has been expanded to show the rôles 
of the two élongation factors EF-Tu and EF-G, which drive translation in the 
forward direction. As explained in the text, EF-Tu provides opportunités for 
proofreading of the codon-anticodon match. In this way, incorrectly paired 
tRNAs are selectively rejected, and the accuracy of translation is improved. 
The binding of a molécule of EF-G to the ribosome and the subséquent 
hydrolysis of GTP lead to a rearrangement of the ribosome structure, moving 
the mRNA being decoded exactly three nucléotides through it (Movie 6.9). 


without the aid of these élongation factors and GTP hydrolysis, but this synthe- 
sis is very slow, inefficient, and inaccurate. Coupling the GTP hydrolysis-driven 
changes in the élongation factors to transitions between different States of the 
ribosome speeds up protein synthesis enormously. The cycles of élongation fac¬ 
tor association, GTP hydrolysis, and dissociation also ensure that ail such changes 
occur in the "forward" direction, helping translation to proceed efficiently (Figure 
6-65). 

In addition to moving translation forward, EF-Tu increases its accuracy. As we 
discussed in Chapter 3, EF-Tu can simultaneously bind GTP and aminoacyl-tR- 
NAs (see Figures 3-72 and 3-73), and it is in this form that the initial codon-anti¬ 
codon interaction occurs in the A site of the ribosome. Because of the free-energy 
change associated with base-pair formation, a correct codon-anticodon match 
will bind more tightly than an incorrect interaction. However, this différence in 
affinity is relatively modest and cannot by itself account for the high accuracy of 
translation. 

To increase the accuracy of this binding reaction, the ribosome and EF-Tu 
worlc together in the following ways. First, the 16s rRNA in the small subunit of 
the ribosome assesses the “correctness" of the codon-anticodon match by folding 
around it and probing its molecular details (Figure 6-66). When a correct match 
is found, the rRNA closes tightly around the codon-anticodon pair, causing a con- 
formational change in the ribosome that triggers GTP hydrolysis by EF-Tu. Only 
when GTP is hydrolyzed does EF-Tu release its grip on the aminoacyl-tRNA and 
allow it to be used in protein synthesis. Incorrect codon-anticodon matches do 
not readily trigger this conformational change, and these errant tRNAs mostly fall 
off the ribosome before they can be used in protein synthesis. Proofreading, how¬ 
ever, does not end here. 

After GTP is hydrolyzed and EF-Tu dissociâtes from the ribosome, there is a 
second opportunity for the ribosome to prevent an incorrect amino acid from 
being added to the growing chain. There is a short time delay as the amino acid 
carried by the tRNA moves into position on the ribosome. This time delay is 
shorter for correct than incorrect codon-anticodon pairs. Moreover, incorrectly 
matched tRNAs dissociate more rapidly than those correctly bound because their 
interaction with the codon is wealcer. Thus, most incorrectly bound tRNA molé¬ 
cules (as well as a significant number of correctly bound molécules) will leave the 
ribosome without being used for protein synthesis. The two proofreading steps, 
acting in sériés, are largely responsible for the 99.99% accuracy of the ribosome in 
translating RNA into protein. 

Even if the wrong amino acid slips through the proofreading steps just 
described and is incorporated onto the growing polypeptide chain, there is still 
one more opportunity for the ribosome to detect the error and provide a solu¬ 
tion, albeit one that is not, strictly speaking, proofreading. An incorrect codon- 
anticodon interaction in the P site of the ribosome (which would occur after the 
misincorporation) causes an increased rate of misreading in the A site. Succes¬ 
sive rounds of amino acid misincorporation eventually lead to prématuré ter- 
mination of the protein by release factors, which are described below. Normally, 
these release factors act when translation of a protein is complété; here, they act 
early. Although this mechanism does not correct the original error, it releases the 
flawed protein for dégradation, ensuring that no additional peptide synthesis is 
wasted on it. 
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Figure 6-66 Récognition of correct codon-anticodon matches by 
the small-subunit rRNA of the ribosome. Shown here is the interaction 
between a nucléotide of the small-subunit rRNA and the first nucléotide pair 
of a correctly paired codon-anticodon. Similar interactions form between 
other nucléotides of the rRNA and the second and third positions of codon- 
anticodon pair. The small-subunit rRNA can form this network of hydrogen 
bonds only when an anticodon is correctly matched to a codon. As explained 
in the text, this codon-anticodon monitoring by the small-subunit rRNA 
increases the accuracy of protein synthesis. (From J.M. Ogle et al., Science 
292:897-902, 2001. With permission from AAAS.) 

anticodon codon 

Many Biological Processes Overcome the Inhérent Limitations of 
Complementary Base-Pairing 

We hâve seen in this and the previous chapter that DNA réplication, repair, tran¬ 
scription, and translation ail rely on complementary base-pairing—G with C, and 
A with T (or U). However, if only the différence in hydrogen bonding is considered, 
a correct versus incorrect match should differ in affinity only by a factor of 10- to 
100-fold. These processes hâve an accuracy much higher than can be accounted 
for by this différence. Although the mechanisms used to "squeeze out" additional 
specificity from complementary base-pairing differ from one process to the next, 
two principles exemplified by the ribosome appear to be general. 

The first is induced fit. We hâve seen that, before an amino acid is added to 
a growing polypeptide chain, the ribosome folds around the codon-anticodon 
interaction, and only when the match is correct is this folding completed and the 
reaction allowed to proceed. Thus, the codon-anticodon interaction is thereby 
checked twice—once by the initial complementary base-pairing and a second 
time by the folding of the ribosome, which dépends on the correctness of the 
match. This same principle of induced fit is seen in transcription by RNA poly¬ 
merase; here, an incoming nucleoside triphosphate initially forms a base pair 
with the template; at this point the enzyme folds around the base pair (thereby 
assessing its correctness) and, in doing so, créâtes the active site of the enzyme. 

The enzyme then covalently adds the nucléotide to the growing chain. Because 
their geometry is “wrong," incorrect base pairs bloclc this induced fit, and they are 
therefore lilcely to dissociate before being incorporated into the growing chain. 

A second principle used to increase the specificity of complementary 
base-pairing is called kinetic proofreading. We hâve seen that after the initial 
codon-anticodon pairing and conformational change of the ribosome, GTP is 
hydrolyzed. This créâtes an irréversible step and starts the dock on a time delay 
during which the aminoacyl-tRNA moves into the proper position for catalysis. 

During this delay, those incorrect codon-anticodon pairs that hâve somehow 
slipped through the induced-fit scrutiny hâve a higher likelihood of dissociating 
than correct pairs. There are two reasons for this: (1) the interaction of the wrong 
tRNA with the codon is wealcer, and (2) the delay is longer for incorrect than cor¬ 
rect matches. 

In its most general form, kinetic proofreading refers to a time delay that begins 
with an irréversible step such as ATP or GTP hydrolysis, during which an incorrect 
substrate is more lilcely to dissociate than a correct one. In this case, kinetic proof¬ 
reading thus increases the specificity of complementary base-pairing above what 
is possible from simple thermodynamic associations alone. The increase in spec¬ 
ificity produced by kinetic proofreading cornes at an energetic cost in the form of 
ATP or GTP hydrolysis. Kinetic proofreading is believed to operate in many bio¬ 
logical processes, but its rôle is understood particularly well for translation. 

Accuracy in Translation Requires an Expenditure of Free Energy 

Translation by the ribosome is a compromise between the opposing constraints 
of accuracy and speed. We hâve seen, for example, that the accuracy of transla¬ 
tion (1 mistake per 10 4 amino acids joined) requires time delays each time a new 
amino acid is added to a growing polypeptide chain, producing an overall speed 





346 Chapter 6: How Cells Read the Genome: From DNA to Protein 



of translation of 20 amino acids incorporated per second in bacteria. Mutant bac- 
teria with a spécifie alteration in the small ribosomal subunit hâve longer delays 
and translate mRNA into protein with an accuracy considerably higher than this; 
however, protein synthesis is so slow in these mutants that the bacteria are barely 
able to survive. 

We hâve also seen that attaining the observed accuracy of protein synthesis 
requires the expenditure of a great deal of free energy; this is expected, since, as 
discussed in Chapter 2, there is a price to be paid for any increase in order in the 
cell. In most cells, protein synthesis consumes more energy than any other biosyn- 
thetic process. At least four high-energy phosphate bonds are split to malce each 
new peptide bond: two are consumed in charging a tRNA molécule with an amino 
acid (see Figure 6-54), and two more drive steps in the cycle of reactions occur- 
ring on the ribosome during protein synthesis itself (see Figure 6-65). In addition, 
extra energy is consumed each time that an incorrect amino acid linlcage is hydro- 
lyzed by a tRNA synthetase (see Figure 6-57) and each time that an incorrect tRNA 
enters the ribosome, triggers GTP hydrolysis, and is rejected (see Figure 6-65). To 
be effective, any proofreading mechanism must also allow an appréciable fraction 
of correct interactions to be removed; for this reason, proofreading is even more 
costly in energy than it might at first seem. 

The Ribosome Is a Ribozyme 

The ribosome is a large complex composed of two-thirds RNA and one-third pro¬ 
tein. The détermination, in 2000, of the entire three-dimensional conformation of 
its large and small subunits is a major triumph of modem structural biology. The 
findings confirm earlier evidence that rRNAs—and not proteins—are responsible 
for the ribosome's overall structure, its ability to position tRNAs on the mRNA, 
and its catalytic activity in forming covalent peptide bonds. The ribosomal RNAs 
are folded into highly compact, précisé three-dimensional structures that form 
the compact core of the ribosome and détermine its overall shape (Figure 6-67). 

In marlced contrast to the central positions of the rRNAs, the ribosomal pro¬ 
teins are generally located on the surface and fill in the gaps and crevices of the 
folded RNA (Figure 6-68). Some of these proteins send out extended régions 
of polypeptide chain that penetrate short distances into holes in the RNA core 
(Figure 6-69). The main rôle of the ribosomal proteins seems to be to stabilize the 


Figure 6-67 Structure of the rRNAs 
in the large subunit of a bacterial 
ribosome, as determined by x-ray 
crystallography. (A) Three-dimensional 
conformations of the large-subunit 
rRNAs (5S and 23S) as they appear 
in the ribosome. One of the protein 
subunits of the ribosome (L1) is also 
shown as a reference point, since it 
forms a characteristic protrusion on 
the ribosome. (B) Schematic diagram 
of the secondary structure of the 23S 
rRNA, showing the extensive network 
of base-pairing. The structure has been 
divided into six “domains” whose colors 
correspond to those in (A). The secondary- 
structure diagram is highly schematized 
to represent as much of the structure as 
possible in two dimensions. To do this, 
several discontinuities in the RNA chain 
hâve been introduced, although in reality 
the 23S rRNA is a single RNA molécule. 

For example, the base of Domain III is 
continuous with the base of Domain IV 
even though a gap appears in the diagram. 
(Adapted from N. Ban et al., Science 
289:905-920, 2000. With permission 
from AAAS.) 
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RNA core, while permitting the changes in rRNA conformation that are necessary 
for this RNA to catalyze efficient protein synthesis. The proteins also aid in the 
initial assembly of the rRNAs that malce up the core of the ribosome. 

Not only are the A, P, and E binding sites for tRNAs formed primarily by ribo- 
somal RNAs, but the catalytic site for peptide bond formation is also formed by 
RNA, as the nearest amino acid is located more than 1.8 nm away. This discovery 
came as a surprise to biologists because, unlilce proteins, RNA does not contain 
easily ionizable functional groups that can be used to catalyze sophisticated reac¬ 
tions lilce peptide bond formation. Moreover, métal ions, which are often used by 
RNA molécules to catalyze Chemical reactions (as discussed later in the chapter), 
were not observed at the active site of the ribosome. Instead, it is believed that 
the 23S rRNA forms a highly structured pocket that, through a networlc of hydro- 
gen bonds, precisely orients the two reactants (the growing peptide chain and an 
aminoacyl-tRNA) and thereby greatly accelerates their covalent joining. An addi- 
tional surprise came from the discovery that the tRNA in the P site contributes an 
important OH group to the active site and participâtes directly in the catalysis. 
This mechanism may ensure that catalysis occurs only when the P site tRNA is 
properly positioned in the ribosome. 

RNA molécules that possess catalytic activity are known as ribozymes. We saw 
earlier in this chapter that some ribozymes function in self-splicing reactions. In 
the final section of this chapter, we consider what the ability of RNA molécules to 
function as catalysts might mean for the early évolution of living cells. For now, 
we merely note that there is good reason to suspect that RNA rather than protein 
molécules served as the first catalysts for living cells. If so, the ribosome, with its 
RNA core, may be a relie of an earlier time in life's history—when protein synthe¬ 
sis evolved in cells that were run almost entirely by ribozymes. 

Nucléotide Sequences in mRNA Signal Where to Start Protein 
Synthesis 

The initiation and termination of translation share features of the translation 
élongation cycle described above. The site at which protein synthesis begins on 
the mRNA is especially crucial, since it sets the reading frame for the whole length 
of the message. An error of one nucléotide either way at this stage would cause 
every subséquent codon in the message to be misread, resulting in a nonfunc- 
tional protein with a garbled sequence of amino acids. The initiation step is also 
important because for most genes it is the last point at which the cell can décidé 
whether the mRNA is to be translated to produce a protein. The rate of this step is 
thus one déterminant of the rate at which any particular protein will be synthe- 
sized. We shall see in Chapter 7 how régulation of this step occurs. 

The translation of an mRNA begins with the codon AUG, and a spécial tRNA 
is required to start translation. This initiator tRNA always carries the amino acid 
méthionine (in bacteria, a modified form of méthionine—formylmethionine—is 
used), with the resuit that ail newly made proteins hâve méthionine as the first 
amino acid at their N-terminus, the end of a protein that is synthesized first. (This 
méthionine is usually removed later by a spécifie protease.) The initiator tRNA 
is specially recognized by initiation factors because it has a nucléotide sequence 
distinct from that of the tRNA that normally carries méthionine. 

In eulcaryotes, the initiator tRNA-methionine complex (Met-tRNAi) is first 
loaded into the small ribosomal subunit along with additional proteins called 
eukaryotic initiation factors, or elFs. Of ail the aminoacyl-tRNAs in the cell, only 
the methionine-charged initiator tRNA is capable of tightly binding the small 
ribosome subunit without the complété ribosome being présent, and unlilce other 
tRNAs it binds directly to the P site (Figure 6-70). Next, the small ribosomal sub¬ 
unit binds to the 5' end of an mRNA molécule, which is recognized by virtue of its 5' 
cap that has previously bound two initiation factors, eIF4E and eIF4G (see Figure 
6-38). The small ribosomal subunit then moves forward (5' to 3') along the mRNA, 
searching for the first AUG; additional initiation factors that act as ATP-powered 



Figure 6-68 Location of the protein 
components of the bacterial large 
ribosomal subunit. The rRNAs (5S and 
23S) are shown in blue and the proteins 
of the large subunit in green. This view is 
toward the outside of the ribosome; the 
interface with the small subunit is on the 
opposite face. (PDB code: 1FFK.) 



Figure 6-69 Structure of the L15 protein 
in the large subunit of the bacterial 
ribosome. The globular domain of the 
protein lies on the surface of the ribosome 
and an extended région pénétrâtes deeply 
into the RNA core of the ribosome. The 
L15 protein is shown in green and a portion 
of the ribosomal RNA core is shown in 
blue. (From D. Klein, P.B. Moore and 
T.A. Steitz, J. Mol. Biol. 340:141-177, 

2004. With permission from Academie 
Press. PDB code: 1S72.) 
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Figure 6-70 The initiation of protein synthesis in eukaryotes. Only three 
of the many translation initiation factors required for this process are shown. 
Efficient translation initiation also requires the poly-A tail of the mRNA bound 
by poly-A-binding proteins, which, in turn, interact with elF4G (see Figure 
6-38). In this way, the translation apparatus ascertains that both ends of 
the mRNA are intact before initiating protein synthesis. Although only one 
GTP-hydrolysis event is shown in the figure, a second is known to occur 
just before the large and small ribosomal subunits join. In the last two steps 
shown in the figure, the ribosome has begun the standard élongation cycle, 
depicted in Figure 6-64. 


helicases facilitate this movement. In 90% of mRNAs, translation begins at the first 
AUG encountered by the small subunit. At this point, the initiation factors dis- 
sociate, allowing the large ribosomal subunit to assemble with the complex and 
complété the ribosome. The initiator tRNA remains at the P site, leaving the A site 
vacant. Protein synthesis is therefore ready to begin (see Figure 6-70). 

The nucléotides immediately surrounding the start site in eulcaryotic mRNAs 
influence the efficiency of AUG récognition during the above scanning process. If 
this récognition site differs substantially from the consensus récognition sequence 
(5'-ACCAUGG-3'), scanning ribosomal subunits will sometimes ignore the first 
AUG codon in the mRNA and skip to the second or third AUG codon instead. Cells 
frequently use this phenomenon, known as “lealcy scanning," to produce two or 
more proteins, differing in their N-termini, from the same mRNA molécule. This 
mechanism allows some genes to produce the same protein with and without 
a signal sequence attached at its N-terminus, for example, so that the protein is 
directed to two different compartments in the cell. 

The mechanism for selecting a start codon in bacteria is different. Bacte- 
rial mRNAs hâve no 5' caps to signal the ribosome where to begin searching for 
the start of translation. Instead, each bacterial mRNA contains a spécifie ribo- 
some-binding site (called the Shine-Dalgarno sequence, named after its discov- 
erers) that is located a few nucléotides upstream of the AUG at which translation 
is to begin. This nucléotide sequence, with the consensus 5'-AGGAGGU-3', forms 
base pairs with the 16S rRNA of the small ribosomal subunit to position the initiat¬ 
ing AUG codon in the ribosome. A set of translation initiation factors orchestrâtes 
this interaction, as well as the subséquent assembly of the large ribosomal sub¬ 
unit to complété the ribosome. 

Unlilce a eulcaryotic ribosome, a bacterial ribosome can readily assemble 
directly on a start codon that lies in the interior of an mRNA molécule, so long 
as a ribosome-binding site précédés it by several nucléotides. As a resuit, bac¬ 
terial mRNAs are often polycistronic —that is, they encode several different pro¬ 
teins, each of which is translated from the same mRNA molécule (Figure 6-71). 
In contrast, a eulcaryotic mRNA generally encodes only a single protein, or more 
accurately, a single set of closely related proteins. 

Stop Codons Mark the End of Translation 

The end of the protein-coding message is signaled by the presence of one of three 
stop codons (UAA, UAG, or UGA) (see Figure 6-48). These are not recognized by 
a tRNA and do not specify an amino acid, but instead signal to the ribosome to 
stop translation. Proteins known as release factors bind to any ribosome with a 
stop codon positioned in the A site, forcing the peptidyl transferase in the ribo¬ 
some to catalyze the addition of a water molécule instead of an amino acid to the 
peptidyl-tRNA (Figure 6-72). This reaction frees the carboxyl end of the growing 
polypeptide chain from its attachment to a tRNA molécule, and since only this 
attachment normally holds the growing polypeptide to the ribosome, the com- 
pleted protein chain is immediately released into the cytoplasm. The ribosome 
then releases its bound mRNA molécule and séparâtes into the large and small 
subunits. These subunits can then assemble on this or another mRNA molécule 
to begin a new round of protein synthesis. 
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Figure 6-71 Structure of a typical bacterial mRNA molécule. Unlike eukaryotic ribosomes, 
which typically require a capped 5' end on the mRNA, prokaryotic ribosomes initiate translation 
at ribosome-binding sites (Shine-Dalgarno sequences), which can be located anywhere along an 
mRNA molécule. This property of their ribosomes permits bacteria to synthesize more than one 
type of protein from a single mRNA molécule. 


During translation, the nascent polypeptide moves through a large, water-filled 
tunnel (approximately 10 nm x 1.5 nm) in the large subunit of the ribosome. The 
walls of this tunnel, made primarily of 23S rRNA, are a patchwork of tiny hydro¬ 
phobie surfaces embedded in a more extensive hydrophilic surface. This struc¬ 
ture is not complementary to any peptide, and thus provides a "Teflon" coating 
through which a polypeptide chain can easily slide. The dimensions of the tunnel 
suggest that nascent proteins are largely unstructured as they pass through the 
ribosome, although some a-helical régions of the protein can form before leaving 
the ribosome tunnel. As it leaves the ribosome, a newly synthesized protein must 
fold into its proper three-dimensional conformation to be useful to the cell. Later 
in this chapter we discuss howthis folding occurs. First, however, we describe sev- 
eral additional aspects of the translation process itself. 

Proteins Are Made on Polyribosomes 

The synthesis of most protein molécules talces between 20 seconds and several 
minutes. During this very short period, however, it is usual for multiple initiations 
to talce place on each mRNA molécule being translated. As soon as the preceding 
ribosome has translated enough of the nucléotide sequence to move out of the 
way, the 5' end of the mRNA is threaded into a new ribosome. The mRNA molé¬ 
cules being translated are therefore usually found in the form of polyribosomes (or 
polysomes ): large cytoplasmic assemblies made up of several ribosomes spaced 
as close as 80 nucléotides apart along a single mRNA molécule (Figure 6-73). 
These multiple initiations allow the cell to malce many more protein molécules in 
a given time than would be possible if each protein had to be completed before 
the next could start. 

Both bacteria and eukaryotes use polysomes, and both employ additional 
strategies to speed up the overall rate of protein synthesis. Because bacterial 
mRNA does not need to be processed and is accessible to ribosomes while it is 
being made, ribosomes can attach to the free end of a bacterial mRNA molécule 
and start translating it even before the transcription of that RNA is complété, fol- 
lowing closely behind the RNA polymerase as it moves along DNA. In eukaryotes, 
as we hâve seen, the 5' and 3' ends of the mRNA interact (see Figure 6-73A); there¬ 
fore, as soon as a ribosome dissociâtes, its two subunits are in an optimal position 
to reinitiate translation on the same mRNA molécule. 

There Are Minor Variations in the Standard Genetic Code 

As discussed in Chapter 1, the genetic code (shown in Figure 6-48) applies to 
ail three major branches of life, providing important evidence for the common 
ancestry of ail life on Earth. Although rare, there are exceptions to this code. For 
example, Candida albicans, the most prévalent human fungal pathogen, trans¬ 
lates the codon CUG as serine, whereas nearly ail other organisms translate it 
as leucine. Mitochondria (which hâve their own genomes and encode much of 
their translational apparatus) often deviate from the standard code. For example, 
in mammalian mitochondria AUA is translated as méthionine, whereas in the 






Figure 6-72 The final phase of protein 
synthesis. The binding of a release 
factor to an A site bearing a stop codon 
terminâtes translation. The completed 
polypeptide is released and, in a sériés of 
reactions that requires additional proteins 
and GTP hydrolysis (not shown), the 
ribosome dissociâtes into its two 
separate subunits. 
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(B) 


Figure 6-73 A polyribosome. (A) Schematic 
drawing showing how a sériés of ribosomes 
can simultaneously translate the same 
eukaryotic mRNA molécule. (B) Electron 
micrograph of a polyribosome from a 
eukaryotic cell (Movie 6.10). (B, courtesy 
of John Heuser.) 


cytosol of the cell it is translatée! as isoleucine (see Table 14-3, p. 805). This type of 
déviation in the genetic code is "hardwired" into the organisms or the organelles 
in which it occurs. 

A different type of variation, sometimes called translation recoding , occurs 
in many cells. In this case, other nucléotide sequence information présent in an 
mRNA can change the meaning of the genetic code at a particular site in the mRNA 
molécule. The standard code allows cells to manufacture proteins using only 20 
amino acids. However, bacteria, archaea, and eulcaryotes hâve available to them a 
twenty-first amino acid that can be incorporated directly into a growing polypep¬ 
tide chain through translation recoding. Selenocysteine, which is essential for the 
efficient function of a variety of enzymes, contains a sélénium atom in place of the 
sulfur atom of cysteine. Selenocysteine is enzymatically produced from a serine 
attached to a spécial tRNA molécule that base-pairs with the UGA codon, a codon 
normally used to signal a translation stop. The mRNAs for proteins in which sele¬ 
nocysteine is to be inserted at a UGA codon carry an additional nearby nucléotide 
sequence in the mRNA that triggers this recoding event (Figure 6-74). 
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Figure 6-74 Incorporation of selenocysteine into a growing polypeptide chain. A specialized tRNA is charged with serine by the normal seryl- 
tRNA synthetase, and the serine is subsequently converted enzymatically to selenocysteine. A spécifie RNA structure in the mRNA (a stem and loop 
structure with a particular nucléotide sequence) signais that selenocysteine is to be inserted at the neighboring UGA codon. As indicated, this event 
requires the participation of a selenocysteine-specific translation factor. After the addition of selenocysteine, translation continues until a conventional 
stop codon is encountered. 
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Inhibitors of Prokaryotic Protein Synthesis Are Useful as Antibiotics 

Many of the most effective antibiotics used in modem medicine are compounds 
made by fungi that inhibit bacterial protein synthesis. Fungi and bacteria com- 
pete for many of the same environmental niches, and millions of years of coevo- 
lution hâve resulted in fungi producing potent bacterial inhibitors. Some of these 
drugs exploit the structural and functional différences between bacterial and 
eukaryotic ribosomes so as to interfère preferentially with the function of bacte¬ 
rial ribosomes. Thus, humans can talce high dosages of some of these compounds 
without undue toxicity. Many antibiotics lodge in poclcets in the ribosomal RNAs 
and simply interfère with the smooth operation of the ribosome; others bloclc 
spécifie parts of the ribosome such as the exit channel (Figure 6-75). Table 6-4 
lists some common antibiotics of this kind along with several other inhibitors of 
protein synthesis, some of which act on eukaryotic cells and therefore cannot be 
used as antibiotics. 

Because they bloclc spécifie steps in the processes that lead from DNA to 
protein, many of the compounds listed in Table 6-4 are useful for cell biological 
studies. Among the most commonly used drugs in such investigations are chlor- 
amphenicol, cycloheximide, and puromycin, ail of which specifically inhibit pro¬ 
tein synthesis. In a eukaryotic cell, for example, chloramphenicol inhibits protein 
synthesis on ribosomes only in mitochondria (and in chloroplasts in plants), pre- 
sumably reflecting the prokaryotic origins of these organelles (discussed in Chap- 
ter 14). Cycloheximide, in contrast, affects only ribosomes in the cytosol. Puromy¬ 
cin is especially interesting because it is a structural analog of a tRNA molécule 
linlced to an amino acid and is therefore another example of molecular mimicry; 
the ribosome mistakes it for an authentic amino acid and covalently incorporâtes 
it at the C-terminus of the growing peptide chain, thereby causing the prématuré 
termination and release of the polypeptide. As might be expected, puromycin 
inhibits protein synthesis in both prokaryotes and eukaryotes. 

Quality Control Mechanisms Act to Prevent Translation of 
Damaged mRNAs 

In eukaryotes, mRNA production involves both transcription and a sériés of elab- 
orate RNA processing steps; as we hâve seen, these talce place in the nucléus, 
segregated from ribosomes, and only when the processing is complété are the 
mRNAs transported to the cytosol to be translated (see Figure 6-38). However, 
this scheme is not foolproof, and some incorrectly processed mRNAs are inadver- 
tently sent to the cytosol. In addition, mRNAs that were flawless when they left the 
nucléus can become brolcen or otherwise damaged in the cytosol. The danger of 
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Figure 6-75 Binding sites for antibiotics 
on the bacterial ribosome. The small (left) 
and large (right) subunits of the ribosome 
are arranged as though the ribosome has 
been opened like a book. Antibiotic binding 
sites are marked with colored spheres, 
and the bound tRNA molécules are shown 
in purple (see Figure 6-62). Most of the 
antibiotics shown bind directly to pockets 
formed by the ribosomal RNA molécules. 
Hygromycin B induces errors in translation, 
spectinomycin blocks the translocation 
of the peptidyl-tRNA from the A site to 
the P site, and streptogramin B prevents 
élongation of nascent peptides. Table 6-4 
lists the inhibitory mechanisms of the other 
antibiotics shown in the figure. (Adapted 
from J. Poehlsgaard and S. Douthwaite, 
Nat. Rev. Microbiol. 3:870-881,2005. With 
permission from Macmillan Publishers Ltd.) 
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Inhibitors of Protein or RNA Synthesis 

Inhibitor 

Spécifie effect 

Acting only on bacteria 

Tétracycline 

Blocks binding of aminoacyl-tRNA to the A site of ribosome 

Streptomycin 

Prevents the transition from translation initiation to chain élongation and also causes miscoding 

Chloramphenicol 

Blocks the peptidyl transferase reaction on ribosomes (step 2 in Figure 6-64) 

Erythromycin 

Binds in the exit channel of the ribosome and thereby inhibits élongation of the peptide chain 

Rifamycin 

Blocks initiation of RNA chains by binding to RNA polymerase (prevents RNA synthesis) 

Acting on bacteria and eukaryotes 

Puromycin 

Causes the prématuré release of nascent polypeptide chains by its addition to the growing chain end 

Actinomycin D 

Binds to DNA and blocks the movement of RNA polymerase (prevents RNA synthesis) 

Acting on eukaryotes but not bacteria 

Cycloheximide 

Blocks the translocation reaction on ribosomes (step 3 in Figure 6-64) 

Anisomycin 

Blocks the peptidyl transferase reaction on ribosomes (step 2 in Figure 6-64) 

a-Amanitin 

Blocks mRNA synthesis by binding preferentially to RNA polymerase II 

The ribosomes of eukaryotic mitochondria (and chloroplasts) often resemble those of bacteria in their sensitivity to inhibitors. Therefore, some of 
these antibiotics can hâve a deleterious effect on human mitochondria. 


translating damaged or incompletely processed mRNAs (which would produce 
truncated or otherwise aberrant proteins) is apparently so great that the cell has 
several baclcup measures to prevent this from happening. To avoid translating 
brolcen mRNAs, for example, the 5' cap and the poly-A tail are both recognized by 
the translation-initiation machinery before translation begins (see Figure 6-70). 

The most powerful mRNA surveillance System, called nonsense-mediated 
mRNA decay, éliminâtes defective mRNAs before they move away from the 
nucléus. This mechanism is brought into play when the cell détermines that an 
mRNA molécule has a nonsense (stop) codon (UAA, UAG, or UGA) in the “wrong" 
place. This situation is likely to arise in an mRNA molécule that has been improp- 
erly spliced, because aberrant splicing will usually resuit in the random intro¬ 
duction of a nonsense codon into the reading frame of the mRNA—especially 
in organisms, such as humans, that hâve a large average intron size (see Figure 
6-31B). 

The nonsense-mediated mRNA decay mechanism begins as an mRNA mol¬ 
écule is being transported from the nucléus to the cytosol. As its 5' end emerges 
from a nuclear pore, the mRNA is met by a ribosome, which begins to translate 
it. As translation proceeds, the exon junction complexes (EJCs) that are bound 
to the mRNA at each splice site are displaced by the moving ribosome. The nor¬ 
mal stop codon will lie within the last exon, so by the time the ribosome reaches 
it and stalls, no more EJCs will be bound to the mRNA. In this case, the mRNA 
"passes inspection" and is released to the cytosol where it can be translated in ear- 
nest (Figure 6-76). However, if the ribosome reaches a stop codon earlier, when 
EJCs remain bound, the mRNA molécule is rapidly degraded. In this way, the first 
round of translation allows the cell to test the fitness of each mRNA molécule as it 
exits the nucléus. 

Nonsense-mediated decay may hâve been especially important in évolution, 
allowing eulcaryotic cells to more easily explore new genes formed by DNA rear¬ 
rangements, mutations, or alternative patterns of splicing—by selecting only those 
mRNAs for translation that can produce a full-length protein. Nonsense-medi¬ 
ated decay is also important in cells of the developing immune System, where 
the extensive DNA rearrangements that occur (see Figure 24-28) often generate 
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Figure 6-76 Nonsense-mediated mRNA decay. As shown on the right, the failure to correctly splice a pre-mRNA often 
introduces a prématuré stop codon into the reading trame for the protein. These abnormal mRNAs are destroyed by the 
nonsense-mediated decay mechanism. To activate this mechanism, an mRNA molécule, bearing exon junction complexes 
(EJCs) to mark successfully completed splices, is first met by a ribosome that performs a “test” round of translation. As the 
mRNA passes through the tight channel of the ribosome, the EJCs are stripped off, and successful mRNAs are released to 
undergo multiple rounds of translation (left side). However, if an in-frame stop codon is encountered before the final EJC is 
reached (right side), the mRNA undergoes nonsense-mediated decay, which is triggered by the Upf proteins (green) that bind to 
each EJC. Note that this mechanism ensures that nonsense-mediated decay is triggered only when the prématuré stop codon 
is in the same reading trame as that of the normal protein. (Adapted from J. Lykke-Andersen et al., Cell 103:1121-1131,2000. 
With permission from Elsevier.) 


prématuré termination codons. The surveillance System dégradés the mRNAs 
produced from such rearranged genes, thereby avoiding the potential toxic effects 
of truncated proteins. 

The nonsense-mediated surveillance pathway also plays an important rôle in 
mitigating the symptoms of many inherited human diseases. As we hâve seen, 
inherited diseases are usually caused by mutations that spoil the function of a 
lcey protein, such as hemoglobin or one of the blood-clotting factors. Approxi- 
mately one-third of ail genetic disorders in humans resuit from nonsense muta¬ 
tions or mutations (such as frameshift mutations or splice-site mutations) that 
place nonsense mutations into the gene's reading frame. In individuals that carry 
one mutant and one functional gene, nonsense-mediated decay éliminâtes the 
aberrant mRNA and thereby prevents a potentially toxic protein from being made. 
Without this safeguard, individuals with one functional and one mutant “disease 
gene" would lilcely suffer much more severe symptoms. 

Some Proteins Begin to Fold While Still Being Synthesized 

The process of gene expression is not over when the genetic code has been used to 
create the sequence of amino acids that constitutes a protein. To be useful to the 
cell, this new polypeptide chain must fold up into its unique three-dimensional 
conformation, bind any small-molecule cofactors required for its activity, be 
appropriately modified by protein kinases or other protein-modifying enzymes, 
and assemble correctly with the other protein subunits with which it functions 
(Figure 6-77). 

The information needed for ail of the steps listed above is ultimately contained 
in the sequence of amino acids that the ribosome produces when it translates 
an mRNA molécule into a polypeptide chain. As discussed in Chapter 3, when a 
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Figure 6-77 Steps in the création of a functional protein. As indicated, 
translation of an mRNA sequence into an amino acid sequence on the 
ribosome is not the end of the process of forming a protein. To function, the 
completed polypeptide chain must fold correctly into its three-dimensional 
conformation, bind any cofactors required, and assemble with its partner 
protein chains, if any. Noncovalent bond formation drives these changes. 

As indicated, many proteins also require covalent modifications of selected 
amino acids. Although the most frequent modifications are protein 
glycosylation and protein phosphorylation, over 200 different types of 
covalent modifications are known (see pp. 165-166). 


protein folds into a compact structure, it buries most of its hydrophobie residues 
in an interior core. In addition, large numbers of noncovalent interactions form 
between various parts of the molécule. It is the sum of ail of these energetically 
favorable arrangements that détermines the final folding pattern of the polypep¬ 
tide chain—as the conformation of lowest free energy (see pp. 114-115). 

Through many millions of years of évolution, the amino acid sequence of each 
protein has been selected not only for the conformation that it adopts but also 
for an ability to fold rapidly. For some proteins, this folding begins immediately, 
as the protein chain emerges from the ribosome, starting from the N-terminal 
end. In these cases, as each protein domain emerges from the ribosome, within 
a few seconds it forms a compact structure that contains most of the final second- 
ary features (a helices and P sheets) aligned in roughly the right conformation 
(Figure 6-78). For some protein domains, this unusually dynamic and flexible 
State, called a molten globule , is the starting point for a relatively slow process in 
which many side-chain adjustments occur that eventually form the correct ter- 
tiary structure. It talces several minutes to synthesize a protein of average size, and 
for some proteins much of the folding process is complété by the time the ribo¬ 
some releases the C-terminal end of a protein (Figure 6-79). 
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Molecular Chaperones Help Guide the Folding of Most Proteins 

Most proteins probably do not fold correctly during their synthesis and require a 
spécial class of proteins called molecular chaperones to do so. Molecular chap¬ 
erones are useful for cells because there are many different folding paths avail- 
able to an unfolded or partially folded protein. Without chaperones, some of these 
pathways would not lead to the correctly folded (and most stable) form because 
the protein would become “kinetically trapped” in structures that are off-path- 
way. Some of these off-pathway configurations would aggregate and be left as irré¬ 
versible dead ends of nonfunctional (and potentially dangerous) structures. 



Figure 6-78 The structure of a molten 
globule. (A) A molten globule form of 
cytochrome £>562 is more open and less 
highly ordered than the final folded form 
of the protein, shown in (B). Note that 
the molten globule contains most of the 
secondary structure of the final form, 
although the ends of the a helices are 
unraveled and one of the helices is only 
partly formed. (Courtesy of Joshua Wand, 
from Y. Feng et al., Nat. Struct. Biol. 1:30- 
35, 1994. With permission from Macmillan 
Publishers Ltd.) 
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Figure 6-79 Co-translational protein folding. A growing polypeptide chain is shown acquiring 
its secondary and tertiary structure as it emerges from a ribosome. The N-terminal domain folds 
first, while the C-terminal domain is still being synthesized. This protein has not achieved its final 
conformation at the time it is released from the ribosome. (Modified from A.N. Fedorov and 
T.O. Baldwin, J. Biol. Chem. 272:32715-32718, 1997.) 


Molecular chaperones specifically recognize incorrect, off-pathway configu¬ 
rations by their exposure of hydrophobie surfaces, which in correctly folded pro¬ 
teins are typically buried in the interior. The binding of these exposed hydrophobie 
surfaces to each other is what causes off-pathway conformations to irreversibly 
aggregate. We saw in Chapter 3 that in some cases of inherited human diseases, 
aggregates do form and can cause severe symptoms and even death. Chaperones 
prevent this from happening in normal proteins by binding to the exposed hydro¬ 
phobie surfaces using hydrophobie surfaces of their own. As we shall see shortly, 
there are several types of chaperones; once bound to an incorrectly folded pro¬ 
tein, they ultimately release it in a way that gives the protein another chance to 
fold correctly. 

Cells Utilize Several Types of Chaperones 

Many molecular chaperones are called heat-shock proteins (designated hsp), 
because they are synthesized in dramatically increased amounts after a brief 
exposure of cells to an elevated température (for example, 42°C for cells that nor- 
mally live at 37°C). This reflects the operation of a feedback System that responds 
to an increase in misfolded proteins (such as those produced by elevated tem¬ 
pératures) by boosting the synthesis of the chaperones that help these proteins 


refold. 


There are several major families of molecular chaperones, including the hsp60 
and hsp70 proteins. Different members of these families function in different 
organelles. Thus, as discussed in Chapter 12, mitochondria contain their own 
hsp60 and hsp70 molécules that are distinct from those that function in the cyto¬ 
sol; and a spécial hsp70 (called BIP ) helps to fold proteins in the endoplasmic 
réticulum. 

The hsp60 and hsp70 proteins each worlc with their own small set of associ- 
ated proteins when they help other proteins to fold. These hsps share an affinity 
for the exposed hydrophobie patches on incompletely folded proteins, and they 
hydrolyze ATP, often binding and releasing their protein substrate with each cycle 
of ATP hydrolysis. In other respects, the two types of hsp proteins function differ- 
ently. The hsp70 machinery acts early in the life of many proteins (often before 
the protein leaves the ribosome), with each monomer of hsp70 binding to a string 
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of about four or five hydrophobie amino acids (Figure 6-80). On binding ATP, 
hsp70 releases the protein into solution allowing it a chance to re-fold. In contrast, 
hsp60-lilce proteins form a large barrel-shaped structure that acts after a protein 
has been fully synthesized. This type of chaperone, sometimes called a chapero- 
nin, forms an "isolation chamber” for the folding process (Figure 6-81). 

To enter a chamber, a substrate protein is first captured via the hydropho¬ 
bie entrance to the chamber. The protein is then released into the interior of the 
chamber, which is lined with hydrophilic surfaces, and the chamber is sealed 
with a lid, a step requiring ATP. Here, the substrate is allowed to fold into its final 
conformation in isolation, where there are no other proteins with which to aggre- 
gate. When ATP is hydrolyzed, the lid pops off, and the substrate protein, whether 
folded or not, is released from the chamber. 

The chaperones shown in Figures 6-80 and 6-81 often need many cycles of 
ATP hydrolysis to fold a single polypeptide chain correctly. This energy is used 
to perform mechanical movements of the hsp60 and hsp70 "machines,” convert- 
ing them from binding forms to releasing forms. Just as we saw for transcription, 
splicing, and translation, the expenditure of free energy can be used by cells to 
improve the accuracy of a biological process. In the case of protein folding, ATP 
hydrolysis allows chaperones to recognize a wide variety of misfolded structures, 
to hait any further misfolding, and to recommence the folding of a protein in an 
orderly way. 


Figure 6-80 The hsp70 family of 
molecular chaperones. These proteins 
act early, recognizing a small stretch of 
hydrophobie amino acids on a protein’s 
surface. Aided by a set of smaller hsp40 
proteins (not shown), ATP-bound hsp70 
molécules grasp their target protein and 
then hydrolyze ATP to ADP, undergoing 
conformational changes that cause the 
hsp70 molécules to associate even more 
tightly with the target. After the hsp40 
dissociâtes, the rapid rebinding of ATP 
induces the dissociation of the hsp70 
protein after ADP release. Repeated cycles 
of hsp binding and release help the target 
protein to refold. 
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Figure 6-81 The structure and function of the hsp60 family of molecular chaperones. (A) A 

misfolded protein is initially captured by hydrophobie interactions with the exposed surface of the 
opening. The initial binding often helps to unfold a misfolded protein. The subséquent binding of ATP 
and a cap releases the substrate protein into an enclosed space, where it has a new opportunity to 
fold. After about 10 seconds, ATP hydrolysis occurs, weakening the binding of the cap. Subséquent 
binding of additional ATP molécules ejects the cap, and the protein is released. As indicated, only 
half of the symmetric barrel opérâtes on a client protein at any one time. This type of molecular 
chaperone is also known as a chaperonin; it is designated as hsp60 in mitochondria, TCP1 in the 
cytosol of vertebrate cells, and GroEL in bacteria. (B) The structure of GroEL bound to its GroES cap, 
as determined by x-ray crystallography. On the left is shown the outside of the barrel-like structure, 
and on the right a cross section through its center. (B, adapted from B. Bukau and A.L. Horwich, Cell 
92:351-366, 1998. With permission from Elsevier.) 
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Although our discussion focuses on only two types of chaperones, the cell has 
a variety of others. The enormous diversity of proteins in cells presumably requires 
a wide range of chaperones with versatile surveillance and correction capabilities. 

Exposed Hydrophobie Régions Provide Critical Signais for Protein 
Quality Control 

If radioactive amino acids are added to cells for a brief period, the newly synthe- 
sized proteins can be followed as they mature into their final functional forms. 
This type of experiment demonstrates that the hsp70 proteins act first, beginning 
when a protein is still being synthesized on a ribosome, and the hsp60-lilce pro¬ 
teins act only later to help fold completed proteins. We hâve seen that the cell dis- 
tinguishes misfolded pro teins, which require additional rounds of ATP-catalyzed 
refolding, from those with correct structures through the récognition of hydro¬ 
phobie surfaces. 

Usually, if a protein has a sizable exposed patch of hydrophobie amino acids 
on its surface, it is abnormal: it has either failed to fold correctly after leaving the 
ribosome, suffered an accident that partly unfolded it at a later time, or failed to 
find its normal partner subunit in a larger protein complex. Such a protein is not 
merely useless to the cell, it can be dangerous. 

Proteins that rapidly fold correctly on their own do not display such patterns 
and generally bypass the chaperones. For the others, the chaperones can carry out 
"protein repair" by giving them additional chances to fold while, at the same time, 
preventing their aggregation. 

Figure 6-82 outlines ail of the quality-control choices that a cell malces for a 
difficult-to-fold, newly synthesized protein. As indicated, when attempts to refold 
a protein fail, an additional mechanism is called into play that completely destroys 
the protein by proteolysis. This proteolytic pathway begins with the récognition of 
an abnormal hydrophobie patch on a protein's surface, and it ends with the deliv- 
ery of the entire protein to a protein-destruction machine, a complex protease 
known as the protéasome. As described next, this process dépends on an elab- 
orate protein-marlcing System that also carries out other central functions in the 
cell by destroying selected normal proteins. 


The Protéasome Is a Compartmentalized Protease with 
Sequestered Active Sites 

The proteolytic machinery and the chaperones compete with one another to rec- 
ognize a misfolded protein. If a newly synthesized protein folds rapidly, at most 
only a small fraction of it is degraded. In contrast, a slowly folding protein is vul¬ 
nérable to the proteolytic machinery for a longer time, and many more of its mol¬ 
écules may be destroyed before the remainder attain the proper folded State. Due 
to mutations or to errors in transcription, RNA splicing, and translation, some 
proteins never fold properly, and it is particularly important that the cell destroy 
these potentially harmful pro teins. 

The apparatus that deliberately destroys aberrant proteins is the protéasome, 
an abundant ATP-dépendent protease that constitutes nearly 1% of cell protein. 
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Figure 6-82 The processes that monitor 
protein quality following protein 
synthesis. A newly synthesized protein 
sometimes folds correctly and assembles 
on its own with its partner proteins, in 
which case the quality control mechanisms 
leave it alone. Incompletely folded proteins 
are helped to properly fold by molecular 
chaperones: first by a family of hsp70 
proteins, and then, in some cases, by 
hsp60-like proteins. For both types of 
chaperones, the substrate proteins are 
recognized by an abnormally exposed 
patch of hydrophobie amino acids on their 
surface. These “protein-rescue” processes 
compete with another mechanism that, 
upon recognizing an abnormally exposed 
hydrophobie patch, marks the protein 
for destruction by the protéasome. The 
combined activity of ail of these processes 
is needed to prevent massive protein 
aggregation in a cell, which can occur 
when many hydrophobie régions on 
proteins clump together nonspecifically. 
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Figure 6-83 The protéasome. (A) A cut- 
away view of the structure of the central 
20S cylinder, as determined by x-ray 
crystallography, with the active sites of the 
proteases indicated by red dots. (B) The 
entire protéasome, in which the central 
cylinder (yellow) is supplemented by a 
19S cap (blue) at each end. The complex 
cap (also called the regulatory particle) 
selectively binds proteins that hâve been 
marked by ubiquitin for destruction; it 
then uses ATP hydrolysis to unfold their 
polypeptide chains and feed them through 
a narrow channel (see Figure 6-85) into 
the inner chamber of the 20S cylinder for 
digestion to short peptides. (B, from 
W. Baumeister et al., Cell 92:367-380, 

1998. With permission from Elsevier.) 


Présent in many copies dispersed throughout the cytosol and the nucléus, the 
protéasome also destroys aberrant proteins that hâve entered the endoplasmic 
réticulum (ER). In the latter case, an ER-based surveillance System detects pro¬ 
teins that hâve failed either to fold or to be assembled properly after they enter the 
ER, and retrotranslocates them baclc to the cytosol for dégradation by the protéa¬ 
some (discussed in Chapter 12). 

Each protéasome consists of a central hollow cylinder (the 20S core protéa¬ 
some) formed from multiple protein subunits that assemble as a stade of four hep- 
tameric rings (Figure 6-83). Some of the subunits are proteases whose active sites 
face the cylinder's inner chamber, thus preventing them from running rampant 
through the cell. Each end of the cylinder is normally associated with a large pro¬ 
tein complex (the 19S cap) that contains a six-subunit protein ring through which 
target proteins are threaded into the protéasome core, where they are degraded 
(Figure 6-84). The threading reaction, driven by ATP hydrolysis, unfolds the target 
proteins as they move through the cap, exposing them to the proteases lining the 
protéasome core (Figure 6-85). The proteins that malce up the ring structure in 
the protéasome cap belong to a large class of protein “unfoldases” known as AAA 
proteins. Many of them function as hexamers, and they share mechanistic fea- 
tures with the ATP-dependent DNA helicases that unwind DNA (see Figure 5-14). 
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Figure 6-84 Processive protein digestion by the protéasome. (A) The protéasome cap recognizes proteins marked by a 
polyubiquitin chain (see Figure 3-70), and subsequently translocates them into the protéasome core, where they are digested. At 
an early stage, the ubiquitin is cleaved from the substrate protein and is recycled. Translocation into the core of the protéasome 
is mediated by a ring of ATPases that unfold the substrate protein as it is threaded through the ring and into the protéasome 
core. This unfoldase ring is depicted in Figure 6-85). (B) Detailed structure of the protéasome cap. The cap includes a ubiquitin 
receptor, which holds a ubiquitylated protein in place while it begins to be pulled into the protéasome core, and a ubiquitin 
hydrolase, which cleaves ubiquitin from the doomed protein. (A, from S. Prakash and A. Matouschek, Trends Biochem. Soi. 
29:593-600, 2004. With permission from Elsevier. B, adapted from G.C. Lânder et al., Nature 482:186-191,2012.) 
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A crucial property of the protéasome, and one reason for the complexity of 
its design, is the processivity of its mechanism: in contrast to a "simple” prote¬ 
ase that cleaves a substrate's polypeptide chain just once before dissociating, the 
protéasome lceeps the entire substrate bound until ail of it is converted into short 
peptides. 

One would expect that a machine as efficient as the protéasome would be 
tightly regulated; in particular, it must be able to distinguish abnormal proteins 
from those that are properly folded. The 19S cap of the protéasome acts as a gâte 
at the entrance to the inner proteolytic core, and only those proteins marlced for 
destruction are threaded through the cap. The destruction "mark” is the covalent 
attachment of the small protein ubiquitin. As we saw in Chapter 3, ubiquitin mod¬ 
ification of proteins is used for many purposes in the cell. The particular type of 
ubiquitin linkage that concerns us here is a chain of ubiquitin molécules linlced 
together at lysine 48 (see Figure 3-69); this is the distinguishing feature of the 
ubiquitin tag that marks a protein for destruction in the protéasome. 

A spécial set of E3 molécules (see Figure 3-70B) is responsible for the ubiq- 
uitylation of denatured or otherwise misfolded proteins, as well as proteins 
containing oxidized or other abnormal amino acids. Abnormal proteins tend to 
display on their surface hydrophobie amino acid sequences or conformational 
motifs that are recognized as dégradation signais by these E3 molécules; these 
sequences are buried and therefore inaccessible in the normal, properly folded 
version. However, a proteolytic pathway that recognizes and destroys abnormal 
proteins must be able to distinguish completed proteins that hâve "wrong” confor¬ 
mations from the many growing polypeptides on ribosomes (as well as polypep¬ 
tides just released from ribosomes) that hâve not yet achieved their normal folded 
conformation. This is not a trivial problem; in the course of carrying out its main 
job, the ubiquitin-proteasome System probably destroys many nascent and newly 
formed protein molécules, not because these proteins are abnormal as such, but 
because they hâve transiently exposed dégradation signais that are buried in their 
mature (folded) State. 

Many Proteins Are Controlled by Regulated Destruction 

One function of intracellular proteolytic mechanisms is to recognize and elimi- 
nate misfolded or otherwise abnormal proteins, as just described. Indeed, every 
protein in the cell eventually accumulâtes damage and is probably degraded by 
the protéasome. Yet another function of these proteolytic pathways is to confer 
short lifetimes on spécifie normal proteins whose concentrations must change 
promptly with alterations in the State of a cell. Some of these short-lived proteins 
are degraded rapidly at ail times, while many others are conditionally short-lived; 
that is, they are metabolically stable under some conditions, but become unsta- 
ble upon a change in the celFs State. For example, mitotic cyclins are long-lived 
throughout the cell cycle until their sudden dégradation at the end of mitosis, as 
explained in Chapter 17. 


Figure 6-85 A hexameric protein 
unfoldase. (A) The protéasome cap 
includes proteins (orange) that recognize 
and hydrolyze ubiquitin and a hexameric 
ring (blue) through which ubiquitylated 
proteins are threaded. The hexameric ring 
is formed from six subunits, each belonging 
to the AAA family of proteins. (B) Model 
for the ATP-dependent unfoldase activity 
of AAA proteins. The ATP-bound form of 
a hexameric ring of AAA proteins binds 
a folded substrate protein that is held in 
place by its ubiquitin tag. A conformational 
change, driven by ATP hydrolysis, pulls 
the substrate into the central core and 
strains the ring structure. At this point, the 
substrate protein, which is being tugged 
upon, can partially unfold and enter further 
into the pore or it can maintain its structure 
and partially withdraw. Very stable protein 
substrates may require hundreds of cycles 
of ATP hydrolysis and dissociation before 
they are successfully pulled through the 
AAA protein ring. Once unfolded (and 
de-ubiquitylated), the substrate protein 
moves relatively quickly through the pore 
by successive rounds of ATP hydrolysis. 

(A, adapted from G.C. Lânder et al., Nature 
482:186-191,2012; B, adapted from 
R.T. Sauer et al., Cell 119:9-18, 2004. 

With permission from Elsevier.) 










360 Chapter 6: How Cells Read the Genome: From DNA to Protein 


(A) ACTIVATION OF A UBIQUITIN LIGASE 


%é 

U- EH 

%é 


ADP 

%é 

phosphorylation 
by protein kinase 

allosteric transition 
caused by ligand binding 

allosteric transition 
caused by protein 


subunit addition 


Figure 6-86 Two general ways of 
inducing the dégradation of a spécifie 
protein. (A) Activation of a spécifie 
E3 molécule créâtes a new ubiquitin 
ligase. Eukaryotic cells hâve many 
different E3 molécules, each activated 
by a different signal. (B) Création of an 
exposed dégradation signal in the protein 
to be degraded. This signal binds a 
ubiquitin ligase, causing the addition of a 
polyubiquitin chain to a nearby lysine on 
the target protein. Ail six pathways shown 
are known to be used by cells to induce 
the movement of selected proteins into the 
protéasome. 
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How is such a regulated destruction of a protein controlled? Several general 
mechanisms are illustrated in Figure 6-86. Spécifie examples of each mechanism 
are discussed in later chapters. In one general class of mechanism (Figure 6-86A), 
the activity of a ubiquitin ligase is turned on either by E3 phosphorylation or by 
an allosteric transition in an E3 protein caused by its binding to a spécifie small or 
large molécule. For example, the anaphase-promoting complex (APC) is a multi- 
subunit ubiquitin ligase that is activated by a cell-cycle-timed subunit addition at 
mitosis. The activated APC then causes the dégradation of mitotic cyclins and sev¬ 
eral other regulators of the metaphase-anaphase transition (see Figure 17-15A). 

Alternatively, in response either to intracellular signais or to signais from the 
environment, a dégradation signal can be created in a protein, causing its rapid 
ubiquitylation and destruction by the protéasome (Figure 6-86B). One common 
way to create such a signal is to phosphorylate a spécifie site on a protein that 
unmasks a normally hidden dégradation signal. Another way to unmaslc such a 
signal is by the regulated dissociation of a protein subunit. Finally, powerful dég¬ 
radation signais can be created by cleaving a single peptide bond, provided that 
this cleavage créâtes a new N-terminus that is recognized by a spécifie E3 protein 
as a “destabilizing" N-terminal residue. This E3 protein recognizes only certain 
amino acids at the N-terminus of a protein; thus not ail protein-cleavage events 
will lead to dégradation of the C-terminal fragment produced. 

In humans, nearly 80% of proteins are acetylated on their N-terminal residue, 
and we now know that this modification is recognized by a spécifie E3 enzyme, 
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which directs the ubiquitylation of the protein and sends it to the protéasome 
for dégradation. Thus, the majority of human proteins carry their own signais for 
destruction. It has been proposed that when a protein is properly folded (and, 
before that, when it is in contact with a chaperone), this acetylated N-terminus is 
buried and therefore inaccessible to the E3 enzyme. According to this idea, as a 
protein âges and becomes damaged (or if it fails to fold correctly from the start), 
this destruction signal becomes exposed, and the protein is destroyed. 


There Are Many Steps From DNA to Protein 

We hâve seen so far in this chapter that many different types of Chemical reactions 
are required to produce a properly folded protein from the information contained 
in a gene (Figure 6-87). The final level of a properly folded protein in a cell there¬ 
fore dépends upon the efficiency with which each of the many steps is performed. 
We also now know that the cell devotes enormous resources to selectively degrad- 
ing proteins, particularly those that fail to fold properly or accumulate damage 
as they âge. It is the balance between the rates of synthesis and dégradation that 
détermines the final amount of every protein in the cell. 

In the following chapter, we shall see that cells hâve the ability to change the 
levels of their proteins according to their needs. In principle, any or ail of the steps 
in Figure 6-87 could be regulated for each individual protein. As we shall see in 
Chapter 7, there are examples of régulation at each step from gene to protein. 
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Figure 6-87 The production of a protein 
by a eukaryotic cell. The final level of each 
protein in a eukaryotic cell dépends upon 
the efficiency of each step depicted. 
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Summary 

The translation ofthe nucléotide sequence of an mRNA molécule into protein takes 
place in the cytosol on a large ribonucleoprotein assembly called a ribosome. Each 
amino acid usedfor protein synthesis is first attached to a tRNA molécule that rec- 
ognizes, by complementary base-pair interactions, a particular set ofthree nuclé¬ 
otides (codons) in the mRNA. As an mRNA is threaded through a ribosome, its 
sequence of nucléotides is then read from one end to the other in sets ofthree accord- 
ing to the genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molécule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molécule. A 
large ribosomal subunit then binds to complété the ribosome and begin protein 
synthesis. During this phase, aminoacyl-tRNAs—each bearing a spécifie amino 
acid—bind sequentially to the appropriate codons in mRNA through complemen¬ 
tary base-pairing between tRNA anticodons and mRNA codons. Each amino acid 
is added to the C-terminal end ofthe growing polypeptide in four sequential steps: 
aminoacyl-tRNA binding, followed by peptide bond formation, followed by two 
ribosome translocation steps. Elongation factors use GTP hydrolysis both to drive 
these reactions forward and to improve the accuracy of amino acid sélection. The 
mRNA molécule progresses codon by codon through the ribosome in the 5'-to-3' 
direction until it reaches one ofthree stop codons. A release factor then binds to the 
ribosome, terminating translation and releasing the completed polypeptide. 

Eukaryotic and bacterial ribosomes are closely related, despite différences in the 
number and size oftheir rRNA and protein components. The rRNA has the domi¬ 
nant rôle in translation, determining the overall structure ofthe ribosome, forming 
the binding sites for the tRNAs; matching the tRNAs to codons in the mRNA, and 
creating the active site of the peptidyl transferase enzyme that links amino acids 
together during translation. 

In the final steps of protein synthesis, two distinct types of molecular chaper- 
ones guide the folding of polypeptide chains. These chaperones, known as hsp60 
and hsp70, recognize exposed hydrophobie patches on proteins and serve to prevent 
the protein aggregation that would otherwise compete with the folding ofnewly 
synthesized proteins into their correct three-dimensional conformations. This pro- 
tein-folding process must also compete with an elaborate quality control mecha- 
nism that destroys proteins with abnormally exposed hydrophobie patches. In this 
case, ubiquitin is covalently added to a misfolded protein by a ubiquitin ligase, and 
the resulting polyubiquitin chain is recognized by the cap on a protéasome that 
unfolds the protein and threads it into the interior ofthe protéasome for proteo- 
lytic dégradation. A closely related proteolytic mechanism, based on spécial dégra¬ 
dation signais recognized by ubiquitin ligases, is used to détermine the lifetimes of 
many normally folded proteins as well as to remove selected proteins from the cell 
in response to spécifie signais. 

THE RNA WORLD AND THE ORIGINS OF LIFE 

We hâve seen that the expression of hereditary information requires extraordi- 
narily complex machinery and proceeds from DNA to protein through an RNA 
intermediate. This machinery présents a central paradox: if nucleic acids are 
required to synthesize proteins and proteins are required, in turn, to synthesize 
nucleic acids, how did such a System of interdependent components ever arise? 
One view is that an RNA world existed on Earth before modem cells arose (Fig¬ 
ure 6-88). According to this hypothesis, RNA both stored genetic information and 
catalyzed the Chemical reactions in primitive cells. Only later in evolutionary time 
did DNA talce over as the genetic material and proteins become the major cata- 
lysts and structural components of cells. If this idea is correct, then the transition 
out of the RNA world was never complété; as we hâve seen in this chapter, RNA 
still catalyzes several fundamental reactions in modern-day cells, which can be 
viewed as molecular fossils from an earlier world. 
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Figure 6-88 Time line for the universe, suggesting the early existence of an RNA world of 
living Systems. 



The RNA world hypothesis relies on the fact that, among present-day biologi- 
cal molécules, RNA is unique in being able to act as both a carrier of genetic infor¬ 
mation and as a ribozyme to catalyze Chemical reactions. In this section, we dis- 
cuss these properties of RNA and how they may hâve been especially important 
in early cells. 

Single-Stranded RNA Molécules Can Fold into Highly Elaborate 
Structures 

We hâve seen in this chapter that RNA can carry genetic information in mRNAs, 
and we saw in Chapter 5 that the genomes of some viruses are composed solely 
of RNA. We hâve also seen that complementary base-pairing and other types of 
hydrogen bonds can occur between nucléotides in the same chain of RNA, caus- 
ing an RNA molécule to fold up in a unique way determined by its nucléotide 
sequence (see, for example, Figures 6-50 and 6-67). Comparisons of many RNA 
structures hâve revealed conserved motifs, short structural éléments that are used 
over and over again as parts of larger structures (Figure 6-89). 

Protein catalysts require a surface with unique contours and Chemical prop¬ 
erties on which a given set of substrates can react (discussed in Chapter 3). In 
exactly the same way, an RNA molécule with an appropriately folded shape can 
serve as a catalyst (Figure 6-90). Lilce some proteins, many of these ribozymes 
worlc by positioning métal ions at their active sites. This feature gives them a wider 
range of catalytic activities than provided by the limited Chemical groups of a 
polynucleotide chain. 

Much of our inference about the RNA world has corne from experiments in 
which large pools of RNA molécules of random nucléotide sequences are gener- 
ated in the laboratory. Those rare RNA molécules with a property specified by the 
expérimenter are then selected out and studied (Figure 6-91). Such experiments 
hâve created RNAs that can catalyze a wide variety of biochemical reactions (Table 
6-5), with reaction rate enhancements only a few orders of magnitude lower than 
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Figure 6-89 Some common éléments 
of RNA structure. Conventional, 
complementary base-pairing interactions are 
indicated by red “rungs” in double-helical 
portions of the RNA. 
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Figure 6-90 A ribozyme. This simple RNA molécule catalyzes the cleavage of a second RNA at a spécifie site. This ribozyme is found embedded 
in larger RNA genomes—called viroids—which infect plants. The cleavage, which occurs in nature at a distant location on the same RNA molécule 
that contains the ribozyme, is a step in the réplication of the viroid genome. Although not shown in the figure, the reaction requires a magnésium ion 
positioned at the active site. (Adapted from T.R. Cech and O.C. Uhlenbeck, Nature 372:39-40, 1994. With permission from Macmillan Publishers Ltd.) 
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Figure 6-91 In vitro sélection of a synthetic ribozyme. Beginning with a 
large pool of nucleic acid molécules synthesized in the laboratory, those rare 
RNA molécules that possess a specified catalytic activity can be isolated 
and studied. Although a spécifie example (that of an autophosphorylating 
ribozyme) is shown, variations of this procedure hâve been used to generate 
many of the ribozymes listed in Table 6-5. During the autophosphorylation 
step, the RNA molécules are kept sufficiently dilute to prevent the “cross”- 
phosphorylation of additional RNA molécules. In reality, several répétitions 
of this procedure are necessary to select the very rare RNA molécules with 
this catalytic activity. Thus, the material initially eluted from the column is 
converted back into DNA, amplified many fold (using reverse transcriptase 
and PCR, as explained in Chapter 8), transcribed back into RNA, and 
subjected to repeated rounds of sélection. (Adapted from J.R. Lorsch and 
J.W. Szostak, Nature 371:31 —36, 1994. With permission from Macmillan 
Publishers Ltd.) 


those of the “fastest" protein enzymes. Given these findings, it is not clear why 
protein catalysts greatly outnumber ribozymes in modem cells. Experiments hâve 
shown, however, that RNA molécules may hâve more difficulty than proteins in 
binding to flexible, hydrophobie substrates. In any case, the availability of 20 types 
of amino acids presumably provides proteins with a greater number of catalytic 
strategies. 

RNA Can Both Store Information and Catalyze Chemical 
Reactions 

RNA molécules hâve one property that contrasts with those of polypeptides: they 
can directly guide the formation of copies of their own sequence. This capacity 
dépends on complementary base-pairing of their nucléotide subunits, which 
enables one RNA to act as a template for the formation of another. As we hâve seen 
in this and the preceding chapter, these complementary templating mechanisms 
lie at the heart of DNA réplication and transcription in modern-day cells. 


Some Biochemical Reactions That Can Be Catalyzed by Ribozymes 

Activity 

Ribozymes 

Peptide bond formation in protein 
synthesis 

Ribosomal RNA 

RNA cleavage, RNA ligation 

Self-splicing RNAs; RNAse P; also in vitro 
selected RNA 

DNA cleavage 

Self-splicing RNAs 

RNA splicing 

Self-splicing RNAs, RNAs of the spliceosome 

RNA polymerizaton 

In vitro selected RNA 

RNA and DNA phosphorylation 

In vitro selected RNA 

RNA aminoacylation 

In vitro selected RNA 

RNA alkylation 

In vitro selected RNA 

Amide bond formation 

In vitro selected RNA 

Glycosidic bond formation 

In vitro selected RNA 

Oxidation/reduction reactions 

In vitro selected RNA 

Carbon-carbon bond formation 

In vitro selected RNA 

Phosphoamide bond formation 

In vitro selected RNA 

Disulfide exchange 

In vitro selected RNA 



large pool of double-stranded DNA molécules, 
each with a different, randomly generated 
nucléotide sequence 

TRANSCRIPTION BY 
RNA POLYMERASE 
AND FOLDING OF 
RNA MOLECULES 



large pool of single-stranded RNA molécules, 
each with a different, randomly generated 
nucléotide sequence 
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Figure 6-92 An RNA molécule that 
can catalyze its own synthesis. This 
hypothetical process would require 
catalysis both of the production of a 
second RNA strand of complementary 
nucléotide sequence (not shown) and the 
use of this second RNA molécule as a 
template to form many molécules of RNA 
with the original sequence. The red rays 
represent the active site of this hypothetical 
RNA enzyme. 


But the efficient synthesis of RNA by such complementary templating mech- 
anisms requires catalysts to promote the polymerization reaction: without cata- 
lysts, polymer formation is slow, error-prone, and inefficient. 

Because RNA has ail the properties required of a molécule that could catalyze 
a variety of Chemical reactions, including those that lead to its own synthesis (Fig¬ 
ure 6-92), it has been proposed that RNAs served long ago as the catalysts for tem¬ 
plate-dépendent RNA synthesis. Although self-replicating Systems of RNA mol¬ 
écules hâve not been found in nature, scientists hâve made significant progress 
toward constructing them in the laboratory. While such démonstrations would 
not prove that self-replicating RNA molécules were central to the origin of life on 
Earth, they would establish that such a scénario is plausible. 

How Did Protein Synthesis Evolve? 

The molecular processes underlying protein synthesis in present-day cells seem 
inextricably complex. Although we understand most of them, they do not malce 
conceptual sense in the way that DNA transcription, DNA repair, and DNA rép¬ 
lication do. It is especially difficult to imagine how protein synthesis evolved 
because it is now performed by a complex interlocking System of protein and RNA 
molécules; obviously the proteins could not hâve existed until an early version of 
the translation apparatus was already in place. As attractive as the RNA world idea 
is for envisioning early life, it do es not explain how the modem-day System of pro¬ 
tein synthesis arose. Although we can only speculate on the origins of the genetic 
code, several experimental observations hâve provided plausible scénarios. 

In modem cells, some short peptides (such as antibiotics) are synthesized 
without the ribosome; peptide synthetase enzymes assemble these peptides, with 
their proper sequence of amino acids, without mRNAs to guide their synthesis. It 
is plausible that this noncoded, primitive version of protein synthesis first devel- 
oped in the RNA world, where it would hâve been catalyzed by RNA molécules. 
This idea présents no conceptual difficulties because, as we hâve seen, rRNA cata- 
lyzes peptide bond formation in present-day cells. However, it leaves unexplained 
how the genetic code—which lies at the core of protein synthesis in today's cells— 
might hâve arisen. We lcnow that ribozymes created in the laboratory can perform 
spécifie amino acylation reactions; that is, they can match spécifie amino acids to 
spécifie tRNAs. It is therefore possible that tRNA-lilce adaptors, each matched to a 
spécifie amino acid, could hâve arisen in the RNA world, marking the beginnings 
of a genetic code. 

Once coded protein synthesis evolved, the transition to a protein-dominated 
world could proceed, with proteins eventually taking over the majority of catalytic 
and structural tasks because of their greater versatility, with 20 rather than 4 dif¬ 
ferent subunits. Although these ideas are highly spéculative, they are consistent 
with the known properties of RNA and protein molécules. 

Ail Present-Day Cells Use DNA as Their Hereditary Material 

If the evolutionary spéculations embodied in the RNA world hypothesis are 
correct, early cells would hâve differed fundamentally from the cells we know 
today in having their hereditary information stored in RNA rather than in DNA 
(Figure 6-93). Evidence that RNA arose before DNA in évolution can be found 



EVOLUTION OF RNAs THAT 
CAN DIRECT PROTEIN SYNTHESIS 



EVOLUTION OF NEW ENZYMES 
THAT REPLICATE DNA AND 
MAKE RNA COPIES FROM IT 



Figure 6-93 The hypothesis that RNA 
preceded DNA and proteins in évolution. 

In the earliest cells, RNA molécules (or their 
close analogs) would hâve had combined 
genetic, structural, and catalytic functions. 
In present-day cells, DNA is the repository 
of genetic information, and proteins 
perform the vast majority of catalytic 
functions in cells. RNA primarily functions 
today as a go-between in protein synthesis, 
although it remains a catalyst for a small 
number of crucial reactions. 
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in the Chemical différences between them. Ribose, lilce glucose and other sim¬ 
ple carbohydrates, can be formed from formaldéhyde (HCHO), a simple Chemi¬ 
cal which is readily produced in laboratory experiments that attempt to simulate 
conditions on the primitive Earth. The sugar deoxyribose is harder to malce, and 
in present-day cells it is produced from ribose in a reaction catalyzed by a pro¬ 
tein enzyme, suggesting that ribose pre-dates deoxyribose in cells. Presumably, 
DNA appeared on the scene later, but then proved more suitable than RNA as 
a permanent repository of genetic information. In particular, the deoxyribose in 
its sugar-phosphate baclcbone malces chains of DNA chemically more stable than 
chains of RNA, so that much greater lengths of DNA can be maintained without 
breakage. 

The other différences between RNA and DNA—the double-helical structure of 
DNA and the use of thymine rather than uracil—further enhance DNA stability by 
malcing the many unavoidable accidents that occur to the molécule much easier 
to repair, as discussed in detail in Chapter 5 (pp. 271-273). 


Summary 

From our knowledge of present-day organisms and the molécules they contain, it 
seems likely that the development ofthe distinctive autocatalytic mechanisms fun- 
damental to living Systems began with the évolution offamilies ofRNA molécules 
thatcould catalyze their own réplication. DNA is likely to hâve been a late addition: 
as the accumulation of protein catalysts allowed more efficient and complex cells to 
evolve, the DNA double hélix replaced RNA as a more stable molécule for storing the 
increased amounts of genetic information required by such cells. 


WHATWE DONT KNOW 


• Flow did the présent relationships 
between nucleic acids and proteins 
evolve? Flow did the genetic code 
originate? 

• The information carried in genomes 
spécifiés the sequences of ail proteins 
and RNA molécules in the cell, 

and it détermines when and where 
these molécules are synthesized. 

Do genomes carry other types of 
information that we hâve not yet 
discovered? 

• Cells go to great length to correct 
mistakes in the processes of DNA 
réplication, transcription, splicing, 
and translation. Are there analogous 
strategies to correct mistakes in the 
sélection of which genes are to be 
expressed in a given cell type? Could 
the great complexity of transcription 
initiation in animais and plants reflect 
such a strategy? 


PROBLEMS 


Which statements are true? Explain why or why not. 

6-1 The conséquences of errors in transcription are 
less severe than those of errors in DNA réplication. 

6-2 Since introns are largely genetic “junlc," they do not 
hâve to be removed precisely from the primary transcript 
during RNA splicing. 

6-3 Wobble pairing occurs between the first position 
in the codon and the third position in the anticodon. 

6-4 During protein synthesis, the thermodynamics of 
base-pairing between tRNAs and mRNAs sets the upper 
limit for the accuracy with which protein molécules are 
made. 



Figure Q6-1 Supercoils around a moving RNA polymerase (Problem 
6 - 6 ). 


You hâve attached an RNA polymerase molécule 
to a glass slide and hâve allowed it to initiate transcription 
on a template DNA that is tethered to a magnetic bead as 
shown in Figure Q6-2. If the DNA with its attached mag¬ 
netic bead moves relative to the RNA polymerase as indi- 
cated in the figure, in which direction will the bead rotate? 


6-5 Protein enzymes are thought to greatly outnum- 
ber ribozymes in modem cells because they can catalyze 
a much greater variety of reactions and ail of them hâve 
faster rates than any ribozyme. 

Discuss the following problems. 

6-6 In which direction along the template must the 
RNA polymerase in Figure Q6-1 be moving to hâve gen- 
erated the supercoiled structures that are shown? Would 
you expect supercoils to be generated if the RNA poly¬ 
merase were free to rotate about the axis of the DNA as it 
progressed along the template? 


magnet 



Figure Q6-2 System for 
measuring the rotation of DNA 
caused by RNA polymerase 
(Problem 6-7). The magnet 
holds the bead upright (but 
doesn’t interfère with its 
rotation), and the attached 
tiny fluorescent beads allow 
the direction of motion to 
be visualized under the 
microscope. RNA polymerase is 
held in place by attachment to 
the glass slide. 
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(A) HUMAN a-TROPOMYOSIN GENE 
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23 78 11 12 13 


(B) FOUR DIFFERENT SPLICE VARIANTS 


Figure Q6-3 Alternative^ spliced mRNAs from the human 
a-tropomyosin gene (Problem 6-8). (A) Exons in the human 
a-tropomyosin gene. The locations and relative sizes of exons are 
shown by the blue and red rectangles, with alternative exons in red. 

(B) Splicing patterns for four a-tropomyosin mRNAs. Splicing is 
indicated by Unes connecting the exons that are included in the mRNA. 

6-8 The human a-tropomyosin gene is alternatively 
spliced to produce different forms of a-tropomyosin 
mRNA in different cell types (Figure Q6-3). For ail forms of 
the mRNA, the protein sequences encoded by exon 1 are 
the same, as are the protein sequences encoded by exon 
10. Exons 2 and 3 are alternative exons used in different 
mRNAs, as are exons 7 and 8. Which of the following state- 
ments about exons 2 and 3 is the most accurate? Is that 
statement also the most accurate one for exons 7 and 8? 
Explain your answers. 

A. Exons 2 and 3 must hâve the same number of 
nucléotides. 

B. Exons 2 and 3 must each contain an intégral num¬ 
ber of codons (that is, the number of nucléotides divided 
by 3 must be an integer). 

C. Exons 2 and 3 must each contain a number of 
nucléotides that when divided by 3 leaves the same 
remainder (that is, 0,1, or 2). 

6-9 After treating cells with a Chemical mutagen, you 
isolate two mutants. One carries alanine and the other 
carries méthionine at a site in the protein that normally 
contains valine (Figure Q6-4). After treating these two 
mutants again with the mutagen, you isolate mutants from 
each that now carry threonine at the site of the original 
valine (Figure Q6-4). Assuming that ail mutations involve 
single-nucleotide changes, deduce the codons that are 
used for valine, méthionine, threonine, and alanine at the 
affected site. Would you expect to be able to isolate valine- 
to-threonine mutants in one step? 

6-10 Which of the following mutational changes would 
you predict to be the most deleterious to gene function? 
Explain your answers. 


first 

Ala 

Figure Q6-4 Two rounds of 

treatment^*' 

N^treatment 

mutagenesis and the altered 
amino acids at a single position in 

Val 

X 

Thr 

X 

a protein (Problem 6-9). 


1. Insertion of a single nucléotide near the end of the 
coding sequence. 

2. Removal of a single nucléotide near the beginning 
of the coding sequence. 

3. Délétion of three consecutive nucléotides in the 
middle of the coding sequence. 

4. Substitution of one nucléotide for another in the 
middle of the coding sequence. 

6-11 Prokaryotes and eukaryotes both protect against 
the dangers of translating brolcen mRNAs. What dangers 
do partial mRNAs pose for the cell? 

6-12 Both hsp60-like and hsp70 molecular chaperones 
share an affinity for exposed hydrophobie patches on pro¬ 
teins, using them as indicators of incomplète folding. Why 
do you suppose hydrophobie patches serve as critical sig¬ 
nais for the folding status of a protein? 

6-13 Most proteins require molecular chaperones to 
assist in their correct folding. How do you suppose the 
chaperones themselves manage to fold correctly? 

6-14 What is so spécial about RNA that it is hypothe- 
sized to be an evolutionary precursor to DNA and protein? 
What is it about DNA that malces it a better material than 
RNA for storage of genetic information? 

6-15 If an RNA molécule could form a hairpin with a 
symmetric internai loop, as shown in Figure Q6-5, could 
the complément of this RNA form a similar structure? If so, 
would there be any régions of the two structures that are 
identical? Which ones? 


5'-G-Q-l 

ii ii n 

3'-C-G-U 


C-U 

✓ \_ 
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II 


G' 

\ / 
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n n u 
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Figure Q6-5 An RNA hairpin with 
a symmetric internai loop (Problem 
6-15). 


6-16 Imagine a warm pond on the primordial Earth. 
Chance processes hâve just assembled a single copy of an 
RNA molécule with a catalytic site that can carry out RNA 
réplication. This RNA molécule folds into a structure that 
is capable of linking nucléotides according to instructions 
in an RNA template. Given an adéquate supply of nucléo¬ 
tides, will this single RNA molécule be able to use itself as a 
template to catalyze its own réplication? Why or why not? 


Met 
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Control of Gene Expression 


CHAPTER 

7 


An organism's DNA encodes ail of the RNA and protein molécules required to 
construct its cells. Yet a complété description of the DNA sequence of an organ- 
ism—be it the few million nucléotides of a bacterium or the few billion nucléo¬ 
tides of a human—no more enables us to reconstruct the organism than a list of 
English words enables us to reconstruct a play by Shakespeare. In both cases, the 
problem is to know how the éléments in the DNA sequence or the words on the 
list are used. Under what conditions is each gene product made, and, once made, 
what does it do? 

In this chapter, we focus on the first half of this problem—the rules and mech- 
anisms that enable a subset of genes to be selectively expressed in each cell. These 
mechanisms operate at many levels, and we shall discuss each level in turn. But 
first we présent some of the basic principles involved. 

AN OVERVIEW OF GENE CONTROL 

The different cell types in a multicellular organism differ dramatically in both 
structure and function. If we compare a mammalian neuron with a liver cell, for 
example, the différences are so extreme that it is difficult to imagine that the two 
cells contain the same genome (Figure 7-1). For this reason, and because cell dif¬ 
férentiation often seemed irréversible, biologists originally suspected that genes 
might be selectively lost when a cell differentiates. We now know, however, that 
cell différentiation generally occurs without changes in the nucléotide sequence 
of a cell's genome. 


IN THIS CHAPTER 

AN OVERVIEW OF GENE CONTROL 

CONTROL OF TRANSCRIPTION BY 
SEQUENCE-SPECIFIC 
DNA-BINDING PROTEINS 

TRANSCRIPTION REGULAFORS 
SWITCH GENES ON AND OFF 

MOLECULAR GENETIC 
MECHANISMS THAT CREATE AND 
MAINTAIN SPECIALIZED CELL 
TYPES 

MECHANISMS THAT REINFORCE 
CELL MEMORY IN PLANTS AND 
AN I MALS 

POST-TRANSCRIPTIONAL 

CONTROLS 


The Different Cell Types of a Multicellular Organism Contain the 
Same DNA 

The cell types in a multicellular organism become different from one another 
because they synthesize and accumulate different sets of RNA and protein mol¬ 
écules. The initial evidence that they do this without altering the sequence of 
their DNA came from a classic set of experiments in frogs. When the nucléus of 
a fully differentiated frog cell is injected into a frog egg whose nucléus has been 
removed, the injected donor nucléus is capable of directing the récipient egg to 
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Figure 7-1 A neuron and a liver cell share 
the same genome. The long branches 
of this neuron from the retina enable it to 
receive electrical signais from many other 
neurons and convey them to neighboring 
neurons. The liver cell, which is drawn 
to the same scale, is involved in many 
metabolic processes, including digestion 
and the détoxification of alcohol and other 
drugs. Both of these mammalian cells 
contain the same genome, but they express 
different sets of RNAs and proteins. (Neuron 
adapted from S. Ramôn y Cajal, Histologie 
du Système Nerveux de l’Homme et de 
Vertebres, 1909-1911. Paris: Maloine; 
reprinted, Madrid: C.S.I.C, 1972.) 
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produce a normal tadpole (Figure 7-2A). The tadpole contains a full range of dif- 
ferentiated cells that derived their DNA sequences from the nucléus of the origi¬ 
nal donor cell. Thus, the differentiated donor cell cannot hâve lost any important 
DNA sequences. A similar conclusion came from experiments performed with 
plants. When differentiated pièces of plant tissue are placed in culture and then 
dissociated into single cells, often one of these individual cells can regenerate an 
entire adult plant (Figure 7-2B). And the same principle has been more recently 
demonstrated in mammals that include sheep, cattle, pigs, goats, dogs, and mice 
(Figure 7-2C). 

Most recently, detailed DNA sequencing has confirmed the conclusion that 
the changes in gene expression that underlie the development of multicellular 
organisms do not generally involve changes in the DNA sequence of the genome. 

Different Cell Types Synthesize Different Sets of RNAs and 
Proteins 

As a first step in understanding cell différentiation, we would lilce to know how 
many différences there are between any one cell type and another. Although we 
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Figure 7-2 Differentiated cells contain 
ail the genetic instructions necessary 
to direct the formation of a complété 
organism. (A) The nucléus of a skin cell 
from an adult frog transplanted into an 
enucleated egg can give rise to an entire 
tadpole. The broken arrow indicates that, 
to give the transplanted genome time to 
adjust to an embryonic environment, a 
further transfer step is required in which one 
of the nuclei is taken from an early embryo 
that begins to develop and is put back 
into a second enucleated egg. (B) In many 
types of plants, differentiated cells retain 
the ability to “de-differentiate,” so that a 
single cell can form a clone of progeny cells 
that later give rise to an entire plant. (C) A 
nucléus removed from a differentiated cell 
from an adult cow and introduced into an 
enucleated egg from a different cow can 
give rise to a calf. Different calves produced 
from the same differentiated cell donor are 
ail clones of the donor and are therefore 
genetically identical. (A, modified from 
J.B. Gurdon, Sci. Am. 219:24-35, 1968.) 



tadpole 



young carrot 

plant 


(C) 



PT 


DONOR CELL 

O ® PLACED N EXT TO 
@ 0^ ENUCLEATED EGG 

0 ®© 

épithélial cells 
from oviduct 


O 


meiotic 

spindle 



ELECTRIC 
PULSE CAUSES 
DONOR CELL 
TO FUSE WITH 
ENUCLEATED 
EGG CELL 



reconstructed embryo embryo placed in 

zygote foster mother 


MEIOTIC SPINDLE 
AND ASSOCIATED 
CHROMOSOMES 
REMOVED 



calf 


unfertilized 
egg cell 

















AN OVERVIEW OF GENE CONTROL 


371 


start of transcription 

I 

I- 


(3-actin gene 


entrons ^ 


1 


I 


CELL LINE 


embryonic stem cell 
liver cell 



muscle cell 
blood vessel cell 
blood cell precursor 
skin cell 
lung cell 


tyrosine aminotransferase gene 


no. of 
reads 


,aB " 

exons 


(B) 



CELL LINE 

embryonic stem cell 
liver cell 
muscle cell 
blood vessel cell 
blood cell precursor 
skin cell 
lung cell 


still do not hâve an exact answer to this fundamental question, we can malce sev- 
eral general statements. 

1. Many processes are common to ail cells, and any two cells in a single 
organism therefore hâve many gene products in common. These include 
the structural proteins of chromosomes, RNA and DNA polymerases, DNA 
repair enzymes, ribosomal proteins and RNAs, the enzymes that catalyze 
the central reactions of metabolism, and many of the proteins that form the 
cytoskeleton such as actin (Figure 7-3A). 

2. Some RNAs and proteins are abundant in the specialized cells in which 
they function and cannot be detected elsewhere, even by sensitive tests. 
Hemoglobin, for example, is expressed specifically in red blood cells, 
where it carries oxygen, and the enzyme tyrosine aminotransferase (which 
breaks down tyrosine in food) is expressed in liver but not in most other 
tissues (Figure 7-3B). 

3. Studies of the number of different RNAs suggest that, at any one time, a 
typical human cell expresses 30-60% of its approximately 30,000 genes at 
some level. There are about 21,000 protein-coding genes and a roughly esti- 
mated 9000 noncoding RNA genes in humans. When the patterns of RNA 
expression in different human cell lines are compared, the level of expres¬ 
sion of almost every gene is found to vary from one cell type to another. A 
few of these différences are striking, lilce those of hemoglobin and tyrosine 
aminotransferase noted above, but most are much more subtle. But even 
those genes that are expressed in ail cell types usually vary in their level of 
expression from one cell type to the next. 

4. Although there are striking différences in coding RNAs (mRNAs) in special¬ 
ized cell types, they underestimate the full range of différences in the final 
pattern of protein production. As we discuss in this chapter, there are many 
steps after RNA production at which gene expression can be regulated. 
And, as we saw in Chapter 3, proteins are often covalently modified after 
they are synthesized. The radical différences in gene expression between 
cell types are therefore most fully revealed through methods that directly 
display the levels of proteins along with their post-translational modifica¬ 
tions (Figure 7-4). 


Figure 7-3 Différences in RNA levels 
for two human genes in seven different 
tissues. To obtain the RNA data by the 
technique known as RNA-seq (see 
p. 447), RNA was collected from human 
cell lines grown in culture, derived from 
each of the seven indicated tissues. Millions 
of “sequence reads” were obtained and 
mapped across the human genome by 
matching RNA sequences to the DNA 
sequence of the genome. At each position 
along the genome, the height of the colored 
trace is proportional to the number of 
sequence reads that match the genome 
sequence at that point. As seen in the 
figure, the exon sequences in transcribed 
genes are présent at high levels, reflecting 
their presence in mature mRNAs. Intron 
sequences are présent at much lower 
levels and reflect pre-mRNA molécules 
that hâve not yet been spliced plus intron 
sequences that hâve been spliced out but 
not yet degraded. (A) The gene coding for 
“all-purpose” actin, a major component 
of the cytoskeleton. Note that the left- 
hand end of the mature p-actin mRNA is 
not translated into protein. As explained 
later in this chapter, many mRNAs hâve 
5' untranslated régions that regulate their 
translation into protein. (B) The same type 
of data displayed for the enzyme tyrosine 
aminotransferase, which is highly expressed 
in liver cells but not in the other cell types 
tested. (Information for both panels from the 
University of California, Santa Cruz, Genome 
Browser (http://genome.ucsc.edu), which 
provides this type of information for every 
human gene. See also S. Djebali et al., 
Nature 489:101 -108, 2012.) 
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(A) human brain (B) human liver 



acidic isoelectric point basic 


External Signais Can Cause a Cell to Change the Expression of Its 
Genes 

Although the specialized cells in a multicellular organism hâve characteristic 
patterns of gene expression, each cell is capable of altering its pattern of gene 
expression in response to extracellular eues. If a liver cell is exposed to a gluco- 
corticoid hormone, for example, the production of a set of proteins is dramatically 
increased. Released in the body during periods of starvation or intense exercise, 
glucocorticoids signal the liver to increase the production of energy from amino 
acids and other small molécules; the set of proteins whose production is induced 
includes the enzyme tyrosine aminotransferase, mentioned above. When the hor¬ 
mone is no longer présent, the production of these proteins drops to its normal, 
unstimulated level in liver cells. 

Other cell types respond to glucocorticoids differently. Fat cells, for example, 
reduce the production of tyrosine aminotransferase, while some other cell types 
do not respond to glucocorticoids at ail. These examples illustrate a general fea- 
ture of cell specialization: different cell types often respond very differently to the 
same extracellular signal. Other features of the gene expression pattern do not 
change and give each cell type its permanently distinctive character. 


Figure 7-4 Différences in the proteins 
expressed by two human tissues, 

(A) brain and (B) liver. In each panel, 
the proteins are displayed using 
two-dimensional polyacrylamide-gel 
electrophoresis (see pp. 452-454). The 
proteins hâve been separated by molecular 
weight (top to bottom) and isoelectric 
point, the pH at which the protein has 
no net charge (right to left). The protein 
spots artificially colored red are common 
to both samples; those in blue are spécifie 
to that tissue. The différences between 
the two tissue samples vastly outweigh 
their similarities: even for proteins that 
are shared between the two tissues, their 
relative abundances are usually different. 
Note that this technique séparâtes proteins 
by both size and charge; therefore a protein 
that has several different phosphorylation 
States will appear as a sériés of horizontal 
spots (see upper right-hand portion of 
right panel). Only a small portion of the 
complété protein spectrum is shown for 
each sample. 

Methods based on mass spectrometry 
(see pp. 455-457) provide much more 
detailed information, including the 
identity of each protein, the position of 
each modification, and the nature of the 
modification. (Courtesy of Tim Myers and 
Leigh Anderson, Large Scale Biology 
Corporation.) 


Gene Expression Can Be Regulated at Many of the Steps in the 
Pathway from DNA to RNA to Protein 

If différences among the various cell types of an organism dépend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the previous chapter, there are many steps in the pathway 
leading from DNA to protein. We now know that ail of them can in principle be 
regulated. Thus a cell can control the proteins it malces by (1) controlling when 
and how often a given gene is transcribed (transcriptional control), (2) con¬ 
trolling the splicing and processing of RNA transcripts (RNA processing control), 
(3) selecting which completed mRNAs are exported from the nucléus to the cyto¬ 
sol and determining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mRNA 
molécules in the cytoplasm (mRNA dégradation control), or (6) selectively acti- 
vating, inactivating, degrading, or localizing spécifie protein molécules after they 
hâve been made (protein activity control) (Figure 7-5). 

For most genes, transcriptional Controls are paramount. This malces sense 
because, of ail the possible control points illustrated in Figure 7-5, only transcrip¬ 
tional control ensures that the cell will not synthesize superfluous intermediates. 
In the following sections, we discuss the DNA and protein components that per¬ 
forai this function by regulating the initiation of gene transcription. We shall then 
return to the additional ways of regulating gene expression. 
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Figure 7-5 Six steps at which eukaryotic 
gene expression can be controlled. 

Controls that operate at steps 1 through 
5 are discussed in this chapter. Step 6, 
the régulation of protein activity, occurs 
largely through covalent post-translational 
modifications including phosphorylation, 
acétylation, and ubiquitylation (see 
Table 3-3, p. 165). Step 6 was introduced 
in Chapter 3 and is subsequently discussed 
in many chapters throughout the book. 


Summary 

The genome ofa cell contains in its DNA sequence the information to malce many 
thousands of different protein and RNA molécules. A cell typically expresses only a 
fraction ofits genes, and the different types ofcells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signais 
from other cells. Although ail ofthe steps involved in expressing a gene can in prin- 
ciple be regulated, for most genes the initiation ofRNA transcription provides the 
most important point of control. 


CONTROL OF TRANSCRIPTION BY SEQUENCE- 
SPECIFIC DNA-BINDING PROTEINS 


How does a cell détermine which of its thousands of genes to transcribe? Perhaps 
the most important concept, one that applies to ail species on Earth, is based on 
a group of proteins known as transcription regulators. These proteins recognize 
spécifie sequences of DNA (typically 5-10 nucléotide pairs in length) that are 
often called cis-regulatory sequences, because they must be on the same chro¬ 
mosome (that is, in cis) to the genes they control. Transcription regulators bind 
to these sequences, which are dispersed throughout genomes, and this binding 
puts into motion a sériés of reactions that ultimately specify which genes are to be 
transcribed and at what rate. Approximately 10% of the protein-coding genes of 
most organisms are devoted to transcription regulators, malcing them one of the 
largest classes of proteins in the cell. In most cases, a given transcription regulator 
recognizes its own cz's-regulatory sequence, which is different from those recog- 
nized by ail the other regulators in the cell. 

Transcription of each gene is, in turn, controlled by its own collection of 
ds-regulatory sequences. These typically lie near the gene, often in the intergenic 
région directly upstream from the transcription start point of the gene. Although 
a few genes are controlled by a single cz's-regulatory sequence that is recognized 
by a single transcription regulator, the majority hâve complex arrangements of 
c/s-regulatory sequences, each of which is recognized by a different transcription 
regulator. It is therefore the positions, identity, and arrangement of c/s-regulatory 
sequences—which are an important part of the information embedded in the 
genome—that ultimately détermine the time and place that each gene is tran¬ 
scribed. 

We begin our discussion by describing how transcription regulators recognize 
cz's-regulatory sequences. 

The Sequence of Nucléotides in the DNA Double Hélix Can Be 
Read by Proteins 

As discussed in Chapter 4, the DNA in a chromosome consists of a very long 
double hélix that has both a major and a minor groove (Figure 7-6). Transcrip¬ 
tion regulators must recognize short, spécifie czs-regulatory sequences within 
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Figure 7-6 Double-helical structure of 

DNA. A space-filling model of DNA showing 
the major and minor grooves on the outside 
of the double hélix (see Movie 4.1). The 
atoms are colored as follows: carbon, dark 
blue ; nitrogen, light b!ue\ hydrogen, white ; 
oxygen, red\ phosphorus, yellow. 
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this structure. When first discovered in the 1960s, it was thought that these pro¬ 
teins might require direct access to the interior of the double hélix to distinguish 
between one DNA sequence and another. It is now clear, however, that the outside 
of the double hélix is studded with DNA sequence information that transcription 
regulators recognize: the edge of each base pair présents a distinctive pattern of 
hydrogen-bond donors, hydrogen-bond acceptors, and hydrophobie patches 
in both the major and minor grooves (Figure 7-7). Because the major groove is 
wider and displays more molecular features than does the minor groove, nearly 
ail transcription regulators malce the majority of their contacts with the major 
groove—as we shall see. 

Transcription Regulators Contain Structural Motifs That Can Read 
DNA Sequences 

Molecular récognition in biology generally relies on an exact fit between the sur¬ 
faces of two molécules, and the study of transcription regulators has provided 
some of the clearest examples of this principle. A transcription regulator rec- 
ognizes a spécifie czs-regulatory sequence because the surface of the protein is 
extensively complementary to the spécial surface features of the double hélix that 
displays that sequence. Each transcription regulator malces a sériés of contacts 
with the DNA, involving hydrogen bonds, ionic bonds, and hydrophobie interac¬ 
tions. Although each individual contact is wealc, the 20 or so contacts that are typ- 
ically formed at the protein-DNA interface add together to ensure that the inter¬ 
action is both highly spécifie and very strong (Figure 7-8). In fact, DNA-protein 


Figure 7-7 How the different base 
pairs in DNA can be recognized from 
their edges without the need to open 
the double hélix. The four possible 
configurations of base pairs are shown, 
with potential hydrogen-bond donors 
indicated in blue, potential hydrogen-bond 
acceptors in red, and hydrogen bonds of 
the base pairs themselves as a sériés of 
short, parallel red lines. Methyl groups, 
which form hydrophobie protubérances, 
are shown in yellow^ and hydrogen atoms 
that are attached to carbons, and are 
therefore unavailable for hydrogen-bonding, 
are white. From the major groove, each of 
the four base-pair configurations projects 
a unique pattern of features. (From 
C. Branden and J. Tooze, Introduction 
to Protein Structure, 2nd ed. New York: 
Garland Publishing, 1999.) 
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interactions include some of the tightest and most spécifie molecular interactions 
known in biology. 

Although each example of protein-DNA récognition is unique in detail, x-ray 
crystallographic and nuclear magnetic résonance (NMR) spectroscopic studies 
of hundreds of transcription regulators hâve revealed that many of them contain 
one or another of a small set of DNA-binding structural motifs (Panel 7-1). These 
motifs generally use either a helices or p sheets to bind to the major groove of 
DNA. The amino acid side chains that extend from these protein motifs malce the 
spécifie contacts with the DNA. Thus, a given structural motif can be used to rec- 
ognize many different cz's-regulatory sequences depending on the spécifie side 
chains présent. 

Dimerization of Transcription Regulators Increases Their Affinity 
and Specificity for DNA 

A monomer of a typical transcription regulator recognizes about 6-8 nucléotide 
pairs of DNA. However, sequence-specific DNA-binding proteins do not bind 
tightly to a single DNA sequence and reject ail others; rather, they recognize a 
range of closely related sequences, with the affinity of the protein for the DNA 
varying according to how closely the DNA matches the optimal sequence. Hence, 
cz's-regulatory sequences are often depicted as "logos” which display the range of 
sequences recognized by a particular transcription regulator (Figure 7-9A and 
B). In Chapter 6, we saw this same représentation at worlc for the binding of RNA 
polymerase to promoters (see Figure 6-12). 

The DNA sequence recognized by a monomer does not contain sufficient 
information to be piclced out from the background of such sequences that would 
occur at random ail over the genome. For example, an exact six-nucleotide DNA 
sequence would be expected to occur by chance approximately once every 4096 
nucléotides (4 6 ), and the range of six-nucleotide sequences described by a typi¬ 
cal logo would be expected to occur by chance much more often, perhaps every 
1000 nucléotides. Clearly, for a bacterial genome of 4.6 x 10 6 nucléotide pairs, 
not to mention a mammalian genome of 3 x 10 9 nucléotide pairs, this is insuf- 
ficient information to accurately control the transcription of individual genes. 
Additional contributions to DNA-binding specificity must therefore be présent. 
Many transcription regulators form dimers, with both monomers malcing nearly 
identical contacts with DNA (Figure 7-9C). This arrangement doubles the length 
of the cz’s-regulatory sequence recognized and greatly increases both the affinity 
and the specificity of transcription regulator binding. Because the DNA sequence 


Figure 7-8 The binding of a transcription 
regulator to a spécifie DNA sequence. On 

the left, a single contact is shown between 
a transcription regulator and DNA; such 
contacts allow the protein to “read” the DNA 
sequence. On the right, the complété set of 
contacts between a transcription regulator 
(a member of the homeodomain family—see 
Panel 7-1) and its c/s-regulatory sequence 
is shown. The DNA-binding portion of the 
protein is 60 amino acids long. Although 
the interactions in the major groove are 
the most important, the protein is also 
seen to contact both the minor groove and 
phosphates in the sugar-phosphate DNA 
backbone. (See C. Wolberger et al., Cell 
67:517-528, 1991.) 









376 


PANEL 7-1 : Common Structural Motifs in Transcription Regulators 
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Originally identified in bacterial transcription regulators, this motif has 
since been found in many hundreds of DNA-binding proteins from both 
eukaryotes and prokaryotes. It is constructed from two a helices ( blue 
and red) connected by a short extended chain of amino acids, which 
constitutes the "turn." The two helices are held at a fixed angle, primarily 
through interactions between the two helices. The more C-terminal hélix 
(in red) is called the récognition hélix because it fits into the major 
groove of DNA; its amino acid side chains, which differ from protein to 
protein, play an important part in recognizing the spécifie DNA sequence 
to which the protein binds. AN of the proteins shown here bind DNA as 
dimers in which the two copies of the récognition hélix (in red) are 
separated by exactly one turn of the DNA hélix (3.4 nm); thus both 
récognition helices of the dimer can fit into the major groove of DNA. 
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Not long after the first transcription regulators were discovered in 
bacteria, genetic analyses of the fruit fly Drosophila led to the 
characterization of an important class of genes, the homeotic selector 
genes, that play a critical part in orchestrating fly development (discussed 
in Chapter 21). It was later shown that these genes coded for 
transcription regulators that bound DNA through a structural motif 
named the homeodomain. Two different views of the same structure are 
shown. (A) The homeodomain is folded into three a helices, which are 
packed tightly together by hydrophobie interactions. The part containing 
helices 2 and 3 closely resembles the helix-turn-helix motif. (B) The 
récognition hélix (hélix 3, red) forms important contacts with the major 
groove of DNA. The asparagine (Asn) of hélix 3, for example, contacts an 
adenine, as shown in Figure 7-8. A flexible arm attached to hélix 1 forms 
contacts with nucléotide pairs in the minor groove. 
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The leucine zipper motif is named 
because of the way the two a 
helices, one from each monomer, are 
joined together to form a short 
coiled-coil. These proteins bind DNA 
as dimers where the two long a 
helices are held together by 
interactions between hydrophobie 
amino acid side chains (often on 
leucines) that extend from one side 
of each hélix. Just beyond the 
dimerization interface, the two a 
helices separate from each other to 
form a Y-shaped structure, which 
allows their side chains to contact 
the major groove of DNA. The dimer 
thus grips the double hélix like a 
clothespin on a clothesline. 
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(3 SHEET DNA RECOGNITION PROTEINS 

In the other DNA-binding motifs displayed in this panel, a helices are the primary mechanism used to recognize spécifie DNA 
sequences. In one large group of transcription regulators, however, a two-stranded p sheet, with amino acid side chains 
extending from the sheet toward the DNA, reads the information on the surface of the major groove. As in the case of a 
récognition a hélix, this P-sheet motif can be used to recognize many different DNA sequences; the exact DNA sequence 
recognized dépends on the sequence of amino acids that make up the p sheet. Shown is a transcription regulator that binds 
two molécules of S-adenosyl méthionine {red). On the left is a dimer of the protein; on the right is a simplified diagram 
showing just the two-stranded p sheet bound to the major groove of DNA. 


ZINC FINGER PROTEINS 
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This group of DNA-binding motifs includes one 
or more zinc atoms as structural components. 
AN such zinc-coordinated DNA-binding motifs 
are called zinc fingers, referring to their 
appearance in early schematic drawings {left). 
They fall into several distinct structural groups, 
only one of which we consider here. It has a 
simple structure, in which the zinc atom holds 
an a hélix and a p sheet together {middle). This 
type of zinc finger is often found in clusters 
with the a hélix of each finger contacting the 
major groove of the DNA, forming a nearly 
continuous stretch of a helices along that 
groove. In this way, a strong and spécifie 
DNA-protein interaction is built up through a 
repeating basic structural unit. Three such 
fingers are shown on the right. 
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HELIX-LOOP-HELIX PROTEINS 

Related to the leucine zipper, the helix-loop-helix 
motif consists of a short a hélix connected by a loop 
{red) to a second, longer a hélix. The flexibility of the 
loop allows one hélix to fold back and park against 
the other thereby forming the dimerization surface. 

As shown, this two-helix structure binds both to DNA 
and to the two-helix structure of a second protein to 
create either a homodimer or a heterodimer. Two a 
helices that extend from the dimerization interface 
make spécifie contacts with the major groove of DNA. 

N N 










378 Chapter 7: Control of Gene Expression 



(C) 



dimer 



dimer heterodimer 


Figure 7-9 Transcription regulators and 
c/s-regulatory sequences. (A) Depiction 
of the c/s-regulatory sequence for Nanog, 
a homeodomain family member that is 
a key regulator in embryonic stem cells. 
This “logo” form (see Figure 6-12) shows 
that the protein can recognize a collection 
of closely related DNA sequences and 
gives the preferred nucléotide pair at each 
position. C/s-regulatory sequences are 
“read” as double-stranded DNA, but only 
one strand typically is shown in a logo. 

(B) Représentation of the c/s-regulatory 
sequence as a colored box. (C) Many 
transcription regulators form dimers 
(homodimers) and heterodimers. In the 
example shown, three different DNA- 
binding specificities are formed from two 
transcription regulators. 


recognized by the protein has increased from approximately 6 nucléotide pairs 
to 12 nucléotide pairs, there are many fewer random occurrences of matching 
sequences. 

Heterodimers are often formed from two different transcription regulators. 
Transcription regulators may form heterodimers with more than one partner pro¬ 
tein; in this way, the same transcription regulator can be “reused" to create several 
distinct DNA-binding specificities (see Figure 7-9C). 


Transcription Regulators Bind Cooperatively to DNA 

In the simplest case, the collection of noncovalent bonds that holds the above 
dimers or heterodimers together is so extensive that these structures form obliga- 
torily, and never fall apart. In this case, the unit of binding is the dimer or heterod¬ 
imer, and the binding curve for the transcription regulator (the fraction of DNA 
bound as a function of protein concentration) has a standard exponential shape 
(Figure 7-1 OA). 

In many cases, however, the dimers and heterodimers are held together very 
wealdy; they exist predominantly as monomers in solution, and yet dimers are 
observed on the appropriate DNA sequence. Here, the proteins are said to bind to 
DNA cooperatively, and the curve describing their binding is sigmoidal in shape 
(Figure 7-10B). Cooperative binding that, over a range of concentrations of 

the transcription regulator, binding is more of an all-or-none phenomenon than 
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Figure 7-10 Occupancy of a c/s- 
regulatory sequence by a transcription 
regulator. (A) Noncooperative binding by a 
stable heterodimer. (B) Cooperative binding 
by components of a heterodimer that are 
predominantly monomers in solution. The 
shape of the curve differs from that of (A) 
because the fraction of protein in a form 
competent to bind DNA (the heterodimer) 
increases with increasing protein 
concentration. 
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for noncooperative binding; that is, at most protein concentrations, the czs-regu- 
latory sequence is either nearly empty or nearly fully occupied and rarely is some- 
where in between. A discussion of the mathematics behind cooperative binding is 
given in Chapter 8 (see Figure 8-79A). 

Nucléosome Structure Promûtes Cooperative Binding of 
Transcription Regulators 

As we hâve just seen, cooperative binding of transcription regulators to DNA often 
occurs because the monomers hâve only a wealc affinity for each other. However, 
there is a second, indirect mechanism for cooperative binding, one that arises 
from the nucléosome structure of eulcaryotic chromosomes. 

In general, transcription regulators bind to DNA in nucléosomes with lower 
affinity than they do to nalced DNA. There are two reasons for this différence. 

First, the surface of the cz's-regulatory sequence recognized by the transcription 
regulator may be facing inward on the nucléosome, toward the histone core, and 
therefore not be readily available to the regulatory protein. Second, even if the 
face of the cis -regulatory sequence is exposed on the outside of the nucléosome, 
many transcription regulators subtly alter the conformation of the DNA when 
they bind, and these changes are generally opposed by the tight wrapping of the 
DNA around the histone core. For example, many transcription regulators induce 
a bend or kinlc in the DNA when they bind. 

We saw in Chapter 4 that nucléosome remodeling can alter the structure of 
the nucléosome, allowing transcription regulators access to the DNA. Even with- 
out remodeling, however, transcription regulators can still gain limited access to 
DNA in a nucléosome. The DNA at the end of a nucléosome "breathes,” transiently 
exposing the DNA and allowing regulators to bind. This breathing happens at a 
much lower rate in the middle of the nucléosome; therefore, the positions where 
the DNA exits the nucléosome are much easier to occupy (Figure 7-11). 

These properties of the nucléosome promote cooperative DNA binding by 
transcription regulators. If a regulatory protein enters the DNA of a nucléosome 
and prevents the DNA from tightly rewrapping around the nucléosome core, it 
will increase the affinity of a second transcription regulator for a nearby czs-regu- 
latory sequence. If the two transcription regulators also interact with each other 

(as described above), the cooperative effect is even greater. In some cases, the Figure 7-11 How nucléosomes effect the 
combined action of the regulatory proteins can eventually displace the histone binding of transcription regulators. 
core of the nucléosome altogether. 



(A) c/s-regulatory this open form occurs (B) compared to its affinity for naked DNA, a typical transcription regulator 

sequence about 1/20th of the time will bind with 20 times lower affinity if its c/'s-regulatory sequence is 

located near the end of a nucléosome 



in the middle of a nucléosome (D) one transcription regulator can destabilize the nucléosome, facilitating binding of another 
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The coopération among transcription regulators can become much greater 
when nucléosome remodeling complexes are involved. If one transcription reg- 
ulator binds its ras-regulatory sequence and attracts a chromatin remodeling 
complex, the localized action of the remodeling complex can allow a second 
transcription regulator to efficiently bind nearby. Moreover, we hâve discussed 
how transcription regulators can worlc together in pairs; in reality, larger num- 
bers often cooperate by repeated use of the same principles. A highly cooperative 
binding of transcription regulators to DNA probably explains why many sites in 
eukaryotic genomes that are bound by transcription regulators are "nucléosome 
free.” 

Summary 

Transcription regulators recognize short stretches of double-helical DNA ofdefined 
sequence called cis -regulatory sequences, and thereby détermine which ofthe thou- 
sands ofgenes in a cell will be transcribed. Approximately 10% ofthe protein-cod- 
ing genes in most organisms produce transcription regulators , and they control 
many features ofcells. Although each ofthese transcription regulators has unique 
features, most bind to DNA as homodimers or heterodimers and recognize DNA 
through one of a small number of structural motifs . Transcription regulators typi- 
cally work in groups and bind DNA cooperatively, afeature that has several under- 
lying mechanisms, some of which exploit the packaging ofDNA in nucléosomes. 


TRANSCRIPTION REGULATORS SWITCH GENES ON 
AND OFF 

Having seen how transcription regulators bind to cz's-regulatory sequences embed- 
ded in the genome, we can now discuss how, once bound, these proteins influence 
the transcription of genes. The situation in bacteria is simpler than in eukaryotes 
(for one thing, chromatin structure is not an issue), and we therefore discuss it first. 
Following this, we turn to the more complex situation in eukaryotes. 


The Tryptophan Repressor Switches Genes Off 

The genome of the bacterium E. coli consists of a single, circular DNA molécule 
of about 4.6 x 10 6 nucléotide pairs. This DNA encodes approximately 4300 pro¬ 
teins, although only a fraction of these are made at any one time. Bacteria regu- 
late the expression of many of their genes according to the food sources that are 
available in the environment. For example, in E. coli , five genes code for enzymes 
that manufacture the amino acid tryptophan. These genes are arranged in a clus¬ 
ter on the chromosome and are transcribed from a single promoter as one long 
mRNA molécule; such coordinately transcribed clusters are called opérons (Fig¬ 
ure 7-12). Although opérons are common in bacteria, they are rare in eukaryotes, 
where genes are typically transcribed and regulated individually (see Figure 7-3). 

When tryptophan concentrations are low, the operon is transcribed; the result- 
ing mRNA is translated to produce a full set of biosynthetic enzymes, which work 
in tandem to synthesize tryptophan from much simpler molécules. When trypto¬ 
phan is abundant, however—for example, when the bacterium is in the gut of a 
mammal that has just eaten a protein-rich meal—the amino acid is imported into 
the cell and shuts down production of the enzymes, which are no longer needed. 
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Figure 7-12 A cluster of bacterial 
genes can be transcribed from a single 
promoter. Each of these five genes 
encodes a different enzyme, and ail of 
these enzymes are needed to synthesize 
the amino acid tryptophan from simpler 
molécules. The genes are transcribed as 
a single mRNA molécule, a feature that 
allows their expression to be coordinated. 
Clusters of genes transcribed as a single 
mRNA molécule are common in bacteria. 
Each of these clusters is called an operon 
because its expression is controlled by a 
c/s-regulatory sequence called the operator 
(green), situated within the promoter. (In 
this and subséquent figures, the yellow 
blocks in the promoter represent DNA 
sequences that bind RNA polymerase; see 
Figure 6-12). 
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We now understand exactiy how this repression of the tryptophan operon 
cornes about. Within the operon's promoter is a c/s-regulatory sequence that 
is recognized by a transcription regulator. When this regulator binds to this 
sequence, it bloclcs access of RNA polymerase to the promoter, thereby prevent- 
ing transcription of the operon (and thus production of the tryptophan-producing 
enzymes). The transcription regulator is lcnown as the tryptophan repressor and its 
c/s-regulatory sequence is called the tryptophan operator. These components are 
controlled in a simple way: the repressor can bind to DNA only if it has also bound 
several molécules of tryptophan (Figure 7-13). 

The tryptophan repressor is an allosteric protein, and the binding of trypto¬ 
phan causes a subtle change in its three-dimensional structure so that the pro¬ 
tein can bind to the operator sequence. Whenever the concentration of free tryp¬ 
tophan in the bacterium drops, tryptophan dissociâtes from the repressor, the 
repressor no longer binds to DNA, and the tryptophan operon is transcribed. The 
repressor is thus a simple device that switches production of a set of biosynthetic 
enzymes on and off according to the availability of the end product of the pathway 
that the enzymes catalyze. 

The tryptophan repressor protein itself is always présent in the cell. The gene 
that encodes it is continuously transcribed at a lowlevel, so that a small amount of 
the repressor protein is always being made. Thus the bacterium can respond very 
rapidly to a rise or fall in tryptophan concentration. 


Repressors Turn Genes Off and Activators Turn Them On 


The tryptophan repressor, as its name suggests, is a transcriptional repressor 
protein: in its active form, it switches genes off, or represses them. Some bacterial 
transcription regulators do the opposite: they switch genes on, or activate them. 
These transcriptional activator proteins worlc on promoters that—in contrast to the 
promoter for the tryptophan operon—are only marginally able to bind and posi¬ 
tion RNA polymerase on their own. However, these poorly functioning promoters 
can be made fully functional by activator proteins that bind to nearby c/s-regula- 
tory sequences and contact the RNA polymerase to help it initiate transcription 
(Figure 7-14). 
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binding site 
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Figure 7-13 Genes can be switched off 
by repressor proteins. If the concentration 
of tryptophan inside a bacterium is low 
(left), RNA polymerase (blue) binds to the 
promoter and transcribes the five genes 
of the tryptophan operon. However, if the 
concentration of tryptophan is high (right), 
the repressor protein (dark green) becomes 
active and binds to the operator (light 
green), where it blocks the binding of RNA 
polymerase to the promoter. Whenever the 
concentration of intracellular tryptophan 
drops, the repressor falls off the DNA, 
allowing the polymerase to again transcribe 
the operon. Although not shown in the 
figure, the repressor is a stable dimer. 


Figure 7-14 Genes can be switched 
on by activator proteins. An activator 
protein binds to its c/s-regulatory sequence 
on the DNA and interacts with the RNA 
polymerase to help it initiate transcription. 
Without the activator, the promoter fails to 
initiate transcription efficiently. In bacteria, 
the binding of the activator to DNA is often 
controlled by the interaction of a métabolite 
or other small molécule (red triangle) with 
the activator protein. The Lac operon works 
in this manner, as we discuss shortly. 
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DNA-bound activator proteins can increase the rate of transcription initiation 
as much as 1000-fold, a value consistent with a relatively wealc and nonspecific 
interaction between the transcription regulator and RNA polymerase. For exam¬ 
ple, a 1000-fold change in the affinity of RNA polymerase for its promoter corre¬ 
sponds to a change in AG of ^18 kj/mole, which could be accounted for by just a 
few wealc, noncovalent bonds. Thus, many activator proteins worlc simply by pro- 
viding a few favorable interactions that help to attract RNA polymerase to the pro¬ 
moter. To provide this assistance, however, the activator protein must be bound to 
its ds-regulatory sequence, and this sequence must be positioned, with respect to 
the promoter, so that the favorable interactions can occur. 

Lilce the tryptophan repressor, activator proteins often hâve to interact with 
a second molécule to be able to bind DNA. For example, the bacterial activator 
protein CAP has to bind cyclic AMP (cAMP) before it can bind to DNA. Genes 
activated by CAP are switched on in response to an increase in intracellular 
cAMP concentration, which rises when glucose, the bacteriunTs preferred carbon 
source, is no longer available; as a resuit, CAP drives the production of enzymes 
that allow the bacterium to digest other sugars. 

An Activator and a Repressor Control the Lac Operon 

In many instances, the activity of a single promoter is controlled by several differ¬ 
ent transcription regulators. The Lac operon in E. coli, for example, is controlled 
by both the Lac repressor and the CAP activator that we just discussed. The Lac 
operon encodes proteins required to import and digest the disaccharide lac¬ 
tose. In the absence of glucose, the bacterium malces cAMP, which activâtes CAP 
to switch on genes that allow the cell to utilize alternative sources of carbon— 
including lactose. It would be wasteful, however, for CAP to induce expression 
of the Lac operon if lactose itself were not présent. Thus the Lac repressor shuts 
off the operon in the absence of lactose. This arrangement enables the control 
région of the Lac operon to integrate two different signais, so that the operon is 
highly expressed only when two conditions are met: glucose must be absent and 
lactose must be présent (Figure 7-15). This genetic circuit thus behaves much lilce 
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Figure 7-15 The Lac operon is controlled 
by two transcription regulators, the 
Lac repressor and CAP. LacZ, the first 
gene of the operon, encodes the enzyme 
P-galactosidase, which breaks down lactose 
to galactose and glucose. When lactose is 
absent, the Lac repressor binds to a cis- 
regulatory sequence, called the Lac operator, 
and shuts off expression of the operon 
(Movie 7.4). Addition of lactose increases 
the intracellular concentration of a related 
compound, allolactose; allolactose binds to 
the Lac repressor, causing it to undergo a 
conformational change that releases its grip 
on the operator DNA (not shown). When 
glucose is absent, cyclic AMP (red triangle) is 
produced by the cell, and CAP binds to DNA. 
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a switch that carries out a logic operation in a computer. When lactose is présent 
AND glucose is absent, the cell executes the appropriate program—in this case, 
transcription of the genes that permit the uptalce and utilization of lactose. 

Ail transcription regulators, whether they are repressors or activators, must be 
bound to DNA to exert their effects. In this way, each regulatory protein acts selec- 
tively, controlling only those genes that bear a czs-regulatory sequence recognized 
by it. The logic of the Lac operon first attracted the attention of biologists more 
than 50 years ago. The way it worlcs was uncovered by a combination of genetics 
and biochemistry, providing some of the first insights into how transcription is 
controlled in any organism. 

DNA Looping Can Occur During Bacterial Gene Régulation 

We hâve seen that transcription activators help RNA polymerase initiate tran¬ 
scription and repressors hinder it. However, the two types of proteins are very 
similar to one another. For example, to occupy their cis -regulatory sequences, 
both the tryptophan repressor and the CAP activator protein must bind a small 
molécule; moreover, they both recognize their cis -regulatory sequences using the 
same structural motif (the hélix-turn-helix shown in Panel 7-1). Indeed, some 
proteins (for example, the CAP protein) can act as both a repressor and an acti¬ 
vator, depending on the exact placement of their c/s-regulatory sequence relative 
to the promoter: for some genes, the CAP c/s-regulatory sequence overlaps the 
promoter, and CAP binding thereby prevents the assembly of RNA polymerase at 
the promoter. 

Most bacteria hâve small, compact genomes, and the cis -regulatory sequences 
that control the transcription of a gene are typically located very near to the start 
point of transcription. But there are some exceptions to this generalization— 
c/s-regulatory sequences can be located hundreds and even thousands of nuclé¬ 
otide pairs from the bacterial genes they control (Figure 7-16). In these cases, the 
intervening DNA is looped out, allowing a protein bound at a distant site along 
the DNA to contact RNA polymerase. Here, the DNA acts as a tether, enormously 
increasing the probability that the proteins will collide, compared with the situa¬ 
tion where one protein is bound to DNA and the other is free in solution. We will 
see shortly that, although it is the exception in bacteria, DNA looping occurs in the 
régulation of nearly every eukaryotic gene. 

A possible explanation for this différence is based on evolutionary consid¬ 
érations. It has been proposed that the compact, simple genetic switches found 
in bacteria evolved in response to large population sizes where compétition for 
growth put sélective pressure on bacteria to maintain small genome sizes. In 
contrast, there appears to hâve been little sélective pressure to “streamline” the 
genomes of multicellular organisms. 
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Figure 7-16 Transcriptional activation 
at a distance. (A) The NtrC protein is 
a bacterial transcription regulator that 
activâtes transcription by directly contacting 
RNA polymerase. (B) The interaction of NtrC 
and RNA polymerase, with the intervening 
DNA looped out, can be seen in the électron 
microscope. (B, courtesy of Harrison Echols 
and Sydney Kustu.) 
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Complex Switches Control Gene Transcription in Eukaryotes 

When compared to the situation in bacteria, transcription régulation in eukaryotes 
involves many more proteins and much longer stretches of DNA. It often seems 
bewilderingly complex. Yet many of the same principles apply. As in bacteria, the 
time and place that each gene is to be transcribed is specified by its c/s-regulatory 
sequences, which are "read" by the transcription regulators that bind to them. 
Once bound to DNA, positive transcription regulators (activators) help RNA poly¬ 
merase begin transcribing genes, and négative regulators (repressors) bloclc this 
from happening. In bacteria, as we hâve seen, most of the interactions between 
DNA-bound transcription regulators and RNA polymerases (whether they acti- 
vate or repress transcription) are direct. In contrast, these interactions are almost 
always indirect in eukaryotes: many intermediate proteins, including the his- 
tones, act between the DNA-bound transcription regulator and RNA polymerase. 
Moreover, in multicellular organisms, it is common for dozens of transcription 
regulators to control a single gene, with c/s-regulatory sequences spread over tens 
of thousands of nucléotide pairs. DNA looping allows the DNA-bound regulatory 
proteins to interact with each other and ultimately with RNA polymerase at the 
promoter. Finally, because nearly ail of the DNA in eulcaryotic organisms is com- 
pacted by nucléosomes and higher-order structures, transcription initiation in 
eukaryotes must overcome this inhérent bloclc 

In the next sections, we discuss these features of transcription initiation in 
eukaryotes, emphasizing how they provide extra levels of control not found in 
bacteria. 

A Eukaryotic Gene Control Région Consists of a Promoter Plus 
Many c/s-Regulatory Sequences 

In eukaryotes, RNA polymerase II transcribes ail the protein-coding genes and 
many noncoding RNA genes, as we saw in Chapter 6. This polymerase requires 
five general transcription factors (27 subunits in toto; see Table 6-3, p. 311), in 
contrast to bacterial RNA polymerase, which needs only a single general tran¬ 
scription factor (the a subunit). As we hâve seen, the stepwise assembly of the 
general transcription factors at a eukaryotic promoter provides, in principle, mul¬ 
tiple steps at which the cell can speed up or slow down the rate of transcription 
initiation in response to transcription regulators. 

Because the many cz's-regulatory sequences that control the expression of a 
typical gene are often spread over long stretches of DNA, we use the term gene 
control région to describe the whole expanse of DNA involved in regulating and 
initiating transcription of a eukaryotic gene. This includes the promoter, where the 
general transcription factors and the polymerase assemble, plus ail of the cis- reg¬ 
ulatory sequences to which transcription regulators bind to control the rate of 
the assembly processes at the promoter (Figure 7-17). In animais and plants, it 
is not unusual to find the regulatory sequences of a gene dotted over stretches of 
DNA as large as 100,000 nucléotide pairs. Some of this DNA is transcribed (but 
not translated), and we discuss these long noncoding RNAs (IncRNAs) later in 
this chapter. For now, we can regard much of this DNA as “spacer” sequences that 
transcription regulators do not directly recognize. It is important to lceep in mind 
that, lilce other régions of eukaryotic chromosomes, most of the DNA in gene con¬ 
trol régions is packaged into nucléosomes and higher-order forms of chromatin, 
thereby compacting its overall length and altering its properties. 

In this chapter, we shall loosely use the term gene to refer to a segment of DNA 
that is transcribed into a functional RNA molécule, one that either codes for a pro- 
tein or has a different rôle in the cell (see Table 6-1, p. 305). However, the classical 
view of a gene includes the gene control région as well, since mutations in it can 
produce an altered phenotype. Alternative RNA splicing further complicates the 
définition of a gene—a point we shall return to later. 

In contrast to the small number of general transcription factors , which are 
abundant proteins that assemble on the promoters of ail genes transcribed by 
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RNA polymerase II, there are thousands of different transcription regulators 
devoted to turning individual genes on and off. In eukaryotes, opérons—sets of 
genes transcribed as a unit—are rare, and, instead, each gene is regulated individ- 
ually. Not surprisingly, the régulation of each gene is different in detail from that 
of every other gene, and it is difficult to formulate simple rules for gene régula¬ 
tion that apply in every case. We can, however, malce some generalizations about 
how transcription regulators, once bound to gene control régions on DNA, set in 
motion the sériés of events that lead to gene activation or repression. 

Eukaryotic Transcription Regulators Work in Groups 

In bacteria, we saw that proteins such as the tryptophan repressor, the Lac repres- 
sor, and the CAP protein bind to DNA on their own and directly affect RNA poly¬ 
merase at the promoter. Eukaryotic transcription regulators, in contrast, usually 
assemble in groups at their czs-regulatory sequences. Often two or more regula¬ 
tors bind cooperatively, as discussed earlier in the chapter. In addition, a broad 
class of multisubunit proteins termed coactivators and co-repressors assemble 
on DNA with them. Typically, these coactivators and co-repressors do not rec- 
ognize spécifie DNA sequences themselves; they are brought to those sequences 
by the transcription regulators. Often the protein-protein interactions between 
transcription regulators and between regulators and coactivators are too wealc for 
them to assemble in solution; however, the appropriate combination of c/s-regu- 
latory sequences can "crystallize" the assembly of these complexes on DNA (Fig¬ 
ure 7-18). 

As their names imply, coactivators are typically involved in activating tran¬ 
scription and co-repressors in repressing it. In the following sections, we will see 
that coactivators and co-repressors can act in a variety of different ways to influ¬ 
ence transcription after they hâve been localized on the genome by transcription 
regulators. 

As shown in Figure 7-18, an individual transcription regulator can often par- 
ticipate in more than one type of regulatory complex. A protein might function, 


Figure 7-17 The gene control région 
for a typical eukaryotic gene. The 

promoter is the DNA sequence where 
the general transcription factors and the 
polymerase assemble (see Figure 6-15). 
The cis-regulatory sequences are binding 
sites for transcription regulators, whose 
presence on the DNA affects the rate of 
transcription initiation. These sequences 
can be located adjacent to the promoter, 
far upstream of it, or even with in introns 
or entirely downstream of the gene. The 
broken stretches of DNA signify that the 
length of DNA between the c/s-regulatory 
sequences and the start of transcription 
varies, sometimes reaching tens of 
thousands of nucléotide pairs in length. The 
TATA box is a DNA récognition sequence 
for the general transcription factor TFIID. 

As shown in the lower panel, DNA looping 
allows transcription regulators bound at 
any of these positions to interact with the 
proteins that assemble at the promoter. 
Many transcription regulators act through 
Mediator (described in Chapter 6), while 
some interact with the general transcription 
factors and RNA polymerase directly. 
Transcription regulators also act by 
recruiting proteins that alter the chromatin 
structure of the promoter (not shown, but 
discussed below). 

Whereas Mediator and the general 
transcription factors are the same for ail 
RNA polymerase II-transcribed genes, the 
transcription regulators and the locations of 
their binding sites relative to the promoter 
differ for each gene. 
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for example, in one case as part of a complex that activâtes transcription and in 
another case as part of a complex that represses transcription. Thus, individual 
eukaryotic transcription regulators function as regulatory parts that are used to 
build complexes whose function dépends on the final assembly of ail of the indi¬ 
vidual components. Each eukaryotic gene is therefore regulated by a “committee” 
of proteins, ail of which must be présent to express the gene at its proper level. 


Figure 7-18 Eukaryotic transcription 
regulators assemble into complexes on 

DNA. (A) Seven transcription regulators 
are shown. The nature and function of 
the complex they form dépend on the 
spécifie c/s-regulatory sequences that 
seed their assembly. (B) Some assembled 
complexes activate gene transcription, 
while another represses transcription. 

Note that the light green and dark green 
proteins are shared by both activating and 
repressing complexes. Proteins that do 
not themselves bind DNA but assemble on 
other DNA-bound transcription regulators 
are termed coactivators or co-repressors. 

In some cases (lower right), RNA molécules 
are found in these assemblies. As 
described later in this chapter, these RNAs 
often act as scaffolds to hold a group of 
proteins together. 


Activator Proteins Promote the Assembly of RNA Polymerase at 
the Start Point of Transcription 

The cis -regulatory sequences to which eukaryotic transcription activator proteins 
bind were originally called enhancers because their presence “enhanced" the rate 
of transcription initiation. It came as a surprise when it was discovered that these 
sequences could be found tens of thousands of nucléotide pairs away from the 
promoter; as we hâve seen, DNA looping, which was not widely appreciated at the 
time, can now explain this initially puzzling observation. 

Once bound to DNA, how do assemblies of activator proteins increase the 
rate of transcription initiation? At most genes, mechanisms worlc in concert. Their 
function is both to attract and position RNA polymerase II at the promoter and to 
release it so that transcription can begin. 

Some activator proteins bind directly to one or more of the general transcrip¬ 
tion factors, accelerating their assembly on a promoter that has been brought in 
proximity—through DNA looping—to that activator. Most transcription activators, 
however, attract coactivators that then perform the biochemical taslcs needed to 
initiate transcription. One of the most prévalent coactivators is the large Media- 
tor protein complex, composed of more than 30 subunits. About the same size 
as RNA polymerase itself, Mediator serves as a bridge between DNA-bound tran¬ 
scription activators, RNA polymerase, and the general transcription factors, facil- 
itating their assembly at the promoter (see Figure 7-17). 


Eukaryotic Transcription Activators Direct the Modification of Local 
Chromatin Structure 

The eukaryotic general transcription factors and RNA polymerase are unable, on 
their own, to assemble on a promoter that is packaged in nucléosomes. Thus, in 
addition to directing the assembly of the transcription machinery at the promoter, 
eukaryotic transcription activators promote transcription by triggering changes 
to the chromatin structure of the promoters, malcing the underlying DNA more 
accessible. 

The most important ways of locally altering chromatin are through covalent 
histone modifications, nucléosome remodeling, nucléosome removal, and his- 
tone replacement (discussed in Chapter 4). Eukaryotic transcription activators 
use ail four of these mechanisms: thus they attract coactivators that include his¬ 
tone modification enzymes, ATP-dependent chromatin remodeling complexes, 
and histone chaperones, each of which can alter the chromatin structure of 
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promoters (Figure 7-19). These local alterations in chromatin structure provide 
greater access to DNA, thereby facilitating the assembly of the general transcrip¬ 
tion factors at the promoter. In addition, some histone modifications specifically 
attract these proteins to the promoter. These mechanisms often worlc together 
during transcription initiation (Figure 7-20). Finally, as discussed earlier in this 
chapter, the local chromatin changes directed by one transcriptional regulator 
can allow the binding of additional regulators. By repeated use of this principle, 
large assemblies of proteins can form on control régions of genes to regulate their 
transcription. 

The alterations of chromatin structure that occur during transcription initia¬ 
tion can persist for different lengths of time. In some cases, as soon as the tran¬ 
scription regulator dissociâtes from DNA, the chromatin modifications are rapidly 
reversed, restoring the gene to its pre-activated State. This rapid reversai is espe- 
cially important for genes that the cell must quicldy switch on and off in response 
to external signais. In other cases, the altered chromatin structure persists, even 
after the transcription regulator that directed its establishment has dissociated 
from DNA. In principle, this memory can extend into the next cell génération 
because, as discussed in Chapter 4, chromatin structure can be self-renewing 
(see Figure 4-44). The fact that different histone modifications persist for different 
times provides the cell with a mechanism that malces possible both longer- and 
shorter-term memory of gene expression patterns. 

A spécial type of chromatin modification occurs as RNA polymerase II tran- 
scribes through a gene. The histones just ahead of the polymerase can be acetyl- 
ated by enzymes carried by the polymerase, removed by histone chaperones, 
and deposited behind the moving polymerase. These histones are then rapidly 
deacetylated and methylated, also by complexes that are carried by the poly¬ 
merase, leaving behind nucléosomes that are especially résistant to transcrip¬ 
tion. This remarkable process seems to prevent spurious transcription reinitiation 


Figure 7-19 Eukaryotic transcription 
activator proteins direct local alterations 
in chromatin structure. Nucléosome 
remodeling, nucléosome removal, histone 
replacement, and certain types of histone 
modifications favor transcription initiation 
(see Figure 4-39). These alterations 
increase the accessibility of DNA and 
facilitate the binding of RNA polymerase 
and the general transcription factors. 
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Figure 7-20 Successive historié modifications during transcription 
initiation. In this example, taken from the human interferon gene promoter, 
a transcription activator binds to DNA packaged into chromatin and attracts 
a histone acetyl transferase that acetylates lysine 9 of histone H3 and lysine 
8 of histone H4. Then a histone kinase, also attracted by the transcription 
activator, phosphorylâtes serine 10 of histone H3 but it can only do so after 
lysine 9 has been acetylated. This serine modification signais the histone 
acetyl transferase to acetylate position Kl 4 of histone H3. Next, the general 
transcription factor TFIID and a chromatin remodeling complex bind to the 
chromatin to promote the subséquent steps of transcription initiation. TFIID 
and the remodeling complex both recognize acetylated histone tails through 
a bromodomain, a protein domain specialized to read this particular mark on 
histones; a bromodomain is carried in a subunit of each protein complex. 

The histone acetyl transferase, the histone kinase, and the chromatin 
remodeling complex are ail coactivators. The order of events shown applies 
to a spécifie promoter; at other genes, the steps may occur in a different 
order or individual steps may be omitted altogether. (Adapted from 
T. Agalioti, G. Chen and D. Thanos, Cell 111:381-392, 2002. With 
permission from Elsevier.) 


behind a moving polymerase, which, in essence, must clear a path through chro¬ 
matin as it transcribes. Later in this chapter, when we discuss RNA interférence, 
the potential dangers to the cell of such inappropriate transcription will become 
especially obvious. The modification of nucléosomes behind a moving RNA poly¬ 
merase also plays an important rôle in RNA splicing (see p. 323). 

Transcription Activators Can Promote Transcription by Releasing 
RNA Polymerase from Promoters 



H3K9 


histone 
acetyl 

transferase— _ 




In some cases, transcription initiation requires that a DNA-bound transcription 
activator releases RNA polymerase from the promoter so as to allow it to begin 
transcribing the gene. In other cases, the RNA polymerase halts after transcribing 
about 50 nucléotides of RNA, and further élongation requires a transcription acti¬ 
vator bound behind it (Figure 7-21). These paused polymerases are common in 
humans, where a significant fraction of genes that are not being transcribed hâve 
a paused polymerase located just downstream from the promoter. 

The release of RNA polymerase can occur in several ways. In some cases, the 
activator brings in a chromatin remodeling complex that removes a nucléosome 
bloclc to the elongating RNA polymerase. In other cases, the activator communi- 
cates with RNA polymerase (typically through a coactivator), signaling it to move 
ahead. Finally, as we saw in Chapter 6, RNA polymerase requires élongation fac¬ 
tors to effectively transcribe through chromatin. In some cases, the lcey step in 
gene activation is the loading of these factors onto RNA polymerase, which can be 
directed by DNA-bound transcription activators. Once loaded, these factors allow 
the polymerase to move through bloclcs imposed by chromatin structure and 
begin transcribing the gene in earnest. Having RNA polymerase already poised 
on a promoter in the beginning stages of transcription bypasses the step of assem- 
bling many components at the promoter, which is often slow. This mechanism 
can therefore allow cells to begin transcribing a gene as a rapid response to an 
extracellular signal. 



f 

REMAINDER OF TRANSCRIPTION 
MACHINERY ASSEMBLES 


Transcription Activators Work Synergistically * 

TRANSCRIPTION 

We hâve seen that complexes of transcription activators and coactivators assem¬ 
ble cooperatively on DNA. We hâve also seen that these assemblies can promote 
different steps in transcription initiation. In general, where several factors worlc 
together to enhance a reaction rate, the joint effect is not merely the sum of the 
enhancements that each factor alone contributes, but the product. If, for exam¬ 
ple, factor A lowers the free-energy barrier for a reaction by a certain amount and 
thereby speeds up the reaction 100-fold, and factor B, by acting on another aspect 
of the reaction, does likewise, then A and B acting in parallel will lower the barrier 
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transcription activator 


(A) PROMÛTES BINDING OF ADDITIONAL REGULATORS 



(B) RECRUITS RNA POLYMERASE TO PROMOTER 


X 



“•Y 


(C) RELEASES RNA POLYMERASE TO BEGIN TRANSCRIPTION 


X 



Figure 7-21 Transcription activators 
can act at different steps. In addition 
to (A) promoting binding of additional 
transcription regulators and (B) assembling 
RNA polymerase at promoters, 
transcription activators are often needed 
(C) to release already assembled RNA 
polymerases from promoters or (D) to 
release RNA polymerase molécules that 
become stalled after transcribing about 50 
nucléotides of RNA. The activities shown in 
Figure 7-19 can affect each of these four 
steps. 


by a double amount and speed up the reaction 10,000-fold. Even if A and B work 
simply by attracting the same protein, the affïnity of that protein for the reac¬ 
tion site increases multiplicatively. Thus, transcription activators often exhibit 
transcriptional synergy, where several DNA-bound activator proteins working 
together produce a transcription rate that is much higher than the sum of their 
transcription rates working alone (Figure 7-22). 

An important point is that a transcription activator protein must be bound 
to DNA to influence transcription of its target gene. And the rate of transcription 
of a gene ultimately dépends upon the spectrum of regulatory proteins bound 
upstream and downstream of its transcription start site, along with the coactivator 
proteins they bring to DNA. 


Eukaryotic Transcription Repressors Can Inhibit Transcription in 
Several Ways 

Although the “default" State of eukaryotic DNA packaged into nucléosomes is 
résistant to transcription, eukaryotes nonetheless use transcription regulators to 



NO TRANSCRIPTION 


1 UNIT OF TRANSCRIPTION 


2 UNITS OF TRANSCRIPTION 



100 UNITS OF TRANSCRIPTION 


Figure 7-22 Transcriptional synergy. 

This experiment compares the rate 
of transcription produced by three 
experimentally constructed regulatory 
régions in a eukaryotic cell and reveals 
transcriptional synergy, a greater than 
additive effect of multiple activators working 
together. For simplicity, coactivators hâve 
been omitted from the diagram. 

Such transcriptional synergy is not only 
observed between different transcription 
activators from the same organism; it is 
also seen between activator proteins from 
different eukaryotic species when they are 
experimentally introduced into the same 
cell. This last observation reflects the high 
degree of conservation of the machinery 
responsible for eukaryotic transcription 
initiation. 
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Figure 7-23 Six ways in which eukaryotic repressor proteins can operate. (A) Activator proteins and repressor proteins 
compete for binding to the same regulatory DNA sequence. (B) Both proteins bind DNA, but the repressor prevents the 
activator from carrying out its functions. (C) The repressor blocks assembly of the general transcription factors. (D) The repressor 
recruits a chromatin remodeling complex, which returns the nucleosomal State of the promoter région to its pre-transcriptional 
form. (E) The repressor attracts a histone deacetylase to the promoter. As we hâve seen, histone acétylation can stimulate 
transcription initiation (see Figure 7-20), and the repressor simply reverses this modification. (F) The repressor attracts a histone 
methyl transferase, which modifies certain positions on histones by attaching methyl groups; the methylated histones, in turn, 
are bound by proteins that maintain the chromatin in a transcriptionally silent form. 


repress the transcription of genes. These transcription repressors can both depress 
the rate of transcription below the default value and rapidly shut off genes that 
were previously activated. We saw in Chapter 4 that large régions of the genome 
can be shut down by the packaging of DNA into especially résistant forms of chro¬ 
matin. However, eukaryotic genes are rarely organized along the genome accord - 
ing to function, and this strategy is not generally applicable for shutting off a set 
of genes that worlc together. Instead, most eukaryotic repressors worlc on a gene- 
by-gene basis. Unlilce bacterial repressors, eukaryotic repressors do not directly 
compete with the RNA polymerase for access to the DNA; rather, they use a variety 
of other mechanisms, some of which are illustrated in Figure 7-23. Although ail 
of these mechanisms ultimately bloclc transcription by RNA polymerase, eukary¬ 
otic transcription repressors typically act by bringing co-repressors to DNA. Lilce 
transcription activation, transcription repression can act through more than one 
mechanism at a given target gene, thereby ensuring especially efficient repres¬ 
sion. 

Gene repression is especially important to animais and plants whose growth 
dépends on elaborate and complex developmental programs. Misexpression of a 
single gene at a critical time can hâve disastrous conséquences for the individual. 
For this reason, many of the genes encoding the most important developmental 
regulatory proteins are lcept tightly repressed when they are not needed. 
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Figure 7-24 Schematic diagram summarizing the properties of insulators and barrier sequences. (A) Insulators 
directionally block the action of c/s-regulatory sequences, whereas barrier sequences prevent the spread of heterochromatin. 
How barrier sequences likely function is depicted in Figure 4-41. (B) Insulator-binding proteins (ourple) hold chromatin in loops, 
thereby favoring “correct” c/s-regulatory sequence-gene associations. Thus, gene B is properly regulated, and gene B’s c/s- 
regulatory sequences are prevented from influencing the transcription of gene A. 


Insulator DNA Sequences Prevent Eukaryotic Transcription 
Regulators from Influencing Distant Genes 

We hâve seen that ail genes hâve control régions, which dictate at which times, 
under what conditions, and in what tissues the gene will be expressed. We hâve 
also seen that eukaryotic transcription regulators can act across very long stretches 
of DNA, with the intervening DNA looped out. How, then, are control régions of 
different genes lcept from interfering with one another? For example, what lceeps 
a transcription regulator bound on the control région of one gene from looping in 
the wrong direction and inappropriately influencing the transcription of an adja¬ 
cent gene? 

To avoid such cross-talk, several types of DNA éléments compartmentalize 
the genome into discrète regulatory domains. In Chapter 4, we discussed barrier 
sequences that prevent the spread of heterochromatin into genes that need to be 
expressed. A second type of DNA element, called an insulator , prevents czs-regu- 
latory sequences from running amok and activating inappropriate genes (Figure 
7-24). Insulators function by forming loops of chromatin, an effect mediated by 
specialized proteins that bind them (see Figures 4-48 and 7-24B). The loops hold 
a gene and its control région in rough proximity and help to prevent the control 
région from “spilling over" to adjacent genes. Importantly, these loops can be in 
different in different cell types, depending on the particular proteins and chroma¬ 
tin structures that are présent. 

The distribution of insulators and barrier sequences in a genome is thought to 
divide it into independent domains of gene régulation and chromatin structure 
(see pp. 207-208). Aspects of this organization can be visualized by staining whole 
chromosomes for the specialized proteins that bind these DNA éléments (Figure 
7-25). 
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Figure 7-25 Localization of a Drosophila insulator-binding protein on polytene chromosomes. 

A polytene chromosome (discussed in Chapter 4) was stained with propidium iodide (red) to 
show its banding patterns, with bands appearing bright red and interbands as dark gaps in the 
pattern (top). The positions on this polytene chromosome that are bound by a particular insulator 
protein are stained bright green using antibodies directed against the protein (bottom). This protein 
is preferentially localized to interband régions, reflecting its rôle in organizing chromosomes into 
structural, as well as functional, domains. For convenience, these two micrographs of the same 
polytene chromosome are arranged as mirror images. (Courtesy of Uli Laemmli, from K. Zhao et al., 
Cell 81:879-889, 1995. With permission from Elsevier.) 
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Although chromosomes are organized into orderly domains that discourage 
control régions from acting indiscriminately, there are spécial circumstances 
where a control région located on one chromosome has been found to activate 
a gene located on a different chromosome. Although there is much we do not 
understand about this mechanism, it indicates the extreme versatility of tran- 
scriptional régulation strategies. 

Summary 

Transcription regulators switch the transcription ofindividual genes on and offin 
cells. In prokaryotes, these proteins typically bind to spécifie DNA sequences close 
to the RNA polymerase start site and, depending on the nature of the regulatory 
protein and the précisé location ofits binding site relative to the start site , either 
activate or repress transcription ofthe gene. The flexibility ofthe DNA hélix, how- 
ever, also allows proteins bound at distant sites to affect the RNA polymerase at 
the promoter by the looping out ofthe intervening DNA. The régulation ofhigher 
eukaryotic genes is much more complex, commensurate with a larger genome size 
and the large variety ofcell types that areformed. A single eukaryotic gene is typ¬ 
ically controlled by many transcription regulators bound to sequences that can 
be tens or even hundreds ofthousands of nucléotide pairs from the promoter that 
directs transcription ofthe gene. Eukaryotic activators and repressors act by a wide 
variety ofmechanisms—generally altering chromatin structure and controlling the 
assembly ofthe general transcription factors and RNA polymerase at the promoter. 
They do this by attracting coactivators and co-repressors, protein complexes that 
perform the necessary biochemical reactions. The time and place that each gene 
is transcribed,, as well as its rates of transcription under different conditions, are 
determined by the particular spectrum of transcription regulators that bind to the 
regulatory région ofthe gene. 

MOLECULAR GENETIC MECHANISMS THAT CREATE 
AND MAINTAIN SPECIALIZED CELL TYPES 

Although ail cells must be able to switch genes on and off in response to changes 
in their environments, the cells of multicellular organisms hâve evolved this 
capacity to an extreme degree. In particular, once a cell in a multicellular organ- 
ism becomes committed to differentiate into a spécifie cell type, the cell main- 
tains this choice through many subséquent cell générations, which means that 
it remembers the changes in gene expression involved in the choice. This phe- 
nomenon of cell memory is a prerequisite for the création of organized tissues and 
for the maintenance of stably differentiated cell types. In contrast, other changes 
in gene expression in eukaryotes, as well as most such changes in bacteria, are 
only transient. The tryptophan repressor, for example, switches off the tryptophan 
genes in bacteria only in the presence of tryptophan; as soon as tryptophan is 
removed from the medium, the genes are switched baclc on, and the descendants 
of the cell will hâve no memory that their ancestors had been exposed to trypto¬ 
phan. 

In this section, we shall examine not only cell memory mechanisms, but 
also how gene regulatory devices can be combined to create the “logic circuits" 
through which cells integrate signais and remember events in their past. We begin 
by considering one such complex gene control région in detail. 

Complex Genetic Switches That Regulate Drosophila 
Development Are Built Up from Smaller Molécules 

We hâve seen that transcription regulators can be positioned at multiple sites 
along long stretches of DNA and that these proteins can bring into play coacti¬ 
vators and co-repressors. Here, we discuss how the numerous transcription reg¬ 
ulators that are bound to the control région of a gene can cause the gene to be 
transcribed at the proper place and time. 
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Figure 7-26 The nonuniform distribution 
of transcription regulators in an early 
Drosophila embryo. At this stage, the 
embryo is a syncytium; that is, multiple 
nuclei are contained in a common 
cytoplasm. Although not shown in 
these drawings, ail of these proteins are 
concentrated in the nuclei. How such 
différences are established is discussed in 
Chapter 21. 


Consider the Drosophila Even-skipped (Eve) gene, whose expression plays an 
important part in the development of the Drosophila embryo. If this gene is inac- 
tivated by mutation, many parts of the embryo fail to form, and the embryo dies 
early in development. As discussed in Chapter 21, at the stage of development 
when Eve begins to be expressed, the embryo is a single giant cell containing mul¬ 
tiple nuclei in a common cytoplasm. This cytoplasm contains a mixture of tran¬ 
scription regulators that are distributed unevenly along the length of the embryo, 
thus providing positional information that distinguishes one part of the embryo 
from another (Figure 7-26). Although the nuclei are initially identical, they rap- 
idly begin to express different genes because they are exposed to different tran¬ 
scription regulators. For example, the nuclei near the anterior end of the devel- 
oping embryo are exposed to a set of transcription regulators that is distinct from 
the set that influences nuclei at the middle or at the posterior end of the embryo. 

The regulatory DNA sequences that control the Eve gene hâve evolved to 
“read" the concentrations of transcription regulators at each position along the 
length of the embryo, and they cause the Eve gene to be expressed in seven pre- 
cisely positioned stripes, each initially five to six nuclei wide (Figure 7-27). Howis 
this remarkable feat of information processing carried out? Although there is still 
much to learn, several general principles hâve emerged from studies of Eve and 
other genes that are similarly regulated. 

The regulatory région of the Eve gene is very large (approximately 20,000 
nucléotide pairs). It is formed from a sériés of relatively simple regulatory mod¬ 
ules, each of which contains multiple czs-regulatory sequences and is responsible 
for specifying a particular stripe of Eve expression along the embryo. This modular 
organization of the Eve gene control région was revealed by experiments in which 
a particular regulatory module (say, that specifying stripe 2) is removed from its 
normal setting upstream of the Eve gene, placed in front of a reporter gene, and 
reintroduced into the Drosophila genome. When developing embryos derived 
from Aies carrying this genetic construct are examined, the reporter gene is found 
to be expressed in precisely the position of stripe 2 (Figure 7-28). Similar exper¬ 
iments reveal the existence of other regulatory modules, each of which spécifiés 
other stripes. 


Figure 7-27 The seven stripes of the protein encoded by the Even- 
skipped (Eve) gene in a developing Drosophila embryo. Two and one-half 
hours after fertilization, the egg was fixed and stained with antibodies that 
recognize the Eve protein (green) and antibodies that recognize the Giant 
protein (red). Where Eve and Giant proteins are both présent, the staining 
appears yellow. At this stage in development, the egg contains approximately 
4000 nuclei. The Eve and Giant proteins are both located in the nuclei, and 
the Eve stripes are about four nuclei wide. The pattern for the Giant protein is 
also shown in Figure 7-26. (Courtesy of Michael Levine.) 
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Figure 7-28 Experiment demonstrating the modular construction of the Eve gene regulatory région. (A) A 480-nucleotide- 
pair section of the Eve regulatory région was removed and (B) inserted upstream of a test promoter that directs the synthesis of 
the enzyme p-galactosidase (the product of the E. coli LacZ gene—see Figure 7-15). (C, D) When this artificial construct was 
reintroduced into the genome of Drosophila embryos, the embryos (D) expressed p-galactosidase (détectable by histochemical 
staining) precisely in the position of the second of the seven Eve stripes (C). P-Galactosidase is simple to detect and thus 
provides a convenient way to monitor the expression specified by a gene control région. As used here, p-galactosidase is said 
to serve as a reporter, since it “reports” the activity of a gene control région. (C and D, courtesy of Stephen Small and Michael 
Levine.) 


The Drosophila Eve Gene Is Regulated by Combinatorial Controls 

A detailed study of the stripe 2 regulatory module has provided insights into how 
it reads and interprets positional information. The module contains récognition 
sequences for two transcription regulators (Bicoid and Hunchbaclc) that acti- 
vate Eve transcription and for two transcription regulators (Krüppel and Giant) 
that repress it (Figure 7-29). The relative concentrations of these four proteins 
détermine whether the protein complexes that form at the stripe 2 module acti- 
vate transcription of the Eve gene. Figure 7-30 shows the distributions of the four 
transcription regulators across the région of a Drosophila embryo where stripe 2 
forms. It is thought that either of the two repressor proteins, when bound to the 
DNA, will turn off the stripe 2 module, whereas both Bicoid and Hunchbaclc must 
bind for this module's maximal activation. This simple regulatory scheme suffices 
to turn on the stripe 2 module (and therefore the expression of the Eve gene) only 
in those nuclei located where the levels of both Bicoid and Hunchbaclc are high 
and both Krüppel and Giant are absent—a combination that occurs in only one 
région of the early embryo. It is not lcnown exactly how these four transcription 
regulators interact with coactivators and co-repressors to specify the final level of 
transcription across the stripe, but the outcome very lilcely relies on compétition 
between activators and repressors that act by the mechanisms outlined in Figures 
7-17, 7-19, and 7-23. 

The stripe 2 element is autonomous, inasmuch as it spécifiés stripe 2 when 
isolated from its normal context (see Figure 7-28). The other stripe regulatory 
modules are thought to be constructed similarly, reading positional information 
provided by other combinations of transcription regulators. The entire Eve gene 
control région binds more than 20 different transcription regulators. Seven com¬ 
binations of regulators—one combination for each stripe—specify Eve expres¬ 
sion, while many other combinations (ail those found in the interstripe régions of 
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Figure 7-29 The Eve stripe 2 unit. The 

segment of the Eve gene control région 
identified in Figure 7-28 contains c/'s- 
regulatory sequences for four transcription 
regulators. It is known from genetic 
experiments that these four regulatory 
proteins are responsible for the proper 
expression of Eve in stripe 2. Flies that 
are déficient in the two gene activators 
Bicoid and Hunchback, for example, fail 
to efficiently express Eve in stripe 2. In 
flies déficient in either of the two gene 
repressors, Giant and Krüppel, stripe 2 
expands and covers an abnormally broad 
région of the embryo. As indicated, in some 
cases the binding sites for the transcription 
regulators overlap, and the proteins can 
compete for binding to the DNA. For 
example, binding of Krüppel and binding of 
Bicoid to the site at the far right is mutually 
exclusive. 
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the embryo) lceep the stripe éléments silent. A large and complex control région is 
thereby built from a sériés of smaller modules, each of which consists of a unique 
arrangement of short cz's-regulatory sequences recognized by spécifie transcrip¬ 
tion regulators. 

The Eve gene itself encodes a transcription regulator, which, after its pattern of 
expression is set up in seven stripes, Controls the expression of other Drosophila 
genes. As development proceeds, the embryo is thus subdivided into finer and 
finer régions that eventually give rise to the different body parts of the adult fly, as 
discussed in Chapter 21. 

Eve exemplifies the complex control régions found in plants and animais. As 
this example shows, control régions can respond to many different inputs, inte- 
grate this information, and produce a complex spatial and temporal output as 
development proceeds. However, exactly how ail these mechanisms work together 
to produce the final output is understood only in broad outline (Figure 7-31). 

Transcription Regulators Are Brought Into Play by Extracellular 
Signais 

The above example from Drosophila clearly illustrâtes the power of combinatorial 
control, but this case is unusual in that the nuclei are exposed directly to posi- 
tional eues in the form of concentrations of transcription regulators. In embryos 
of most other organisms and in ail adults, individual nuclei are in separate cells, 
and extracellular information (including positional eues) must be passed across 
the plasma membrane so as to generate signais in the cytosol that cause different 
transcription regulators to become active in different cell types. Some of the dif¬ 
ferent mechanisms that are known to be used to activate transcription regulators 
are diagrammed in Figure 7-32, and in Chapter 15, we discuss how extracellular 
signais trigger these changes. 
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Figure 7-30 Distribution of the 
transcription regulators responsible 
for ensuring that Eve is expressed in 
stripe 2. The distributions of these proteins 
were visualized by staining a developing 
Drosophila embryo with antibodies directed 
against each of the four proteins. The 
expression of Eve in stripe 2 occurs only 
at the position where the two activators 
(Bicoid and Hunchback) are présent and 
the two repressors (Giant and Krüppel) are 
absent. In fly embryos that lack Krüppel, 
for example, stripe 2 expands posteriorly. 
Likewise, stripe 2 expands posteriorly if the 
DNA-binding sites for Krüppel in the stripe 
2 module are inactivated by mutation (see 
also Figures 7-26 and 7-27). 


Figure 7-31 The intégration of multiple 
inputs at a promoter. Multiple sets of 
transcription regulators, coactivators, 
and co-repressors can work together 
to influence transcription initiation at a 
promoter, as they do in the Eve stripe 
2 module illustrated in Figure 7-29. It is 
not yet understood in detail how the cell 
achieves intégration of multiple inputs, but 
it is likely that the final transcriptional activity 
of the gene results from a compétition 
between activators and repressors that act 
by the mechanisms summarized in Figures 
7-17, 7-19, and 7-23. 
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Figure 7-32 Some ways in which the 
activity of transcription regulators is 
controlled inside eukaryotic cells. (A) The 

protein is synthesized only when needed 
and is rapidly degraded by proteolysis so 
that it does not accumulate. (B) Activation 
by ligand binding. (C) Activation by 
covalent modification. Phosphorylation is 
shown here, but many other modifications 
are possible (see Table 3-3, p. 165). 

(D) Formation of a complex between a 
DNA-binding protein and a separate protein 
with a transcription-activating domain. 

(E) Unmasking of an activation domain by 
the phosphorylation of an inhibitor protein. 

(F) Stimulation of nuclear entry by removal 
of an inhibitory protein that otherwise 
keeps the regulatory protein from entering 
the nucléus. (G) Release of a transcription 
regulator from a membrane bilayer by 
regulated proteolysis. 


We hâve seen that transcription regulators can act in combination to control the 
expression of an individual gene. It is also generally true that each transcription 
regulator in an organism contributes to the control of many genes. This point is 
illustrated schematically in Figure 7-33, which shows how combinatorial gene 
control malces it possible to generate a great deal of biological complexity even 
with relatively few transcription regulators. 

Due to combinatorial control, a given transcription regulator does not neces- 
sarily hâve a single, simply definable function as commander of a particular bat- 
tery of genes or spécifier of a particular cell type. Rather, transcription regulators 
can be lilcened to the words of a language: they are used with different meanings 
in a variety of contexts and rarely alone; it is the well-chosen combination that 
conveys the information that spécifiés a gene regulatory event. 

Combinatorial gene control causes the effect of adding a new transcription 
regulator to a cell to dépend on that cell's past history, since it is this history that 
détermines which transcription regulators are already présent. Thus, during 
development, a cell can accumulate a sériés of transcription regulators that need 
not initially alter gene expression. The addition of the final members of the req- 
uisite combination of transcription regulators will complété the regulatory mes¬ 
sage, and can lead to large changes in gene expression. 

The importance of combinations of transcription regulators for the spécifica¬ 
tion of cell types is most easily demonstrated by their ability—when expressed 
artificially—to convert one type of cell to another. Thus, the artificial expression 
of three neuron-specific transcription regulators in liver cells can convert the liver 
cells into functional nerve cells (Figure 7-34). In some cases, expression of even a 
single transcription regulator is sufficient to convert one cell type to another. For 
example, when the gene encoding the transcription regulator MyoD is artificially 
introduced into fibroblasts cultured from slcin connective tissue, the fibroblasts 
form muscle-like cells. As discussed in Chapter 22, fibroblasts, which are derived 
from the same broad class of embryonic cells as muscle cells, hâve already accu- 
mulated many of the other necessary transcription regulators required for the 
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embryonic cell 





Figure 7-33 The importance of 
combinatorial gene control for 
development. Combinations of a few 
transcription regulators can generate many 
cell types during development. In this 
simple, idealized scheme, a “decision” to 
make one of a pair of different transcription 
regulators (shown as numbered circles) 
is made after each cell division. Sensing 
its relative position in the embryo, the 
daughter cell toward the left side of the 
embryo is always induced to synthesize the 
even-numbered protein of each pair, while 
the daughter cell toward the right side of 
the embryo is induced to synthesize the 
odd-numbered protein. The production of 
each transcription regulator is assumed to 
be self-perpetuating once it has become 
initiated (see Figure 7-39). In this way, 
through cell memory, the final combinatorial 
spécification is built up step by step. In this 
purely hypothetical example, five different 
transcription regulators hâve created eight 
final cell types (G-N). 


combinatorial control of the muscle-specific genes, and the addition of MyoD 
complétés the unique combination required to direct the cells to become muscle. 
An even more striking example is seen by artificially expressing, early in devel¬ 
opment, a single Drosophila transcription regulator (Eyeless) in groups of cells 



(A) I-1 (B) I- 

50 |um 50 |um 


Figure 7-34 A small set of transcription regulators can convert one differentiated cell type 
into another. In this experiment, (A) liver cells grown in culture were converted into (B) neuronal 
cells via the artificial expression of three nerve-specific transcription regulators. Both types of cells 
express an artificial red fluorescent protein, which is used to visualize them. This conversion involves 
the activation of many nerve-specific genes as well as the repression of many liver-specific genes. 
(From S. Marro et al., Cell Stem Cell 9:374-382, 2011. With permission from Elsevier.) 
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that would normally go on to form leg parts. Here, this abnormal gene expression 
change causes eye-like structures to develop in the legs (Figure 7-35). 

Specialized Cell Types Can Be Experimentally Reprogrammed to 
Become Pluripotent Stem Cells 

Manipulation of transcription regulators can also coax various differentiated 
cells to de-differentiate into pluripotent stem cells that are capable of giving rise 
to the different cell types in the body, much lilce the embryonic stem (ES) cells 
discussed in Chapter 22. When three spécifie transcription regulators are artifi- 
cially expressed in cultured mouse fibroblasts, a number of cells become induced 
pluripotent stem cells (iPS cells)—cells that look and behave lilce the pluripo¬ 
tent ES cells that are derived from embryos (Figure 7-36). This approach has been 
adapted to produce iPS cells from a variety of specialized cell types, including 
cells talcen from humans. Such human iPS cells can then be directed to generate 
a population of differentiated cells for use in the study or treatment of disease, as 
we discuss in Chapter 22. 

Although it was once thought that cell différentiation was irréversible, it is now 
clear that by manipulating combinations of master transcription regulators, cell 
types and différentiation pathways can be readily altered. 

Combinations of Master Transcription Regulators Specify Cell 
Types by Controlling the Expression of Many Genes 

As we saw in the introduction of this chapter, different cell types of multicellular 
organisms differ enormously in the proteins and RNAs they express. For example, 
only muscle cells express spécial types of actin and myosin that form the contractile 
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Figure 7-35 Expression of the 
Drosophila Eyeless gene in precursor 
cells of the leg triggers the development 
of an eye on the leg. (A) Simplified 
diagrams showing the resuit when a fruit fly 
larva contains either the normally expressed 
Eyeless gene (left) or an Eyeless gene that 
is additionally expressed artificially in cells 
that normally give rise to leg tissue (right). 

(B) Photograph of an abnormal leg that 
contains a misplaced eye (see also Figure 
21-2). The transcription regulator was 
named Eyeless because its inactivation in 
otherwise normal flies causes the loss of 
eyes. (B, courtesy of Walter Gehring.) 


Figure 7-36 A combination of 
transcription regulators can induce a 
differentiated cell to de-differentiate into 
a pluripotent cell. The artificial expression 
of a set of three genes, each of which 
encodes a transcription regulator, can 
reprogram a fibroblast into a pluripotent 
cell with embryonic stem (ES)-cell-like 
properties. Like ES cells, such induced 
pluripotent stem (iPS) cells can proliferate 
indefinitely in culture and can be stimulated 
by appropriate extracellular signal 
molécules to differentiate into almost any 
cell type found in the body. Transcription 
regulators such as Oct4, Sox2, and Klf4 are 
often called master transcription regulators 
because their expression is sufficient to 
trigger a change in cell identity. 
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Figure 7-37 A portion of the 
transcription network specifying 
embryonic stem cells. (A) The three 
master transcription regulators in Figure 
7-36 are shown as large circles. Genes 
whose c/s-regulatory sequences are bound 
by each regulator in embryonic stem cells 
are indicated by a small dot (representing 
the gene) connected by a thin line 
(representing the binding reaction). Note 
that many of the target genes are bound 
by more than one of the regulators. 

(B) The master regulators control their 
own expression. As shown here, the three 
transcriptional regulators bind to their own 
control régions (indicated by feedback 
loops), as well as those of the other 
master regulators (indicated by straight 
arrows). (Courtesy ofTrevor Sorrells, based 
on data from J. Kim et al., Ce//132:1049- 
1061,2008.) 


apparatus, while nerve cells must malce and assemble ail the proteins needed to 
form dendrites and synapses. We hâve seen that these patterns of cell-type-spe- 
cific expression are orchestrated by a combination of master transcription regu¬ 
lators. In many cases, these proteins bind directly to c/s-regulatory sequences of 
the genes particular to that cell type. Thus, MyoD binds directly to cz's-regulatory 
sequences located in the control régions of the muscle-specific genes. In other 
cases, the master regulators control the expression of “downstream” transcription 
regulators which, in turn, bind to the control régions of other cell-type-specific 
genes and control their synthesis. 

The spécification of a particular cell type typically involves changes in the 
expression of several thousand genes. Genes whose protein products are required 
in the cell type are expressed at high levels, while those not needed are typically 
down-regulated. As might be imagined, the pattern of binding between the mas¬ 
ter regulators and ail of the regulated genes can be extremely elaborate (Figure 
7-37). When we consider that many of these regulated genes hâve control régions 
that span tens of thousands of nucléotide pairs, commensurate with the Eve 
example discussed above, we can begin to appreciate the enormous complexity 
of cell-type spécification. 

An outstanding question in biology is how the information in a genome is used 
to specify a multicellular organism. Although we hâve the general outline of the 
answer, we are far from understanding how a single cell type is completely speci- 
fied, let alone a whole organism. 

Specialized Cells Must Rapidly Turn Sets of Genes On and Off 

Although they generally maintain their identities, specialized cells must con- 
stantly respond to changes in their environment. Among the most important 
changes are signais from other cells that coordinate the behavior of the whole 
organism. Many of these signais induce transient changes in gene transcription, 
and we discuss the nature of these signais in detail in Chapter 15. Here, we con¬ 
sider how specialized cell types rapidly and decisively switch groups of genes on 
and off in response to their environment. Even though control of gene expression 
is combinatorial, the effect of a single transcription regulator can still be décisive 
in switching any particular gene on or off, simply by completing the combination 
needed to maximally activate or repress that gene. This situation is analogous to 
dialing in the final number of a combination loclc: the loclc will spring open with 
only this simple addition if ail of the other numbers hâve been previously entered. 
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Moreover, the same number can complété the combination for many different 
loclcs. Likewise, the addition of a particular protein can turn on many different 
genes. 

An example is the rapid control of gene expression by the human glucocorti- 
coid receptor protein. To bind to its czs-regulatory sequences in the genome, this 
transcription regulator must first form a complex with a molécule of a glucocorti- 
coid steroid hormone, such as cortisol (see Figure 15-64). The body releases this 
hormone during times of starvation and intense physical activity, and among its 
other activities, it stimulâtes liver cells to increase the production of glucose from 
amino acids and other small molécules. To respond in this way, liver cells increase 
the expression of many different genes that code for metabolic enzymes, such as 
tyrosine aminotransferase, as we discussed earlier in this chapter (see Figure 7-3). 
Although these genes ail hâve different and complex control régions, their maxi¬ 
mal expression dépends on the binding of the hormone-glucocorticoid receptor 
complex to its cz's-regulatory sequence, which is présent in the control région of 
each gene. When the body has recovered and the hormone is no longer présent, 
the expression of each of these genes drops to its normal level in the liver. In this 
way, a single transcription regulator can rapidly control the expression of many 
different genes (Figure 7-38). 

The effects of the glucocorticoid receptor are not confined to cells of the liver. 
In other cell types, activation of this transcription regulator by hormone also 
causes changes in the expression levels of many genes; the genes affected, how- 
ever, are usually different from those affected in liver cells. As we hâve seen, each 
cell type has an individualized set of transcription regulators, and because of 
combinatorial control, these critically influence the action of the glucocorticoid 
receptor. Because the receptor is able to assemble with many different sets of cell- 
type-specific transcription regulators, switching it on with hormone produces a 
different spectrum of effects in each cell type. 


Differentiated Cells Maintain Their Identity 

Once a cell has become differentiated into a particular cell type, it will generally 
remain differentiated, and ail its progeny cells will remain that same cell type. 
Some highly specialized cells, including slceletal muscle cells and neurons, never 
divide again once they hâve differentiated—that is, they are terminally differen¬ 
tiated (as discussed in Chapter 17). But many other differentiated cells—such as 
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Figure 7-38 A single transcription 
regulator can coordinate the expression 
of many different genes. The action of 
the glucocorticoid receptor is illustrated 
schematically. On the left is a sériés 
of genes, each of which has various 
transcription regulators bound to its 
regulatory région. However, these bound 
proteins are not sufficient on their own 
to fully activate transcription. On the 
right is shown the effect of adding an 
additional transcription regulator—the 
glucocorticoid receptor in a complex with 
glucocorticoid hormone—that has a cis- 
regulatory sequence in the control région 
of each gene. The glucocorticoid receptor 
complétés the combination of transcription 
regulators required for maximal initiation 
of transcription, and the genes are now 
switched on as a set. When the hormone 
is no longer présent, the glucocorticoid 
receptor dissociâtes from DNA and the 
genes return to their pre-stimulated levels. 
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fibroblasts, smooth muscle cells, and liver cells—will divide many times in the life 
of an individual. When they do, these specialized cell types give rise only to cells 
like themselves: smooth muscle cells do not give rise to liver cells, nor liver cells 
to fibroblasts. 

For a proliferating cell to maintain its identity—a property called cell mem- 
ory—the patterns of gene expression responsible for that identity must be remem- 
bered and passed on to its daughter cells through subséquent cell divisions. Thus, 
in the model we discussed in Figure 7-33, the production of each transcription 
regulator, once begun, has to be continued in the daughter cells of each cell divi¬ 
sion. Howis such perpétuation accomplished? 

Cells hâve several ways of ensuring that their daughters "remember” what 
kind of cells they are. One of the simplest and most important is through a pos¬ 
itive feedback loop, where a master cell-type transcription regulator activâtes 
transcription of its own gene, in addition to that of other cell-type-specific genes. 
Each time a cell divides, the regulator is distributed to both daughter cells, where 
it continues to stimulate the positive feedback loop, making more of itself each 
division. Positive feedback is crucial for establishing "self-sustaining” circuits of 
gene expression that allow a cell to commit to a particular fate—and then to trans¬ 
mit that information to its progeny (Figure 7-39). 

As was previously shown in Figure 7-37B, the master regulators needed to 
maintain the pluripotency of iPS cells bind to cz's-regulatory sequences in their 
own control régions, providing examples of the type of positive feedback loop. In 
addition, most of these pluripotent cell regulators also activate transcription of 
other master regulators, resulting in a complex sériés of indirect feedback loops. 
For example, if A activâtes B, and B activâtes A, this forms a positive feedback 
loop where A activâtes its own expression, albeit indirectly. The sériés of direct 
and indirect feedback loops observed in the iPS circuit is typical of other special¬ 
ized cell circuits. Such a networlc structure strengthens cell memory, increasing 
the probability that a particular pattern of gene expression is transmitted through 
successive générations. For example, if the level of A drops below the critical 
threshold to stimulate its own synthesis, regulator B can rescue it. By succes¬ 
sive application of this mechanism, a complex sériés of positive feedback loops 
among multiple transcription regulators can stably maintain a differentiated State 
through many cell divisions. 
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Figure 7-39 A positive feedback loop can create cell memory. Protein A is a master transcription 
regulator that activâtes the transcription of its own gene—as well as other cell-type-specific genes (not 
shown). Ail of the descendants of the original cell will therefore “remember” that the progenitor cell had 
experienced a transient signal that initiated the production of protein A. 
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Positive feedback loops formed by transcription regulators are probably the 
most prévalent way of ensuring that daughter cells remember what lcind of cells 
they are meant to be, and they are found in ail species on Earth. For example, 
many bacteria and single-cell eukaryotes form different types of cells, and posi¬ 
tive feedback loops lie at the heart of mechanisms that maintain their cell types 
through many rounds of cell division. Plants and animais also malce extensive use 
of transcription feedback loops; as we shall discuss later in the chapter, they hâve 
additional, more specialized mechanisms for making cell memory even stronger. 
But first, we briefly consider how combinations of transcription regulators and 
cz's-regulatory sequences can be combined to create useful logic devices for the 
cell. 

Transcription Circuits Allow the Cell to Carry Out Logic Operations 

Simple gene regulatory switches can be combined to create ail sorts of control 
devices, just as simple electronic switching éléments in a computer can be linlced 
to perform different types of operations. An analysis of gene regulatory circuits 
reveals that certain simple types of arrangements (called network motifs ) are 
found over and over again in cells from widely different species. For example, pos¬ 
itive and négative feedback loops are common in ail cells (Figure 7-40). Whereas 
the former provides a simple memory device, the latter is often used to lceep the 
expression of a gene close to a standard level despite the variations in biochem- 
ical conditions inside a cell. Suppose, for example, that a transcription repressor 
protein binds to the regulatory région of its own gene and exerts a strong négative 
feedback, such that transcription falls to a very low rate when the concentration of 
the repressor protein is above some critical value (determined by its affinity for its 
DNA binding site). The concentration of the protein can then be held close to the 
critical value, since any circumstance that causes a fall below that value can lead 
to a steep increase in synthesis, and any that causes a rise above that value will 
lead to synthesis being switched off. Such adjustments will, however, talce time, 
so that an abrupt change of conditions will cause a disturbance of gene expres¬ 
sion that is strong but transient. If there is a delay in the feedback loop, the resuit 
may be spontaneous oscillations in the expression of the gene (see Figure 15-18). 
The different types of behavior produced by a feedback loop will dépend on the 
details of the System; for example, how tightly the transcription regulator binds to 
its ds-regulatory sequence, its rate of synthesis, and its rate of decay. We discuss 
these issues in quantitative terms and in more detail in Chapter 8. 

With two or more transcription regulators, the possible range of circuit behav- 
iors becomes more complex. Some bacterial viruses contain a common type of 
two-gene circuit that can flip-flop between expression of one gene and expression 
of the other. Another common circuit arrangement is called a feed-forward loop; 
such a loop can serve as a filter, responding to input signais that are prolonged 
but disregarding those that are brief (Figure 7-41). These various network motifs 
resemble miniature logic devices, and they can process information in surpris- 
ingly sophisticated ways. 

The simple types of devices just illustrated are found to be interwoven in 
eukaryotic cells, creating exceedingly complex circuits (Figure 7-42). Each cell in 
a developing multicellular organism is equipped with similarly complex control 
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Figure 7-40 Common types of network motifs in transcription circuits. A and B represent 
transcription regulators, arrows indicate positive transcription control, while Unes with bars depict 
négative transcription control. In the feed-forward loop, A and B represent transcription regulators 
that both activate the transcription of target gene Z (see also Figure 8-86). 
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machinery, and it must, in effect, use its intricate System of interlocking transcrip¬ 
tion switches to compute how it should behave at each time point in response to 
the many different past and présent inputs received. We are only beginning to 
understand how to study such complex intracellular control networlcs. Indeed, 
without new approaches, coupled with quantitative information that is far more 
précisé and complété than we now possess, it will be impossible to predict the 
behavior of a System such as that shown in Figure 7-42. As explained in Chapter 8, 
a circuit diagram by itself is not enough. 


Figure 7-41 How a feed-forward loop 
can measure the duration of a signal. 

(A) In this theoretical example, transcription 
regulators A and B are both required for 
transcription of Z, and A becomes active 
only when an input signal is présent. 

(B) If the input signal to A is brief, A 
does not stay active long enough for B 
to accumulate, and the Z gene is not 
transcribed. (C) If the signal to A persists, 

B accumulâtes, A remains active, and Z is 
transcribed. This arrangement allows the 
cell to ignore rapid fluctuations of the input 
signal and respond only to persistent levels. 
This strategy could be used, for example, 
to distinguish between random noise and a 
true signal. 

The behavior shown here was 
computed for one particular set of 
parameter values describing the 
quantitative properties of A, B, and the 
product of Z, along with their synthèses. 
With different values of these parameters, 
feed-forward loops can in principle perform 
other types of “calculations.” Many feed- 
forward loops hâve been discovered 
in cells, and theoretical analysis helps 
researchers to discern—and subsequently 
test—the different ways in which they may 
function (see Figure 8-86). (Adapted from 
S.S. Shen-Orr et al., Nat. Genet. 31:64-68, 
2002. With permission from Macmillan 
Publishers Ltd.) 



Figure 7-42 The exceedingly complex 
gene circuit that spécifiés a portion of 
the developing sea urchin embryo. Each 
colored small box represents a different 
gene. Those in yellow code for transcription 
regulators and those in green and blue 
code for proteins that give cells of the 
mesoderm and endoderm, respectively, 
their specialized characteristics. Genes 
depicted in gray are largely active in the 
mother and provide the egg with eues 
needed for proper development. As in 
Figure 7-40, arrows depict instances in 
which a transcription regulator activâtes 
the transcription of another gene. Lines 
ending in bars indicate examples of gene 
repression. (From I.S. Peter and 
E.H. Davidson, Nature 474:635-639, 2011. 
With permission from Macmillan 
Publishers Ltd.) 
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Summary 

The many types ofcells in animais and plants are created largely through mecha- 
nisms that cause different sets ofgenes to be transcribed in different cells. The tran¬ 
scription ofany particular gene is generally controlled by a combination of tran¬ 
scription regulators. Each type ofcell in a higher eukaryotic organism contains a 
spécifie set of transcription regulators that ensures the expression ofonly thosegenes 
appropriate to that type ofcell A given transcription regulator may be active in a 
variety of circumstances and is typically involved in the régulation ofmany differ¬ 
ent genes. 

Since specialized animal cells can maintain their unique character through 
many cell-division cycles , and even when grown in culture, the gene regulatory 
mechanisms involved in creating them must be stable once established and herita- 
ble when the cell divides. These features reflect the celVs memory ofits developmen- 
tal history. Direct or indirect positive feedback loops, which enable transcription 
regulators to perpetuate their own synthesis, provide the simplest mechanism for 
cell memory. Transcription circuits also provide the cell with the means to carry out 
other types oflogic operations. Simple transcription circuits combined into large 
regulatory networlcs drive highly sophisticated programs of embryonic develop¬ 
ment that will require new approaches to decipher. 

MECHANISMS THAT REINFORCE CELL MEMORY IN 
PLANTS AND ANIMALS 

Thus far in this chapter, we hâve emphasized the régulation of gene transcription 
by proteins that associate either directly or indirectly with DNA. However, DNA 
itself can be covalently modified, and certain types of chromatin States appear 
to be inherited. In this section, we shall see how these phenomena also provide 
opportunities for the régulation of gene expression. At the end of this section, we 
discuss how, in mice and humans, an entire chromosome can be transcription- 
ally inactivated using such mechanisms, and how this State can be maintained 
through many cell divisions. 

Patterns of DNA Méthylation Can Be Inherited When Vertebrate 
Cells Divide 

In vertebrate cells, the méthylation of cytosine provides a mechanism through 
which gene expression patterns can be passed on to progeny cells. The methyl- 
ated form of cytosine, 5-methyl cytosine (5-methyl C), has the same relation to 
cytosine that thymine has to uracil, and the modification likewise has no effect on 
base-pairing (Figure 7-43). DNA méthylation in vertebrate DNA occurs on cyto¬ 
sine (C) nucléotides largely in the sequence CG, which is base-paired to exactly 
the same sequence (in opposite orientation) on the other strand of the DNA hélix. 
Consequently, a simple mechanism permits the existing pattern of DNA méth¬ 
ylation to be inherited directly by the daughter DNA strands. An enzyme called 
maintenance methyl transferase acts preferentially on those CG sequences that 
are base-paired with a CG sequence that is already methylated. As a resuit, the 
pattern of DNA méthylation on the parental DNA strand serves as a template for 
the méthylation of the daughter DNA strand, causing this pattern to be inherited 
directly following DNA réplication (Figure 7-44). 

Although DNA méthylation patterns can be maintained in differentiated 
cells by the mechanism shown in Figure 7-44, méthylation patterns are dynamic 
during mammalian development. Shortly after fertilization, there is a genome- 
wide wave of déméthylation, when the vast majority of methyl groups are lost 
from the DNA. This déméthylation may occur either by suppression of mainte¬ 
nance DNA methyl transferase activity, resulting in the passive loss of methyl 
groups during each round of DNA réplication, or by demethylating enzymes (dis- 
cussed below). Later in development, new méthylation patterns are established 
by several de novo DNA methyl transferases that are directed to DNA by sequence 


cytosine 5-methyl cytosine 



Figure 7-43 Formation of 5-methyl 
cytosine occurs by méthylation of a 
cytosine base in the DNA double hélix. 

In vertebrates, this event is largely confined 
to selected cytosine (C) nucléotides located 
in the sequence CG. CG sequences are 
sometimes denoted as CpG sequences, 
where the p indicates a phosphate linkage 
to distinguish it from a CG base pair. In 
this chapter, we will continue to use the 
simpler nomenclature CG to indicate this 
dinucleotide. 
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Figure 7-44 How DNA méthylation patterns are faithfully inherited. In vertebrate DNA, a large fraction of the cytosine 
nucléotides in the sequence CG is methylated (see Figure 7-43). Because of the existence of a methyl-directed methylating 
enzyme (the maintenance methyl transferase), once a pattern of DNA méthylation is established, that pattern of méthylation is 
inherited in the progeny DNA, as shown. 


spécifie DNA-binding proteins. Once the new patterns of méthylation are estab¬ 
lished, they can be propagated through rounds of DNA réplication by the mainte¬ 
nance methyl transferases. 

DNA méthylation has several uses in the vertebrate cell. A very important 
rôle is to worlc in conjunction with other gene expression control mechanisms 
to establish a particularly efficient form of gene repression. This combination of 
mechanisms ensures that unneeded eukaryotic genes can be repressed to very 
high degrees. For example, the rate at which a vertebrate gene is transcribed can 
vary 10 6 -fold between one tissue and another. The unexpressed vertebrate genes 
are much less “lealcy” in terms of transcription than bacterial genes, in which the 
largest known différences in transcription rates between expressed and unex¬ 
pressed gene States are about 1000-fold. 

DNA méthylation helps to repress transcription in several ways. The methyl 
groups on methylated cytosines lie in the major groove of DNA and interfère 
directly with the binding of proteins (transcription regulators as well as the gen¬ 
eral transcription factors) required for transcription initiation. In addition, the 
cell contains a répertoire of proteins that bind specifically to methylated DNA.The 
best characterized of these associate with histone modifying enzymes, leading to 
a répressive chromatin State where chromatin structure and DNA méthylation act 
synergistically (Figure 7-45). One reflection of the importance of DNA méthyla¬ 
tion to humans is the widespread involvement of "incorrect” DNA méthylation 
patterns in cancer progression (discussed in Chapter 20). 

CG-Rich Islands Are Associated with Many Genes in Mammals 

Because of the way in which DNA repair enzymes worlc, methylated C nucléotides 
in the vertebrate genome tend to be eliminated in the course of évolution. Acci¬ 
dentai deamination of an unmethylated C gives rise to U (see Figure 5-38), which 
is not normally présent in DNA and thus is recognized easily by the DNA repair 
enzyme uracil DNA glycosylase, excised, and then replaced with a C (as discussed 
in Chapter 5). But accidentai deamination of a 5-methyl C cannot be repaired in 
this way, for the deamination product is a T and so is indistinguishable from the 
other, nonmutant T nucléotides in the DNA. Although a spécial repair System 
exists to remove these mutant T nucléotides, many of the deaminations escape 
détection, so that those C nucléotides in the genome that are methylated tend to 
mutate to T over evolutionary time. 
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Figure 7-45 Multiple mechanisms 
contribute to stable gene repression. 

In this schematic example, histone reader 
and writer proteins (discussed in Chapter 
4), under the direction of transcription 
regulators, establish a répressive form 
of chromatin. A de novo DNA methylase 
is attracted by the histone reader and 
methylates nearby cytosines in DNA, which 
are, in turn, bound by DNA methyl-binding 
proteins. During DNA réplication, some 
of the modified (blue dot) histones will be 
inherited by one daughter chromosome, 
some by the other, and in each daughter 
they can induce reconstruction of the 
same pattern of chromatin modifications 
(discussed in Chapter 4). At the same 
time, the mechanism shown in Figure 7-44 
will cause both daughter chromosomes 
to inherit the same méthylation pattern. 

In these cases where DNA méthylation 
stimulâtes the activity of the histone writer, 
the two inheritance mechanisms will be 
mutually reinforcing. This scheme can 
account for the inheritance by daughter 
cells of both the histone and the DNA 
modifications. It can also explain the 
tendency of some chromatin modifications 
to spread along a chromosome (see 
Figure 4-44). 


During the course of évolution, more than three out of every four CGs hâve 
been lost in this way, leaving vertebrates with a remarkable deficiency of this 
dinucleotide. The CG sequences that remain are very unevenly distributed in the 
genome; they are présent at 10 times their average density in selected régions, 
called CG islands, which average 1000 nucléotide pairs in length. The human 
genome contains roughly 20,000 CG islands and they usually include promot- 
ers of genes. For example, 60% of human protein-coding genes hâve promot- 
ers embedded in CG islands and these include virtually ail the promoters of the 
so-called houselceeping genes —those genes that code for the many proteins that 
are essential for cell viability and are therefore expressed in nearly ail cells (Figure 
7-46). Over evolutionary timescales, the CG islands were spared the accelerated 
mutation rate of bulle CG sequences because they remained unmethylated in the 
germ line (Figure 7-47). 

CG islands also remain unmethylated in most somatic tissues whether or 
not the associated gene is expressed. The unmethylated State is maintained 
by sequence-specific DNA-binding proteins, many of whose cz's-regulatory 
sequences contain a CG. By binding to these sequences, which are spread across 
CG islands, they protect the DNA from methyl transferases. These proteins also 
recruit DNA demethylases, which convert 5-methyl C to hydroxy-methyl C, which 
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Figure 7-46 The CG islands surrounding 
the promoter in three mammalian 
housekeeping genes. The yellow boxes 
show the extent of each island. As for 
most genes in mammals, the exons (dark 
red) are very short relative to the introns 
(light red). (Adapted from A.P. Bird, Trends 
Genet. 3:342-347, 1987. With permission 
from Elsevier.) 
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is later replaced by C either through DNA repair (see Figure 5-41 A) or, passively, 
through multiple rounds of DNA réplication. Unmethylated CG islands hâve sev- 
eral properties that malce them particularly suitable for promoters. For example, 
some of the same proteins that bind to CG islands and protect them from méthyl¬ 
ation recruit histone modifying enzymes that malce the islands particularly "pro¬ 
moter friendly." As a resuit, RNA polymerase is often found bound to promoters 
within CG islands, even when the associated gene is not being actively transcribed. 
At unmethylated CG islands, the balance between polymerase and nucléosome 
assembly is thus tipped toward the former. Additional steps are needed to “push” 
the bound polymerase into transcribing the adjacent gene, and these are directed 
by transcription regulators that bind to cz's-regulatory sequences of DNA (often 
well upstream from the CG islands). These regulators serve to release the poly¬ 
merase with the appropriate élongation factors (see Figure 7-21C and D). 


Genomic Imprinting Is Based on DNA Méthylation 


Mammalian cells are diploid, containing one set of genes inherited from the father 
and one set from the mother. The expression of a small minority of genes dépends 
on whether they hâve been inherited from the mother or the father: when the 
paternally inherited gene copy is active, the maternally inherited gene copy is 
silent, or vice versa. This phenomenon is called genomic imprinting. 

Roughly 300 genes are imprinted in humans. Because only one copy of an 
imprinted gene is expressed, imprinting can “unmaslc” mutations that would 
normally be covered by the other, functional copy. For example, Angelman syn¬ 
drome, a disorder of the nervous System in humans that causes reduced mental 
ability and severe speech impairment, results from a gene délétion on one chro¬ 
mosomal homolog and the silencing, by imprinting, of the intact gene on the 
other homolog. 

The insulin-like growth factor-2 ( Igf2 ) gene in the mouse provides a well-stud- 
ied example of imprinting. Mice that do not express Igf2 at ail are born half the 
size of normal mice. However, only the paternal copy of Igf2 is transcribed, and 
only this gene copy matters for the phenotype. As a resuit, mice with a mutated 
paternally derived Igf2 gene are stunted, while mice with a mutated maternally 
derived Igf2 gene are normal. 
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Figure 7-47 A mechanism to explain both the marked overall deficiency 
of CG sequences and their clustering into CG islands in vertebrate 
genomes. White Unes mark the location of CG dinucleotides in the DNA 
sequences, while red circles indicate the presence of a methyl group on the 
CG dinucleotide. CG sequences that lie in regulatory sequences of genes 
that are transcribed in germ cells are unmethylated and therefore tend to be T 
retained in évolution. Methylated CG sequences, on the other hand, tend to 
be lost through deamination of 5-methyl C to T, unless the CG sequence is 
critical for survival. 
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Figure 7-48 Imprinting in the mouse. The top portion of the figure shows a pair of homologous chromosomes in the somatic 
cells of two adult mice, one male and one female. In this example, both mice hâve inherited the top homolog from their father 
and the bottom homolog from their mother, and the paternal copy of a gene subject to imprinting (indicated in orange) is 
methylated, preventing ifs expression. The maternally derived copy of the same gene (yellow) is expressed. The remainder of 
the figure shows the outcome of a cross between these two mice. During germ-cell formation, but before meiosis, the imprints 
are erased and then, much later in germ-cell development, they are reimposed in a sex-specific pattern (middle portion of 
figure). In eggs produced from the female, neither allele of the A gene is methylated. In sperm from the male, both alleles of 
gene A are methylated. Shown at the bottom of the figure are two of the possible imprinting patterns inherited by the progeny 
mice; the mouse on the left has the same imprinting pattern as each of the parents, whereas the mouse on the right has the 
opposite pattern. If the two alleles of gene A are distinct, these different imprinting patterns can cause phenotypic différences 
in the progeny mice, even though they carry exactly the same DNA sequences of the two A gene alleles. Imprinting provides 
an important exception to classical genetic behavior, and several hundred mouse genes are thought to be affected in this way. 
However, the majority of mouse genes are not imprinted, and therefore the rules of Mendelian inheritance apply to most of the 
mouse genome. 


In the early embryo, genes subject to imprinting are marlced by méthylation 
according to whether they were derived from a sperm or an egg chromosome. 
In this way, DNA méthylation is used as a mark to distinguish two copies of a 
gene that may be otherwise identical (Figure 7-48). Because imprinted genes 
are somehow protected from the wave of déméthylation that takes place shortly 
after fertilization (see pp. 404-405), this mark enables somatic cells to "remem- 
ber ,; the parental origin of each of the two copies of the gene and to regulate 
their expression accordingly. In most cases, the methyl imprint silences nearby 
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gene expression. In some cases, however, it can activate expression of a gene. In 
the case of Igf2, for example, méthylation of an insulator element on the pater¬ 
nally derived chromosome bloclcs its function and allows distant czs-regulatory 
sequences to activate transcription of the Igf2 gene. On the maternally derived 
chromosome, the insulator is not methylated and the Igf2 gene is therefore not 
transcribed (Figure 7-49A). 

Other cases of imprinting involve long noncoding RNAs, which are defined as 
RNA molécules over 200 nucléotides in length that do not code for proteins. We 
discuss IncRNAs broadly at the end of this chapter; here, we focus on the rôle of 
a spécifie IncRNA in imprinting. In the case of the Kcnql gene, which codes for a 
voltage-gated calcium channel needed for proper heart function, the IncRNA is 
made from the paternal allele (which is unmethylated) but it is not released by the 
RNA polymerase, remaining instead at its site of synthesis on the DNA template. 
This RNA in turn recruits histone-modifying and DNA-methylating enzymes that 
direct the formation of répressive chromatin, which silences the protein-coding 
gene associated on the paternally derived chromosome (Figure 7-49B). The mater¬ 
nally derived gene, on the other hand, is immune to these effects because the spé¬ 
cifie méthylation présent from imprinting bloclcs the synthesis of the IncRNA but 
allows transcription of the protein-coding gene. Lilce Igf2, the specificity of Kcnql 
imprinting arises from the inherited méthylation patterns; the différence lies in 
the way these patterns bring about differential expression of the imprinted gene. 

Why imprinting should exist at ail is a mystery. In vertebrates, it is restricted to 
placental mammals, and many of the imprinted genes are involved in fêtai devel¬ 
opment. One idea is that imprinting reflects a middle ground in the evolutionary 
struggle between males to produce larger offspring and females to limit offspring 
size. Whatever its purpose might be, imprinting provides startling evidence that 
features of DNA other than its sequence of nucléotides can be inherited. 

Chromosome-Wide Alterations in Chromatin Structure Can Be 
Inherited 

We hâve seen that DNA méthylation and certain types of chromatin structure 
can be heritable, preserving patterns of gene expression across cell générations. 
Perhaps the most striking example of this effect occurs in mammals, in which an 
alteration in the chromatin structure of an entire chromosome can modulate the 
levels of expression of most genes on that chromosome. 


Figure 7-49 Mechanisms of imprinting. 

(A) On chromosomes inherited from the 
female, a protein called CTCF binds to 
an insulator (see Figure 7-24), blocking 
communication between cis- regulatory 
sequences (green) and the Igf2 gene 
(orange). Igf2 is therefore not expressed 
from the maternally inherited chromosome. 
Because of imprinting, the insulator on the 
male-derived chromosome is methylated 
(red circles)] this inactivâtes the insulator 
by blocking the binding of the CTCF 
protein, and allows the cis- regulatory 
sequences to activate transcription of the 
Igf2 gene. In other examples of imprinting, 
méthylation simply blocks gene expression 
by interfering with the binding of proteins 
required for a gene’s transcription. 

(B) Imprinting of the mouse Kcnql gene. 
On the maternally derived chromosome, 
synthesis of the IncRNA is blocked by 
méthylation of the DNA (red circles), 
and the Kcnql gene is expressed. On 
the paternally derived chromosome, the 
IncRNA is synthesized, remains in place, 
and by directing alterations in chromatin 
structure blocks expression of the Kcnql 
gene. Although shown as directly binding 
to IncRNA, the histone-modifying enzymes 
are likely to be recruited indirectly, through 
additional proteins. 
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Males and females differ in their sex chromosomes. Females hâve two X chro¬ 
mosomes, whereas males hâve one X and one Y chromosome. As a resuit, female 
cells contain twice as many copies of X-chromosome genes as do male cells. In 
mammals, the X and Y sex chromosomes differ radically in gene content: the X 
chromosome is large and contains more than a thousand genes, whereas the Y 
chromosome is small and contains less than 100 genes. Mammals hâve evolved a 
dosage compensation mechanism to equalize the dosage of X-chromosome gene 
products between males and females. The correct ratio of X chromosome to auto- 
some (non-sex chromosome) gene products is carefully controlled, and mutations 
that interfère with this dosage compensation are generally léthal. 

Mammals achieve dosage compensation by the transcriptional inactivation of 
one of the two X chromosomes in female somatic cells, a process known as X-in- 
activation. As a resuit of X-inactivation, two X chromosomes can coexist within 
the same nucléus, exposed to the same diffusible transcription regulators, yet dif¬ 
fer entirely in their expression. 

Early in the development of a female embryo, when it consists of a few hun- 
dred cells, one of the two X chromosomes in each cell becomes highly condensed 
into a type of heterochromatin. The initial choice of which X chromosome to inac- 
tivate, the maternally inherited one (X m ) or the paternally inherited one (X p ), is 
random. Once either X p or X m has been inactivated, it remains silent through- 
out ail subséquent cell divisions of that cell and its progeny, indicating that the 
inactive State is faithfully maintained through many cycles of DNA réplication and 
mitosis. Because X-inactivation is random and talces place after several hundred 
cells hâve already formed in the embryo, every female is a mosaic of clonal groups 
of cells in which either X p or X m is silenced (Figure 7-50). These clonal groups are 


cell in early embryo 





only X m active in this clone only X p active in this clone 


Figure 7-50 X-inactivation. The clonal 
inheritance in female mammals of a 
condensed, inactive X chromosome. 
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Figure 7-51 Photoreceptor cells in the retina of a female mouse showing 
patterns of X-chromosome inactivation. Using genetic engineering 
techniques (described in Chapter 8), the germ line of a mouse was modified 
so that one copy of the X chromosome (if active) makes a green fluorescent 
protein and the other a red fluorescent protein. Both proteins concentrate in 
the nucléus and, in the field of cells shown here, it is clear that only one of 
the two X chromosomes is active in each cell. (From H. Wu et al., Neuron 
81:103-119, 2014. With permission from Elsevier.) 


distributed in small clusters in the adult animal because sister cells tend to remain 
close together during later stages of development (Figure 7-51). For example, 
X-chromosome inactivation causes the orange and blaclc “tortoiseshell” coat 
coloration of some female cats. In these cats, one X chromosome carries a gene 
that produces orange hair color, and the other X chromosome carries an allele 
of the same gene that results in blaclc hair color; it is the random X-inactivation 
that produces patches of cells of two distinctive colors. In contrast, male cats of 
this genetic stock are either solid orange or solid blaclc, depending on which X 
chromosome they inherit from their mothers. Although X-chromosome inactiva¬ 
tion is maintained over thousands of cell divisions, it is reversed during germ-cell 
formation, so that ail haploid oocytes contain an active X chromosome and can 
express X-linlced gene products. 

How is an entire chromosome transcriptionally inactivated? X-chromosome 
inactivation is initiated and spreads from a single site near the middle of the X 
chromosome, the X-inactivation center (XIC). Within the XIC is a transcribed 
20,000-nucleotide IncRNA (called Xist), which is expressed solely from the inac¬ 
tive X chromosome. Xist RNA spreads from the XIC over the entire chromosome 
and directs gene silencing. Although we do not lcnow exactly how this is accom- 
plished, it lilcely involves recruitment of histone-modifying enzymes and other 
proteins to form a répressive form of chromatin analogous to that of Figure 7-45. 
Curiously, about 10% of the genes on the X chromosome (including Xist itself) 
escape this silencing and remain active. 

The spread of Xist RNA along the X chromosome does not proceed linearly 
along the DNA. Rather, starting at its site of synthesis, it is first handed off across 
the base of the DNA loops that malce up the chromosome; these shortcuts explain 
how Xist can spread rapidly, by a “hand-over-hand” mechanism, along the X 
chromosome once the inactivation process begins (Figure 7-52). It also helps to 
explain why the inactivation does not spread to the other, active X chromosome. 

Imprinting and X-chromosome inactivation are examples of monoallelic gene 
expression, where in a diploid genome, only one of the two copies of a gene is 
expressed. In addition to the approximately 1000 genes on the X chromosome and 
the 300 or so genes that are imprinted, there are another 1000-2000 human genes 
that exhibit monoallelic expression. Lilce X-chromosome inactivation (but unlilce 
imprinting), the choice of which copy of the gene is to be expressed and which is 
to be silenced often appears random. Yet once the choice is made, it can persist for 
many cell divisions. Because the choice is often made relatively late in develop¬ 
ment, cells of the same tissue in the same individual can express different copies 
of a given gene. In other words, somatic tissues are often mosaics, where different 
clones of cells hâve subtly different patterns of gene expression. The mechanisms 
responsible for this type of monoallelic expression are not lcnown in detail, and 
its general purpose—if any—is poorly understood. Several different mechanisms 
may contribute to such epigenetic inheritance, as we explain next. 



50 jum 


Epigenetic Mechanisms Ensure That Stable Patterns of Gene 
Expression Can Be Transmitted to Daughter Cells 

As we hâve seen, once a cell in an organism differentiates into a particular cell 
type, it generally remains specialized in that way; if it divides, its daughters inherit 
the same specialized character. Perhaps the simplest way for a cell to remember 
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its identity is through a positive feedback loop in which a key transcription regu- 
lator activâtes; either directly or indirectly, the transcription of its own gene (see 
Figure 7-39). Interlocking positive feedback loops of the type shown in Figure 
7-37 provide greater stability by buffering the circuit against fluctuations in the 
level of any one transcription regulator. Because transcription regulators are syn- 
thesized in the cytosol and diffuse throughout the nucléus, feedback loops based 
on this mechanism will affect both copies of a gene in a diploid cell. However, as 
discussed in this section, the expression pattern of a gene on one chromosome 
can differ from the copy of the same gene on the other chromosome (as in X-chro- 
mosome inactivation or in imprinting), and such différences can also be inherited 
through many cell divisions. 

The ability of a daughter cell to retain a memory of the gene expression pat¬ 
terns that were présent in the parent cell is an example of epigenetic inheritance: 
a heritable alteration in a cell or organisons phenotype that do es not resuit from 
changes in the nucléotide sequence of DNA (discussed in Chapter 4). (Unfortu- 
nately, the term epigenetic is sometimes also used to refer to ail covalent modifi¬ 
cations to histones and DNA, whether or not they are self-propagating; many of 
these modifications are erased each time a cell divides and do not generate cell 
memory.) 

In Figure 7-53, we contrast two self-propagating epigenetic mechanisms 
that worlc in cis , affecting only one chromosomal copy with two self-propagating 
mechanisms that worlc in trans, affecting both chromosomal copies of a gene. 
Cells can combine these mechanisms to ensure that patterns of gene expression 
are maintained and inherited accurately and reliably—over a period of up to a 
hundred years or more, in our own case. 

We can get some idea of the prevalence of epigenetic changes by comparing 
identical twins. Their genomes hâve the same sequence of nucléotides, and, obvi- 
ously, many features of identical twins—such as their appearance—are strongly 
determined by the genome sequences they inherit. When their gene expression, 
histone modification, and DNA méthylation patterns are compared, however, 
many différences are observed. Because these différences are roughly correlated 
not only with âge but also with the time that the twins hâve spent apart from each 
other, it has been proposed that some of these différences are heritable from cell 
to cell and are the resuit of environmental factors. Although these studies are in 
early stages, the idea that environmental events can be permanently registered 
as epigenetic changes in our cells is a fascinating one that présents an important 
challenge to the next génération of biological scientists. 


Figure 7-52 Mammalian X-chromosome 
inactivation. X-chromosome inactivation 
begins with the synthesis of Xist 
(X-inactivation spécifie transcript) RNA from 
the XIC (X-inactivation center) locus and 
moves outward to the chromosome ends. 
According to the model depicted here, 
the long (-20,000 nucléotides) Xist RNA 
has many low-affinity binding sites for the 
structural components of chromosomes 
and spreads by releasing its hold on one 
portion of the chromosome while grasping 
another. The continued synthesis of Xist 
from the center of the chromosome drives 
it to the ends. As shown, Xist RNA does 
not move linearly along the chromosomal 
DNA, but, instead, moves first across the 
base of chromosome loops. It has been 
proposed that the portions of chromosomal 
DNA at the tips of long loops contain the 
10% of genes that escape X-chromosome 
inactivation. 
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Summary 

Eukaryotic cells can use inherited forms ofDNA méthylation and inherited States 
of chromatin condensation as additional mechanisms for generating cell memory 
ofgene expression patterns. An especially dramatic case that involves chromatin 
condensation is the inactivation ofan entire Xchromosome infernale mammals. 
DNA méthylation underlies the phenomenon in mammals ofgenomic imprinting, 
in which the expression of a gene dépends on whether it was inherited from the 
mother or thefather. 

POST-TRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression can be con- 
trolled. Indeed, one can find examples of each type of régulation, and many genes 
are regulated by multiple mechanisms. As we hâve seen, Controls on the initiation 
of gene transcription are a critical form of régulation for ail genes. But other Con¬ 
trols can act later in the pathway from DNA to protein to modulate the amount 
of gene product that is made—and in some cases, to détermine the exact amino 
acid sequence of the protein product. These post-transcriptional Controls, which 
operate after RNA polymerase has bound to the gene's promoter and has begun 
RNA synthesis, are crucial for the régulation of many genes. 

In the following sections, we consider the varieties of post-transcriptional rég¬ 
ulation in temporal order, according to the sequence of events that an RNA molé¬ 
cule might expérience after its transcription has begun (Figure 7-54). 


Figure 7-53 Four distinct mechanisms 
that can produce an epigenetic 
form of inheritance in an organism. 

(A) Epigenetic mechanisms that act 
in cis. As discussed in this chapter, a 
maintenance methylase can propagate 
spécifie patterns of cytosine méthylation 
(see Figure 7-44). As discussed in 
Chapter 4, a histone modifying enzyme 
that replicates the same modification that 
attracts it to chromatin can resuit in the 
modification being self-propagating (see 
Figure 4-44). (B) Epigenetic mechanisms 
that act in trans. Positive feedback loops, 
formed by transcriptional regulators are 
found in ail species and are probably the 
most common form of cell memory. As 
discussed in Chapter 3, some proteins 
can form self-propagating prions (Figure 
3-33). If these proteins are involved in gene 
expression, they can transmit patterns of 
gene expression to daughter cells. 
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Figure 7-54 Post-transcriptional 
Controls of gene expression. The final 
synthesis rate of a protein can, in principle, 
be controlled at any of the steps listed in 
capital letters. In addition, RNA splicing, 
RNA editing, and translation recoding can 
also alter the sequence of amino acids in 
a protein, making it possible for the cell to 
produce more than one protein variant from 
the same gene. Only a few of the steps 
depicted here are likely to be critical for the 
régulation of any one particular protein. 
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Transcription Atténuation Causes the Prématuré Termination of 
Some RNA Molécules 

It has long been lcnown that the expression of some genes is inhibited by pré¬ 
maturé termination of transcription, a phenomenon called transcription atténua¬ 
tion. In some of these cases, the nascent RNA chain adopts a structure that causes 
it to interact with the RNA polymerase in such a way as to abort its transcription. 
When the gene product is required, regulatory proteins bind to the nascent RNA 
chain and remove the atténuation, allowing the transcription of a complété RNA 
molécule. 

A well-studied example of transcription atténuation occurs during the life 
cycle of HIV, the human immunodeficiency virus that is the causative agent of 
acquired immune deficiency syndrome, or AIDS. Once the HIV genome has been 
integrated into the host genome, the viral DNA is transcribed by the cell's RNA 
polymerase II (see Figure 5-62). However, this polymerase usually terminâtes 
transcription after synthesizing transcripts of several hundred nucléotides and 
therefore fails to efficiently transcribe the entire viral genome. When conditions 
for viral growth are optimal, a virus-encoded protein called Tat, which binds to 
a spécifie stem-loop structure in the nascent RNA that contains a “bulged base,” 
prevents this prématuré termination (see Figure 6-89). Once bound to this spé¬ 
cifie RNA structure (called TAR), Tat assembles several host-cell proteins that 
allow the RNA polymerase to continue transcribing. The normal rôle of at least 
some of these proteins is to prevent pausing and prématuré termination by RNA 
polymerase when it transcribes normal cell genes. Thus, a normal cell mechanism 
has apparently been highjacked by HIV to permit transcription of its genome to be 
controlled by a single viral protein. 

Riboswitches Probably Represent Ancient Forms of Gene Control 

In Chapter 6, we discussed the idea that, before modem cells arose on Earth, RNA 
played the rôle of both DNA and proteins, both storing hereditary information 
and catalyzing Chemical reactions (see pp. 362-366). The discovery of riboswitches 
shows that RNA can also form control devices. Riboswitches are short sequences 
of RNA that change their conformation on binding small molécules, such as 
métabolites. Each riboswitch recognizes a spécifie small molécule and the result- 
ing conformational change is used to regulate gene expression. Riboswitches are 
often located near the 5' end of mRNAs, and they fold while the mRNA is being 
synthesized, blocking or permitting progress of the RNA polymerase according to 
whether the regulatory small molécule is bound (Figure 7-55). 

Riboswitches are particularly common in bacteria, in which they sense lcey 
small métabolites in the cell and adjust gene expression accordingly. Perhaps 
their most remarkable feature is the high specificity and affinity with which each 
recognizes only the appropriate small molécule; in many cases, every Chemical 
feature of the small molécule is read by the RNA (Figure 7-55C). Moreover, the 
binding affinities observed are as tight as those typically observed between small 
molécules and proteins. 



(0 


Figure 7-55 A riboswitch that responds 
to guanine. (A) In this example from 
bacteria, the riboswitch Controls expression 
of the purine biosynthetic genes. When 
guanine levels in cells are low, an 
elongating RNA polymerase transcribes 
the purine biosynthetic genes, and the 
enzymes needed for guanine synthesis are 
therefore expressed. (B) When guanine is 
abundant, it binds the riboswitch, causing 
it to undergo a conformational change that 
forces the RNA polymerase to terminate 
transcription (see Figure 6-11). (C) Guanine 
(red) bound to the riboswitch. Only those 
nucléotides that form the guanine-binding 
pocket are shown. Many other riboswitches 
exist, including those that recognize 
S-adenosylmethionine, coenzyme B 12 , 
flavin mononucleotide, adenine, lysine, and 
glycine. (Adapted from M. Mandai and 
R.R. Breaker, Nat. Rev. Mol. Cell Biol. 
5:451-463, 2004. With permission from 
Macmillan Publishers Ltd; and C.K. 
Vanderpool and S. Gottesman, Mol. 
Microbiol. 54:1076-1089, 2004. With 
permission from Blackwell Publishing.) 
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Figure 7-56 Five patterns of alternative RNA splicing. In each case, a 
single type of RNA transcript is spliced in two alternative ways to produce 
two distinct mRNAs (1 and 2). The dark blue boxes mark exon sequences 
that are retained in both mRNAs. The light blue boxes mark possible exon 
sequences that are included in only one of the mRNAs. The boxes are 
joined by red Unes to indicate where intron sequences (yellow) are removed. 
(Adapted from H. Keren et al. Nat. Rev. Genet. 11:345-355, 2010. With 
permission from Macmillan Publishers Ltd.) 

Riboswitches are perhaps the most economical examples of gene control 
devices, inasmuch as they bypass the need for regulatory proteins altogether. In 
the example shown in Figure 7-55, the riboswitch Controls transcription élonga¬ 
tion, but they can also regulate other steps in gene expression, as we shall see later 
in this chapter. Clearly, highly sophisticated gene control devices can be made 
from short sequences of RNA, a fact that supports the hypothesis of an early "RNA 
world." 
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Alternative RNA Splicing Can Produce Different Forms of a Protein 
from the Same Gene 

As discussed in Chapter 6 (see Figure 6-26), RNA splicing shortens the transcripts 
of many eukaryotic genes by removing the intron sequences from the mRNA pre- 
cursor. We also saw that a cell can splice an RNA transcript differently and thereby 
malce different polypeptide chains from the same gene—a process called alterna¬ 
tive RNA splicing (Figure 7-56). A substantial proportion of animal genes (esti- 
mated at 90% in humans) produce multiple proteins in this way. 

When different splicing possibilities exist at several positions in the transcript, 
a single gene can produce dozens of different proteins. In one extreme case, a 
Drosophila gene may produce as many as 38,000 different proteins from a single 
gene through alternative splicing (Figure 7-57), although only a fraction of these 
forms hâve thus far been experimentally observed. Considering that the Drosoph¬ 
ila genome has approximately 14,000 identified genes, it is clear that the protein 
complexity of an organism can greatly exceed the number of its genes. This exam¬ 
ple also illustrâtes the périls in equating gene number with an organisnTs com¬ 
plexity. For example, alternative splicing is rare in single-celled budding yeasts 
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Figure 7-57 Alternative splicing of RNA transcripts of the Drosophila Dscam gene. DSCAM proteins hâve several different functions. In cells 
of the fly immune System, they médiate the phagocytosis of bacterial pathogens. In cells of the nervous System, DSCAM proteins are needed for 
proper wiring of neurons. The final mRNA contains 24 exons, four of which (denoted A, B, C, and D) are présent in the Dscam gene as arrays of 
alternative exons. Each RNA contains 1 of 12 alternatives for exon A (red), 1 of 48 alternatives for exon B (green), 1 of 33 alternatives for exon C 
(blue), and 1 of 2 alternatives for exon D (yellow). This figure shows only one of the many possible splicing patterns (indicated by the red line and 
by the mature mRNA below it). Each variant DSCAM protein would fold into roughly the same structure (predominantly a sériés of extracellular 
immunoglobulin-like domains linked to a membrane-spanning région; see Figure 24-48), but the amino acid sequence of the domains vary 
according to the splicing pattern. The diversity of DSCAM variants contributes to the plasticity of the immune System as well as the formation of 
complex neural circuits; we take up the spécifie rôle of the DSCAM variants in more detail when we describe the development of the nervous System 
in Chapter 21. (Adapted from D.L. Black, Cell 103:367-370, 2000. With permission from Elsevier.) 
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but very common in Aies. Buddingyeast has -6200 genes, only about 300 of which 
are subject to splicing, and nearly ail of these hâve only a single intron. To say that 
Aies hâve only 2-3 times as many genes as yeasts greatly underestimates the dif¬ 
férence in complexity of these two genomes. 

In some cases, alternative RNA splicing occurs because there is an intron 
sequence ambiguity: the standard spliceosome mechanism for removing intron 
sequences (discussed in Chapter 6) is unable to distinguish clearly between two 
or more alternative pairings of 5' and 3' splice sites, so that diAerent choices are 
made by chance on diAerent individual transcripts. Where such constitutive alter¬ 
native splicing occurs, several versions of the protein encoded by the gene are 
made in ail cells in which the gene is expressed. 

In many cases, however, alternative RNA splicing is regulated. In the simplest 
examples, regulated splicing is used to switch from the production of a nonfunc- 
tional protein to the production of a functional one (or the other way around). 
The transposase that catalyzes the transposition of the Drosophila P element, 
for example, is produced in a functional form in germ cells and a nonfunctional 
form in somatic cells of the Ay, allowing the P element to spread throughout the 
genome of the Ay without causing damage in somatic cells (see Figure 5-61). The 
diAerence in transposon activity has been traced to the presence of an intron 
sequence in the transposase RNA that is removed only in germ cells. 

In addition to enabling switching from the production of a functional protein 
to the production of a nonfunctional one (or vice versa), the régulation of RNA 
splicing can generate diAerent versions of a protein in diAerent cell types, accord- 
ing to the needs of the cell. Tropomyosin, for example, is produced in specialized 
forms in diAerent types of cells (see Figure 6-26). Cell-type-speciAc forms of many 
other proteins are produced in the same way. 

RNA splicing can be regulated either negatively, by a regulatory molécule that 
prevents the splicing machinery from gaining access to a particular splice site 
on the RNA, or positively, by a regulatory molécule that helps direct the splicing 
machinery to an otherwise overlooked splice site (Figure 7-58). 

Because of the plasticity of RNA splicing, the blocking of a “strong” splicing 
site will often expose a “wealc" site and resuit in a diAerent pattern of splicing. 
Thus, the splicing of a pre-mRNA molécule can be thought of as a délicate balance 
between competing splice sites—a balance that can easily be tipped by eAects on 
splicing of regulatory proteins. 


The Définition of a Gene Has Been Modified Since the Discovery 
of Alternative RNA Splicing 


The discovery that eulcaryotic genes usually contain introns and that their coding 
sequences can be assembled in more than one way raised new questions about 
the deAnition of a gene. A gene was Arst clearly deAned in molecular terms in 
the early 1940s from worlc on the biochemical genetics of the fungus Neurospora. 
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Figure 7-58 Négative and positive 
control of alternative RNA splicing. 

(A) In négative control, a repressor protein 
binds to a spécifie sequence in the pre- 
mRNA transcript and blocks access of 
the splicing machinery to a splice junction. 
This often results in the use of a secondary 
splice site, thereby producing an altered 
pattern of splicing (see Figure 7-56). 

(B) In positive control, the splicing 
machinery is unable to remove a particular 
intron sequence efficiently without 
assistance from an activator protein. 
Because RNA is flexible, the nucléotide 
sequences that bind these activators can 
be located many nucléotide pairs from 
the splice junctions they control, and they 
are often called splicing enhancers, by 
analogy with the transcriptional enhancers 
mentioned earlier in this chapter. 
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Until then, a gene had been defined operationally as a région of the genome 
that segregates as a single unit during meiosis and gives rise to a definable phe- 
notypic trait, such as a red or a white eye in Drosophila or a round or wrinlded 
seed in peas. The worlc on Neurospora showed that most genes correspond to a 
région of the genome that directs the synthesis of a single enzyme. This led to the 
hypothesis that one gene encodes one polypeptide chain. The hypothesis proved 
fruitful for subséquent research; as more was learned about the mechanism of 
gene expression in the 1960s, a gene became identified as that stretch of DNA that 
was transcribed into the RNA coding for a single polypeptide chain (or a single 
structural RNA such as a tRNA or an rRNA molécule). The discovery of split genes 
and introns in the late 1970s could be readily accommodated by the original défi¬ 
nition of a gene, provided that a single polypeptide chain was specified by the 
RNA transcribed from any one DNA sequence. But it is now clear that many DNA 
sequences in higher eukaryotic cells can produce a set of distinct (but related) 
proteins by means of alternative RNA splicing. How, then, is a gene to be defined? 

In those relatively rare cases in which a single transcription unit produces two 
very different eukaryotic proteins, the two proteins are considered to be produced 
by distinct genes that overlap on the chromosome. It seems unnecessarily com- 
plex, however, to consider most of the protein variants produced by alternative 
RNA splicing as being derived from overlapping genes. A more sensible alternative 
is to modify the original définition to count a DNA sequence that is transcribed as 
a single unit and encodes one set of closely related polypeptide chains (protein 
isoforms) as a single protein-coding gene. This définition also accommodâtes 
those DNA sequences that encode protein variants produced by post-transcrip- 
tional processes other than RNA splicing, such as transcript cleavage and RNA 
editing (discussed below). 

A Change in the Site of RNA Transcript Cleavage and Poly-A 
Addition Can Change the C-terminus of a Protein 

We saw in Chapter 6 that the 3' end of a eukaryotic mRNA molécule is not formed 
by the termination of RNA synthesis by the RNA polymerase, as it is in bacteria. 
Instead, it results from an RNA cleavage reaction that is catalyzed by additional 
proteins while the transcript is elongating (see Figure 6-34). A cell can control 
the site of this cleavage so as to change the C-terminus of the résultant protein. In 
the simplest cases, one protein variant is simply a truncated version of the other; 
in many other cases, however, the alternative cleavage and polyadenylation sites 
lie within intron sequences and the pattern of splicing is thereby altered. This 
process can produce two closely related proteins differing only in the amino acid 
sequences at their C-terminal ends. Close analysis of RNAs produced from the 
human genome in a variety of cell types (see Figure 7-3) indicate that as many as 
50% of human protein-coding genes produce mRNA species that differ at their 
site of polyadenylation. 

A well-studied example of regulated polyadenylation is the switch from the 
synthesis of membrane-bound to secreted antibody molécules that occurs during 
the development of B lymphocytes (see Figure 24-22). Early in the life history of 
a B lymphocyte, the antibody it produces is anchored in the plasma membrane, 
where it serves as a receptor for antigen. Antigen stimulation causes B lympho¬ 
cytes to multiply and to begin secreting their antibody. The secreted form of the 
antibody is identical to the membrane-bound form except at the extreme C-ter¬ 
minus. In this part of the protein, the membrane-bound form has a long string 
of hydrophobie amino acids that traverses the lipid bilayer of the membrane, 
whereas the secreted form has a much shorter string of hydrophilic amino acids. 
The switch from membrane-bound to secreted antibody is generated through a 
change in the site of RNA cleavage and polyadenylation, as shown in Figure 7-59. 

The change is caused by an increase in the concentration of a subunit of a 
protein (CstF) that promûtes RNA cleavage (see Figure 6-34). The first cleavage/ 
poly-A addition site that a transcribing RNA polymerase encounters is subopti¬ 
mal and is usually skipped in unstimulated B lymphocytes, leading to production 
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of the longer RNA transcript. When activated to produce antibodies, the B lym¬ 
phocyte increases its CstF concentration; as a resuit, cleavage now occurs at the 
suboptimal site, and the shorter transcript is produced. In this way, a change in 
concentration of a general RNA-processing factor has a dramatic effect on the 
expression of a particular gene. 

RNA Editing Can Change the Meaning of the RNA Message 

The molecular mechanisms used by cells are a continuai source of surprises. An 
example is the process of RNA editing, which alters the nucléotide sequences of 
RNA transcripts once they are synthesized and thereby changes the coded mes¬ 
sage they carry. We saw in Chapter 6 that tRNA and rRNA molécules are chemi- 
cally modified after they are synthesized: here we focus on changes to mRNAs. 

In animais, two principal types of mRNA editing occur: the deamination of 
adenine to produce inosine (A-to-I editing) and, less frequently, the deamination 
of cytosine to produce uracil (C-to-U editing), as shown in Figure 5-43. Because 
these Chemical modifications alter the pairing properties of the bases (I pairs with 
C, and U pairs with A), they can hâve profound effects on the meaning of the RNA. 
If the edit occurs in a coding région, it can change the amino acid sequence of the 
protein or produce a truncated protein by creating a prématuré stop codon. Edits 
that occur outside coding sequences can affect the pattern of pre-mRNA splicing, 
the transport of mRNA from the nucléus to the cytosol, the efficiency with which 
the RNA is translated, or the base-pairing between microRNAs (miRNAs) and 
their mRNA targets, a form of régulation that will be discussed later in the chapter. 

The process of A-to-I editing is particularly prévalent in humans, where it 
occurs in approximately 1000 genes. Enzymes called ADARs (adenosine deam- 
inases acting on RNA) perform this type of editing; these enzymes recognize a 
double-stranded RNA structure that is formed through base-pairing between the 
site to be edited and a complementary sequence located elsewhere on the same 
RNA molécule, typically in an intron (Figure 7-60). The structure of the dou¬ 
ble-stranded RNA spécifiés whether the mRNA is to be edited, and if so, where 
the edit should be made. An especially important example of A-to-I editing talces 
place in the mRNA that codes for a transmitter-gated ion channel in the brain. A 
single edit changes a glutamine to an arginine; the affected amino acid lies on 
the inner wall of the channel, and the editing change alters the Ca 2+ permeabil- 
ity of the channel. Mutant mice that cannot malce this edit are prone to epileptic 
seizures and die during or shortly after weaning, showing that editing of the ion 
channel RNA is normally crucial for proper brain development. 

C-to-U editing, which is carried out by a different set of enzymes, is also crucial 
in mammals. For example, in certain cells of the gut, the mRNA for apolipopro- 
tein B undergoes a C-to-U edit that créâtes a prématuré stop codon and therefore 


Figure 7-59 Régulation of the site of 
RNA cleavage and poly-A addition 
détermines whether an antibody 
molécule is secreted or remains 
membrane-bound. In unstimulated B 
lymphocytes (left), a long RNA transcript is 
produced, and the intron sequence (yellow) 
near its 3' end is removed by RNA splicing 
to provide an mRNA molécule that codes 
for a membrane-bound antibody molécule. 
Only a portion of the antibody gene is 
shown in the figure; the actual gene and 
its mRNA would extend further to the 
left of the diagram. After antigen stimulation 
(right), the RNA transcript is cleaved 
and polyadenylated upstream from the 
intron’s 3' splice site. As a resuit, some 
of the intron sequence remains as a 
coding sequence in the short transcript 
and spécifiés the hydrophilic C-terminal 
portion of the secreted antibody molécule 
(brown). (Adapted from D. Di Giammartino 
et al., Mol. Cell 43:853-866, 2011. With 
permission from Elsevier.) 



intron 

^^—3' 


Figure 7-60 Mechanism of A-to-I RNA 
editing in mammals. Typically, a sequence 
complementary to the position of the edit 
is présent in an intron, and the resulting 
double-stranded RNA structure attracts an 
A-to-I editing enzyme (ADAR). In the case 
illustrated, the edit is made in an exon; 
in most cases, however, this occurs in 
noncoding portions of the mRNA. Editing 
by ADAR takes place in the nucléus, before 
the pre-mRNA has been fully processed. 
Mice and humans hâve two ADAR genes: 
ADR1 is expressed in many tissues and is 
required in the liver for proper red blood 
cell development; ADR2 is expressed only 
in the brain, where it is required for proper 
brain development. 
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Figure 7-61 C-to-U RNA editing 
produces a truncated form of 
apolipoprotein B. 


protein made in liver 
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produces a shorter form of the protein. In cells of the liver, the editing enzyme is 
not expressed, and the full-length apolipoprotein B is produced. The two protein 
isoforms hâve different properties, and each plays a rôle in lipid metabolism that 
is spécifie to the organ that produces it (Figure 7-61). 

Why RNA editing exists at ail is a mystery. One idea is that it arose in évolution 
to correct “mistakes” in the genome. Another is that it arose as a somewhat slap- 
dash way for the cell to produce subtly different proteins from the same gene. A 
third possibility is that RNA editing originally evolved as a defense mechanism 
against retroviruses and retrotransposons and was later adapted by the cell to 
change the meanings of certain mRNAs. Indeed, RNA editing still plays important 
rôles in cell defense. Some retroviruses, including HIV, are extensively edited after 
they infect cells. This hyperediting créâtes many harmful mutations in the viral 
RNA genome and also causes viral mRNAs to be retained in the nucléus, where 
they are eventually degraded. Although some modem retroviruses protect them- 
selves against this defense mechanism, RNA editing presumably helps to hold 
many viruses in checlc. 

RNA Transport from the Nucléus Can Be Regulated 

It has been estimated that in mammals only about one-twentieth of the total mass 
of RNA synthesized ever leaves the nucléus. We saw in Chapter 6 that most mam- 
malian RNA molécules undergo extensive processing and that the “leftover” RNA 
fragments (excised introns and RNA sequences 3' to the cleavage/poly-A site) are 
degraded in the nucléus. Incompletely processed and otherwise damaged RNAs 
are also eventually degraded as part of the quality control System for RNA produc¬ 
tion. 

As described in Chapter 6, the export of RNA molécules from the nucléus is 
delayed until processing has been completed. However, mechanisms that delib- 
erately override this control point can be used to regulate gene expression. This 
strategy forms the basis for one of the best-understood examples of regulated 
nuclear transport of mRNA, which occurs in the human AIDS virus, HIV. 

As we saw in Chapter 5, HIV, once inside the cell, directs the formation of a 
double-stranded DNA copy of its genome, which is then inserted into the genome 
of the host (see Figure 5-62). Once inserted, the viral DNA can be transcribed as 
one long RNA molécule by the host cell's RNA polymerase IL This transcript is 
then spliced in many different ways to produce over 30 different species of mRNA, 
which in turn are translated into a variety of different pro teins (Figure 7-62). In 
order to malce progeny virus, entire, unspliced viral transcripts must be exported 
from the nucléus to the cytosol, where they are packaged into viral capsids and 
serve as the viral genome. This large transcript, as well as alternatively spliced 
HIV mRNAs that the virus needs to move to the cytoplasm for protein synthesis, 
still carries complété introns. The host cell's normal bloclc to the nuclear export of 
unspliced RNAs therefore présents a spécial problem for HIV. 







420 Chapter 7: Control of Gene Expression 



Pol 


Vif 


Vpr 


Vpu 


Env 


-Tat- 

-Rev- 


Nef 


1,000 nucléotide pairs 


viral DNA 
integrated 
into host 
genome 


5' splice sites 
3' splice sites 


j \ \ 

t t Ht 


Ç) RRE 

t 


viral RNA 


The blockis overcome in an ingenious way. The virus encodes a protein (called 
Rev) that binds to a spécifie RNA sequence (called the Rev responsive element, 
RRE) located within a viral intron. The Rev protein interacts with a nuclear export 
receptor (Crml), which directs the movement of viral RNAs through nuclear pores 
into the cytosol despite the presence of intron sequences. We discuss in detail the 
way in which export receptors function in Chapter 12. 

The régulation of nuclear export by Rev has several important conséquences 
for HIV growth and pathogenesis. In addition to ensuring the nuclear export 
of spécifie unspliced RNAs, it divides the viral infection into an early phase (in 
which Rev is translated from a fully spliced RNA and ail of the intron-containing 
viral RNAs are retained in the nucléus and degraded) and a late phase (in which 
unspliced RNAs are exported due to Rev function). This timing helps the virus 
replicate by providing the gene products in roughly the order in which they are 
needed (Figure 7-63). Régulation by Rev and by Tat, the HIV protein that counter- 
acts prématuré transcription termination (see p. 414), allows the virus to achieve 
latency, a condition in which the HIV genome has become integrated into the 
host-cell genome but the production of viral proteins has temporarily ceased. 
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Figure 7-62 The compact genome of 
HIV, the human AIDS virus. The positions 
of the nine HIV genes are shown in green. 
The red double line indicates a DNA copy 
of the viral genome that has become 
integrated into the host DNA (gray). Note 
that the coding régions of many genes 
overlap, and that those of Tat and Rev are 
split by introns. The blue line at the bottom 
of the figure represents the pre-mRNA 
transcript of the viral DNA and shows 
the locations of ail the possible splice 
sites (arrows). There are many alternative 
ways of splicing the viral transcript; for 
example, the Env mRNAs retain the intron 
that has been spliced out of the Tat and 
Rev mRNAs. The Rev response element 
(RRE) is indicated by a blue bail and stick. 

It is a 234-nucleotide-long stretch of RNA 
that folds into a defined structure; Rev 
recognizes a particular hairpin within this 
larger structure. 

The Gag gene codes for a protein that 
is cleaved into several smaller proteins 
that form the viral capsid. The Pol gene 
codes for a protein that is cleaved to 
produce reverse transcriptase (which 
transcribes RNA into DNA), as well as the 
integrase involved in integrating the viral 
genome (as double-stranded DNA) into 
the host genome. The Env gene codes for 
the envelope proteins (see Figure 5-62). 
Tat, Rev, Vif, Vpr, Vpu, and Nef are small 
proteins with a variety of functions. For 
example, Rev régulâtes nuclear export 
(see Figure 7-63) and Tat régulâtes the 
élongation of transcription across the 
integrated viral genome (see p. 414). 
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Figure 7-63 Régulation of nuclear export 
by the HIV Rev protein. (A) Early in HIV 
infection, only the fully spliced RNAs (which 
contain the coding sequences for Rev, Tat, 
and Nef) are exported from the nucléus and 
translated. (B) Once sufficient Rev protein 
has accumulated and been transported 
into the nucléus, unspliced viral RNAs can 
be exported from the nucléus. Many of 
these RNAs are translated into protein, and 
the full-length transcripts are packaged into 
new viral particles. 
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If, after its initial entry into a host cell, conditions become unfavorable for viral 
transcription and réplication, Rev and Tat are made at levels too low to promote 
transcription and export of unspliced RNA. This situation stalls the viral growth 
cycle until conditions improve, whereupon Rev and Tat levels increase, and the 
virus enters the réplication cycle. 

Some mRNAs Are Localized to Spécifie Régions of the Cytosol 

Once a newly made eukaryotic mRNA molécule has passed through a nuclear 
pore and entered the cytosol, it is typically met by ribosomes, which translate 
it into a polypeptide chain (see Figure 6-8). Once the first round of translation 
"passes” the nonsense-mediated decay test (see Figure 6-76), the mRNA is usu- 
ally translated in earnest. If the mRNA encodes a protein that is destined to be 
secreted or expressed on the cell surface, a signal sequence at the protein's N-ter- 
minus will direct it to the endoplasmic réticulum (ER). In this case, as discussed 
in Chapter 12, components of the celFs protein-sorting apparatus recognize the 
signal sequence as soon as it emerges from the ribosome and direct the entire 
complex of ribosome, mRNA, and nascent protein to the membrane of the ER, 
where the remainder of the polypeptide chain is synthesized. In other cases, free 
ribosomes in the cytosol synthesize the entire protein, and signais in the com- 
pleted polypeptide chain may then direct the protein to other sites in the cell. 

Many mRNAs are themselves directed to spécifie intracellular locations before 
their efficient translation begins, allowing the cell to position its mRNAs close 
to the sites where the encoded protein is needed. RNA localization has been 
observed in many organisms, including unicellular fungi, plants, and animais, 
and it is lilcely to be a common mechanism that cells use to concentrate high-level 
production of proteins at spécifie sites. This strategy also provides the cell with 
other advantages. For example, it allows the establishment of asymmetries in the 
cytosol of the cell, a lcey step in many stages of development. Localized mRNA, 
coupled with translational control, also allows the cell to regulate gene expression 
independently in different régions. This feature is particularly important in large, 
highly polarized cells such as neurons, where it plays a central rôle in synaptic 
function. 

Several mechanisms for mRNA localization hâve been discovered (Figure 
7-64), ail of which require spécifie signais in the mRNA itself. These signais are 
usually concentrated in the 3' untranslated région ( UTR ), the région of RNA that 
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Figure 7-64 Mechanisms for the 
localization of mRNAs. The mRNA to 
be localized leaves the nucléus through 
nuclear pores (top). Some localized mRNAs 
(left diagram) travel to their destination by 
associating with cytoskeletal motors, which 
use the energy of ATP hydrolysis to move 
the mRNAs unidirectionally along filaments 
in the cytoskeleton (red) (see Chapter 16). 

At their destination, the mRNAs are held 
in place by anchor proteins (black). Other 
mRNAs randomly diffuse through the 
cytosol and are simply trapped by anchor 
proteins and at their sites of localization 
(center diagram). Some mRNAs (right 
diagram) are degraded in the cytosol unless 
they hâve bound, through random diffusion, 
a localized protein complex that anchors 
and protects the mRNA from dégradation 
(black). Each mechanism requires 
spécifie signais on the mRNA, which are 
typically located in the 3' UTR. Additional 
components can block the translation of the 
mRNA until it is properly localized. (Adapted 
from H.D. Lipshitz and C.A. Smibert, Curr. 
Opin. Genet. Dev. 10:476-488, 2000. With 
permission from Elsevier.) 
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Figure 7-65 An experiment demonstrating the importance of the 3' 

UTR in localizing mRNAs to spécifie régions of the cytoplasm. For 

this experiment, two different fluorescently labeled RNAs were prepared by 
transcribing DNA in vitro in the presence of fluorescently labeled dérivatives of 
UTR One RNA (labeled with a red fluorochrome) contains the coding région 
for the Drosophila Hairy protein and includes the adjacent 3' UTR (see Figure 
6-21). The other RNA (labeled green) contains the Hairy coding région with 
the 3' UTR deleted. The two RNAs were mixed and injected into a Drosophila 
embryo at a stage of development when multiple nuclei résidé in a common 
cytoplasm (see Figure 7-26). When the fluorescent RNAs were visualized 
10 minutes later, the full-length hairy RNA (red) was localized to the apical 
side of nuclei (blue) but the transcript missing the 3' UTR (green) failed to 
localize. Hairy is one of many transcriptional regulators that specify positional 
information in the developing Drosophila embryo (discussed in Chapter 21), 
and the localization of its mRNA (shown in this experiment to dépend on its 
3' UTR) is critical for proper fly development. (Courtesy of Simon Bullock and 
David Ish-Horowicz.) 
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extends from the stop codon that terminâtes protein synthesis to the start of the 
poly-A tail (Figure 7-65). The mRNA localization is usually coupled with trans- 
lational Controls to ensure that the mRNA remains quiescent until it has been 
moved into place. 

The Drosophila egg exhibits an especially striking example of mRNA local¬ 
ization. The mRNA encoding the Bicoid transcription regulator is localized by 
attachment to the cytoslceleton at the anterior tip of the developing egg. When 
fertilization triggers the translation of this mRNA, it generates a gradient of the 
Bicoid protein that plays a crucial part in directing the development of the ante¬ 
rior part of the embryo (see Figure 7-26). Many mRNAs in somatic cells are also 
localized in a similar way. The mRNA that encodes actin, for example, is localized 
to the actin-filament-rich cell cortex in mammalian fibroblasts by means of a 3' 

UTR signal. 

We saw in Chapter 6 that mRNA molécules exit from the nucléus bearing 
numerous markings in the form of RNA modifications (the 5' cap and the 3' 
poly-A tail) and bound proteins (exon-junction complexes, for example) that sig- 
nify the successful completion of the different pre-mRNA processing steps. As just 
described, the 3' UTR of an mRNA can be thought of as a “zip code" that directs 
mRNAs to different places in the cell. Below, we will also see that mRNAs carry 
information specifying their average lifetime in the cytosol and the efficiency with 
which they are translated into protein. In a broad sense, the untranslated régions 
of eulcaryotic mRNAs resemble the transcriptional control régions of genes: their 
nucléotide sequences contain information specifying the way the RNA is to be 
used, and proteins interpret this information by binding specifically to these 
sequences. Thus, over and above the spécification of the amino acid sequences of 
proteins, mRNA molécules are rich with information. 

The 5' and 3' Untranslated Régions of mRNAs Control Their 
Translation 

Once an mRNA has been synthesized, one of the most common ways of regulating 
the levels of its protein product is to control the step that initiâtes translation. Even 
though the details of translation initiation differ between eukaryotes and bacteria 
(as we saw in Chapter 6), they each use some of the same basic regulatory strat¬ 
egies. 

In bacterial mRNAs, a conserved stretch of nucléotides, the Shine-Dalgarno 
sequence, is always found a few nucléotides upstream of the initiating AUG codon. 

In bacteria, translational control mechanisms are carried out by proteins or by 
RNA molécules, and they generally involve either exposing or blocking the Shine- 
Dalgarno sequence (Figure 7-66). 

Eulcaryotic mRNAs do not contain such a sequence. Instead, as discussed in 
Chapter 6, the sélection of an AUG codon as a translation start site is largely deter- 
mined by its proximity to the cap at the 5' end of the mRNA molécule, which is the 
site at which the small ribosomal subunit binds to the mRNA and begins scanning 
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for an initiating AUG codon. In eukaryotes, translational repressors can bind to 
the 5' end of the mRNA and thereby inhibit translation initiation. Other repressors 
recognize nucléotide sequences in the 3' UTR of spécifie mRNAs and decrease 
translation initiation by interfering with the communication between the 5' cap 
and 3' poly-A tail, a step required for efficient translation (see Figure 6-70). A 
particularly important type of translational control in eukaryotes relies on small 
RNAs (termed microRNAs or miRNAs ) that bind to mRNAs and reduce protein 
output, as described later in this chapter. 

The Phosphorylation of an Initiation Factor Régulâtes Protein 
Synthesis Globally 

Eukaryotic cells decrease their overall rate of protein synthesis in response to a 
variety of situations, including deprivation of growth factors or nutrients, infec¬ 
tion by viruses, and sudden increases in température. Much of this decrease is 
caused by the phosphorylation of the translation initiation factor eIF2 by spécifie 
protein kinases that respond to the changes in conditions. 

The normal function of eIF2 was outlined in Chapter 6. It forms a complex 
with GTP and médiates the binding of the methionyl initiator tRNA to the small 
ribosomal subunit, which then binds to the 5' end of the mRNA and begins scan- 
ning along the mRNA. When an AUG codon is recognized, the eIF2 protein hydro- 
lyzes the bound GTP to GDP, causing a conformational change in the protein and 
releasing it from the small ribosomal subunit. The large ribosomal subunit then 
joins the small one to form a complété ribosome that begins protein synthesis. 
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Figure 7-66 Mechanisms of translational control. Although these examples are from bacteria, many of the same principles operate in eukaryotes. 
(A) Sequence-specific RNA-binding proteins repress translation of spécifie mRNAs by blocking access of the ribosome to the Shine-Dalgarno 
sequence (orange). For example, some ribosomal proteins repress translation of their own RNA. This mechanism allows the cell to maintain correctly 
balanced quantities of the various components needed to form ribosomes. (B) An RNA “thermosensor” permits efficient translation initiation only at 
elevated températures at which the stem-loop structure has been melted. An example occurs in the human pathogen Listeria monocytogenes, in 
which the translation of its virulence genes increases at 37°C, the température of the host. (C) Binding of a small molécule to a riboswitch causes 
a major rearrangement of the RNA forming a different set of stem-loop structures. In the bound structure, the Shine-Dalgarno sequence (orange) 
is sequestered and translation initiation is thereby blocked. In many bacteria, S-adenosylmethionine acts in this manner to block production of 
the enzymes that synthesize it. (D) An “antisense” RNA produced elsewhere from the genome base-pairs with a spécifie mRNA and blocks its 
translation. Many bacteria regulate expression of iron-storage proteins in this way. 
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Because eIF2 binds very tightly to GDP, a guanine nucléotide exchange fac¬ 
tor (see p. 157), designated eIF2B, is required to cause GDP release so that a new 
GTP molécule can bind and eIF2 can be reused (Figure 7-67A). The reuse of eIF2 
is inhibited when it is phosphorylated—the phosphorylated eIF2 binds to eIF2B 
unusually tightly, inactivating eIF2B. There is more eIF2 than eIF2B in cells, and 
even a fraction of phosphorylated eIF2 can trap nearly ail of the eIF2B. This pre- 
vents the reuse of the nonphosphorylated eIF2 and greatly slows protein synthesis 
(Figure 7-67B). 

Régulation of the level of active eIF2 is especially important in mammalian 
cells; eIF2 is part of the mechanism that allows cells to enter a nonproliferating, 
resting State (called Go) in which the rate of total protein synthesis is reduced to 
about one-fifth the rate in proliferating cells. 


Initiation at AUG Codons Upstream of the Translation Start Can 
Regulate Eukaryotic Translation Initiation 

We saw in Chapter 6 that eukaryotic translation typically begins at the first AUG 
downstream of the 5' end of the mRNA, which is the first AUG encountered by a 
scanning small ribosomal subunit. But the nucléotides immediately surrounding 
the AUG also influence the efficiency of translation initiation. If the récognition 
site is poor enough, scanning ribosomal subunits will sometimes ignore the first 
AUG codon in the mRNA and skip to the second or third AUG codon instead. This 
phenomenon, known as "lealcy scanning," is a strategy frequently used to produce 
two or more closely related proteins, differing only in their N-termini, from the 
same mRNA. A particularly important use of this mechanism is the production 
of the same protein with and without a signal sequence attached at its N-termi- 
nus. This allows the protein to be directed to two different locations in the cell (for 
example, to both mitochondria and the cytosol). Cells can regulate the relative 
abundance of the protein isoforms produced by lealcy scanning; for example, a 
cell-type-specific increase in the abundance of the initiation factor eIF4F favors 
the use of the AUG closest to the 5' end of the mRNA. 

Another type of control found in eukaryotes uses one or more short open read- 
ingframes —short stretches of DNA that begin with a start codon (ATG) and end 
with a stop codon, with no stop codons in between—that lie between the 5' end of 
the mRNA and the beginning of the gene. Often, the amino acid sequences coded 
by these upstream open reading frames (uORFs) are not important; rather, the 
uORFs serve a purely regulatory function. An uORF présent on an mRNA mol¬ 
écule will generally decrease translation of the downstream gene by trapping a 
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scanning ribosome initiation complex and causing the ribosome to translate the 
uORF and dissociate from the mRNA before it reaches the bona fide protein-cod- 
ing sequence. 

When the activity of a general translation factor (such as the eIF2 discussed 
above) is reduced, one might expect that the translation of ail mRNAs would be 
reduced equally. Contrary to this expectation, however, the phosphorylation of 
eIF2 can hâve sélective effects, even enhancing the translation of spécifie mRNAs 
that contain uORFs. This can enable cells, for example, to adapt to starvation for 
spécifie nutrients by shutting down the synthesis of ail proteins except those that 
are required for synthesis of the missing nutrients. The details of this mechanism 
hâve been worlced out for a spécifie yeast mRNA that encodes a protein called 
Gcn4, a transcription regulator that activâtes many genes that encode proteins 
that are important for amino acid synthesis. 

The Gcn4 mRNA contains several short uORFs, and when amino acids are 
abundant, ribosomes translate the uORFs and generally dissociate before they 
reach the Gcn4 coding région. A global decrease in eIF2 activity brought about 
by amino acid starvation malces it more lilcely that a scanning small ribosomal 
subunit will move across the uORFs (without translating them) before it acquires 
a molécule of eIF2 (see Figure 6-70). Such a ribosomal subunit is then free to ini- 
tiate translation on the actual Gcn4 sequences. The increased level of this tran¬ 
scription regulator increases production of the amino acid biosynthetic enzymes. 

Internai Ribosome Entry Sites Provide Opportunities for 
Translational Control 

Although approximately 90% of eulcaryotic mRNAs are translated beginning with 
the first AUG downstream from the 5' cap, certain AUGs, as we saw in the pre- 
vious section, can be skipped over during the scanning process. In this section, 
we discuss yet another way that cells can initiate translation at positions distant 
from the 5' end of the mRNA, using a specialized type of RNA sequence called an 
internai ribosome entry site (IRES). In some cases, two distinct protein-coding 
sequences are carried in tandem on the same eulcaryotic mRNA; translation of 
the first occurs by the usual scanning mechanism, and translation of the second 
occurs through an IRES. IRESs are typically several hundred nucléotides in length 
and fold into spécifie structures that bind many, but not ail, of the same proteins 
that are used to initiate normal 5' cap-dependent translation (Figure 7-68). In 
fact, different IRESs require different subsets of initiation factors. However, ail of 
them bypass the need for a 5' cap structure and the translation initiation factor 
that recognizes it, eIF4E. 
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Figure 7-68 Two mechanisms of 
translation initiation. (A) The normal, cap- 
dependent mechanism requires a set of 
initiation factors whose assembly on the 
mRNA is stimulated by the presence of a 5' 
cap and a poly-A tail (see also Figure 6-70). 
(B) The IRES-dependent mechanism, seen 
mainly in viruses, requires only a subset of 
the normal translation initiating factors, and 
these assemble directly on the folded IRES. 
(Adapted from A. Sachs, Ce//101:243-245, 
2000. With permission from Elsevier.) 
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Some viruses use IRESs as part of a strategy to get their own mRNA molécules 
translatée! while blocking normal 5' cap-dependent translation of host mRNAs. 
On infection, these viruses produce a protease (encoded in the viral genome) that 
cleaves the host-cell translation factor eIF4G, rendering it unable to bind to eIF4E, 
the cap-binding complex. This shuts down most of the host cell's translation and 
effectively diverts the translation machinery to the IRES sequences présent on the 
viral mRNAs. (The truncated eIF4G remains competent to initiate translation at 
these internai sites.) 

The many ways in which viruses manipulate their host's protein-synthesis 
machinery for their own advantage continue to surprise cell biologists. Studying 
this “arms race" between humans and pathogens has led to many fundamental 
insights into the workings of the cell, and we revisit this topic in more detail in 
Chapter 23. 


Changes in mRNA Stability Can Regulate Gene Expression 

Most mRNAs in a bacterial cell are very unstable, having half-lives of less than a 
couple of minutes. Exonucleases, which dégradé in the 3'-to-5' direction, are usu- 
ally responsible for the rapid destruction of these mRNAs. Because its mRNAs are 
both rapidly synthesized and rapidly degraded, a bacterium can adapt quicldy to 
environmental changes. 

As a general rule, the mRNAs in eulcaryotic cells are more stable. Some, such 
as that encoding p globin, hâve half-lives of more than 10 hours, but most hâve 
considerably shorter half-lives, typically less than 30 minutes. The mRNAs that 
code for proteins such as growth factors and transcription regulators, whose pro¬ 
duction rates need to change rapidly in cells, hâve especially short half-lives. 

We saw in Chapter 6 that the cell has several mechanisms that rapidly destroy 
incorrectly processed RNAs; here, we consider the fate of the typical "normal" 
eulcaryotic mRNA. Two general mechanisms exist for eventually destroying each 
mRNA that is made by the cell. Both begin with the graduai shortening of the 
poly-A tail by an exonuclease, a process that starts as soon as the mRNA reaches 
the cytosol. In a broad sense, this poly-A shortening acts as a timer that counts 
down the lifetime of each mRNA. Once the poly-A tail is reduced to a critical 
length (about 25 nucléotides in humans), the two pathways diverge. In one, the 5' 
cap is removed (a process called decapping) and the “exposed" mRNA is rapidly 
degraded from its 5' end. In the other, the mRNA continues to be degraded from 
the 3' end, through the poly-A tail, into the coding sequences (Figure 7-69). 

Nearly ail mRNAs are subjeetto both types of decay, and the spécifie sequences 
of each mRNA détermine how fast each step occurs and therefore how long 
each mRNA will persist in the cell and be able to produce protein. The 3' UTR 
sequences are especially important in controlling mRNA lifetimes, and they often 
carry binding sites for spécifie proteins that increase or decrease the rate of poly-A 
shortening, decapping, or 3'-to-5' dégradation. The half-life of an mRNA is also 
affected by how efficiently it is translated. Poly-A shortening and decapping com- 
pete directly with the machinery that translates the mRNA; therefore, any factors 
that affect the translation efficiency of an mRNA will tend to hâve the opposite 
effect on its dégradation (Figure 7-70). 

Although poly-A shortening Controls the half-life of most eulcaryotic mRNAs, 
some mRNAs can be degraded by a specialized mechanism that bypasses this 
step altogether. In these cases, spécifie nucleases cleave the mRNA internally, 
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Figure 7-69 Two mechanisms of 
eukaryotic mRNA decay. A critical 
threshold of poly-A tail length induces 
rapid 3'-to-5' dégradation, which may 
be triggered by the loss of the poly-A- 
binding proteins. As shown in Figure 7-70, 
a deadenylase associâtes with both the 
3' poly-A tail and the 5' cap, and this 
connection may be involved in signaling 
decapping after poly-A shortening. 
Although 5'-to-3 f and 3'-to-5 f dégradation 
are shown here on separate RNA 
molécules, these two processes can occur 
together on the same molécule. (Adapted 
from C.A. Beelman and R. Parker, Cell 
81:179-183, 1995. With permission from 
Elsevier.) 
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Figure 7-70 The compétition between 
mRNA translation and mRNA decay. 

The same two features of an mRNA 
molécule—its 5' cap and the 3' poly-A 
tail—are used in both translation initiation 
and deadenylation-dependent mRNA 
decay (see Figure 7-69). The deadenylase 
that shortens the poly-A tail in the 3'-to- 
5' direction associâtes with the 5' cap. 

As described in Chapter 6 (see Figure 
6-70), the translation initiation machinery 
also associâtes with both the 5' cap and 
the poly-A tail. (Adapted from M. Gao 
et al., Mol. Cell 5:479-488, 2000. With 
permission from Elsevier.) 


effectively decapping one end and removing the poly-A tail from the other so 
that both halves are rapidly degraded. The mRNAs that are destroyed in this way 
carry spécifie nucléotide sequences, often in the 3' UTRs, that serve as récogni¬ 
tion sequences for these endonucleases. This strategy malces it especially simple 
to tightly regulate the stability of these mRNAs by bloclcing or exposing the endo¬ 
nucléase site in response to extracellular signais. For example, the addition of iron 
to cells decreases the stability of the mRNA that encodes the receptor protein that 
binds the iron-transporting protein transferrin, causing less of this receptor to 
be made. This effect is mediated by the iron-sensitive RNA-binding protein aco- 
nitase. Aconitase can bind to the 3' UTR of the transferrin receptor mRNA and 
increase receptor production by blocking endonucleolytic cleavage of the mRNA. 
On the addition of iron, aconitase is released from the mRNA, exposing the cleav¬ 
age site and thereby decreasing the stability of the mRNA (Figure 7-71). 


Régulation of mRNA Stability Involves P-bodies and Stress 
Granules 


We saw in Chapters 3 and 6 that large aggregates of proteins and nucleic acids 
that work together are often held in proximity by loose, low-affinity connections 
(see Figure 3-36). In this way, they function as “organelles” even though they are 
not surrounded by membranes. Many of the events discussed in the previous 
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Figure 7-71 Two post-translational 
Controls mediated by iron. (A) During 
iron starvation, the binding of aconitase 
to the 5' UTR of the ferritin mRNA blocks 
translation initiation; its binding to the 
3' UTR of the transferrin receptor mRNA 
blocks an endonuclease cleavage 
site and thereby stabilizes the mRNA. 

(B) In response to an increase in iron 
concentration in the cytosol, a cell 
increases its synthesis of ferritin in order 
to bind the extra iron and decreases its 
synthesis of transferrin receptors in order 
to import less iron across the plasma 
membrane. Both responses are mediated 
by the same iron-responsive regulatory 
protein, aconitase, which recognizes 
common features in a stem-loop structure 
in the mRNAs encoding ferritin and the 
transferrin receptor. Aconitase dissociâtes 
from the mRNA when it binds iron. But 
because the transferrin receptor and 
ferritin are regulated by different types 
of mechanisms, their levels respond 
oppositely to iron concentrations even 
though they are regulated by the same 
iron-responsive regulatory protein. 
(Adapted from M.W. Hentze et al., Science 
238:1570-1573, 1987 and J.L. Casey 
et al., Science 240:924-928, 1988. With 
permission from AAAS.) 
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Figure 7-72 Visualization of P-bodies. 

Human cells were stained with antibodies 
to a component of the mRNA decapping 
enzyme Dcpl a (left panels) and to the 
Argonaute protein (middle panels). As 
described later in this chapter, Argonaute 
is a key component of RNA interférence 
pathways. The merged image (right panels) 
shows that the two proteins co-localize to 
P-bodies in the cytoplasm. (Adapted from 
J. Liu et al., Nat CellBiol. 7:719-723, 
2005. With permission from Macmillan 
Publishers Ltd.) 


section—including decapping and RNA dégradation—talce place in aggregates 
lcnown as Processing- or P-bodies, which are présent in the cytosol (Figure 7-72). 

Although many mRNAs are eventually degraded in P-bodies, some remain 
intact and are later returned to the pool of translating mRNAs. To be “rescued" 
in this way, mRNAs move from P-bodies to another type of aggregate lcnown as a 
stress granule, which contains translation initiation factors, poly-A-binding pro¬ 
tein, and small ribosomal subunits. Translation itself does not occur in stress 
granules, but mRNAs can become “translation-ready” as the proteins bound to 
them in P-bodies are replaced with those in stress granules. The movement of 
mRNAs between active translation, P-bodies, and stress granules can be seen as 
an mRNA cycle (Figure 7-73) where the compétition between translation and 
mRNA dégradation is carefully controlled. Thus, when translation initiation is 
blocked (by starvation, drugs, or genetic manipulation), stress granules enlarge 
as more and more nontranslated mRNAs are moved directly into them for storage. 
Clearly, once a cell has made the large investment in producing a properly pro- 
cessed mRNA molécule, it carefully Controls its subséquent fate. 

Summary 

Many steps in the pathway from RNA to protein are regulated by cells in order to 
control gene expression. Most genes are regulated at multiple levels, in addition 
to being controlled at the initiation stage of transcription. The regulatory mecha- 
nisms include (1 ) atténuation ofthe RNA transcript by its prématuré termination, 
(2) alternative RNA splice-site sélection, (3) control of 3'-end formation by cleavage 
and poly-A addition, (4) RNA editing (5) control of transport from the nucléus to 
the cytosol, (6) localization of mRNAs to particular parts ofthe cell, (7) control of 
translation initiation, and (8) regulated mRNA dégradation. Most ofthese control 
processes require the récognition of spécifie sequences or structures in the RNA mol¬ 
écule being regulated, a task performed by either regulatory proteins or regulatory 
RNA molécules. 
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Figure 7-73 Possible fates of an mRNA 
molécule. An mRNA molécule released 
from the nucléus can be actively translated 
(center), stored in stress granules (right), or 
degraded in P-bodies (left). As the needs 
of the cell change, mRNAs can be shuffled 
from one pool to the next, as indicated by 
the arrows. 
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REGULATION OF GENE EXPRESSION BY 
NONCODING R N As 

In the previous chapter, we introduced the central dogma, according to which the 
flow of genetic information proceeds from DNA through RNA to protein (Figure 
6-1). But we hâve seen throughout this boolc that RNA molécules perform many 
critical taslcs in the cell besides serving as intermediate carriers of genetic infor¬ 
mation. Among these noncoding RNAs are the rRNA and tRNA molécules, which 
are responsible for reading the genetic code and synthesizing proteins. The RNA 
molécule in telomerase serves as a template for the réplication of chromosome 
ends, snoRNAs modify ribosomal RNA, and snRNAs carry out the major events of 
RNA splicing. And we saw in the previous section that Xist RNA has an important 
rôle in inactivating one copy of the X chromosome in females. 

A sériés of recent discoveries has revealed that noncoding RNAs are even more 
prévalent than previously imagined. We now know that such RNAs play wide- 
spread rôles in regulating gene expression and in protecting the genome from 
viruses and transposable éléments. These newly discovered RNAs are the subject 
of this section. 

Small Noncoding RNA Transcripts Regulate Many Animal and 
Plant Genes Through RNA Interférence 

We begin our discussion with a group of short RNAs that carry out RNA inter¬ 
férence or RNAi. Here, short single-stranded RNAs (20-30 nucléotides) serve as 
guide RNAs that selectively reorganize and bind—through base-pairing—other 
RNAs in the cell. When the target is a mature mRNA, the small noncoding RNAs 
can inhibit its translation or even catalyze its destruction. If the target RNA molé¬ 
cule is in the process of being transcribed, the small noncoding RNA can bind to 
it and direct the formation of certain types of répressive chromatin on its attached 
DNA template (Figure 7-74). Three classes of small noncoding RNAs worlc in this 
way— microRNAs ( miRNAs ), small interfering RNAs (siRNAs ), and piwi-interact- 
ing RNAs (piRNAs )—and we discuss them in turn in the next sections. Although 
they differ in the way the short pièces of single-stranded RNA are generated, ail 
three types of short RNAs locate their targets through RNA-RNA base-pairing, and 
they generally cause réductions in gene expression. 

miRNAs Regulate mRNA Translation and Stability 

Over 1000 different microRNAs (miRNAs) are produced from the human genome, 
and these appear to regulate at least one-third of ail human protein-coding genes. 
Once made, miRNAs base-pair with spécifie mRNAs and fine-tune their transla¬ 
tion and stability. The miRNA precursors are synthesized by RNA polymerase II 
and are capped and polyadenylated. They then undergo a spécial type of Process¬ 
ing, after which the miRNA (typically 23 nucléotides in length) is assembled with 
a set of proteins to form an RNA-induced silencing complex or RISC. Once formed, 
the RISC seelcs out its target mRNAs by searching for complementary nucléotide 
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Figure 7-74 RNA interférence in 
eukaryotes. Single-stranded interfering 
RNAs are generated from double-stranded 
RNA. They locate target RNAs through 
base-pairing and, at this point, several fates 
are possible, as shown. As described in 
the text, there are several types of RNA 
interférence; the way the double-stranded 
RNA is produced and processed and the 
ultimate fate of the target RNA dépends on 
the particular System. 
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sequences (Figure 7-75). This search is greatly facilitated by the Argonaute pro- 
tein, a component of RISC, which holds the 5' région of the miRNA so that it is 
optimally positioned for base-pairing to another RNA molécule (Figure 7-76). In 
animais, the extent of base-pairing is typically at least seven nucléotide pairs, and 
this pairing most often occurs in the 3' UTR of the target mRNA. 

Once an mRNA has been bound by an miRNA, several outcomes are possible. 
If the base-pairing is extensive (which is unusual in humans but common in many 
plants), the mRNA is cleaved ( sliced ) by the Argonaute protein, effectively remov- 
ing the mRNAs poly-A tail and exposing it to exonucleases (see Figure 7-69). Fol- 
lowing cleavage of the mRNA, the RISC with its associated miRNA is released, and 
it can seelc out additional mRNAs (see Figure 7-75). Thus, a single miRNA can act 
catalytically to destroy many complementary mRNAs. These miRNAs can be thus 
thought of as guide sequences that bring destructive nucleases into contact with 
spécifie mRNAs. 

If the base-pairing between the miRNA and the mRNA is less extensive (as 
observed for most human miRNAs), Argonaute does not slice the mRNA; rather, 
translation of the mRNA is repressed and the mRNA is shuttled to P-bodies (see 
Figure 7-73) where, sequestered from ribosomes, it eventually undergoes poly-A 
tail shortening, decapping, and dégradation. 

Several features make miRNAs especially useful regulators of gene expression. 
First, a single miRNA can regulate a whole set of different mRNAs, so long as the 
mRNAs carry a common short sequence in their UTRs. This situation is common 
in humans, where a single miRNA can control hundreds of different mRNAs. Sec¬ 
ond, régulation by miRNAs can be combinatorial. When the base-pairing between 
the miRNA and mRNA fails to trigger cleavage, additional miRNAs binding to the 
same mRNA lead to further réductions in its translation. As discussed earlier for 
transcription regulators, combinatorial control greatly expands the possibilities 
available to the cell by linking gene expression to a combination of different reg¬ 
ulators rather than a single regulator. Third, an miRNA occupies relatively little 
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Figure 7-75 miRNA processing and 
mechanism of action. The precursor 
miRNA, through complementarity between 
one part of its sequence and another, 
forms a double-stranded structure. This 
RNA is cropped while still in the nucléus 
and then exported to the cytosol, where 
it is further cleaved by the Dicer enzyme 
to form the miRNA proper. Argonaute, 
in conjunction with other components of 
RISC, initially associâtes with both strands 
of the miRNA and then cleaves and 
discards one of them. The other strand 
guides RISC to spécifie mRNAs through 
base-pairing. If the RNA-RNA match is 
extensive, as is commonly seen in plants, 
Argonaute cleaves the target mRNA, 
causing its rapid dégradation. In mammals, 
the miRNA-mRNA match often does not 
extend beyond a short seven-nucleotide 
“seed” région near the 5' end of the 
miRNA. This less extensive base-pairing 
leads to inhibition of translation, mRNA 
destabilization, and transfer of the mRNA to 
P-bodies, where it is eventually degraded. 
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Figure 7-76 Human Argonaute protein 
carrying an miRNA. The protein is folded 
into four structural domains, each indicated 
by a different color. The miRNA is held in an 
extended form that is optimal for forming 
RNA-RNA base pairs. The active site 
of Argonaute that “slices” a target RNA, 
when it is extensively base-paired with the 
miRNA, is indicated in red. Many Argonaute 
proteins (three out of the four human 
proteins, for example) lack the catalytic site 
and therefore bind target RNAs without 
slicing them. (Adapted from C.D. Kuhn 
and L. Joshua-Tor, Trends Biochem. Sci. 
38:263-271,2013. With permission from 
Cell Press.) 


space in the genome when compared with a protein. Indeed, their small size is 
one reason that miRNAs were discovered only recently. Although we are only 
beginning to appreciate the full impact of miRNAs, it is clear that they represent 
an important part of the cell's equipment for regulating the expression of genes. 
We discuss spécifie examples of miRNAs that hâve lcey rôles in development in 
Chapter 21. 

RNA Interférence Is Also Used as a Cell Defense Mechanism 

Many of the proteins that participate in the miRNA regulatory mechanisms just 
described also serve a second function as a defense mechanism: they orches- 
trate the dégradation of foreign RNA molécules, specifically those that occur in 
double-stranded form. Many transposable éléments and viruses produce dou- 
ble-stranded RNA, at least transiently, in their life cycles, and RNA interférence 
helps to keep these potentially dangerous invaders in checlc. As we shall see, this 
form of RNAi also provides scientists with a powerful experimental technique to 
turn off the expression of individual genes. 

The presence of double-stranded RNA in the cell triggers RNAi by attract- 
ing a protein complex containing Dicer, the same nuclease that processes miR¬ 
NAs (see Figure 7-75). This protein cleaves the double-stranded RNA into small 
fragments (approximately 23 nucléotide pairs) called small interfering RNAs 
(siRNAs). These double-stranded siRNAs are then bound by Argonaute and other 
components of RISC. As we saw above for miRNAs, one strand of the duplex RNA 
is then cleaved by Argonaute and discarded. The single-stranded siRNA molécule 
that remains directs RISC baclc to complementary RNA molécules produced by 
the virus or transposable element. Because the match is usually exact, Argonaute 
cleaves these molécules, leading to their rapid destruction. 

Each time RISC cleaves a new RNA molécule, the RISC is released; thus, as 
we saw for miRNAs, a single RNA molécule can act catalytically to destroy many 
complementary RNAs. Some organisms employ an additional mechanism that 
amplifies the RNAi response even further. In these organisms, RNA-dependent 
RNA polymerases use siRNAs as primers to produce additional copies of dou- 
ble-strand RNAs which are then cleaved into siRNAs. This amplification ensures 
that, once initiated, RNA interférence can continue even after ail the initiating 
double-stranded RNA has been degraded or diluted out. For example, it permits 
progeny cells to continue carrying out the spécifie RNA interférence that was pro- 
volced in the parent cells. 

In some organisms, the RNA interférence activity can be spread by the transfer 
of RNA fragments from cell to cell. This is particularly important in plants (whose 
cells are linlced by fine connecting channels, as discussed in Chapter 19), because 
it allows an entire plant to become résistant to an RNA virus after only a few of 
its cells hâve been infected. In a broad sense, the RNAi response resembles cer¬ 
tain aspects of the animal immune System; in both, an invading organism elicits a 
customized response, and—through amplification of the “attaclc" molécules—the 
host becomes systemically protected. 
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We hâve seen that although miRNAs and siRNAs are generated in slightly dif¬ 
ferent ways, they rely on the same proteins and seelc out their targets in a funda- 
mentally similar manner. Because siRNAs are found in widespread species, they 
are believed to be the most ancient form of RNA interférence, with miRNAs being 
a later refinement. These siRNA-mediated defense mechanisms are crucial for 
plants, worms, and insects. In mammals, a protein-based System (described in 
Chapter 24) has largely talcen over the taslc of fighting off viruses. 

RNA Interférence Can Direct Heterochromatin Formation 

The siRNA interférence pathway just described does not necessarily stop with 
the destruction of target RNA molécules. In some cases, the RNA interférence 
machinery can also selectively shut off synthesis of the target RNAs. For this to 
occur, the short siRNAs produced by the Dicer protein are assembled with a group 
of proteins (including Argonaute) to form the RITS (RNA-induced transcriptional 
silencing) complex. Using single-stranded siRNA as a guide sequence, this com- 
plex binds complementary RNA transcripts as they emerge from a transcribing 
RNA polymerase II (Figure 7-77). Positioned on the genome in this way, the RITS 
complex attracts proteins that covalently modify nearby histones and eventually 
direct the formation of heterochromatin to prevent further transcription initia¬ 
tion. In some cases, an RNA-dépendent RNA polymerase and a Dicer enzyme are 
also recruited by the RITS complex to continually generate additional siRNAs in 
situ. This positive feedback loop ensures continued repression of the target gene 
even after the initiating siRNA molécules hâve disappeared. 

RNAi-directed heterochromatin formation is an important cell defense mech- 
anism that limits the spread of transposable éléments in genomes by maintain- 
ing their DNA sequences in a transcriptionally silent form. However, this same 
mechanism is also used in some normal processes in the cell. For example, in 
many organisms, the RNA interférence machinery maintains the heterochroma¬ 
tin formed around centromeres. Centromeric DNA sequences are transcribed 
in both directions, producing complementary RNA transcripts that can base- 
pair to form double-stranded RNA. This double-stranded RNA triggers the RNA 
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Figure 7-77 RNA interférence directed 
by siRNAs. In many organisms, double- 
stranded RNA can trigger both the 
destruction of complementary mRNAs 
(left) and transcriptional silencing (right). 
The change in chromatin structure 
induced by the bound RITS (RNA- 
induced transcriptional silencing) complex 
resembles that in Figure 7-45. 
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interférence pathway and stimulâtes formation of the heterochromatin that sur¬ 
rounds centromeres, which is necessary for the centromeres to segregate chromo¬ 
somes accurately during mitosis. 

piRNAs Protect the Germ Line from Transposable Eléments 

A third System of RNA interférence relies on piRNAs (piwi-interacting RNAs, 
named for Piwi, a class of proteins related to Argonaute). piRNAs are made specif- 
ically in the germ line, where they bloclc the movement of transposable éléments. 
Found in many organisms, including humans, genes coding for piRNAs consist 
largely of sequence fragments of transposable éléments. Ihese clusters of frag¬ 
ments are transcribed and brolcen up into short, single-stranded piRNAs. The Pro¬ 
cessing differs from that for miRNAs and siRNAs (for one thing, the Dicer enzyme 
is not involved), and the resulting piRNAs are slightly longer than miRNAs and 
siRNAs; moreover, they are complexed with Piwi rather than Argonaute proteins. 
Once formed, the piRNAs seelc out RNA targets by base-pairing and, much lilce 
siRNAs, transcriptionally silence intact transposon genes and destroy any RNA 
(including mRNAs) produced by them. 

Many mysteries surround piRNAs. Over a million piRNA species are coded in 
the genomes of many mammals and expressed in the testes, yet only a small frac¬ 
tion seem to be directed against the transposons présent in those genomes. Are 
the piRNAs remnants of past invaders? Do they cover so much “sequence space” 
that they are broadly protective for any foreign DNA? Another curious feature 
of piRNAs is that many of them (particularly if base-pairing does not hâve to be 
perfect) should, in principle, attaclc the normal mRNAs made by the organism, 
yet they do not. It has been proposed that these large numbers of piRNAs may 
form a System to distinguish “self" RNAs from “foreign" RNAs and attaclc only the 
latter. If this is the case, there must be a spécial way for the cell to spare its own 
RNAs. One idea is that RNAs produced in the previous génération of an organism 
are somehow registered and set aside from piRNA attaclc in subséquent généra¬ 
tions. Whether or not this mechanism truly exists, and, if so, how it might worlc, 
are questions that demonstrate our incomplète understanding of the full implica¬ 
tions of RNA interférence. 

RNA Interférence Has Become a Powerful Experimental Tool 

Although it lilcely arose as a defense mechanism against viruses and transposable 
éléments, RNA interférence, as we hâve seen, has become thoroughly integrated 
into many aspects of normal cell biology, ranging from the control of gene expres¬ 
sion to the structure of chromosomes. It has also been developed by scientists 
into a powerful experimental tool that allows almost any gene to be inactivated by 
evolcing an RNAi response to it. This technique, which can be readily carried out 
in cultured cells and, in many cases, whole animais and plants, has made possi¬ 
ble new genetic approaches in cell and molecular biology. We shall discuss it in 
detail in the following chapter where we cover modem genetic methods used to 
study cells (see pp. 499-501). RNAi also has potential in treating human disease. 
Since many human disorders resuit from the misexpression of genes, the ability to 
turn these genes off by experimentally introducing complementary siRNA molé¬ 
cules holds great medical promise. Although the mechanism of RNA interférence 
was discovered a few décades ago, we are still being surprised by its mechanistic 
details and by its broad biological implications. 

Bacteria Use Small Noncoding RNAs to Protect Themselves from 
Viruses 

Bacteria malce up the vast majority of the Earth's biomass and, not surprisingly, 
viruses that infect bacteria greatly outnumber plant and animal viruses. These 
viruses generally hâve DNA genomes. A recent discovery revealed that many 
species of bacteria (and almost ail species of archaebacteria) use a repository of 
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small noncoding RNA molécules to seelc out and destroy the DNA of the invading 
viruses. Many features of this defense mechanism, known as the CRISPR System, 
resemble those we saw above for miRNAs and siRNAs, but there are two import¬ 
ant différences. First, when bacteria and archaea are first infected by a virus, they 
hâve a mechanism that causes short fragments of that viral DNA to become inte- 
grated into their genomes. These serve as "vaccinations," in the sense that they 
become the templates for producing small noncoding RNAs known as crRNAs 
(CRISPR RNAs) that will thereafter destroy the virus should it reinfect the descen¬ 
dants of the original cell. This aspect of the CRISPR System is similar in principle 
to adaptive immunity in mammals, in that the cell carries a record of past expo- 
sures that is used to protect against future exposures. The second distinguishing 
feature of the CRISPR System is that these crRNAs then become associated with 
spécial proteins that allow them to seelc out and destroy double-stranded DNA 
molécules, rather than single-stranded RNA molécules. 

Although many details of CRISPR-mediated immunity remain to be discov- 
ered, we can outline the general process in three steps (Figure 7-78). In the first, 
viral DNA sequences are integrated into spécial régions of the bacterial genome 
known as CRISPR (clustered regularly interspersed short palindromie repeat) 
loci, named for the peculiar structure that first drew the attention of scientists. 
In its simplest form, a CRISPR locus consists of several hundred repeats of a host 
DNA sequence interspersed with a large collection of sequences (typically 25-70 
nucléotide pairs each) that has been derived from prior exposures to viruses and 
other foreign DNA. The newest viral sequence is always integrated at the 5' end of 
the CRISPR locus, the end that is transcribed first. Each locus, therefore, carries 
a temporal record of prior infections. Many bacterial and archaeal species carry 
several large CRISPR loci in their genomes and are thus immune to a wide range 
of viruses. 

In the second step, the CRISPR locus is transcribed to produce a long RNA mol¬ 
écule, which is then processed into the much shorter (approximately 30 nucléo¬ 
tides) crRNAs. In the third step, crRNAs complexed with Cas {CRISPR-associated) 
proteins seelc out complementary viral DNA sequences and direct their destruc¬ 
tion by nucleases. Although structurally dissimilar, Cas proteins are analogous to 
the Argonaute and Piwi proteins discussed above: they hold small single-stranded 
RNAs in an extended configuration that is optimized, in this case, for seeking and 
forming complementary base pairs with DNA. 

We still hâve much to learn about CRISPR-based immunity in bacteria and 
archaebacteria. The mechanism through which viral sequences are first identified 
and integrated into the host genome is poorly understood, as is the way that the 
crRNAs find their complementary sequences in double-stranded DNA. Moreover, 
in different species of bacteria and archaebacteria, crRNAs are processed in dif¬ 
ferent ways, and in some cases, the crRNAs can attaclc viral RNAs as well as DNAs. 

We shall see in the following chapter that bacterial CRISPR Systems hâve 
already been artificially “moved" into plants and animais, where they hâve 
become very powerful experimental tools for manipulating genomes. 
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Figure 7-78 CRISPR-mediated immunity 
in bacteria and archaebacteria. After 
infection by a virus (left panel), a small bit of 
DNA from the viral genome is inserted into 
the CRISPR locus. For this to happen, a 
small fraction of infected cells must survive 
the initial viral infection. The surviving 
cells, or more generally their descendants, 
transcribe the CRISPR locus and process 
the transcript into crRNAs (middle panel). 
Upon reinfection with a virus that the 
population has already been “vaccinated” 
against, the incoming viral DNA is 
destroyed by a complementary crRNA 
(right panel). For a CRISPR System to be 
effective, the crRNAs must not destroy 
the CRISPR locus itself, even though the 
crRNAs are complementary in sequence 
to it. In many species, in order for crRNAs 
to attack an invading DNA molécule, 
there must be additional short nucléotide 
sequences that are carried by the target 
molécule. Because these sequences, 
known as PAMs (protospacer adjacent 
motifs), lie outside the crRNA sequences, 
the host CRISPR locus is spared (see 
Figure 8-55). 
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Long Noncoding RNAs Hâve Diverse Functions in the Cell 

In this and the preceding chapters, we hâve seen that noncoding RNA molécules 
hâve many functions in the cell. Yet, as is the case with proteins, there remain 
many noncoding RNAs whose function is still unknown. Many RNAs of unknown 
function belong to a group lcnown as long noncoding RNA (IncRNA). These are 
arbitrarily defined as RNAs longer than 200 nucléotides that do not code for pro- 
tein. As methods hâve improved for determining the nucléotide sequences of ail 
the RNA molécules produced by a cell line or tissue, the sheer number of IncRNAs 
(an estimated 8000 for the human genome, for example) came as a surprise to sci- 
entists. Most IncRNAs are transcribed by RNA polymerase II and hâve 5' caps and 
poly-A tails, and, in many cases, they are spliced. It has been difficult to annotate 
IncRNAs because low levels of RNA are now lcnown to be made from 75% of the 
human genome. Most of these RNAs are thought to resuit from the background 
"noise" of transcription and RNA processing. According to this idea, such non- 
functional RNAs provide no fitness advantage or disadvantage to the organism 
and are a tolerated by-product of the complex patterns of gene expression that 
need to be produced in multicellular organisms. For these reasons, it is difficult to 
estimate the number of IncRNAs that are lilcely to hâve a function in the cell and to 
distinguish them from the background transcription. 

We hâve already encountered a few IncRNAs, including the RNA in telomerase 
(see Figure 5-33), Xist RNA (see Figure 7-52), and an RNA involved in imprinting 
(see Figure 7-49). Other IncRNAs hâve been implicated in controlling the enzy- 
matic activity of proteins, inactivating transcription regulators, affecting splicing 
patterns, and blocking translation of certain mRNAs. 

In terms of biological function, IncRNA should be considered a catch-ail phrase 
encompassing a great diversity of functions. Nevertheless, there are two unifying 
features of IncRNAs that can account for their many rôles in the cell. The first is 
that IncRNAs can function as scaffold RNA molécules , holding together groups of 
proteins to coordinate their functions (Figure 7-79A). We hâve already seen an 
example in telomerase, where the RNA molécule holds together and organizes 
protein components. These RNA-based scaffolds are analogous to protein scaf- 
folds we discussed in Chapter 3 (see Figure 3-78) and Chapter 6 (see Figure 6-47). 
RNA molécules are well suited to act as scaffolds: small bits of RNA sequence, 
often those portions that form stem-loop structures, can serve as binding sites 
for proteins, and these can be strung together with random sequences of RNA 
in between. This property may be one reason that IncRNAs show relatively little 
primary-sequence conservation across species. 

The second lcey feature of IncRNAs is their ability to serve as guide sequences, 
binding to spécifie RNA or DNA target molécules through base-pairing. By doing 
so, they bring proteins that are bound to them into close proximity with the DNA 


Figure 7-79 Rôles of long noncoding 
RNA (IncRNA). (A) IncRNAs can serve as 
scaffolds, bringing together proteins that 
function in the same process. As described 
in Chapter 6, RNAs can fold into spécifie 
three-dimensional structures that are often 
recognized by proteins. (B) In addition to 
serving as scaffolds, IncRNAs can, through 
formation of complementary base pairs, 
localize proteins to spécifie sequences on 
RNA or DNA molécules. (C) In some cases, 
IncRNAs act only in cis, for example, 
when the RNA is held in place by RNA 
polymerase (top). Other IncRNAs, however, 
diffuse from their sites of synthesis and 
therefore act in trans. 
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and RNA sequences (Figure 7-79B). This behavior is similar to that of snoRNAs 
(see Figure 6-41), crRNAs (see Figure 7-78), and miRNAs (see Figure 7-75), ail of 
which act in this way to guide protein enzymes to spécifie nucleic acid sequences. 

In some cases, IncRNAs worlc simply by base-pairing, without bringing in 
enzymes or other proteins. For example, a number of IncRNA genes are embed- 
ded in protein-coding genes, but they are transcribed in the “wrong direction." 
These antisense RNAs can form complementary base pairs with the mRNA (tran¬ 
scribed in the "correct" direction) and bloclc its translation into protein (see Figure 
7-66D). Other antisense IncRNAs base-pair with pre-mRNAs as they are synthe- 
sized and change the pattern of RNA splicing by masking splice-site sequences. 
Still others act as "sponges," base-pairing with miRNAs and thereby reducing their 
effects. 

Finally, we note that some IncRNAs can act only in cis; that is, they affect only 
the chromosome from which they are transcribed. This readily occurs when the 
transcribed RNA has not yet been released from RNA polymerases (Figure 7-79C). 
Many IncRNAs, however, diffuse from their site of synthesis and act in trans. 
Although the best understood IncRNAs worlc in the nucléus, many are found in the 
cytosol. The functions—if any—of the great majority of these cytosolic IncRNAs 
remain undiscovered. 


WHATWE DONT KNOW 


• How is the final rate of transcription 
of a gene specified by the hundreds of 
proteins that assemble on its control 
régions? Will we ever be able to predict 
this rate from inspection of the DNA 
sequences of control régions? 

• How does the collection of cis- 
regulatory sequences embedded in a 
genome orchestrate the developmental 
program of a multicellular organism? 

• How much of the human genome 
sequence is functional, and why is the 
remainder retained? 

• Which of the thousands of unstudied 
noncoding RNAs hâve functions in the 
cell, and what are these functions? 


Summary • were introns présent in early cells 

(and subsequently lost in some 

RNA molécules hâve many uses in the cell besides carrying the information needed organisais), or did they arise at later 
to specify the order of amino acids during protein synthesis. Although we hâve times? 
encountered noncoding RNAs in other chapter s (tRNAs, rRNAs, snoRNAs, for exam¬ 
ple), the sheer number of noncoding RNAs produced by cells has surprised scien- 
tists. One well understood use of noncoding RNAs occurs in RNA interférence, where 
guide RNAs (miRNAs, siRNAs, piRNAs) base-pair with mRNAs. RNA interférence 
can cause mRNAs to be either destroyed or translationally repressed. It can also 
cause spécifie genes to be paclcaged into heterochromatin suppressing their tran¬ 
scription. In bacteria and archaebacteria, RNA interférence is used as an adaptive 
immune response to destroy viruses that infect them. A large family of large noncod¬ 
ing RNAs (IncRNAs) has recently been discovered. Although the function ofmost of 
these RNAs is unlcnown , some serve as RNA scaffolds to bring spécifie proteins and 
RNA molécules together to speed up needed reactions. 


PROBLEMS 


Which statements are true? Explain why or why not. Discuss the following problems. 


7-1 In terms of the way it interacts with DNA, the 
helix-loop-helix motif is more closely related to the leu- 
cine zipper motif than it is to the helix-turn-helix motif. 

7-2 Once cells hâve differentiated to their final spe- 
cialized forms, they never again alter expression of their 
genes. 

CG islands are thought to hâve arisen during évo¬ 
lution because they were associated with portions of the 
genome that remained unmethylated in the germ line. 

7-4 In most differentiated tissues, daughter cells retain 
a memory of gene expression patterns that were présent 
in the parent cell through mechanisms that do not involve 
changes in the sequence of their genomic DNA. 


7-5 A small portion of a two-dimensional display 
of proteins from human brain is shown in Figure Q7-1. 
These proteins were separated on the basis of size in one 
dimension and electrical charge (isoelectric point) in the 
other. Not ail protein spots on such displays are products 

Figure Q7-1 Two- 
dimensional séparation of 
proteins from the human 
brain (Problem 7-5). The 
proteins were displayed 
using two-dimensional 
gel electrophoresis. Only 
a small portion of the 
protein spectrum is shown. 
(Courtesy of Tim Myers and 
Leigh Anderson, Large Scale 
Biology Corporation.) 
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of different genes; some represent modified forms of a pro- 
tein that migrate to different positions. Piclc out a couple 
of sets of spots that could represent proteins that differ by 
the number of phosphates they carry. Explain the basis for 
your sélection. 

7-6 Comparisons of the patterns of mRNA levels 
across different human cell types show that the level of 
expression of almost every active gene is different. The 
patterns of mRNA abundance are so characteristic of cell 
type that they can be used to détermine the tissue of origin 
of cancer cells, even though the cells may hâve metasta- 
sized to different parts of the body. By définition, however, 
cancer cells are different from their noncancerous precur- 
sor cells. How do you suppose then that patterns of mRNA 
expression might be used to détermine the tissue source of 
a human cancer? 

What are the two fundamental components of a 
genetic switch? 

7-8 The nucléus of a eukaryotic cell is much larger 
than a bacterium, and it contains much more DNA. As a 
conséquence, a transcription regulator in a eukaryotic cell 
must be able to select its spécifie binding site from among 
many more unrelated sequences than does a transcription 
regulator in a bacterium. Does this présent any spécial 
problems for eukaryotic gene régulation? 

Consider the following situation. Assume that the 
eukaryotic nucléus and the bacterial cell each hâve a sin¬ 
gle copy of the same DNA binding site. In addition, assume 
that the nucléus is 500 times the volume of the bacterium, 
and has 500 times as much DNA. If the concentration of the 
transcription regulator that binds the site were the same 
in the nucléus and in the bacterium, would the regulator 
occupy its binding site equally as well in the eukaryotic 
nucléus as it does in the bacterium? Explain your answer. 

Some transcription regulators bind to DNA and 
cause the double hélix to bend at a sharp angle. Such 
“bending proteins" can affect the initiation of transcrip¬ 
tion without directly contacting any other protein. Can you 
devise a plausible explanation for how such proteins might 
worlc to modulate transcription? Draw a diagram that illus¬ 
trâtes your explanation. 

7-10 How is it that protein-protein interactions that 
are too wealc to cause proteins to assemble in solution 
can nevertheless allow the same proteins to assemble into 
complexes on DNA? 
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Figure Q7-2 Gene regulatory circuits and cell memory (Problem 7-11). 
(A) Induction of synthesis of transcription activator A by a transient 
signal. (B) Induction of synthesis of transcription repressor R by a 
transient signal. 


7-12 Examine the two pedigrees shown in Figure Q7-3. 
One results from délétion of a maternally imprinted auto- 
somal gene. The other pedigree results from délétion of a 
paternally imprinted autosomal gene. In both pedigrees, 
affected individuals (red symbols) are heterozygous for 
the délétion. These individuals are affected because one 
copy of the chromosome carries an imprinted, inactive 
gene, while the other carries a délétion of the gene. Dotted 
yellow symbols indicate individuals that carry the deleted 
locus, but do not display the mutant phenotype. Which 
pedigree is based on paternal imprinting and which on 
maternai imprinting? Explain your answer. 
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Figure Q7-3 Pedigrees reflecting maternai and paternal imprinting 
(Problem 7-12). In one pedigree, the gene is paternally imprinted; in 
the other, it is maternally imprinted. In générations 3 and 4, only one 
of the two parents in the indicated matings is shown; the other parent 
is a normal individual from outside this pedigree. Affected individuals 
are represented by red circles for females and red squares for males. 
Dotted yellow symbols indicate individuals that carry the délétion but 
do not display the phenotype. 


7-11 Imagine the two situations shown in Figure Q7-2. 
In cell 1, a transient signal induces the synthesis of pro¬ 
tein A, which is a transcription activator that turns on 
many genes including its own. In cell 2, a transient signal 
induces the synthesis of protein R, which is a transcription 
repressor that turns off many genes including its own. In 
which, if either, of these situations will the descendants of 
the original cell “remember" that the progenitor cell had 
experienced the transient signal? Explain your reasoning. 


7-13 If you insert a fi-galactosidase gene lacking its own 
transcription control région into a cluster of piRNA genes 
in Drosophila, you find that P-galactosidase expression 
from a normal copy elsewhere in the genome is strongly 
inhibited in the fly's germ cells. If the inactive fi-galactosi- 
dase gene is inserted outside the piRNA gene cluster, the 
normal gene is properly expressed. What do you suppose 
is the basis for this observation? How would you test your 
hypothesis? 
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Progress in science is often driven by advances in technology. The entire field 
of cell biology, for example, came into being when optical craftsmen learned to 
grind small lenses of sufficienüy high quality to observe cells and their substruc¬ 
tures. Innovations in lens grinding, rather than any conceptual or philosophical 
advance, allowed Hoolce and van Leeuwenhoek to discover a previously unseen 
cellular world, where tiny créatures tumble and twirl in a small droplet of water 
(Figure 8-1). 

The twenty-first century is a particularly exciting time for biology. New meth- 
ods for analyzing cells, proteins, DNA, and RNA are fueling an information explo¬ 
sion and allowing scientists to study cells and their macromolecules in previ¬ 
ously unimagined ways. We now hâve access to the sequences of many billions of 
nucléotides, providing the complété molecular blueprints for hundreds of organ- 
isms—from microbes and mustard weeds to worms, Aies, mice, dogs, chimpan- 
zees, and humans. And powerful new techniques are helping us to decipher that 
information, allowing us not only to compile huge, detailed catalogs of genes and 
proteins but also to begin to unravel how these components work together to form 
functional cells and organisms. The long-range goal is nothing short of obtaining 
a complété understanding of what talces place inside a cell as it responds to its 
environment and interacts with its neighbors. 

In this chapter, we présent some of the principal methods used to study cells 
and their molecular components. We consider how to separate cells of different 
types from tissues, how to grow cells outside the body, and how to disrupt cells 
and isolate their organelles and constituent macromolecules in pure form. We 
also présent the techniques used to détermine protein structure, function, and 
interactions, and we discuss the brealcthroughs in DNA technology that continue 
to revolutionize our understanding of cell function. We end the chapter with 
an overview of some of the mathematical approaches that are helping us deal 
with the enormous complexity of cells. By considering cells as dynamic Systems 
with many moving parts, mathematical approaches can reveal hidden insights 
into how the many components of cells work together to produce the spécial 
qualities of life. 
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ISOLATING CELLS AND GROWING THEM IN CULTURE 

Although the organelles and large molécules in a cell can be visualized with 
microscopes, understanding how these components function requires a detailed 
biochemical analysis. Most biochemical procedures require that large numbers of 
cells be physically disrupted to gain access to their components. If the sample is 
a piece of tissue, composed of different types of cells, heterogeneous cell popula¬ 
tions will be mixed together. To obtain as much information as possible about the 
cells in a tissue, biologists hâve developed ways of dissociating cells from tissues 
and separating them according to type. These manipulations resuit in a relatively 
homogeneous population of cells that can then be analyzed—either directly or 
after their number has been greatly increased by allowing the cells to proliferate 
in culture. 

Cells Can Be Isolated from Tissues 

Intact tissues provide the most realistic source of material, as they represent the 
actual cells found within the body. The first step in isolating individual cells is to 
disrupt the extracellular matrix and cell-cell junctions that hold the cells together. 
For this purpose, a tissue sample is typically treated with proteolytic enzymes 
(such as trypsin and collagénase) to digest proteins in the extracellular matrix and 
with agents (such as ethylenediaminetetraacetic acid, or EDTA) that bind, or che- 
late, the Ca 2+ on which cell-cell adhesion dépends. The tissue can then be teased 
apart into single cells by gentle agitation. 

For some biochemical préparations, the protein of interest can be obtained in 
sufficient quantity without having to separate the tissue or organ into cell types. 
Examples include the préparation of histones from calf thymus, actin from rab- 
bit muscle, or tubulin from cow brain. In other cases, obtaining the desired pro¬ 
tein requires enrichment for a spécifie cell type of interest. Several approaches 
are used to separate the different cell types from a mixed cell suspension. One 
of the most sophisticated cell-separation techniques uses an antibody coupled 
to a fluorescent dye to label spécifie cells. An antibody is chosen that specifi- 
cally binds to the surface of only one cell type in the tissue. The labeled cells can 
then be separated from the unlabeled ones in a fluorescence-activated cell sorter. 
In this remarkable machine, individual cells traveling single file in a fine stream 
pass through a laser beam, and the fluorescence of each cell is rapidly measured. 
A vibrating nozzle generates tiny droplets, most containing either one cell or no 
cells. The droplets containing a single cell are automatically given a positive or a 
négative charge at the moment of formation, depending on whether the cell they 
contain is fluorescent; they are then deflected by a strong electric field into an 
appropriate container. Occasional clumps of cells, detected by their increased 
light scattering, are left uncharged and are discarded into a waste container. Such 
machines can accurately select 1 fluorescent cell from a pool of 1000 unlabeled 
cells and sort several thousand cells each second (Figure 8-2). 

Cells Can Be Grown in Culture 

Although molécules can be extracted from whole tissues, this is often not the 
most convenient or useful source of material. The complexity of intact tissues and 
organs is an inhérent disadvantage when trying to purify particular molécules. 
Cells grown in culture provide a more homogeneous population of cells from 
which to extract material, and they are also much more convenient to worlc with 
in the laboratory. Given appropriate surroundings, most plant and animal cells 
can live, multiply, and even express differentiated properties in a culture dish. The 
cells can be watched continuously under the microscope or analyzed biochemi- 
cally, and the effects of adding or removing spécifie molécules, such as hormones 
or growth factors, can be systematically explored. 

Experiments performed on cultured cells are sometimes said to be carried out 
in vitro (literally, "in glass”) to contrast them with experiments using intact organ- 
isms, which are said to be carried out in vivo (literally, "in the living organism"). 



(B) 


Figure 8-1 Microscopie life. A sample 
of “diverse animalcules” seen by van 
Leeuwenhoek using his simple microscope. 
(A) Bacteria seen in material he excavated 
from between his teeth. Those in fig. B he 
described as “swimming first forward and 
then backwards” (1692). (B) The eukaryotic 
green alga Volvox (1700). (Courtesy of the 
John Innés Foundation.) 
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Figure 8-2 A fluorescence-activated 
cell sorter. A cell passing through the 
laser beam is monitored for fluorescence. 
Droplets containing single cells are given a 
négative or positive charge, depending on 
whether the cell is fluorescent or not. The 
droplets are then deflected by an electric 
field into collection tubes according to their 
charge. Note that the cell concentration 
must be adjusted so that most droplets 
contain no cells and flow to a waste 
container together with any cell clumps. 


These terms can be confusing, however, because they are often used in a very dif¬ 
ferent sense by biochemists. In the biochemistry lab, in vitro refers to reactions 
carried out in a test tube in the absence of living cells, whereas in vivo refers to 
any reaction talcing place inside a living cell, even if that cell is growing in culture. 

Tissue culture began in 1907 with an experiment designed to settle a con- 
troversy in neurobiology. The hypothesis under examination was lcnown as the 
neuronal doctrine, which States that each nerve fiber is the outgrowth of a single 
nerve cell and not the product of the fusion of many cells. To test this contention, 
small pièces of spinal cord were placed on clotted tissue fiuid in a warm, moist 
chamber and observed at regular intervals under the microscope. After a day or 
so, individual nerve cells could be seen extending long, thin filaments (axons) into 
the clôt. Thus, the neuronal doctrine received strong support, and the foundation 
was laid for the cell-culture révolution. 

These original experiments on nerve fibers used cultures of small tissue frag¬ 
ments called expiants. Today, cultures are more commonly made from suspen¬ 
sions of cells dissociated from tissues. Unlilce bacteria, most tissue cells are not 
adapted to living suspended in fiuid and require a solid surface on which to grow 
and divide. For cell cultures, this support is usually provided by the surface of a 
plastic culture dish. Cells vary in their requirements, however, and many do not 
proliferate or differentiate unless the culture dish is coated with materials that 
cells lilce to adhéré to, such as polylysine or extracellular matrix components. 

Cultures prepared directly from the tissues of an organism are called primary 
cultures. These can be made with or without an initial fractionation step to sepa- 
rate different cell types. In most cases, cells in primary cultures can be removed 
from the culture dish and recultured repeatedly in so-called secondary cultures; 
in this way, they can be repeatedly subcultured ( J passaged ) for weelcs or months. 
Such cells often display many of the differentiated properties appropriate to their 
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Figure 8-3 Light micrographs of cells in culture. (A) Mouse fibroblasts. 

(B) Chick myoblasts fusing to form multinucleate muscle cells. (C) Purified rat 
retinal ganglion nerve cells. (D) Tobacco cells in liquid culture. (A, courtesy of 
Daniel Zicha; B, courtesy of Rosalind Zalin; C, from A. Meyer-Franke et al., 
Neuron 15:805-819, 1995. With permission from Elsevier; D, courtesy of 
Gethin Roberts.) 


origin (Figure 8-3): fibroblasts continue to secrete collagen; cells derived from 
embryonic slceletal muscle fuse to form muscle fibers that contract spontaneously 
in the culture dish; nerve cells extend axons that are electrically excitable and 
malce synapses with other nerve cells; and épithélial cells form extensive sheets 
with many of the properties of an intact epithelium. Because these properties are 
maintained in culture, they are accessible to study in ways that are often not pos¬ 
sible in intact tissues. 

Cell culture is not limited to animal cells. When a piece of plant tissue is cul- 
tured in a stérile medium containing nutrients and appropriate growth regula- 
tors, many of the cells are stimulated to proliferate indefinitely in a disorganized 
manner, producing a mass of relatively undifferentiated cells called a callus. If 
the nutrients and growth regulators are carefully manipulated, one can induce 
the formation of a shoot and then root apical meristems within the callus, and, in 
many species, regenerate a whole new plant. Similar to animal cells, callus cul¬ 
tures can be mechanically dissociated into single cells, which will grow and divide 
as a suspension culture (see Figure 8-3D). 



Eukaryotic Cell Lines Are a Widely Used Source of Homogeneous 
Cells 

The cell cultures obtained by disrupting tissues tend to suffer from a problem— 
eventually the cells die. Most vertebrate cells stop dividing after a finite number of 
cell divisions in culture, a process called réplicative cell senescence (discussed in 
Chapter 17). Normal human fibroblasts, for example, typically divide only 25-40 
times in culture before they stop. In these cells, the limited prolifération capacity 
reflects a progressive shortening and uncapping of the celFs telomeres, the répét¬ 
itive DNA sequences and associated proteins that cap the ends of each chromo¬ 
some (discussed in Chapter 5). Human somatic cells in the body hâve turned off 
production of the enzyme, called telomerase, that normally maintains the telo¬ 
meres, which is why their telomeres shorten with each cell division. Human fibro¬ 
blasts can often be coaxed to proliferate indefinitely by providing them with the 
gene that encodes the catalytic subunit of telomerase; in this case, they can be 
propagated as an “immortalized” cell line. 

Some human cells, however, cannot be immortalized by this triclc. Although 
their telomeres remain long, they still stop dividing after a limited number of divi¬ 
sions because culture conditions cause excessive mitogenic stimulation, which 
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activâtes a poorly understood protective mechanism (discussed in Chapter 17) 
that stops cell division—a process sometimes called "culture shoclc." To immor- 
talize these cells, one has to do more than introduce telomerase. One must also 
inactivate the protective mechanisms, which can be done by introducing certain 
cancer-promoting oncogenes (discussed in Chapter 20). Unlilce human cells, 
most rodent cells do not turn off production of telomerase and therefore their 
telomeres do not shorten with each cell division. Therefore, if culture shoclc can 
be avoided, some rodent cell types will divide indefinitely in culture. In addition, 
rodent cells often undergo spontaneous genetic changes in culture that inactivate 
their protective mechanisms, thereby producing immortalized cell lines. 

Cell lines can often be most easily generated from cancer cells, but these cul¬ 
tures—referred to as transformed cell lines —differ from those prepared from nor¬ 
mal cells in several ways. Transformed cell lines often grow without attaching to a 
surface, for example, and they can proliferate to a much higher density in a culture 
dish. Similar properties can be induced experimentally in normal cells by trans- 
forming them with a tumor-inducing virus or Chemical. The resulting transformed 
cell lines can usually cause tumors if injected into a susceptible animal. 

Transformed and nontransformed cell lines are extremely useful in cell 
research as sources of very large numbers of cells of a uniform type, especially 
since they can be stored in liquid nitrogen at -196°C for an indefinite period and 
retain their viability when thawed. It is important to lceep in mind, however, that 
cell lines nearly always differ in important ways from their normal progenitors in 
the tissues from which they were derived. 

Some widely used cell lines are listed in Table 8-1. Different lines hâve dif¬ 
ferent advantages; for example, the PtK épithélial cell lines derived from the rat 


Some Commonly Used Cell Lines 

Cell line* 

Cell type and origin 

3T3 

Fibroblast (mouse) 

BHK21 

Fibroblast (Syrian hamster) 

MDCK 

Epithelial cell (dog) 

HeLa 

Epithelial cell (human) 

PtKI 

Epithelial cell (rat kangaroo) 

L6 

Myoblast (rat) 

PCI 2 

Chromaffin cell (rat) 

SP2 

Plasma cell (mouse) 

COS 

Kidney (monkey) 

293 

Kidney (human); transformed with adenovirus 

CHO 

Ovary (Chinese hamster) 

DT40 

Lymphoma cell for efficient targeted recombination (chick) 

RI 

Embryonic stem cell (mouse) 

El 4.1 

Embryonic stem cell (mouse) 

H1, H9 

Embryonic stem cell (human) 

S2 

Macrophage-like cell ( Drosophila) 

BY2 

Undifferentiated meristematic cell (tobacco) 

*Many of these cell Unes were derived from tumors. Ail of them are capable of indefinite 
réplication in culture and express at least some of the spécial characteristics of their 
cells of origin. 
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kangaroo, unlilce many other cell types, remain fiat during mitosis, allowing the 
mitotic apparatus to be readily observed in action. 

Hybridoma Cell Lines Are Factories That Produce Monoclonal 
Antibodies 

As we see throughout this boolc, antibodies are particularly useful tools for cell 
biology. Their great specificity allows précisé visualization of selected proteins 
among the many thousands that each cell typically produces. Antibodies are 
often produced by inoculating animais with the protein of interest and subse- 
quently isolating the antibodies spécifie to that protein from the sérum of the 
animal. However, only limited quantities of antibodies can be obtained from a 
single inoculated animal, and the antibodies produced will be a heterogeneous 
mixture of antibodies that recognize a variety of different antigenic sites on a mac- 
romolecule that differs from animal to animal. Moreover, antibodies spécifie for 
the antigen will constitute only a fraction of the antibodies found in the sérum. An 
alternative technology, which allows the production of an unlimited quantity of 
identical antibodies and greatly increases the specificity and convenience of anti- 
body-based methods, is the production of monoclonal antibodies by hybridoma 
cell lines. 

This technology, developed in 1975, revolutionized the production of antibod¬ 
ies for use as tools in cell biology, as well as for the diagnosis and treatment of cer¬ 
tain diseases, including rheumatoid arthritis and cancer. The procedure requires 
hybrid cell technology (Figure 8-4), and it involves propagating a clone of cells 
from a single antibody-secreting B lymphocyte to obtain a homogeneous prépa¬ 
ration of antibodies in large quantities. B lymphocytes normally hâve a limited 
life-span in culture, but individual antibody-producing B lymphocytes from an 
immunized mouse, when fused with cells derived from a transformed B lympho¬ 
cyte cell line, can give rise to hybrids that hâve both the ability to malce a particular 
antibody and the ability to multiply indefinitely in culture. These hybridomas are 
propagated as individual clones, each of which provides a permanent and stable 
source of a single type of monoclonal antibody. Each type of monoclonal anti¬ 
body recognizes a single type of antigenic site—for example, a particular cluster 
of five or six amino acid side chains on the surface of a protein. Their uniform 
specificity malces monoclonal antibodies much more useful than conventional 
antisera for many purposes. 

An important advantage of the hybridoma technique is that monoclonal anti¬ 
bodies can be made against molécules that constitute only a minor component 
of a complex mixture. In an ordinary antiserum made against such a mixture, the 
proportion of antibody molécules that recognize the minor component would be 
too small to be useful. But if the B lymphocytes that produce the various compo- 
nents of this antiserum are made into hybridomas, it becomes possible to screen 
individual hybridoma clones from the large mixture to select one that produces 
the desired type of monoclonal antibody and to propagate the selected hybridoma 
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SELECTIVE MEDIUM 
ALLOWS ONLY 
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three clones of hybrid cells 
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Figure 8-4 The production of hybrid 
cells. It is possible to fuse one cell 
with another to form a heterokaryon, a 
combined cell with two separate nuclei. 
Typically, a suspension of cells is treated 
with certain inactivated viruses or with 
polyethylene glycol, each of which alters 
the plasma membranes of cells in a way 
that induces them to fuse. Eventually, 
a heterokaryon proceeds to mitosis 
and produces a hybrid cell in which the 
two separate nuclear envelopes hâve 
been disassembled, allowing ail the 
chromosomes to be brought together in a 
single large nucléus. Such hybrid cells can 
give rise to immortal hybrid cell lines. If one 
of the parent cells was from a tumor cell 
line, the hybrid cell is called a hybridoma. 
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indefinitely so as to produce that antibody in unlimited quantities. In principle, 
therefore, a monoclonal antibody can be made against any protein in a biological 
sample. Once an antibody has been made, it can be used to localize the protein 
in cells and tissues, to follow its movement, and to purify the protein to study its 
structure and function. 

Summary 

Tissues can be dissociated into their component cells, from which individual cell 
types can be purified and used for biochemical analysis or for the establishment of 
cell cultures. Many animal and plant cells survive and proliferate in a culture dish if 
they are provided with a suitable culture medium containing nutrients and appro- 
priate signal molécules. Although many animal cells stop dividing after a finite 
number of cell divisions, cells that hâve been immortalized through spontaneous 
mutations or genetic manipulation can be maintained indefinitely as cell Unes. 
Hybridoma cells are widely employed to produce unlimited quantities ofuniform 
monoclonal antibodies; which are used to detect and purify cell proteins; as well as 
to diagnose and treat diseases. 

PURIFYING PROTEINS 

The challenge of isolating a single type of protein from the thousands of other 
proteins présent in a cell is a formidable one, but must be overcome in order to 
study protein function in vitro. As we shall see later in this chapter, recombinant 
DNA technology can enormously simplify this taslc by “tricking” cells into produc- 
ing large quantities of a given protein, thereby making its purification much eas- 
ier. Whether the source of the protein is an engineered cell or a natural tissue, a 
purification procedure usually starts with subcellular fractionation to reduce the 
complexity of the material, and is then followed by purification steps of increasing 
specificity. 

Cells Can Be Separated into Their Component Fractions 

To purify a protein, it must first be extracted from inside the cell. Cells can be 
brolcen up in various ways: they can be subjected to osmotic shock or ultrasonic 
vibration, forced through a small orifice, or ground up in a blender. These proce¬ 
dures break many of the membranes of the cell (including the plasma membrane 
and endoplasmic réticulum) into fragments that immediately reseal to form small 
closed vesicles. If carefully carried out, however, the disruption procedures leave 
organelles such as nuclei, mitochondria, the Golgi apparatus, lysosomes, and 
peroxisomes largely intact. The suspension of cells is thereby reduced to a thiclc 
slurry (called a homogenate or extract) that contains a variety of membrane-en- 
closed organelles, each with a distinctive size, charge, and density. Provided that 
the homogenization medium has been carefully chosen (by trial and error for 
each organelle), the various components—including the vesicles derived from 
the endoplasmic réticulum, called microsomes—retain most of their original bio¬ 
chemical properties. 

The different components of the homogenate must then be separated. Such 
cell fractionations became possible only after the commercial development in 
the early 1940s of an instrument known as the préparative ultracentrifuge, which 
rotâtes extracts of brolcen cells at high speeds (Figure 8-5). This treatment sép¬ 
arâtes cell components by size and density: in general, the largest objects expé¬ 
rience the largest centrifugal force and move the most rapidly. At relatively low 
speed, large components such as nuclei sédiment to form a pellet at the bottom 
of the centrifuge tube; at slightly higher speed, a pellet of mitochondria is depos- 
ited; and at even higher speeds and with longer periods of centrifugation, first the 
small closed vesicles and then the ribosomes can be collected (Figure 8-6). Ail 
of these fractions are impure, but many of the contaminants can be removed by 
resuspending the pellet and repeating the centrifugation procedure several times. 
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Figure 8-5 The préparative ultracentrifuge. (A) The sample is contained in tubes that are 
inserted into a ring of angled cylindrical holes in a métal rotor. Rapid rotation of the rotor generates 
enormous centrifugal forces, which cause particles in the sample to sédiment against the bottom 
sides of the sample tubes, as shown here. The vacuum reduces friction, preventing heating of the 
rotor and allowing the réfrigération System to maintain the sample at 4°C. (B) Some fractionation 
methods require a different type of rotor called a swinging-bucket rotor. In this case, the sample 
tubes are placed in métal tubes on hinges that allow the tubes to swing outward when the rotor 
spins. Sample tubes are therefore horizontal during spinning, and samples are sedimented toward 
the bottom, not the sides, of the tube, providing better séparation of differently sized components 
(see Figures 8-6 and 8-7). 



cell 

homogenate 


Centrifugation is the first step in most fractionations, but it séparâtes only com¬ 
ponents that differ greatly in size. A finer degree of séparation can be achieved by 
layering the homogenate in a thin band on top of a sait solution that fills a centri¬ 
fuge tube. When centrifuged, the various components in the mixture move as a 
sériés of distinct bands through the solution, each at a different rate, in a process 
called velocity sédimentation (Figure 8-7A). For the procedure to worlc effectively, 
the bands must be protected from convective mixing, which would normally 
occur whenever a denser solution (for example, one containing organelles) finds 
itself on top of a lighter one (the sait solution). This is achieved by augmenting 
the solution in the tube with a shallow gradient of sucrose prepared by a spécial 
mixing device. The resulting density gradient—with the dense end at the bottom 
of the tube—lceeps each région of the solution denser than any solution above it, 
and it thereby prevents convective mixing from distorting the séparation. 

When sedimented through sucrose gradients, different cell components sepa- 
rate into distinct bands that can be collected individually. The relative rate at which 
each component sédiments dépends primarily on its size and shape—normally 
being described in terms of its sédimentation coefficient, or S value. Present-day 
ultracentrifuges rotate at speeds of up to 80,000 rpm and produce forces as high as 
500,000 times gravity. These enormous forces drive even small macromolecules, 
such as tRNA molécules and simple enzymes, to sédiment at an appréciable rate 
and allow them to be separated from one another by size. 

The ultracentrifuge is also used to separate cell components on the basis of 
their buoyant density, independently of their size and shape. In this case, the 

Figure 8-6 Cell fractionation by centrifugation. Repeated centrifugation 

at progressive^ higher speeds will fractionate homogenates of cells into their 

components. In general, the smaller the subcellular component, the greater 

the centrifugal force required to sédiment it. Typical values for the various 

centrifugation steps referred to in the figure are: 

low speed: 1000 times gravity for 10 minutes 

medium speed: 20,000 times gravity for 20 minutes 

high speed: 80,000 times gravity for 1 hour 

very high speed: 150,000 times gravity for 3 hours 
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Figure 8-7 Comparison of velocity 
sédimentation and equilibrium 
sédimentation. (A) In velocity 
sédimentation, subcellular components 
sédiment at different speeds according to 
their size and shape when layered over a 
solution containing sucrose. To stabilize 
the sedimenting bands against convective 
mixing caused by small différences in 
température or soluté concentration, 
the tube contains a continuous shallow 
gradient of sucrose, which increases 
in concentration toward the bottom 
of the tube (typically from 5% to 20% 
sucrose). After centrifugation, the different 
components can be collected individually, 
most simply by puncturing the plastic 
centrifuge tube with a needle and collecting 
drops from the bottom, as illustrated here. 
(B) In equilibrium sédimentation, subcellular 
components move up or down when 
centrifuged in a gradient until they reach 
a position where their density matches 
their surroundings. Although a sucrose 
gradient is shown here, denser gradients, 
which are especially useful for protein and 
nucleic acid séparation, can be formed 
from césium chloride. The final bands, at 
equilibrium, can be collected as in (A). 


w 


sample is sedimented through a steep density gradient that contains a very high 
concentration of sucrose or césium chloride. Each cell component begins to move 
down the gradient as in Figure 8-7A, but it eventually reaches a position where the 
density of the solution is equal to its own density. At this point, the component 
floats and can move no farther. A sériés of distinct bands is thereby produced in 
the centrifuge tube, with the bands closest to the bottom of the tube containing 
the components of highest buoyant density (Figure 8-7B). This method, called 
equilibrium sédimentation , is so sensitive that it can separate macromolecules 
that hâve incorporated heavy isotopes, such as 13 C or 15 N, from the same mac¬ 
romolecules that contain the lighter, common isotopes ( 12 C or 14 N). In fact, the 
césium-chloride method was developed in 1957 to separate the labeled from the 
unlabeled DNA produced after exposure of a growing population of bacteria to 
nucléotide precursors containing 15 N; this classic experiment provided direct évi¬ 
dence for the semiconservative réplication of DNA (see Figure 5-5). 

Cell Extracts Provide Accessible Systems to Study Cell Functions 

Studies of organelles and other large subcellular components isolated in the ultra- 
centrifuge hâve contributed enormously to our understanding of the functions of 
different cell components. Experiments on mitochondria and chloroplasts puri- 
fied by centrifugation, for example, demonstrated the central function of these 
organelles in converting energy into forms that the cell can use. Similarly, resealed 
vesicles formed from fragments of rough and smooth endoplasmic réticulum 
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(microsomes) hâve been separated from each other and analyzed as functional 
models of these compartments of the intact cell. 

Similarly, highly concentrated cell extracts, especially extracts oïXenopus lae- 
vis (African clawed frog) oocytes, hâve played a critical rôle in the study of such 
complex and highly organized processes as the cell-division cycle, the sépara¬ 
tion of chromosomes on the mitotic spindle, and the vesicular-transport steps 
involved in the movement of proteins from the endoplasmic réticulum through 
the Golgi apparatus to the plasma membrane. 

Cell extracts also provide, in principle, the starting material for the complété 
séparation of ail of the individual macromolecular components of the cell. We 
now consider how this séparation is achieved, focusing on proteins. 

Proteins Can Be Separated by Chromatography 

Proteins are most often fractionated by column chromatography, in which a 
mixture of proteins in solution is passed through a column containing a porous 
solid matrix. Different proteins are retarded to different extents by their interac¬ 
tion with the matrix, and they can be collected separately as they flow out of the 
bottom of the column (Figure 8-8). Depending on the choice of matrix, proteins 
can be separated according to their charge ( ion-exchange chromatography), their 
hydrophobicity ( hydrophobie chromatography), their size [gel-filtration chroma¬ 
tography), or their ability to bind to particular small molécules or to other macro- 
molecules ( affinity chromatography). 

Many types of matrices are available. Ion-exchange columns (Figure 8-9A) 
are paclced with small beads that carry either a positive or a négative charge, so 
that proteins are fractionated according to the arrangement of charges on their 
surface. Hydrophobie columns are packed with beads from which hydrophobie 
side chains protrude, selectively retarding proteins with exposed hydrophobie 
régions. Gel-filtration columns (Figure 8-9B), which separate proteins according 
to their size, are packed with tiny porous beads: molécules that are small enough 
to enter the pores linger inside successive beads as they pass down the column, 
while larger molécules remain in the solution flowing between the beads and 
therefore move more rapidly, emerging from the column first. Besides providing 
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Figure 8-8 The séparation of molécules 
by column chromatography. The sample, 
a solution containing a mixture of different 
molécules, is applied to the top of a 
cylindrical glass or plastic column filled with 
a permeable solid matrix, such as cellulose. 
A large amount of solvent is then passed 
slowly through the column and collected 
in separate tubes as it emerges from the 
bottom. Because various components of 
the sample travel at different rates through 
the column, they are fractionated into 
different tubes. 
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(C) AFFINITY CHROMATOGRAPHY 


a means of separating molécules, gel-filtration chromatography is a convenient 
way to estimate their size. 

Affinity chromatography (Figure 8-9C) takes advantage of the biologically 
important binding interactions that occur on protein surfaces. If a substrate mol¬ 
écule is covalently coupled to an inert matrix such as a polysaccharide bead, the 
enzyme that opérâtes on that substrate will often be specifically retained by the 
matrix and can then be eluted (washed out) in nearly pure form. Likewise, short 
DNA oligonucleotides of a specifically designed sequence can be immobilized in 
this way and used to purify DNA-binding proteins that normally recognize this 
sequence of nucléotides in chromosomes. Alternatively, spécifie antibodies can 
be coupled to a matrix to purify protein molécules recognized by the antibodies. 
Because of the great specificity of ail such affinity columns, 1000- to 10,000-fold 
purifications can sometimes be achieved in a single pass. 

If one starts with a complex mixture of proteins, a single passage through an 
ion-exchange or a gel-filtration column does not produce very highly purified 
fractions, since these methods individually increase the proportion of a given pro¬ 
tein in the mixture no more than twentyfold. Because most individual proteins 
represent less than 1/1000 of the total cell protein, it is usually necessary to use 
several different types of columns in succession to attain sufficient purity, with 
affinity chromatography being the most efficient (Figure 8-10). 

Inhomogeneities in the matrices (such as cellulose), which cause an uneven 
flow of solvent through the column, limit the resolution of conventional column 
chromatography. Spécial chromatography resins (usually silica-based) com- 
posed of tiny spheres (3-10 pm in diameter) can be paclced with a spécial appa- 
ratus to form a uniform column bed. Such high-performance liquid chromatog¬ 
raphy (HPLC) columns attain a high degree of resolution. In HPLC, the solutés 
equilibrate very rapidly with the interior of the tiny spheres, and so solutés with 
different affinities for the matrix are efficiently separated from one another even 
at very fast flow rates. HPLC is therefore the method of choice for separating many 
proteins and small molécules. 

Immunoprécipitation Is a Rapid Affinity Purification Method 

Immunoprécipitation is a useful variation on the theme of affinity chromatogra¬ 
phy. Spécifie antibodies that recognize the protein to be purified are attached to 
small agarose beads. Rather than being paclced into a column, as in affinity chro¬ 
matography, a small quantity of the antibody-coated beads is simply added to a 
protein extract in a test tube and mixed in suspension for a short period of time— 
thereby allowing the antibodies to bind the desired protein. The beads are then 
collected by low-speed centrifugation, and the unbound proteins in the super- 
natant are discarded. This method is commonly used to purify small amounts of 
enzymes from cell extracts for analysis of enzymatic activity or for studies of asso- 
ciated proteins. 


Figure 8-9 Three types of matrices 
used for chromatography. (A) In ion- 
exchange chromatography, the insoluble 
matrix carries ionic charges that retard 
the movement of molécules of opposite 
charge. Matrices used for separating 
proteins include diethylaminoethylcellulose 
(DEAE-cellulose), which is positively 
charged, and carboxymethylcellulose 
(CM-cellulose) and phosphocellulose, 
which are negatively charged. Analogous 
matrices based on agarose or other 
polymers are also frequently used. The 
strength of the association between the 
dissolved molécules and the ion-exchange 
matrix dépends on both the ionic strength 
and the pH of the solution that is passing 
down the column, which may therefore be 
varied systematically (as in Figure 8-10) 
to achieve an effective séparation. (B) In 
gel-filtration chromatography, the small 
beads that form the matrix are inert but 
porous. Molécules that are small enough 
to penetrate into the matrix beads are 
thereby delayed and travel more slowly 
through the column than larger molécules 
that cannot penetrate. Beads of cross- 
linked polysaccharide (dextran, agarose, 
or acrylamide) are available commercially 
in a wide range of pore sizes, making them 
suitable for the fractionation of molécules of 
various molecular mass, from less than 
500 daltons to more than 5 x 10 6 daltons. 
(C) Affinity chromatography uses an 
insoluble matrix that is covalently linked 
to a spécifie ligand, such as an antibody 
molécule or an enzyme substrate, that will 
bind a spécifie protein. Enzyme molécules 
that bind to immobilized substrates 
on such columns can be eluted with a 
concentrated solution of the free form of 
the substrate molécule, while molécules 
that bind to immobilized antibodies can be 
eluted by dissociating the antibody-antigen 
complex with concentrated sait solutions or 
solutions of high or low pH. High degrees 
of purification can be achieved in a single 
pass through an affinity column. 
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Genetically Engineered Tags Provide an Easy Way to Purify 
Proteins 

Using the recombinant DNA methods discussed in subséquent sections, any gene 
can be modified to produce its protein with a spécial récognition tag attached to 
it, so as to malce subséquent purification of the protein simple and rapid. Often 
the récognition tag is itself an antigenic déterminant, or epitope, which can be 
recognized by a highly spécifie antibody. The antibody can then be used to purify 
the protein by affinity chromatography or immunoprécipitation (Figure 8-11). 
Other types of tags are specifically designed for protein purification. For exam¬ 
ple, a repeated sequence of the amino acid histidine binds to certain métal ions, 
including nickel and copper. If genetic engineering techniques are used to attach 
a short string of histidines to one end of a protein, the slightly modified protein 
can be retained selectively on an affinity column containing immobilized nickel 
ions. Métal affinity chromatography can thereby be used to purify the modified 
protein from a complex molecular mixture. 


Figure 8-10 Protein purification by 
chromatography. Typical results obtained 
when three different chromatographie steps 
are used in succession to purify a protein. 

In this example, a homogenate of cells was 
first fractionated by allowing it to percolate 
through an ion-exchange resin packed into 
a column (A). The column was washed to 
remove ail unbound contaminants, and the 
bound proteins were then eluted by pouring 
a solution containing a gradually increasing 
concentration of sait onto the top of the 
column. Proteins with the lowest affinity 
for the ion-exchange resin passed directly 
through the column and were collected in 
the earliest fractions eluted from the bottom 
of the column. The remaining proteins 
were eluted in sequence according to 
their affinity for the resin—those proteins 
binding most tightly to the resin requiring 
the highest concentration of sait to remove 
them. The protein of interest was eluted 
in several fractions and was detected by 
its enzymatic activity. The fractions with 
activity were pooled and then applied 
to a gel-filtration column (B). The elution 
position of the still-impure protein was 
again determined by its enzymatic activity, 
and the active fractions were pooled and 
purified to homogeneity on an affinity 
column (C) that contained an immobilized 
substrate of the enzyme. 
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Figure 8-11 Epitope tagging for the 
purification of proteins. Using standard 
genetic engineering techniques, a 
short peptide tag can be added to a 
protein of interest. If the tag is itself an 
antigenic déterminant, or epitope, it can 
be targeted by an appropriate antibody, 
which can be used to purify the protein 
by immunoprécipitation or affinity 
chromatography. 
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In other cases, an entire protein is used as the récognition tag. When cells 
are engineered to synthesize the small enzyme glutathione S-transferase (GST) 
attached to a protein of interest, the resulting fusion protein can be purified from 
the other contents of the cell with an affinity column containing glutathione, a 
substrate molécule that binds specifically and tightly to GST. 

As a further refinement of purification methods using récognition tags, an 
amino acid sequence that forms a cleavage site for a highly spécifie proteolytic 
enzyme can be engineered between the protein of choice and the récognition tag. 
Because the amino acid sequences at the cleavage site are very rarely found by 
chance in proteins, the tag can later be cleaved off without destroying the purified 
protein. 

This type of spécifie cleavage is used in an especially powerful purification 
methodology known as tandem affinity purification tagging ( TAP-tagging ). Here, 
one end of a protein is engineered to contain two récognition tags that are sepa- 
rated by a protease cleavage site. The tag on the very end of the construct is cho- 
sen to bind irreversibly to an affinity column, allowing the column to be washed 
extensively to remove ail contaminating proteins. Protease cleavage then releases 
the protein, which is then further purified using the second tag. Because this two- 
step strategy provides an especially high degree of protein purification with rela- 
tively little effort, it is used extensively in cell biology. Thus, for example, a set of 
approximately 6000 yeast strains, each with a different gene fused to DNA that 
encodes a TAP-tag, has been constructed to allow any yeast protein to be rapidly 
purified. 

Purified Cell-free Systems Are Required for the Précisé Dissection 
of Molecular Functions 

Purified cell-free Systems provide a means of studying biological processes free 
from ail of the complex side reactions that occur in a living cell. To malce this pos¬ 
sible, cell homogenates are fractionated with the aim of purifying each of the indi- 
vidual macromolecules that are needed to catalyze a biological process of interest. 
For example, the experiments to decipher the mechanisms of protein synthesis 
began with a cell homogenate that could translate RNA molécules to produce 
proteins. Fractionation of this homogenate, step by step, produced in turn the 
ribosomes, tRNAs, and various enzymes that together constitute the protein-syn- 
thetic machinery. Once individual pure components were available, each could 
be added or withheld separately to define its exact rôle in the overall process. 

A major goal for cell biologists is the reconstitution of every biological process 
in a purified cell-free System. Only in this way can we define ail of the components 
needed for the process and control their concentrations, which is required to worlc 
out their précisé mechanism of action. Although much remains to be done, a 
great deal of what we know today about the molecular biology of the cell has been 
discovered by studies in such cell-free Systems. They hâve been used, for example, 
to decipher the molecular details of DNA réplication and DNA transcription, RNA 
splicing, protein translation, muscle contraction, and particle transport along 
microtubules, and many other processes that occur in cells. 

Summary 

Populations of cells can be analyzed biochemically by disrupting them and fraction- 
ating their contents, allowing functional cell-free Systems to be developed. Highly 
purified cell-free Systems are needed for determining the molecular details of com¬ 
plex cell processes, and the development of such Systems requires extensive purifi¬ 
cation of ail the proteins and other components involved. The proteins in soluble 
cell extracts can be purified by column chromatography; depending on the type of 
column matrix, biologically active proteins can be separated on the basis of their 
molecular weight, hydrophobicity, charge characteristics, or affinity for other mol¬ 
écules. In a typical purification, the sample is passed through several different col- 
umns in turn, with the enriched fractions obtained from one column being applied 
to the next. Recombinant DNA techniques (described later) allow spécial récogni¬ 
tion tags to be attached to proteins, thereby greatly simplifying their purification. 
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ANALYZING PROTEINS 

Proteins perform most cellular processes: they catalyze metabolic reactions, use 
nucléotide hydrolysis to do mechanical worlc, and serve as the major structural 
éléments of the cell. The great variety of protein structures and functions has stim- 
ulated the development of a multitude of techniques to study them. 

Proteins Can Be Separated by SDS Polyacrylamide-Gel 
Electrophoresis 

Proteins usually possess a net positive or négative charge, depending on the mix¬ 
ture of charged amino acids they contain. An electric field applied to a solution 
containing a protein molécule causes the protein to migrate at a rate that dépends 
on its net charge and on its size and shape. The most popular application of this 
property is SDS polyacrylamide-gel electrophoresis (SDS-PAGE). It uses a highly 
cross-linlced gel of polyacrylamide as the inert matrix through which the proteins 
migrate. The gel is prepared by polymerization of monomers; the pore size of the 
gel can be adjusted so that it is small enough to retard the migration of the pro¬ 
tein molécules of interest. The proteins are dissolved in a solution that includes a 
powerful negatively charged detergent, sodium dodecyl sulfate, or SDS (Figure 
8-12). Because this detergent binds to hydrophobie régions of the protein mol¬ 
écules, causing them to unfold into extended polypeptide chains, the individual 
protein molécules are released from their associations with other proteins or lipid 
molécules and rendered freely soluble in the detergent solution. In addition, a 
reducing agent such as (3-mercaptoethanol (see Figure 8-12) is usually added to 
break any S-S linlcages in the proteins, so that ail of the constituent polypeptides 
in multisubunit proteins can be analyzed separately. 

What happens when a mixture of SDS-solubilized proteins is run through 
a slab of polyacrylamide gel? Each protein molécule binds large numbers of 
the negatively charged detergent molécules, which maslc the protein's intrin- 
sic charge and cause it to migrate toward the positive electrode when a voltage 
is applied. Proteins of the same size tend to move through the gel with similar 
speeds because (1) their native structure is completely unfolded by the SDS, so 
that their shapes are the same, and (2) they bind the same amount of SDS and 
therefore hâve the same amount of négative charge. Larger proteins, with more 
charge, are subjected to larger electrical forces but also to a larger drag. In free 
solution, the two effects would cancel out, but, in the mesh of the polyacrylamide 
gel, which acts as a molecular sieve, large proteins are retarded much more than 
small ones. As a resuit, a complex mixture of proteins is fractionated into a sériés 
of discrète protein bands arranged in order of molecular weight (Figure 8-13). 
The major proteins are readily detected by staining the pro teins in the gel with 
a dye such as Coomassie blue. Even minor proteins are seen in gels treated with 
a silver stain, so that as little as 10 ng of protein can be detected in a band. For 
some purposes, spécifie proteins can also be labeled with a radioactive isotope 
tag; exposure of the gel to film results in an autoradiograph on which the labeled 
proteins are visible (see Figure 8-16). 

SDS-PAGE is widely used because it can separate ail types of proteins, includ- 
ing those that are normally insoluble in water—such as the many pro teins in 
membranes. And because the method séparâtes polypeptides by size, it provides 
information about the molecular weight and the subunit composition of proteins. 
Figure 8-14 présents a photograph of a gel that has been used to analyze each of 
the successive stages in the purification of a protein. 

Two-Dimensional Gel Electrophoresis Provides Greater Protein 
Séparation 

Because different proteins can hâve similar sizes, shapes, masses, and overall 
charges, most séparation techniques such as SDS polyacrylamide-gel electropho¬ 
resis or ion-exchange chromatography cannot typically separate ail the pro teins 
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Figure 8-12 The detergent sodium 
dodecyl sulfate (SDS) and the reducing 
agent p-mercaptoethanol. These two 
Chemicals are used to solubilize proteins for 
SDS polyacrylamide-gel electrophoresis. 
The SDS is shown here in its ionized form. 
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Figure 8-13 SDS polyacrylamide-gel electrophoresis (SDS-PAGE). (A) An 

electrophoresis apparatus. (B) Individual polypeptide chains form a complex 
with negatively charged molécules of sodium dodecyl sulfate (SDS) and 
therefore migrate as a negatively charged SDS-protein complex through a 
porous gel of polyacrylamide. Because the speed of migration under these 
conditions is greater the smaller the polypeptide, this technique can be used 
to détermine the approximate molecular weight of a polypeptide chain as 
well as the subunit composition of a protein. If the protein contains a large 
amount of carbohydrate, however, it will move anomalously on the gel and its 
apparent molecular weight estimated by SDS-PAGE will be misleading. Other 
modifications, such as phosphorylation, can also cause small changes in a 
protein’s migration in the gel. 
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in a cell or even in an organelle. In contrast, two-dimensional gel electrophore¬ 
sis, which combines two different séparation procedures, can résolve up to 2000 
proteins in the form of a two-dimensional protein map. 

In the first step, the proteins are separated by their intrinsic charges. The sam- 
ple is dissolved in a small volume of a solution containing a nonionic (uncharged) 
detergent, together with (3-mercaptoethanol and the denaturing reagent urea. 
This solution solubilizes, dénaturés, and dissociâtes ail the polypeptide chains 
but leaves their intrinsic charge unchanged. The polypeptide chains are then 
separated in a pH gradient by a procedure called isoelectric focusing, which talces 
advantage of the variation in the net charge on a protein molécule with the pH of 
its surrounding solution. Every protein has a characteristic isoelectric point, the 
pH at which the protein has no net charge and therefore does not migrate in an 
electric field. In isoelectric focusing, proteins are separated electrophoretically in 
a narrow tube of polyacrylamide gel in which a gradient of pH is established by 
a mixture of spécial buffers. Each protein moves to a position in the gradient that 

Figure 8-14 Analysis of protein samples by SDS polyacrylamide-gel 
electrophoresis. The photograph shows a Coomassie-stained gel that 
has been used to detect the proteins présent at successive stages in the 
purification of an enzyme. The leftmost lane (lane 1) contains the complex 
mixture of proteins in the starting cell extract, and each succeeding lane 
analyzes the proteins obtained after a chromatographie fractionation of the 
protein sample analyzed in the previous lane (see Figure 8-10). The same 
total amount of protein (10 pg) was loaded onto the gel at the top of each 
lane. Individual proteins normally appear as Sharp, dye-stained bands; a band 
broadens, however, when it contains a large amount of protein. (From 
T. Formosa and B.M. Alberts, J. Biol. Chem. 261:6107—6118, 1986.) 
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corresponds to its isoelectric point and remains there (Figure 8-15). This is the 
first dimension of two-dimensional polyacrylamide-gel electrophoresis. 

In the second step, the narrow tube gel containing the separated proteins is 
again subjected to electrophoresis but in a direction that is at a right angle to the 
direction used in the first step. This time SDS is added, and the proteins separate 
according to their size, as in one-dimensional SDS-PAGE: the original tube gel is 
soalced in SDS and then placed along the top edge of an SDS polyacrylamide-gel 
slab, through which each polypeptide chain migrâtes to form a discrète spot. This 
is the second dimension of two-dimensional polyacrylamide-gel electrophore¬ 
sis. The only proteins left unresolved are those that hâve both identical sizes and 
identical isoelectric points, a relatively rare situation. Even trace amounts of each 
polypeptide chain can be detected on the gel by various staining procedures—or 
by autoradiography if the protein sample was initially labeled with a radioisotope 
(Figure 8-16). The technique has such great resolving power that it can distin- 
guish between two proteins that differ in only a single charged amino acid, or a 
single negatively charged phosphorylation site. 


Spécifie Proteins Can Be Detected by Blotting with Antibodies 

A spécifie protein can be identified after its fractionation on a polyacrylamide 
gel by exposing ail the proteins présent on the gel to a spécifie antibody that has 
been labeled with a radioactive isotope or a fluorescent dye. This procedure is 
normally carried out after transferring ail of the separated proteins présent in the 

basic -stable pH gradient -3*- acidic 



Figure 8-15 Séparation of protein 
molécules by isoelectric focusing. 

At low pH (high H + concentration), the 
carboxylic acid groups of proteins tend to 
be uncharged (-COOH) and their nitrogen- 
containing basic groups fully charged (for 
example, -NH 3 + ), giving most proteins 
a net positive charge. At high pH, the 
carboxylic acid groups are negatively 
charged (-COO - ) and the basic groups 
tend to be uncharged (for example, -NH 2 ), 
giving most proteins a net négative charge. 
At its isoelectric pH, a protein has no net 
charge since the positive and négative 
charges balance. Thus, when a tube 
containing a fixed pH gradient is subjected 
to a strong electric field in the appropriate 
direction, each protein species migrâtes 
until it forms a Sharp band at its isoelectric 
pH, as shown. 


Figure 8-16 Two-dimensional 
polyacrylamide-gel electrophoresis. 

Ail the proteins in an E. coli bacterial cell are 
separated in this gel, in which each spot 
corresponds to a different polypeptide chain. 
The proteins were first separated on the 
basis of their isoelectric points by isoelectric 
focusing in the horizontal dimension. They 
were then further fractionated according to 
their molecular mass by electrophoresis from 
top to bottom in the presence of SDS. Note 
that different proteins are présent in very 
different amounts. The bacteria were fed with 
a mixture of radioisotope-labeled amino acids 
so that ail of their proteins were radioactive 
and could be detected by autoradiography. 
(Courtesy of Patrick O’Farrell.) 
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gel onto a sheet of nitrocellulose paper or nylon membrane. Placing the mem¬ 
brane over the gel and driving the proteins out of the gel with a strong electric 
current transfers the protein onto the membrane. The membrane is then soalced 
in a solution of labeled antibody to reveal the protein of interest. This method of 
detecting proteins is called Western blotting, or immunoblotting (Figure 8-17). 
Sensitive Western-blotting methods can detect very small amounts of a spécifie 
protein (1 nanogram or less) in a total cell extract or some other heterogeneous 
protein mixture. The method can be very useful when assessing the amounts of a 
spécifie protein in the cell or when measuring changes in those amounts under 
various conditions. 

Hydrodynamic Measurements Reveal the Size and Shape of a 
Protein Complex 

Most proteins in a cell act as part of larger complexes, and knowledge of the size 
and shape of these complexes often leads to insights regarding their function. This 
information can be obtained in several important ways. Sometimes, a complex 
can be directly visualized using électron microscopy, as described in Chapter 9. 
A complementary approach relies on the hydrodynamic properties of a complex; 
that is, its behavior as it moves through a liquid medium. Usually, two separate 
measurements are made. One measure is the velocity of a complex as it moves 
under the influence of a centrifugal field produced by an ultracentrifuge (see Fig¬ 
ure 8-7A). The sédimentation coefficient (or S value) obtained dépends on both 
the size and the shape of the complex and does not, by itself, convey especially 
useful information. However, once a second hydrodynamic measurement is per- 
formed—by charting the migration of a complex through a gel-filtration chroma- 
tography column (see Figure 8-9B)—both the approximate shape of a complex 
and its molecular weight can be calculated. 

Molecular weight can also be determined more directly by using an analytical 
ultracentrifuge, a complex device that allows protein absorbance measurements 
to be made on a sample while it is subjected to centrifugal forces. In this approach, 
the sample is centrifuged until it reaches equilibrium, where the centrifugal force 
on a protein complex exactly balances its tendency to diffuse away. Because this 
balancing point is dépendent on a complexé molecular weight but not on its par- 
ticular shape, the molecular weight can be directly calculated. 

Mass Spectrometry Provides a Highly Sensitive Method for 
Identifying Unknown Proteins 

A frequent problem in cell biology and biochemistry is the identification of a pro¬ 
tein or collection of pro teins that has been obtained by one of the purification pro¬ 
cedures discussed in the preceding pages. Because the genome sequences of most 
experimental organisms are now known, catalogs of ail the proteins produced in 
those organisms are available. The taslc of identifying an unknown protein (or col¬ 
lection of unknown proteins) thus reduces to matching some of the amino acid 


Figure 8-17 Western blotting. Ail 

the proteins from dividing tobacco 
cells in culture were first separated by 
two-dimensional polyacrylamide-gel 
electrophoresis. In (A), the positions of 
the proteins are revealed by a sensitive 
protein stain. In (B), the separated proteins 
on an identical gel were then transferred 
to a sheet of nitrocellulose and exposed 
to an antibody that recognizes only those 
proteins that are phosphorylated on 
threonine residues during mitosis. The 
positions of the few proteins that are 
recognized by this antibody are revealed by 
an enzyme-linked second antibody. (From 
J.A. Traas et al., Plant J. 2:723-732, 1992. 
With permission from Blackwell Publishing.) 
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Figure 8-18 The mass spectrometer. (A) Mass spectrometers used in biology contain an ion source that generates gaseous 
peptides or other molécules under conditions that render most molécules positively charged. The two major types of ion source 
are MALDI and electrospray, as described in the text. Ions are accelerated into a mass analyzer, which séparâtes the ions 
on the basis of their mass and charge by one of three major methods: 1. Time-of-flight ( TOF) analyzers détermine the mass- 
to-charge ratio of each ion in the mixture from the rate at which it travels from the ion source to the detector. 2. Quadropole 
mass filters contain a long chamber lined by four électrodes that produce oscillating electric fields that govern the trajectory of 
ions; by varying the properties of the electric field over a wide range, a spectrum of ions with spécifie mass-to-charge ratios is 
allowed to pass through the chamber to the detector, while other ions are discarded. 3. Ion traps contain doughnut-shaped 
électrodes producing a three-dimensional electric field that traps ail ions in a circular chamber; the properties of the electric 
field can be varied over a wide range to eject a spectrum of spécifie ions to a detector. (B) Tandem mass spectrometry typically 
involves two mass analyzers separated by a collision chamber containing an inert, high-energy gas. The electric field of the 
first mass analyzer is adjusted to select a spécifie peptide ion, called a precursor ion, which is then directed to the collision 
chamber. Collision of the peptide with gas molécules causes random peptide fragmentation, primarily at peptide bonds, 
resulting in a highly complex mixture of fragments containing one or more amino acids from throughout the original peptide. The 
second mass analyzer is then used to measure the masses of the fragments (called product or daughter ions). With computer 
assistance, the pattern of fragments can be used to deduce the amino acid sequence of the original peptide. 


sequences présent in the unknown sample with known cataloged genes. This taslc 
is now performed almost exclusively by using mass spectrometry in conjunction 
with computer searches of databases. 

Charged particles hâve very précisé dynamics when subjected to electrical and 
magnetic fields in a vacuum. Mass spectrometry exploits this principle to separate 
ions according to their mass-to-charge {m/z) ratio. It is an enormously sensitive 
technique. It requires very little material and is capable of determining the précisé 
mass of intact proteins and of peptides derived from them by enzymatic or Chem¬ 
ical cleavage. Masses can be obtained with great accuracy, often with an error of 
less than one part in a million. 

Mass spectrometry is performed using complex instruments with three major 
components (Figure 8-18A). The first is the ion source, which transforms tiny 
amounts of a peptide sample into a gas containing individual charged peptide 
molécules. These ions are accelerated by an electric field into the second compo- 
nent, the mass analyzer, where electric or magnetic fields are used to separate the 
ions on the basis of their mass-to-charge ratios. Finally, the separated ions collide 
with a detector, which generates a mass spectrum containing a sériés of pealcs rep- 
resenting the masses of the molécules in the sample. 

There are many different types of mass spectrometer, varying mainly in the 
nature of their ion sources and mass analyzers. One of the most common ion 
sources dépends on a technique called matrix-assisted laser desorption ionization 
{MALDI). In this approach, the proteins in the sample are first cleaved into short 
peptides by a protease such as trypsin. These peptides are mixed with an organic 
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acid and then dried onto a métal or ceramic slide. A brief laser burst is directed 
toward the sample, producing a gaseous puff of ionized peptides, each carrying 
one or more positive charges. In many cases, the MALDI ion source is coupled to 
a mass analyzer called a time-of-flight ( TOF ) analyzer, which is a long chamber 
through which the ionized peptides are accelerated by an electric field toward a 
detector. Iheir mass and charge détermine the time it talces them to reach the 
detector: large peptides move more slowly, and more highly charged molécules 
move more quickly. By analyzing those ionized peptides that bear a single charge, 
the précisé masses of peptides présent in the original sample can be determined. 
This information is then used to search genomic databases, in which the masses 
of ail proteins and of ail their predicted peptide fragments hâve been tabulated 
from the genomic sequences of the organism. An unambiguous match to a par- 
ticular open reading frame can often be made by knowing the mass of only a few 
peptides derived from a given protein. 

By employing two mass analyzers in tandem (an arrangement known as MS/ 
MS; Figure 8-18B), it is possible to directly détermine the amino acid sequences of 
individual peptides in a complex mixture. The MALDI-TOF instrument described 
above is not idéal for this method. Instead, MS/MS typically involves an electro- 
spray ion source, which produces a continuous thin stream of peptides that are 
ionized and accelerated into the first mass analyzer. The mass analyzer is typi¬ 
cally either a quadropole or ion trap, which employs large électrodes to produce 
oscillating electric fields inside the chamber containing the ions. These instru¬ 
ments act as mass filters : the electric field is adjusted over a broad range to select 
a single peptide ion and discard ail the others in the peptide mixture. In tandem 
mass spectrometry, this single ion is then exposed to an inert, high-energy gas, 
which collides with the peptide, resulting in fragmentation, primarily at peptide 
bonds. The second mass analyzer then détermines the masses of the peptide frag¬ 
ments, which can be used by computational methods to détermine the amino 
acid sequence of the original peptide and thereby identify the protein from which 
it came. 

Tandem mass spectrometry is also useful for detecting and precisely mapping 
post-translational modifications of proteins, such as phosphorylations or acétyl¬ 
ations. Because these modifications impart a characteristic mass increase to an 
amino acid, they are easily detected during the analysis of peptide fragments in 
the second mass analyzer, and the précisé site of the modification can often be 
deduced from the spectrum of peptide fragments. 

A powerful, “two-dimensional" mass spectrometry technique can be used to 
détermine ail of the proteins présent in an organelle or another complex mixture 
of proteins. First, the mixture of proteins présent is digested with trypsin to pro¬ 
duce short peptides. Next, these peptides are separated by automated high-per- 
formance liquid chromatography (LC). Every peptide fraction from the chromato¬ 
graphie column is injected directly into an electrospray ion source on a tandem 
mass spectrometer (MS/MS), providing the amino acid sequence and post-trans- 
lational modifications for every peptide in the mixture. This method, often called 
LC-MS/MS, is used to identify hundreds or thousands of proteins in complex pro¬ 
tein mixtures from spécifie organelles or from whole cells. It can also be used to 
map ail of the phosphorylation sites in the cell, or ail of the proteins targeted by 
other post-translational modifications such as acétylation or ubiquitylation. 

Sets of Interacting Proteins Can Be Identified by Biochemical 
Methods 

Because most proteins in the cell function as part of complexes with other pro¬ 
teins, an important way to begin to characterize the biological rôle of an unknown 
protein is to identify ail of the other proteins to which it specifically binds. 

A lcey method for identifying proteins that bind to one another tightly is co- 
immunoprecipitation. A spécifie target protein is immunoprecipitated from a cell 
lysate using spécifie antibodies coupled to beads, as described earlier. If the target 
protein is associated tightly enough with another protein when it is captured by 


458 Chapter 8: Analyzing Cells, Molécules, and Systems 


the antibody, the partner précipitâtes as well and can be identified by mass spec- 
trometry. This method is useful for identifying proteins that are part of a complex 
inside cells, including those that interact only transiently—for example, when 
extracellular signal molécules stimulate cells (discussed in Chapter 15). 

In addition to capturing protein complexes on columns or in test tubes, 
researchers are developing high-density protein arrays to investigate protein 
interactions. These arrays, which contain thousands of different proteins or anti- 
bodies spotted onto glass slides or immobilized in tiny wells, allow one to exam¬ 
ine the biochemical activities and binding profiles of a large number of proteins 
at once. For example, if one incubâtes a fluorescently labeled protein with arrays 
containing thousands of immobilized proteins, the spots that remain fluorescent 
after extensive washing each contain a protein that specifically binds the labeled 
protein. 

Optical Methods Can Monitor Protein Interactions 

Once two proteins—or a protein and a small molécule—are lcnown to associate, 
it becomes important to characterize their interaction in more detail. Proteins 
can associate with each other more or less permanently (lilce the subunits of RNA 
polymerase or the protéasome), or engage in transient encounters that may last 
only a few milliseconds (lilce a protein kinase and its substrate). To understand 
how a protein functions inside a cell, we need to détermine how tightly it binds to 
other proteins, how rapidly it dissociâtes from them, and how covalent modifica¬ 
tions, small molécules, or other proteins influence these interactions. 

As we discussed in Chapter 3 (see Figure 3-44), the extent to which two pro¬ 
teins interact is determined by the rates at which they associate and dissociate. 
These rates dépend, respectively, on the association rate constant ( k on ) and dis¬ 
sociation rate constant (Aj 0 ff). The lcinetic rate constant /c 0 ff is a particularly useful 
number because it provides valuable information about how long two proteins 
remain bound to one another. The ratio of the two lcinetic constants (/c on /^off) 
yields another very useful number called the equilibrium constant ( K , also lcnown 
as K eq or K a ), the inverse of which is the more commonly used dissociation con¬ 
stant Kd. The equilibrium constant is useful as a general indicator of the affinity 
of the interaction, and it can be used to estimate the amount of bound complex at 
different concentrations of the two protein partners—thereby providing insights 
into the importance of the interaction at the protein concentrations found inside 
the cell. 

A wide range of methods can be used to détermine binding constants for a 
two-protein complex. In a simple equilibrium binding experiment , two proteins 
are mixed at a range of concentrations, allowed to reach equilibrium, and the 
amount of bound complex is measured; half of the protein complex will be bound 
at a concentration that is equal to K<x. Equilibrium experiments often involve the 
use of radioactive or fluorescent tags on one of the protein partners, coupled with 
biochemical or optical methods for measuring the amount of bound protein. In 
a more complex kinetic binding experiment , the kinetic rate constants are deter¬ 
mined using rapid methods that allow real-time measurement of the formation 
of a bound complex over time (to détermine /c on ) or the dissociation of a bound 
complex over time (to détermine k 0 ff). 

Optical techniques provide particularly rapid, convenient, and accurate bind¬ 
ing measurements, and in some cases the proteins do not even need to be labeled. 
Certain amino acids (tryptophan, for example) exhibit wealc fluorescence that can 
be detected with sensitive fluorimeters. In many cases, the fluorescence intensity, 
or the émission spectrum of fluorescent amino acids located in a protein-pro- 
tein interface, will change when two proteins associate. When this change can 
be detected by fluorimetry, it provides a simple and sensitive measure of protein 
binding that is useful in both equilibrium and kinetic binding experiments. A 
related but more widely useful optical binding technique is based on fluorescence 
anisotropy, a change in the polarized light that is emitted by a fluorescently tagged 
protein in the bound and free States (Figure 8-19). 
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Figure 8-19 Measurement of binding with fluorescence anisotropy. This method dépends on a fluorescently tagged 
protein that is illuminated with polarized light at the appropriate wavelength for excitation; a fluorimeter is used to measure the 
intensity and polarization of the emitted light. If the fluorescent protein is fixed in position and therefore does not rotate during 
the brief period between excitation and émission, then the emitted light will be polarized at the same angle as the excitation 
light. This directional effect is called fluorescence anisotropy. Protein molécules in solution rotate or tumble rapidly, however, 
so that there is a decrease in the amount of anisotropic fluorescence. Larger molécules tumble at a slower rate and therefore 
hâve higher fluorescence anisotropy. (A) To measure the binding between a small molécule and a large receptor protein, the 
smaller molécule is labeled with a fluorophore. In the absence of its binding partner, the molécule tumbles rapidly, resulting in 
low fluorescence anisotropy (top). When the small molécule binds to its larger partner, however, it tumbles less rapidly, resulting 
in an increase in fluorescence anisotropy (bottom). (B) In the equilibrium binding experiment shown here, a small, fluorescent 
peptide ligand was présent at a low concentration, and the amount of fluorescence anisotropy (in millipolarization units, mP) was 
measured after incubation with various concentrations of a larger protein receptor for the ligand. From the hyperbolic curve that 
fits the data, it can be seen that 50% binding occurred at about 10 jiM, which is equal to the dissociation constant K d for the 
binding interaction. 


Another optical method for probing protein interactions uses green fluorescent 
protein (discussed in detail below) and its dérivatives of different colors. In this 
application, two proteins of interest are each labeled with a different fluorescent 
protein, such that the émission spectrum of one fluorescent protein overlaps the 
absorption spectrum of the second. If the two proteins corne very close to each 
other (within about 1-5 nm), the energy of the absorbed light is transferred from 
one fluorescent protein to the other. The energy transfer, called fluorescence réso¬ 
nance energy transfer {FRET), is determined by illuminating the first fluorescent 
protein and measuring émission from the second (see Figure 9-26). When com- 
bined with fluorescence microscopy, this method can be used to characterize 
protein-protein interactions at spécifie locations inside living cells (discussed in 
Chapter 9). 

Protein Function Can Be Selectively Disrupted With Small 
Molécules 

Small Chemical inhibitors of spécifie proteins hâve contributed a great deal to the 
development of cell biology. For example, the microtubule inhibitor colchicine 
is routinely used to test whether microtubules are required for a given biological 
process; it also led to the first purification of tubulin several décades ago. In the 
past, these small molécules were usually natural products; that is, they were syn- 
thesized by living créatures. Although natural products hâve been extraordinarily 
useful in science and medicine (see, for example, Table 6-4, p. 352), they act on 
a limited number of biological processes. However, the recent development of 
methods to synthesize hundreds of thousands of small molécules and to carry out 
large-scale automated screens holds the promise of identifying Chemical inhibi¬ 
tors for virtually any biological process. In such approaches, large collections of 
small Chemical compounds are simultaneously tested, either on living cells or in 
cell-free assays. Once an inhibitor is identified, it can be used as a probe to iden- 
tify, through affinity chromatography or other means, the protein to which the 
inhibitor binds. This general strategy, sometimes called Chemical biology, has 
successfully identified inhibitors of many proteins that carry out lcey processes 
in cell biology. An inhibitor of a kinesin protein that functions in mitosis, for 
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Figure 8-20 Small-molecule inhibitors for manipulating living cells. 

(A) Chemical structure of monastrol, a kinesin inhibitor identified in a large- 
scale screen for small molécules that disrupt mitosis. (B) Normal mitotic 
spindle seen in an untreated cell. The microtubules are stained green and 
chromosomes blue. (C) Monopolar spindle that forms in cells treated with 
monastrol, which inhibits a kinesin protein required for séparation of the 
spindle pôles in early mitosis. (B and C, from T.U. Mayer et al., Science 
286:971-974, 1999. With permission from AAAS.) 



example, was identified by this method (Figure 8-20). Chemical inhibitors give 
the cell biologist great control over the timing of inhibition, as drugs can be rap- 
idly added to or removed from cells, allowing protein function to be switched on 
or off quicldy. 

Protein Structure Can Be Determined Using X-Ray Diffraction 

The main technique that has been used to discover the three-dimensional struc¬ 
ture of molécules, including proteins, at atomic resolution is x-ray crystallogra- 
phy. X-rays, lilce light, are a form of electromagnetic radiation, but they hâve a 
much shorter wavelength, typically around 0.1 nm (the diameter of a hydrogen 
atom). If a narrow beam of parallel x-rays is directed at a sample of a pure protein, 
most of the x-rays pass straight through it. A small fraction, however, are scattered 
by the atoms in the sample. If the sample is a well-ordered crystal, the scattered 
waves reinforce one another at certain points and appear as diffraction spots 
when recorded by a suitable detector (Figure 8-21). 

The position and intensity of each spot in the x-ray diffraction pattern con- 
tain information about the locations of the atoms in the crystal that gave rise to it. 
Deducing the three-dimensional structure of a large molécule from the diffrac¬ 
tion pattern of its crystal is a complex taslc and was not achieved for a protein mol¬ 
écule until 1960. But in recent years x-ray diffraction analysis has become increas- 
ingly automated, and now the slowest step is likely to be the génération of suitable 
protein crystals. This step requires large amounts of very pure protein and often 
involves years of trial and error to discover the proper crystallization conditions; 
the pace has greatly accelerated with the use of recombinant DNA techniques to 
produce pure proteins and robotic techniques to test large numbers of crystalli¬ 
zation conditions. 

Analysis of the resulting diffraction pattern produces a complex three-dimen¬ 
sional électron-density map. Interpreting this map—translating its contours into 
a three-dimensional structure—is a complicated procedure that requires knowl¬ 
edge of the amino acid sequence of the protein. Largely by trial and error, the 
sequence and the électron-density map are correlated by computer to give the 
best possible fit. The reliability of the final atomic model dépends on the reso¬ 
lution of the original crystallographic data: 0.5 nm resolution might produce a 
low-resolution map of the polypeptide baclcbone, whereas a resolution of 0.15 nm 
allows ail of the non-hydrogen atoms in the molécule to be reliably positioned. 

A complété atomic model is often too complex to appreciate directly, but sim- 
plified versions that show a protein's essential structural features can be readily 
derived from it (see Panel 3-2, pp. 142-143). The three-dimensional structures of 
tens of thousands of different proteins hâve been determined by x-ray crystallog- 
raphy or by NMR spectroscopy (see page 461)—enough to allow the grouping of 
common structures into families (Movie 8.1). These structures or protein folds 
often seem to be more conserved in évolution than are the amino acid sequences 
that form them (see Figure 3-13). 

X-ray crystallographic techniques can also be applied to the study of macro- 
molecular complexes. The method was used, for example, to détermine the struc¬ 
ture of the ribosome, a large and complex machine made of several RNAs and 
more than 50 proteins (see Figure 6-62). The détermination required the use of a 
synchrotron, a radiation source that generates x-rays with the intensity needed to 
analyze the crystals of such large macromolecular complexes. 
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Figure 8-21 X-ray crystallography. (A) A narrow beam of x-rays is directed (O 

at a well-ordered crystal (B). Shown here is a protein crystal of ribulose 

bisphosphate carboxylase, an enzyme with a central rôle in CO 2 fixation 

during photosynthesis. The atoms in the crystal scatter some of the beam, 

and the scattered waves reinforce one another at certain points and appear 

as a pattern of diffraction spots (C). This diffraction pattern, together with 

the amino acid sequence of the protein, can be used to produce an atomic 

model (D). The complété atomic model is hard to interpret, but this simplified 

version, derived from the x-ray diffraction data, shows the protein’s structural 

features clearly (a helices, green\ p strands, red). The components pictured in 

A to D are not shown to scale. (B, courtesy of C. Branden; C, courtesy of J. 

Hajdu and I. Andersson; D, adapted from original provided by B. Furugren.) 
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NMR Can Be Used to Détermine Protein Structure in Solution 


Nuclear magnetic résonance (NMR) spectroscopy has been widely used for 
many years to analyze the structure of small molécules, small proteins, or protein 
domains. Unlilce x-ray crystallography, NMR does not dépend on having a crys- (D) 
talline sample. It simply requires a small volume of concentrated protein solu¬ 
tion that is placed in a strong magnetic field; indeed, it is the main technique that 
yields detailed evidence about the three-dimensional structure of molécules in 


beam of x-rays 


solution. 


Certain atomic nuclei, particularly hydrogen nuclei, hâve a magnetic moment 
or spin: that is, they hâve an intrinsic magnetization, lilce a bar magnet. The spin 
aligns along the strong magnetic field, but it can be changed to a misaligned, 
excited State in response to applied radiofrequency (RF) puises of electromag- 
netic radiation. When the excited hydrogen nuclei return to their aligned State, 
they émit RF radiation, which can be measured and displayed as a spectrum. The 
nature of the emitted radiation dépends on the environment of each hydrogen 
nucléus, and if one nucléus is excited, it influences the absorption and émission 
of radiation by other nuclei that lie close to it. It is consequently possible, by an 
ingenious élaboration of the basic NMR technique lcnown as two-dimensional 
NMR, to distinguish the signais from hydrogen nuclei in different amino acid resi- 
dues, and to identify and measure the small shifts in these signais that occur when 
these hydrogen nuclei lie close enough together to interact. Because the size of 
such a shift reveals the distance between the interacting pair of hydrogen atoms, 
NMR can provide information about the distances between the parts of the pro¬ 
tein molécule. By combining this information with a knowledge of the amino acid 
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Figure 8-22 N MR spectroscopy. 

(A) An example of the data from an N MR 
machine. This two-dimensional NMR 
spectrum is derived from the C-terminal 
domain of the enzyme cellulase. The spots 
represent interactions between hydrogen 
atoms that are near neighbors in the 
protein and hence reflect the distance 
that séparâtes them. Complex computing 
methods, in conjunction with the known 
amino acid sequence, enable possible 
compatible structures to be derived. 

(B) Ten structures of the enzyme, which ail 
satisfy the distance constraints equally well, 
are shown superimposed on one another, 
giving a good indication of the probable 
three-dimensional structure. (Courtesy of 

P. Kraulis.) 


(A) 


(B) 


sequence, it is possible in principle to compute the three-dimensional structure 
of the protein (Figure 8-22). 

For technical reasons, the structure of small proteins of about 20,000 daltons or 
less can be most readily determined by NMR spectroscopy. Resolution decreases 
as the size of a macromolecule increases. But recent technical advances hâve now 
pushed the limit to about 100,000 daltons, thereby making the majority of pro¬ 
teins accessible for structural analysis by NMR. 

Because NMR studies are performed in solution, this method also offers a con- 
venient means of monitoring changes in protein structure, for example during 
protein folding or when the protein binds to another molécule. NMR is also used 
widely to investigate molécules other than proteins and is valuable, for example, 
as a method to détermine the three-dimensional structures of RNA molécules 
and the complex carbohydrate side chains of glycoproteins. 

A third major method for the détermination of protein structure, and partic- 
ularly the structure of large protein complexes, is single-particle analysis by élec¬ 
tron microscopy. We discuss this approach in Chapter 9. 

Protein Sequence and Structure Provide Clues About Protein 
Function 

Having discussed methods for purifying and analyzing proteins, we now turn to 
a common situation in cell and molecular biology: an investigator has identified 
a gene important for a biological process but has no direct knowledge of the bio- 
chemical properties of its protein product. 

Thanlcs to the prolifération of protein and nucleic acid sequences that are cat- 
aloged in genome databases, the function of a gene—and its encoded protein— 
can often be predicted by simply comparing its sequence with those of previously 
characterized genes. Because amino acid sequence détermines protein structure, 
and structure dictâtes biochemical function, proteins that share a similar amino 
acid sequence usually hâve the same structure and usually perform similar bio¬ 
chemical functions, even when they are found in distantly related organisms. In 
modem cell biology, the study of a newly discovered protein usually begins with 
a search for previously characterized proteins that are similar in their amino acid 
sequences. 

Searching a collection of known sequences for similar genes or proteins is 
typically done over the Internet, and it simply involves selecting a database and 
entering the desired sequence. A sequence-alignment program—the most popu- 
lar is BLAST—scans the database for similar sequences by sliding the submitted 
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Score = 399 bits (1025), Expect = e-111 

Identities = 198/290 (68%), Positives = 241/290 (82%), Gaps = 1/290 


Query: 

57 

Sbjcts 

1 

Query: 

117 

Sbjcts 

61* 

Query s 

177 

Sbjcts 

121 

Query s 

236 

Sbjcts 

181 

Query s 

296 

Sbjcts 

241 


MENFQKVEKIGEGTYGWYRARNKLTGEWALKKIRLDTETEGVPSTAXREISLLKELNH 116 
ME HkKVEKTGEGTYGAÜIÜA +K T E +ALKKISÜ E EGVPSTAIREISLLKE+NH 
MEQ YEKVE KI GE GTYGVVYKALDKATNETIALKKIRLEQE DE GVP ST AI RE ISLLKEMNH 60 

PNIVKLLDVIHTENKLYLVFEFLHQDLKKFMDASALTGIPLPLIKSYLFQLLQGLAFCHS 176 
NÏV+L DV+H+E ++YLVFE+L DLKKFMD+ LIKSYL+Q+L G+A+CHS 

GNXVRLHDWHSEKRI.YLVFEYLDLDLKKFMDSCPEFAKNPTLIKSYLYQILHGVAYCHS 120 

HRVLHRDLKPQNLLINTE-GAIKLADFGLARAFGVPVRTYTHEWTLWYRAPEILLGCKY 235 

HRVLHRDLKPQNLLIDRRTNALKLADFGLARAFGIPVRTFTHEWTLWYRAPEILLGARQ 180 

YSTAVDIWSLGCIFAEMVTRRALFPGDSEIDQLFRIFRTLGTPDEWWPGVTSMPDYKPS 295 
YST VD+WS+GCIFAEMV ++ LGTP+E WPGV+ +PD+K + 

YSTPVDVWSVGCIFAEMVNQKPLFPGDSEIDELFKIFRILGTPNEQSWPGVSCLPDFKTA 240 

FPKWARQDFSKWPPLDEDGRSLLSQMLHYDPNKRISAKAALAHPFFQDV 345 
FP+W QD 1 WW. ü G LLS+ML AL H 4F+D+ 

FPRWQAQDLATWPNLDPAGLDLLSKMLRYEPSKRITARQALEHEYFKDL 290 


sequence along the archived sequences until a cluster of residues falls into full or 
partial alignment (Figure 8-23). Such comparisons can predict the functions of 
individual proteins, families of proteins, or even most of the protein complément 
of a newly sequenced organism. 

As was explained in Chapter 3, many proteins that adopt the same confor¬ 
mation and hâve related functions are too distantly related to be identified from 
a comparison of their amino acid sequences alone (see Figure 3-13). Thus, an 
ability to reliably predict the three-dimensional structure of a protein from its 
amino acid sequence would improve our ability to infer protein function from the 
sequence information in genomic databases. In recent years, major progress has 
been made in predicting the précisé structure of a protein. These prédictions are 
based, in part, on our knowledge of the thousands of protein structures that hâve 
already been determined by x-ray crystallography and NMR spectroscopy and, 
in part, on computations using our knowledge of the physical forces acting on 
the atoms. However, it remains a substantial and important challenge to predict 
the structures of proteins that are large or hâve multiple domains, or to predict 
structures at the very high levels of resolution needed to assist in computer-based 
drug discovery. 

While finding related sequences and structures for a new protein will provide 
many dues about its function, it is usually necessary to test these insights through 
direct expérimentation. However, the dues generated from sequence compar¬ 
isons typically point the investigator in the correct experimental direction, and 
their use has therefore become one of the most important strategies in modem 
cell biology. 

Summary 

Many methods existfor identifying proteins and analyzing their biochemical prop- 
erties, structures, and interactions with other proteins. Small-molecule inhibitors 
allow the functions of proteins they act upon to be studied in living cells. Because 
proteins with similar structures often hâve similarfunctions , the biochemical activ- 
ity ofa protein can often be predicted by searching databases for previously charac- 
terized proteins that are similar in their amino acid sequences. 


ANALYZING AND MANIPULATING DNA 

Until the early 1970s, DNA was the most difficult biological molécule for the bio- 
chemist to analyze. Enormously long and chemically monotonous, the string of 
nucléotides that forms the genetic material of an organism could be examined 
only indirectly, by protein sequencing or by genetic analysis. Today, the situation 
has changed entirely. From being the most difficult macromolecule of the cell to 


Figure 8-23 Results of a BLAST search. 

Sequence databases can be searched 
to find similar amino acid or nucleic acid 
sequences. Here, a search for proteins 
similar to the human cell-cycle regulatory 
protein Cdc2 (Query) locates maize Cdc2 
(Sbjct), which is 68% identical to human 
Cdc2 in its amino acid sequence. The 
alignment begins at residue 57 of the Query 
protein, suggesting that the human protein 
has an N-terminal région that is absent 
from the maize protein. The green blocks 
indicate différences in sequence, and the 
yellow bar summarizes the similarities: 
when the two amino acid sequences are 
identical, the residue is shown; similar 
amino acid substitutions are indicated 
by a plus sign (+). Only one small gap 
has been introduced—indicated by the 
red arrow at position 194 in the Query 
sequence—to align the two sequences 
maximally. The alignment score (Score), 
which is expressed in two different types 
of units, takes into account penalties for 
substitutions and gaps; the higher the 
alignment score, the better the match. The 
significance of the alignment is reflected in 
the Expectation (E) value, which spécifiés 
how often a match this good would be 
expected to occur by chance. The lower 
the E value, the more significant the 
match; the extremely low value here (e -111 ) 
indicates certain significance. E values 
much higher than 0.1 are unlikely to reflect 
true relatedness. For example, an E value 
of 0.1 means there is a 1 in 10 likelihood 
that such a match would arise solely by 
chance. 
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analyze, DNA has become the easiest. It is now possible to détermine the entire 
nucléotide sequence of a bacterial or fungal genome in a matter of hours, and the 
sequence of an individual human genome in less than a day. Once the nucléotide 
sequence of a genome is lcnown, any individual gene can be easily isolated, and 
large quantities of the gene product (be it RNA or protein) can be made either by 
introducing the gene into bacteria or animal cells and coaxing these cells to over- 
express the foreign gene or by synthesizing the gene product in vitro. In this way, 
proteins and RNA molécules that might be présent in only tiny amounts in living 
cells can be produced in large quantities for biochemical and structural analyses. 
And this approach can also be used to produce large quantities of human proteins 
(such as insulin, or interferon, or blood-clotting proteins) for use as human phar- 
maceuticals. As we will see later in this chapter, it is also possible for scientists to 
alter an isolated gene and transfer it baclc into the germ line of an animal or plant, 
so as to become a functional and heritable part of the organism's genome. In this 
way, the biological rôles of any gene can be assessed by observing—in the whole 
organism—the results of modifying it. 

The ability to manipulate DNA with précision in a test tube or an organism, 
lcnown as recombinant DNA technology has had a dramatic impact on ail aspects 
of cell and molecular biology, allowing us to routinely study cells and their mac- 
romolecules in ways that were unimaginable even twentyyears ago. Central to the 
technology are the following manipulations: 

1. Cleavage of DNA at spécifie sites by restriction nucleases, which greatly 
facilitâtes the isolation and manipulation of individual pièces of a genome. 

2. DNA ligation, which malces it possible to seamlessly join together DNA 
molécules from widely different sources. 

3. DNA cloning (through the use of either cloning vectors or the polymerase 
chain reaction) in which a portion of a genome (often an individual gene) 
is “purified” away from the remainder of the genome by repeatedly copying 
it to generate many billions of identical molécules. 

4. Nucleic acid hybridization, which malces it possible to identify any spécifie 
sequence of DNA or RNA with great accuracy and sensitivity based on its 
ability to selectively bind a complementary nucleic acid sequence. 

5. DNA synthesis, which malces it possible to chemically synthesize DNA 
molécules with any sequence of nucléotides, whether or not the sequence 
occurs in nature. 

6. Rapid détermination of the sequence of nucléotides of any DNA or RNA 
molécule. 

In the following sections, we describe each of these basic techniques which, 
together, hâve revolutionized the study of cell and molecular biology. 

Restriction Nucleases Cut Large DNA Molécules into Spécifie 
Fragments 

Unlilce a protein, a gene does not exist as a discrète entity in cells, but rather as a 
small région of a much longer DNA molécule. Although the DNA molécules in a 
cell can be randomly brolcen into small pièces by mechanical force, a fragment 
containing a single gene in a mammalian genome would still be only one among 
a hundred thousand or more DNA fragments, indistinguishable in their average 
size. How could such a gene be separated from ail the others? Because ail DNA 
molécules consist of an approximately equal mixture of the same four nucléo¬ 
tides, they cannot be readily separated, as proteins can, on the basis of their dif¬ 
ferent charges and biochemical properties. The solution to this problem began 
to emerge with the discovery of restriction nucleases. These enzymes, which are 
purified from bacteria, cut the DNA double hélix at spécifie sites defined by the 
local nucléotide sequence, thereby cleaving a long, double-stranded DNA molé¬ 
cule into fragments of strictly defined sizes. 

Lilce many of the tools of recombinant DNA technology, restriction nucleases 
were discovered by researchers trying to understand an intriguing biological 
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cleavage site 



phenomenon. It had been observed that certain bacteria always degraded “for- 
eign" DNA that was introduced into them experimentally. A search for the mech- 
anism responsible revealed a then unanticipated class of bacterial nucleases that 
cleave DNA at spécifie nucléotide sequences. The bacterium's own DNA is pro- 
tected from cleavage by méthylation of these same sequences, thereby protecting 
a bacterium's own genome from being overrun by foreign DNA. Because these 
enzymes restrict the transfer of DNA into bacteria, they were called restriction 
nucleases. The pursuit of this seemingly arcane biological puzzle set off the devel¬ 
opment of technologies that hâve forever changed the way cell and molecular 
biologists study living things. 

Different bacterial species produce different restriction nucleases, each cut- 
ting at a different, spécifie nucléotide sequence (Figure 8-24). Because these 
target sequences are short—generally four to eight nucléotide pairs—many sites 
of cleavage will occur, purely by chance, in any long DNA molécule. The reason 
restriction nucleases are so useful in the laboratory is that each enzyme will 
always eut a particular DNA molécule at the same sites. Thus for a given sample of 
DNA (which contains many identical molécules), a particular restriction nuclease 
will reliably generate the same set of DNA fragments. 

The size of the resulting fragments dépends on the length of the target 
sequences of the restriction nucleases. As shown in Figure 8-24, the enzyme 
HaelII cuts at a sequence of four nucléotide pairs; a sequence this long would 
be expected to occur purely by chance approximately once every 256 nucléotide 
pairs (1 in 4 4 ). In comparison, a restriction nuclease with a target sequence that 
is eight nucléotides long would be expected to cleave DNA on average once every 
65,536 nucléotide pairs (1 in 4 8 ). This différence in sequence selectivity malces it 
possible to cleave a long DNA molécule into the fragment sizes that are most suit- 
able for a given application. 

Gel Electrophoresis Séparâtes DNA Molécules of Different Sizes 

The same types of gel-electrophoresis methods that hâve proved so useful in the 
analysis of proteins (see Figure 8-13) can be applied to DNA molécules. The pro¬ 
cedure is actually simpler than for proteins: because each nucléotide in a nucleic 
acid molécule already carries a single négative charge (on the phosphate group), 
there is no need to add the negatively charged detergent SDS that is required to 
malce protein molécules move uniformly toward the positive electrode. Larger 
DNA fragments will migrate more slowly because their progress is impeded to a 
greater extent by the gel matrix. Over several hours, the DNA fragments become 
spread out across the gel according to size, forming a ladder of discrète bands, 
each composed of a collection of DNA molécules of identical length (Figure 8-25A 
and B). To separate DNA molécules longer than 500 nucléotide pairs, the gel is 
made of a diluted solution of agarose (a polysaccharide isolated from seaweed). 
For DNA fragments less than 500 nucléotides long, specially designed polyacryl- 
amide gels allow the séparation of molécules that differ in length by as little as a 
single nucléotide (see Figure 8-25C). 


Figure 8-24 Restriction nucleases cleave 
DNA at spécifie nucléotide sequences. 

Like the sequence-specific DNA-binding 
proteins we encountered in Chapter 7 (see 
Figure 7-8), restriction enzymes often work 
as dimers, and the DNA sequence they 
recognize and cleave is often symmetrical 
around a central point. Here, both strands 
of the DNA double hélix are eut at spécifie 
points within the target sequence (orange). 
Some enzymes, such as HaelII, eut straight 
across the double hélix and leave two 
blunt-ended DNA molécules; with others, 
such as EcoRI and Hindlll, the cuts on each 
strand are staggered. These staggered 
cuts generate “sticky ends”—short, single- 
stranded overhangs that help the eut DNA 
molécules join back together through 
complementary base-pairing. This rejoining 
of DNA molécules becomes important 
for DNA cloning, as we discuss below. 
Restriction nucleases are usually obtained 
from bacteria, and their names reflect their 
origins: for example, the enzyme EcoRI 
cornes from Escherichia coli. There are 
currently hundreds of different restriction 
enzymes available; they can be ordered 
from companies that commercially produce 
them. 
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A variation of agarose-gel electrophoresis, called pulsed-field gel electrophore¬ 
sis, malces it possible to separate extremely long DNA molécules, even those found 
in whole chromosomes. Ordinary gel electrophoresis fails to separate very large 
DNA molécules because the steady electric field stretches them out so that they 
travel end-first through the gel in snakelilce configurations at a rate that is inde- 
pendent of their length. In pulsed-field gel electrophoresis, by contrast, the direc¬ 
tion of the electric field changes periodically, which forces the molécules to re¬ 
orient before continuing to move snakelilce through the gel. This re- orientation 
talces much more time for larger molécules, so that longer molécules move more 
slowly than shorter ones. As a conséquence, entire bacterial or yeast chromo¬ 
somes separate into discrète bands in pulsed-field gels and so can be sorted and 
identified on the basis of their size (Figure 8-25D). Although a typical mammalian 
chromosome of 10 8 nucléotide pairs is still too long to be sorted even in this way, 
large segments of these chromosomes are readily separated and identified if the 
chromosomal DNA is first eut with a restriction nuclease selected to recognize 
sequences that occur only rarely. 

The DNA bands on agarose or polyacrylamide gels are invisible unless the 
DNA is labeled or stained in some way. A particularly sensitive method of staining 
DNA is to soalc the gel in the dye ethidium bromide, which fluoresces under ultra¬ 
violet light when it is bound to DNA (see Figure 8-25B and D). Even more sensitive 
détection methods incorporate a radioisotope or Chemical marlcer into the DNA 
molécules before electrophoresis, as we next describe. 


DNA double-stranded 
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direction 
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migration 


bottom 



(B) 



(C) 


Figure 8-25 DNA molécules can 
be separated by size using gel 
electrophoresis. (A) Schematic illustration 
comparing the results of cutting the same DNA 
molécule (in this case, the genome of a virus 
that infects wasps) with two different restriction 
nucleases, EcoRI [middlé) and Hindlll (right). 
The fragments are then separated by gel 
electrophoresis using a gel matrix of agarose. 
Because larger fragments migrate more slowly 
than smaller ones, the lowermost bands on 
the gel contain the smallest DNA fragments. 
The sizes of the fragments can be estimated 
by comparing them to a set of DNA fragments 
of known sizes (left). (B) Photograph of an 
actual agarose gel showing DNA “bands” that 
hâve been stained with ethidium bromide. 

(C) A polyacrylamide gel with small pores 
was used to separate short DNA molécules 
that differ by only a single nucléotide. Shown 
here are the results of a dideoxy sequencing 
reaction, explained later in this chapter. From 
left to right, the bands in the four lanes were 
produced by adding G, A, T, and C chain- 
terminating nucléotides (see Panel 8-1). The 
DNA molécules were labeled with 32 P, and the 
image shown was produced by laying a piece 
of photographie film over the gel and allowing 
the 32 P to expose the film, producing the dark 
bands observed when the film was developed. 

(D) The technique of pulsed-field agarose-gel 
electrophoresis was used to separate the 16 
different chromosomes of the yeast species 
Saccharomyces cerevisiae, which range in 
size from 220,000 to 2.5 million nucléotide 
pairs. The DNA was stained as in (B). DNA 
molécules as large as 10 7 nucléotide pairs can 
be separated in this way. (B, from U. Albrecht 
et al., J. Gen. Virol. 75: 3353-3363, 1994; 

C, courtesy of Leander Lauffer and Peter 
Walter; D, from D. Vollrath and R.W. Davis, 
Nucleic Acids Res. 15:7865-7876, 1987. With 
permission from Oxford University Press.) 
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Purified DNA Molécules Can Be Specifically Labeled with 
Radioisotopes or Chemical Markers in vitro 

The DNA polymerases that synthesize and repair DNA (discussed in Chapter 5) 
hâve become important tools in experimentally manipulating DNA. Because they 
synthesize sequences complementary to an existing DNA molécule, they are often 
used in the test tube to create exact copies of existing DNA molécules. The copies 
can include specially modified nucléotides (Figure 8-26). To synthesize DNA in 
this way, the DNA polymerase is presented with a template and a pool of nucléo¬ 
tide precursors that contain the modification. As long as the polymerase can use 
these precursors, it automatically malces new, modified molécules that match the 
sequence of the template. Modified DNA molécules hâve many uses. DNA labeled 
with the radioisotope 32 P can be detected following gel electrophoresis by placing 
the gel next to a piece of photographie film (see Figure 8-25C). The 32 P atoms émit 
P particles which expose the film, producing a visible record of every band on the 
gel. Alternatively, the gel can be scanned by a detector that measures the P émis¬ 
sions directly. Other types of modified DNA, such as that labeled by digoxigenin 
(see Figure 8-26B), are useful for visualizing DNA molécules in whole cells, a topic 
we discuss later in this chapter. 

Genes Can Be Cloned Using Bacteria 

Any DNA fragment can be cloned. In molecular biology, the term DNA cloning 
is used in two senses. It literally refers to the act of making many identical cop¬ 
ies (typically billions) of a DNA molécule—the amplification of a particular DNA 
sequence. However, the term also describes the isolation of a particular stretch of 
DNA (often a particular gene) from the rest of the celFs genome; the same term 
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DNA polymerase incorporâtes 32 P nucléotides, 
resulting in a population of radiolabeled DNA 
molécules that contain sequences from both strands 


Figure 8-26 Methods for labeling DNA molécules in vitro. (A) A purified DNA polymerase enzyme can incorporate radiolabeled nucléotides as it 
synthesizes new DNA molécules. In this way, radiolabeled versions of any DNA sequence can be prepared in the laboratory. (B) The method in (A) 
is also used to produce non radioactive DNA molécules that carry a spécifie Chemical marker that can be detected with an appropriate antibody. 
The base on the nucleoside triphosphate shown is an analog of thymine, in which the methyl group on T has been replaced by a spacer arm linked 
to the plant steroid digoxigenin. An anti-digoxigenin antibody coupled to a visible marker such as a fluorescent dye is then used to visualize the 
DNA. Other Chemical labels, such as biotin, can be attached to nucléotides and used in the same way. The only requirements are that the modified 
nucléotides properly base-pair and appear “normal” to the DNA polymerase. 
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Figure 8-27 The insertion of a DNA 
fragment into a bacterial plasmid with 
the enzyme DNA ligase. The plasmid is 
eut open with a restriction nuclease (in this 
case, one that produces staggered ends) 
and is mixed with the DNA fragment to be 
cloned (which has been prepared with the 
same restriction nuclease). DNA ligase and 
ATP are added. The staggered ends base- 
pair, and DNA ligase seals the nicks in the 
DNA backbone, producing a complété 
recombinant DNA molécule. In the 
accompanying micrographs, the inserted 
DNA is colored red. (Micrographs courtesy 
of Huntington Potter and David Dressler.) 


is used because this isolation is usually accomplished by malcing many identical 
copies of only the DNA of interest. We note that elsewhere in the boolc, cloning, 
particularly when used in the context of developmental biology, can also refer 
to the génération of many genetically identical cells starting from a single cell or 
even to the génération of genetically identical organisms (see, for example, Figure 

7- 2). In ail cases, cloning refers to the act of making many identical copies, and 
in this section, we use the term to refer to methods designed to generate many 
identical copies of a defined segment of nucleic acid. 

DNA cloning can be accomplished in several ways. One of the simplest 
involves inserting a particular fragment of DNA into the purified DNA genome of 
a self-replicating genetic element—usually a plasmid. The plasmid vectors most 
widely used for gene cloning are small, circular molécules of double-stranded 
DNA derived from plasmids that occur naturally in bacterial cells. They generally 
account for only a minor fraction of the total host bacterial cell DNA, but owing 
to their small size, they can easily be separated from the much larger chromo¬ 
somal DNA molécules, which precipitate as a pellet upon centrifugation. For use 
as cloning vectors, the purified plasmid DNA circles are first eut with a restric¬ 
tion nuclease to create linear DNA molécules. The DNA to be cloned is added to 
the eut plasmid and then covalently joined using the enzyme DNA ligase (Figure 

8- 27 and Figure 8-28). As discussed in Chapter 5, this enzyme is used by the cell 
to stitch together the Okazaki fragments produced during DNA réplication. The 
recombinant DNA circle is introduced baclc into bacterial cells that hâve been 
made transiently permeable to DNA. As the cells grow and divide, doubling in 
number every 30 minutes, the recombinant plasmids also replicate to produce an 
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STAGGERED END FILLED IN 
BY DNA POLYMERASE + dNTPs 



(A) JOINING TWO FRAGMENTS CUT (B) JOINING TWO FRAGMENTS CUT 

BY THE SAME RESTRICTION NUCLEASE BY DIFFERENT RESTRICTION NUCLEASES 


Figure 8-28 DNA ligase can join 
together any two DNA fragments in 
vitro to produce recombinant DNA 
molécules. ATP provides the energy 
necessary to reseal the sugar-phosphate 
backbone of DNA (see Figure 5-12). 

(A) DNA ligase can readily join two 
DNA fragments produced by the same 
restriction nuclease, in this case EcoRI. 
Note that the staggered ends produced 
by this enzyme enable the ends of the two 
fragments to base-pair correctly with each 
other, greatly facilitating their rejoining. 

(B) DNA ligase can also be used to 
join DNA fragments produced by 
different restriction nucleases—for 
example, EcoRI and Haelll. In this 
case, before the fragments undergo 
ligation, DNA polymerase plus a mixture 
of deoxyribonucleoside triphosphates 
(dNTPs) are used to fill in the staggered eut 
produced by EcoRI. Each DNA fragment 
shown in the figure is oriented so that its 5' 
ends are at the left end of the upper strand 
and the right end of the lower strand, as 
indicated. 
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cell culture produces many copies of purified 
hundreds of millions of plasmid isolated from 
new bacteria lysed bacteria 


Figure 8-29 A DNA fragment can be 
replicated inside a bacterial cell. To 

clone a particular fragment of DNA, it is first 
inserted into a plasmid vector, as shown 
in Figure 8-27. The resulting recombinant 
plasmid DNA is then introduced into a 
bacterium, where it is replicated many 
millions of times as the bacterium 
multiplies. For simplicity, the genome of the 
bacterial cell is not shown. 


enormous number of copies of DNA circles containing the foreign DNA (Figure 
8-29). Once the cells are lysed and the plasmid DNA isolated, the cloned DNA 
fragment can be readily recovered by cutting it out of the plasmid DNA with the 
same restriction nuclease that was used to insert it, and then separating it from 
the plasmid DNA by gel electrophoresis. Together, these steps allow the amplifi¬ 
cation and purification of any segment of DNA from the genome of any organism. 

A particularly useful plasmid vector is based on the naturally occurring F 
plasmid of E. coli. Unlilce smaller bacterial plasmids, the F plasmid—and its engi- 
neered dérivative, the bacterial artificial chromosome (BAC)—is présent in only 
one or two copies per E. coli cell. The fact that BACs are lcept in such low numbers 
means that they can stably maintain very long DNA sequences, up to 1 million 
nucléotide pairs in length. With only a few BACs présent per bacterium, it is less 
lilcely that the cloned DNA fragments will become scrambled by recombination 
with sequences carried on other copies of the plasmid. Because of their stability, 
ability to accept large DNA inserts, and ease of handling, BACs are now the pre- 
ferred vector for handling large fragments of foreign DNA. As we will see below, 
BACs were instrumental in determining the complété nucléotide sequence of the 
human genome. 

An Entire Genome Can Be Represented in a DNA Library 

Often it is useful to break up a genome into much smaller fragments and clone 
every fragment, separately, using a plasmid vector. This approach is useful 
because it allows scientists to worlc with easily managed, discrète pièces of a 
genome instead of whole, unwieldy chromosomes. 

This strategy involves cleaving genomic DNA into small pièces using a restric¬ 
tion nuclease (or, in some cases, by mechanically shearing the DNA) and ligat- 
ing the entire collection of DNA fragments into plasmid vectors, using conditions 
that favor the insertion of a single DNA fragment into each plasmid molécule. 
These recombinant plasmids are then introduced into E. coli at a concentration 
that ensures that no more than one plasmid molécule is taken up by each bac¬ 
terium. The collection of cloned plasmid molécules is known as a DNA library. 
Because the DNA fragments were derived directly from the chromosomal DNA of 
the organism of interest, the resulting collection—called a genomic library—will 
represent the entire genome of that organism (Figure 8-30), spread out over tens 
of thousands of individual bacterial colonies. 

An alternative strategy, one that enriches for protein-coding genes, is to 
begin the cloning process by selecting only those DNA sequences that are tran- 
scribed into mRNA and thus correspond to protein-encoding genes. This is done 
by extracting the mRNA from cells and then making a DNA copy of each mRNA 

Figure 8-30 Human genomic libraries containing DNA fragments 
that represent the whole human genome can be constructed using 
restriction nucleases and DNA ligase. Such a genomic library consists 
of a set of bacteria, each carrying a different fragment of human DNA. For 
simplicity, only the colored DNA fragments are shown in the library; in reality, 
ail of the different gray fragments will also be represented. 
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molécule présent—a so-called complementary DNA, or cDNA. The copying reac¬ 
tion is catalyzed by the reverse transcriptase enzyme of retroviruses, which syn- 
thesizes a complementary DNA chain on an RNA template. The single-stranded 
cDNA molécules synthesized by the reverse transcriptase are converted by DNA 
polymerase into double-stranded cDNA molécules, and these molécules are 
inserted into a plasmid or virus vector and cloned (Figure 8-31). Each clone 
obtained in this way is called a cDNA clone, and the entire collection of clones 
derived from one mRNA préparation constitutes a cDNA library. 

Figure 8-32 illustrâtes some important différences between genomic DNA 
clones and cDNA clones. Genomic clones represent a random sample of ail of 
the DNA sequences in an organism—both coding and noncoding—and, with very 
rare exceptions, are the same regardless of the cell type used to préparé them. By 
contrast, cDNA clones contain only those régions of the genome that hâve been 
transcribed into mRNA. Because the cells of different tissues produce distinct sets 
of mRNA molécules, a distinct cDNA library is obtained for each type of cell used 
to préparé the library. 
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Figure 8-31 The synthesis of cDNA. Total mRNA is extracted from a particular tissue, and the 
enzyme reverse transcriptase (see Figure 5-62) is used to produce DNA copies (cDNA) of the mRNA 
molécules. For simplicity, the copying of just one of these mRNAs into cDNA is illustrated. A short 
oligonucleotide complementary to the poly-A tail at the 3' end of the mRNA (discussed in Chapter 
6) is first hybridized to the RNA to act as a primer for the reverse transcriptase, which then copies 
the RNA into a complementary DNA chain, thereby forming a DNA-RNA hybrid hélix. Treating the 
DNA-RNA hybrid with a specialized nuclease (RNAse H) that attacks only the RNA produces nicks 
and gaps in the RNA strand. DNA polymerase then copies the remaining single-stranded cDNA 
into double-stranded cDNA. Because DNA polymerase can synthesize through the bound RNA 
molécules, the RNA fragment that is base-paired to the 3' end of the first DNA strand usually acts 
as the primer for the second strand synthesis, as shown. Any remaining RNA is eventually degraded 
during subséquent cloning steps. As a resuit, the nucléotide sequences at the extreme 5' ends of 
the original mRNA molécules are often absent from cDNA libraries. 
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Figure 8-32 The différences between 
cDNA clones and genomic DNA clones 
derived from the same région of DNA. 

In this example, gene A is infrequently 
transcribed, whereas gene B is frequently 
transcribed, and both genes contain introns 
(orange). In the genomic DNA library, both 
the introns and the nontranscribed DNA 
(gray) are included in the clones, and most 
clones contain, at most, only part of the 
coding sequence of a gene (red). In the 
cDNA clones, the intron sequences (yellow) 
hâve been removed by RNA splicing during 
the formation of the mRNA (blue), and a 
continuous coding sequence is therefore 
présent in each clone. Because gene B 
is transcribed more frequently than gene 
A in the cells from which the cDNA library 
was made, it is represented much more 
frequently than A in the cDNA library. In 
contrast, A and B are represented equally 
in the genomic DNA library. 


Genomic and cDNA Libraries Hâve Different Advantages and 
Drawbacks 

Genomic libraries are especially useful in determining the nucléotide sequences 
of a whole genome. For example, to détermine the nucléotide sequence of the 
human genome, it was brolcen up into roughly 100,000-nucleotide-pair pièces, 
each of which was inserted into a BAC plasmid and amplified in E. coli. The result- 
ing genomic library consisted of tens of thousands of bacterial colonies, each 
containing a different human DNA insert. The nucléotide sequence of each insert 
was determined separately and the sequence of the entire genome was stitched 
together from the pièces. 

The most important advantage of cDNA clones, over genomic clones, is that 
they contain the uninterrupted coding sequence of a gene. When the aim of the 
cloning, for example, is to produce the protein in large quantities by expressing 
the cloned gene in a bacterial or yeast cell, it is more préférable to start with cDNA. 

Genomic and cDNA libraries are inexhaustible resources, which are widely 
shared among investigators. Today, many such libraries are also available from 
commercial sources. Because the identity of each insert in a library is often known 
(through sequencing the insert), it is often possible to order a particular région of 
a chromosome (or, in the case of cDNA, a complété, intron-less protein-coding 
gene) and hâve it delivered by mail. 

Cloning DNA by using bacteria revolutionized the study of genomes and is still 
in wide use today. However, there is an even simpler way to clone DNA, one that 
can be carried out entirely in vitro. We discuss this approach, called the polymerase 
chain reaction , below. However, first we need to review a fundamental, far-reach- 
ing property of DNA and RNA called hybridization. 
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Hybridization Provides a Powerful, But Simple Way to Detect 
Spécifie Nucléotide Sequences 

Under normal conditions, the two strands of a DNA double hélix are held together 
by hydrogen bonds between the complementary base pairs (see Figure 4-3). But 
these relatively wealc, noncovalent bonds can be fairly easily broken. Such DNA 
dénaturation will release the two strands from each other, but does not break the 
covalent bonds that linlc together the nucléotides within each strand. Perhaps 
the simplest way to achieve this séparation involves heating the DNA to around 
90°C. When the conditions are reversed—by slowly lowering the température— 
the complementary strands will readily corne baclc together to re-form a double 
hélix. This hybridization, or DNA renaturation , is driven by the re-formation of 
the hydrogen bonds between complementary base pairs (Figure 8-33). We saw 
in Chapter 5 that DNA hybridization underlies the crucial process of homologous 
recombination (see Figure 5-47). 

This fundamental capacity of a single-stranded nucleic acid molécule, either 
DNA or RNA, to form a double hélix with a single-stranded molécule of a com¬ 
plementary sequence provides a powerful and sensitive technique for detecting 
spécifie nucléotide sequences. Today, one simply designs a short, single-stranded 
DNA molécule (often called a DNA probe ) that is complementary to the nucléo¬ 
tide sequence of interest. Because the nucléotide sequences of so many genomes 
are known—and are stored in publicly accessible databases—designing a probe 
to hybridize anywhere in a genome is straightforward. Probes are single-stranded, 
typically 30 nucléotides in length, and are usually synthesized chemically by a 
commercial service for pennies per nucléotide. A DNA sequence of 30 nucléotides 
will occur by chance only once every 1 x 10 18 nucléotides (4 30 ); so, even in the 
human genome of 3 x 10 9 nucléotide pairs, a DNA probe designed to match a par- 
ticular 30-nucleotide sequence will be highly unlilcely to hybridize—by chance— 
anywhere else on the genome. This, of course, présumés that the sequence com¬ 
plementary to the probe does not occur multiple times in the genome, a condition 
that can be checlced beforehand by scanning the genomic sequence in silico (using 
a computer) and designing probes that match only one spot. The hybridization 
conditions can be set so that even a single mismatch will prevent hybridization 
to “near-miss" sequences. The exquisite specificity of nucleic acid hybridization 
can be easily appreciated by the in situ (Latin for “in place”) hybridization exper- 
iment shown in Figure 8-34. As we will see throughout this chapter, nucleic acid 

Figure 8-34 In situ hybridization can be used to locate genes on 
isolated chromosomes. Here, six different DNA probes hâve been used 
to mark the locations of their complementary nucléotide sequences on 
human Chromosome 5, isolated from a mitotic cell in metaphase (see 
Figure 4-59 and Panel 17-1, pp. 980-981). The DNA probes hâve been 
labeled with different Chemical groups (see Figure 8-26B) and are detected 
using fluorescent antibodies spécifie for those groups. The chromosomal 
DNA has been partially denatured to allow the probes to base-pair with 
their complementary sequences. Both the maternai and paternal copies of 
Chromosome 5 are shown, aligned side by side. Each probe produces two 
dots on each chromosome because chromosomes undergoing mitosis hâve 
already replicated their DNA; therefore, each chromosome contains two 
identical DNA helices. The technique employed here is nicknamed FISH, for 
fluorescence in situ hybridization. (Courtesy of David C. Ward.) 


Figure 8-33 A molécule of DNA can 
undergo dénaturation and renaturation 
(hybridization). For two single-stranded 
molécules to hybridize, they must hâve 
complementary nucléotide sequences that 
allow base-pairing. In this example, the red 
and orange strands are complementary to 
each other, and the blue and green strands 
are complementary to each other. Although 
dénaturation by heating is shown, DNA can 
also be renatured after being denatured by 
alkali treatment. 
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hybridization has many uses in modem cell and molecular biology; one of the 
most powerful is in the cloning of DNA by the polymerase chain reaction, as we 
next discuss. 


Genes Can Be Cloned in vitro Using PCR 

Genomic and cDNA libraries were once the only route to cloning genes and they 
are still used for cloning very large genes and for sequencing whole genomes. 
However, a powerful and versatile method for amplifying DNA, lcnown as the 
polymerase chain reaction (PCR), provides a more rapid and straightforward 
approach to DNA cloning, particularly in organisms whose complété genome 
sequence is lcnown. Today, since genome sequences are abundant, most cloning 
is carried out by PCR. 

Invented in the 1980s, PCR revolutionized the way that DNA and RNA are 
analyzed. The technique can amplify any nucléotide sequence selectively and is 
performed entirely in a test tube. Eliminating the need for bacteria malces PCR 
convenient and rapid—billions of copies of a nucléotide can be generated in a 
matter of hours. Starting with an entire genome, PCR allows DNA from a specified 
région—selected by the expérimenter—to be greatly amplified, effectively “puri- 
fying" this DNA away from the remainder of the genome, which remains unam- 
plified. Because of its power to greatly amplify nucleic acids, PCR is remarkably 
sensitive: the method can be used to detect the trace amounts of DNA in a drop of 
blood left at a crime scene or in a few copies of a viral genome in a patient's blood 
sample. 

The success of PCR dépends both on the selectivity of DNA hybridization 
and on the ability of DNA polymerase to copy a DNA template faithfully through 
repeated rounds of réplication in vitro. As discussed in Chapter 5, this enzyme 
adds nucléotides to the 3' end of a growing strand of DNA (see Figure 5-4). To copy 
DNA, the polymerase requires a primer—a short nucléotide sequence that pro¬ 
vides a 3' end from which synthesis can begin. For PCR, the primers are designed 
by the expérimenter, synthesized chemically, and, by hybridizing to genomic 
DNA, “tell" the polymerase which part of the genome to copy. As discussed in the 
previous section, DNA primers (in essence, the same type of molécules as DNA 
probes but without a radioactive or fluorescent label) can be designed to uniquely 
locate any position on a genome. 

PCR is an itérative process in which the cycle of amplification is repeated doz- 
ens of times. At the start of each cycle, the two strands of the double-stranded 
DNA template are separated and a different primer is annealed to each. These 
primers mark the right and left boundaries of the DNA to be amplified. DNA poly¬ 
merase is then allowed to replicate each strand independently (Figure 8-35). In 
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Figure 8-35 A pair of primers directs the synthesis of a desired segment of DNA in a test tube. Each cycle of PCR includes three steps: 

(1) The double-stranded DNA is heated briefly to separate the two strands. (2) The DNA is exposed to a large excess of a pair of spécifie primers— 
designed to bracket the région of DNA to be amplified—and the sample is cooled to allow the primers to hybridize to complementary sequences 
in the two DNA strands. (3) This mixture is incubated with DNA polymerase and the four deoxyribonucleoside triphosphates so that DNA can be 
synthesized, starting from the two primers. To amplify the DNA, the cycle is repeated many times by reheating the sample to separate the newly 
synthesized DNA strands (see Figure 8-36). 

The technique dépends on the use of a spécial DNA polymerase isolated from a thermophilic bacterium; this polymerase is stable at much higher 
températures than eukaryotic DNA polymerases, so it is not denatured by the heat treatment shown in step 1. The enzyme therefore does not hâve 
to be added again after each cycle. 
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Figure 8-36 PCR uses repeated rounds of strand séparation, hybridization, and synthesis to amplify DNA. As the 

procedure outlined in Figure 8-35 is repeated, ail the newly synthesized fragments serve as templates in their turn. Because the 
polymerase and the primers remain in the sample after the first cycle, PCR involves simply heating and then cooling the same 
sample, in the same test tube, again and again. Each cycle doubles the amount of DNA synthesized in the previous cycle, so 
that within a few cycles, the prédominant DNA is identical to the sequence bracketed by and including the two primers in the 
original template. In the example illustrated here, three cycles of reaction produce 16 DNA chains, 8 of which (boxed in yellow) 
correspond exactly to one or the other strand of the original bracketed sequence. After four more cycles, 240 of the 256 DNA 
chains will correspond exactly to the original sequence, and after several more cycles, essentially ail of the DNA strands will be 
this length. Typically, 20-30 cycles are carried out to effectively clone a région of DNA starting from genomic DNA; the rest of the 
genome remains unamplified, and its concentration is therefore negligible compared with that of the amplified région (Movie 8.2). 


subséquent cycles, ail the newly synthesized DNA molécules produced by the 
polymerase serve as templates for the next round of réplication (Figure 8-36). 
Through this itérative amplification process, many copies of the original sequence 
can be made—billions after about 20 to 30 cycles. 

PCR is now the method of choice for cloning relatively short DNA fragments 
(say, under 10,000 nucléotide pairs). Each cycle talces only about five minutes, 
and automation of the whole procedure enables cell-free cloning of a DNA frag¬ 
ment in a few hours. The original template for PCR can be either DNA or RNA, so 
this method can be used to obtain either a genomic clone (complété with introns 
and exons) or a cDNA copy of an mRNA (Figure 8-37). 

PCR Is Also Used for Diagnostic and Forensic Applications 

The PCR method is extraordinarily sensitive; it can detect a single DNA molé¬ 
cule in a sample if at least part of the sequence of that molécule is lcnown. Trace 
amounts of RNA can be analyzed in the same way by first transcribing them into 
DNA with reverse transcriptase. For these reasons, PCR is frequently employed for 
uses that go beyond simple cloning. For example, it can be used to detect invading 
pathogens at very early stages of infection. In this case, short sequences comple- 
mentary to a segment of the infectious agent's genome are used as primers and 
following many cycles of amplification, even a few copies of an invading bacterial 
or viral genome in a patient's sample can be detected (Figure 8-38). For many 
infections, PCR has replaced the use of antibodies against microbial molécules to 
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Figure 8-37 PCR can be used to 
obtain either genomic or cDNA clones. 

(A) To use PCR to clone a segment of 
chromosomal DNA, total genomic DNA 
is first purified from cells. PCR primers 
that flank the stretch of DNA to be cloned 
are added, and many cycles of PCR are 
completed (see Figure 8-36). Because 
only the DNA between (and including) the 
primers is amplified, PCR provides a way 
to obtain selectively any short stretch of 
chromosomal DNA in an effectively pure 
form. (B) To use PCR to obtain a cDNA 
clone of a gene, total mRNA is first purified 
from cells. The first primer is added to 
the population of mRNAs, and reverse 
transcriptase is used to make a DNA 
strand complementary to the spécifie RNA 
sequence of interest. The second primer 
is then added, and the DNA molécule is 
amplified through many cycles of PCR. 


detect the presence of the invader. It is also used to verify the authenticity of a food 
source—for example, whether a sample of beef actually came from a cow. 

Finally, PCR is nowwidely used in forensics. The method's extreme sensitivity 
allows forensic investigators to isolate DNA from minute traces of human blood or 
other tissue to obtain a DNA fingerprint of the person who left the sample behind. 
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Figure 8-38 PCR can be used to detect the presence of a viral genome in a sample of blood. 

Because of its ability to amplify enormously the signal from a single molécule of nucleic acid, PCR 
is an extraordinarily sensitive method for detecting trace amounts of virus in a sample of blood or 
tissue, without the need to purify the virus. For HIV, the virus that causes AIDS, the genome is a 
single-stranded molécule of RNA, as illustrated here. In addition to HIV, many other viruses that 
infect humans are now detected in this way. 
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Figure 8-39 PCR is used in forensic science to distinguish one individual from another. The DNA sequences analyzed are short tandem 
repeats (STRs) composed of sequences such as CACACA... or GTGTGT... STRs are found in various positions (loci) in the human genome. The 
number of repeats in each STR locus is highly variable in the population, ranging from 4 to 40 in different individuals. Because of the variability in 
these sequences, individuals will usually inherit a different number of repeats at each STR locus from their mother and from their father; two unrelated 
individuals, therefore, rarely contain the same pair of sequences at a given STR locus. (A) PCR using primers that recognize unique sequences on 
either side of one particular STR locus produces a pair of bands of amplified DNA from each individual, one band representing the maternai STR 
variant and the other representing the paternal STR variant. The length of the amplified DNA, and thus its position after gel electrophoresis, will 
dépend on the exact number of repeats at the locus. (B) In the schematic example shown here, the same three STR loci are analyzed in samples 
from three suspects (individuals A, B, and C), producing six bands for each individual. Although different people can hâve several bands in common, 
the overall pattern is quite distinctive for each person. The band pattern can therefore serve as a DNA fingerprint to identify an individual nearly 
uniquely. The fourth lane (F) contains the products of the same PCR amplifications carried out on a hypothetical forensic DNA sample, which could 
hâve been obtained from a single hair or a tiny spot of blood left at a crime scene. 

The more loci that are examined, the more confident one can be about the results. When examining the variability at 5-10 different STR loci, the 
odds that two random individuals would share the same fingerprint by chance are approximately one in 10 billion. In the case shown here, individuals 
A and C can be eliminated from inquiries, while B is a clear suspect. A similar approach is used routinely in paternity testing. 
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With the possible exception of identical twins, the genome of each human differs 
in DNA sequence from that of every other person on Earth. Using primer pairs 
targeted at genome sequences that are lcnown to be highly variable in the human 
population, PCR malces it possible to generate a distinctive DNA fingerprint for 
any individual (Figure 8-39). Such forensic analyses can be used not only to help 
identify those who hâve done wrong, but also—equally important—to exonerate 
those who hâve been wrongfully accused. 

Both DNA and RNA Can Be Rapidly Sequenced 

Most current methods of manipulating DNA, RNA, and proteins rely on prior 
knowledge of the nucléotide sequence of the genome of interest. But how were 
these sequences determined in the first place? And how are new DNA and RNA 
molécules sequenced today? In the late 1970s, researchers developed several 
strategies for determining, simply and quicldy, the nucléotide sequence of any 
purified DNA fragment. The one that became the most widely used is called dide- 
oxy sequencing or Sanger sequencing (Panel 8-1). This method was used to 
détermine the nucléotide sequence of many genomes, including those of E. coli, 
fruit Ries, nematode worms, mice, and humans. Today, cheaper and faster meth¬ 
ods are routinely used to sequence DNA, and even more efficient strategies are 
being developed (see Panel 8-1). The original “reference" sequence of the human 
genome, completed in 2003, cost over $1 billion and required many scientists from 
around the world working together for 13 years. The enormous progress made in 
the past decade malces it possible for a single person to complété the sequence of 
an individual human genome in less than a day. 

The methods summarized in Panel 8-1 for rapidly sequencing DNA can also 
be applied to RNA. Although methods are being developed to sequence RNA 
directly, it is most commonly carried out by converting the RNA to complemen- 
tary DNA (using reverse transcriptase) and using one of the methods described for 
DNA sequencing. It is important to lceep in mind that although genomes remain 
the same from cell to cell and from tissue to tissue, the RNA produced from the 
genome can vary enormously. We will see later in this chapter that sequencing 
the entire répertoire of RNA from a cell or tissue (lcnown as deep RNA sequenc¬ 
ing, or RNA-seq) is a powerful way to understand how the information présent in 
the genome is used by different cells under different circumstances. In the next 
section, we shall see how RNA-seq has also become a valuable tool for annotating 
genomes. 

To Be Useful, Genome Sequences Must Be Annotated 

Long strings of nucléotides, at first glance, reveal nothing about how this genetic 
information directs the development of a living organism—or even what types 
of DNA, protein, and RNA molécules are produced by a genome. The process of 
genome annotation attempts to mark out ail the genes (both protein-coding and 
noncoding) in a genome and ascribe a rôle to each. It also seelcs to understand 
more subtle types of genome information, such as the cz's-regulatory sequences 
that specify the time and place that a given gene is expressed and whether its 
mRNA undergoes alternative splicing to produce different protein isotypes. 
Clearly, this is a daunting taslc, and we are far short of completing it for any form of 
life, even the simplest bacterium. For many organisms, we lcnow the approximate 
number of genes, and, for very simple organisms, we understand the functions of 
about half their genes. 

In this section, we discuss broadly how genes are identified in genome 
sequences and what dues we can discern about their rôles from simply inspect- 
ing their sequences. Later in the chapter, we turn to the more difficult problem of 
experimentally determining gene function. 

How does one begin to malce sense of a genome sequence? The first step is 
usually to translate in silico the entire genome into protein. There are six differ¬ 
ent reading frames for any piece of double-stranded DNA (three on each strand). 
We saw in Chapter 6 that a random sequence of nucléotides, read in frame, will 
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PANEL 8-1 : DNA Sequencing Methods 




MANUAL DIDEOXY SEQUENCING 

To détermine the complété sequence of a 
single-stranded fragment of DNA (gray), the DNA 
is first hybridized with a short DNA primer (orange) 
that is labeled with a fluorescent dye or 
radioisotope. DNA polymerase and an excess of ail 
four normal deoxyribonucleoside triphosphates 
0 blue A, C, G, or T) are added to the primed DNA, 
which isthen divided into four reaction tubes. 

Each of these tubes receives a small amount of a 
single chain-terminating dideoxyribonucleoside 
triphosphate (red A, C, G, or T). Because these will 
be incorporated only occasionally, each reaction 
produces a set of DNA copies that terminate at 
different points in the sequence. The products of 
these four reactions are separated by 
electrophoresis in four parallel lanes of a 
polyacrylamide gel (labeled here A, T, C, and G). In 
each lane, the bands represent fragments that 
hâve terminated at a given nucléotide but at 
different positions in the DNA. By reading off the 
bands in order, starting at the bottom of the gel 
and reading across ail lanes, the DNA sequence of 
the newly synthesized strand can be determined 
(see Figure 8-25C). The sequence, which is given in 
the green arrow to the right of the gel, is 
complementary to the sequence of the original 
gray single-stranded DNA. 


AUTOMATED DIDEOXY SEQUENCING 
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Fully automated machines can run dideoxy sequencing reactions. 
(A) The automated method uses an excess amount of normal 
dNTPs plus a mixture of four different chain-terminating ddNTPs, 
each of which is labeled with a fluorescent tag of a different color. 
The reaction products are loaded onto a long, thin capillary gel 
and separated by electrophoresis. A caméra (not shown) reads the 
color of each band as it moves through the gel and feeds the data 
to a computer that assembles the sequence. (B) A tiny part of the 
data from such an automated sequencing run. Each colored peak 
represents a nucléotide in the DNA sequence. 





































































SEQUENCING WHOLE GENOMES 


Shotgun sequencing: To détermine the nucléotide sequence of a whole genome, 
the genomic DNA is first fragmented into small pièces and a genomic library is 
constructed, typically using plasmids and bacteria (see Figure 8-30). In shotgun 
sequencing, the nucléotide sequence of tens of thousands of individual clones is 
determined; the full genome sequence is then reconstructed by stitching together 
(in silico) the nucléotide sequence of each clone, using the overlaps between clones 
as a guide. The shotgun method works well for small genomes (such as those of 
viruses and bacteria) that lack répétitive DNA. 


BAC clones: Most plant and animal genomes are large 
(often over 10 9 nucléotide pairs) and contain extensive 
amounts of répétitive DNA spread throughout the 
genome. Because a nucléotide sequence of a fragment of 
répétitive DNA will "overlap" every instance of the 
repeated DNA, it is difficult, if not impossible, to assemble 
the fragments into a unique order solely by the shotgun 
method. 
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To circumvent this problem, the human genome was first 
broken down into very large DNA fragments (each 
approximately 100,000 nucléotide pairs) and cloned into 
BACs (see p. 469). The order of the BACs along a 
chromosome was determined by comparing the pattern of 
restriction enzyme cleavage sites in a given BAC clone with 
that of the whole genome. In this way, a given BAC clone 
can be mapped, say, to the left arm of human 
Chromosome 3. Once a collection of BAC clones was 


obtained that spanned the entire genome, each individual BAC 
was sequenced by the shotgun method. At the end, the 
sequences of ail the BAC inserts were stitched together using the 
knowledge of the position of each BAC insert in the human 
genome. In ail, approximately 30,000 BAC clones were sequenced 
to complété the human genome. 
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Thousands of genomes from 
individual humans hâve now been 
sequenced and it is not necessary to 
painstakingly reconstruct the order of 
DNA sequence "reads" each time; 
they are simply assembled using the 
order determined from the original 
human genome sequencing project. 
For this reason, resequencing, the 
term applied when the genome of a 
species is sequenced again (even 
though it may be from a different 
individual), is far easier than the 
original sequencing. 


SECOND-GENERATION SEQUENCING TECHNOLOGIES 

The dideoxy method made it possible to sequence the genomes 
of humans and most of the other organisms discussed in this 
book. But newer methods, developed since 2005, hâve made 
genome sequencing even more rapid—and very much cheaper. 

With these so-called second-generation sequencing methods, 
the cost of sequencing DNA has decreased dramatically. Not 
surprisingly, the number of genomes that hâve been sequenced 
has increased enormously. These rapid methods allow multiple 
genomes to be sequenced in parallel in a matter of weeks, 
enabling investigators to examine thousands of individual 
human genomes, catalog the variation in nucléotide sequences 


from people around the world, and uncover the mutations that 
increase the risk of various diseases, from cancer to autism. 

These methods hâve also made it possible to détermine the 
genome sequence of extinct species, including Neanderthal man 
and the wooly mammoth. By sequencing genomes from many 
closely related species, they hâve also made it possible to 
understand the molecular basis of key evolutionary events in the 
tree of life, such as the "inventions" of multicellularity, vision, 
and language. The ability to rapidly sequence DNA has had 
major impacts on ail branches of biology and medicine; it is 
almost impossible to imagine where we would be without it. 









































ILLUMINA® SEQUENCING 

Several second-generation sequencing methods are now in wide 
use, and we will discuss two of the most common. Both rely on 
the construction of libraries of DNA fragments that 
represent —in toto —the DNA of the genome. Instead of using 
bacterial cells to generate these libraries, as we saw in Figure 
8-30), they are made using PCR amplification of billions of DNA 
fragments, each attached to a solid support. The amplification is 


carried out so that the PCR-generated copies, instead of f loating 
away in solution, remain bound in proximity to the original DNA 
fragment. This process generates clusters of DNA fragments, 
where each cluster contains about 1000 identical copies of a 
small bit of the genome. These clusters—a billion of which can 
fit in a single slide or plate—are then sequenced at the same 
time; that is, in parallel. 



A slide showing individual 
clusters of PCR-generated 
DNA molécules. Each 
cluster carries about 1000 
identical DNA molécules; 
the four colors are 
produced by incorporation 
of C, G, A, or T, each of 
which has a different color 
fluorophore. The image 
has been taken just after a 
fluorescent nucléotide has 
been incorporated into 
each growing DNA chain. 
(From Illumina Sequencing 
OverView, 2013.) 


One method, known as Illumina sequencing, is based on the 
dideoxy method described above, but it incorporâtes several 
innovations. Here, each nucléotide is attached to a removable 
fluorescent molécule (a different color for each of the four 
bases) as well as a spécial chain-terminating Chemical adduct: 
instead of a 3'-OH group, as in conventional dideoxy sequencing, 
the nucléotides carry a Chemical group that blocks élongation by 
DNA polymerase but which can be removed chemically. 
Sequencing is then carried out as follows: the four fluorescently 
labeled nucléotides along with DNA polymerase are added to 
billions of DNA clusters immobilized on a slide. Only the 
appropriate nucléotide (that is complementary to the next 
nucléotide in the template) is covalently incorporated at each 


cluster; the un incorporated nucléotides are washed away, and 
a high-resolution digital caméra takes an image that registers 
which of the four nucléotides was added to the chain at each 
cluster. The fluorescent label and the 3'-OH blocking group are 
then removed enzymatically, washed away, and the process is 
repeated many times. In this way, billions of sequencing 
reactions are carried out simultaneously. By keeping track of 
the color changes occuring at each cluster, the DNA sequence 
represented by each spot can be read. Although each 
individual sequence read is relatively short (approximately 
200 nucléotides), the billions that are carried out 
simultaneously can produce several human genomes worth 
of sequence in about a day. 
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DNA 



Principle behind Illumina sequencing. This reaction is carried out 
stepwise, on billions of DNA clusters at once. The method relies 
on a color digital caméra that rapidly scans ail the DNA clusters 
after each round of modified nucléotide incorporation. The DNA 
sequence of each cluster is then determined by the sequence of 
color changes it undergoes as the élongation reaction proceeds 
stepwise. Each round of modified nucléotide incorporation, 


image acquisition, and removal of the 3' block and the 
fluorescent group takes less than an hour. Each cluster on the 
slide contains many copies of different, random bits of a 
genome; in preparing the clusters, a DNA sequence (specified 
by the expérimenter) is joined to each copy in every cluster, 
and a primer complementary to this sequence is used to begin 
the élongation reaction by DNA polymerase. 
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ION TORRENT™ SEQUENCING 


Another widely used strategy for rapid DNA sequencing is 
called the ion torrent method. Here, a genome is fragmented, 
and the individual fragments are attached to microscopie 
beads. Using PCR, each DNA fragment isthen amplified so 
that copies of it eventually coat the bead to which it was 
initially attached. This process produces a library of billions of 
individual beads, each covered with identical copies of a 
particular DNA fragment. 

Like eggs in a carton, the beads are placed into individual 
wells on an array that can hold a billion beads in a square 
inch. Beginning with a primer, DNAsynthesis is then initiated 
on each bead. A hydrogen ion (H + ) is released (along with 
pyrophosphate) each time a nucléotide is incorporated into a 
growing DNA chain (see Figure 5-3), and the ion torrent 


method is based on this simple fact. Each of the four 
nucléotides is washed in, one at a time, over the array of 
beads; when a nucléotide is incorporated in the DNA of a 
given bead, the release of an H + ion changes the pH, which 
is registered by a semiconductor chip placed beneath the 
array of wells. In this way, the DNA sequence on a given 
bead can be read from the pattern of pH changes observed 
as nucléotides are washed over them. Like a high-resolution 
sensor in a digital caméra, the ion torrent semiconductor 
chip can register enormous amounts of information and can 
thus keep track of billions of parallel sequencing reactions. 
Using this technology it is currently possible, using a single 
chip, to détermine the nucléotide sequences of several 
human genomes in just a few hours. 


bead coated with 



DNA sequencing by the ion torrent method. Beads, each coated with a DNA 
molécule that has been amplified many times, are placed in wells along with 
primers and DNA polymerase. As nucléotides are sequentially washed over the 
beads, those incorporated by the polymerase cause a pH change. In the 
example shown, an A is incorporated; thus, the template must hâve a T in this 
position. As the four nucléotides are sequentially washed over the beads, the 
sequence of the DNA on each bead can be "read" by the pattern of pH 
fluctuations. Billions of beads are monitored at once by a voltage-sensitive 
semiconductor chip placed below the array of beads. 


pH change detected 
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THE FUTURE OF DNA SEQUENCING 

Even newer, potentially faster, methods of sequencing DNA are being 
developed. Some of these "third-generation" technologies bypassthe 
DNA amplification steps altogether and détermine the sequence of single 
molécules of DNA. In one technique, a DNA molécule is pushed through a 
tiny channel, like a thread through the eye of a needle. As the DNA 
molécule moves through the pore, it generates electrical currents that 
dépend on its sequence of nucléotides; the pattern of currents can then 
be used to deduce the nucléotide sequence. Other methods visualize 
single DNA molécules using électron or atomic force microscopy; the 
nucléotide sequence is read from the small différences in the 
"appearance" of the DNA as it is scanned. Finally, another method is 
based on immobilizing a single DNA polymerase molécule (with a 
template) and measuring the "dwell" time of each of the four 
nucléotides, which are labeled with different removable fluorescent dyes. 

Nucléotides that résidé longer on the polymerase 
(before their dye is removed) are those 
incorporated by the polymerase. Although the 
two methods we hâve described in detail 
(Illumina and ion torrent) are now used 
extensively, it is likely that faster and cheaper 
methods will continue to be developed. 


Shown here are the costs of sequencing a 
human genome, which was $100 million in 2001 
and about a thousand dollars by the end of 
2014. (Data from the National Human Genome 
Research Initiative.) 
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contain a stop codon about every 20 amino acids; protein-coding régions will, 
in contrast, usually contain much longer stretches without stop codons (Figure 
8-40). Known as open reading frames (ORFs), these usually signify bona fide 
protein-coding genes. This assignment is often "double-checlced" by comparing 
the ORF amino acid sequence to the many databases of documented proteins 
from other species. If a match is found, even as imperfect one, it is very lilcely that 
the ORF will code for a functional protein (see Figure 8-23). 

This strategy works very well for compact genomes, where intron sequences 
are rare and ORFs often extend for many hundreds of amino acids. However, in 
many animais and plants, the average exon size is 150-200 nucléotide pairs (see 
Figure 6-31) and additional information is usually required to unambiguously 
locate ail the exons of a gene. Although it is possible to search genomes for splic- 
ing signais and other features that help to identify exons (codon bias, for exam¬ 
ple), one of the most powerful methods is simply to sequence the total RNA pro- 
duced from the genome in living cells. As can be seen in Figure 7-3, this RNA-seq 
information, when mapped onto the genome sequence, can be used to accurately 
locate ail the introns and exons of even complex genes. By sequencing total RNA 
from different cell types, it is also possible to identify cases of alternative splicing 
(see Figure 6-26). 

RNA-seq also identifies noncoding RNAs produced by a genome. Although the 
function of some of them can be readily recognized (tRNAs or snoRNAs, for exam¬ 
ple), many hâve unknown functions and still others probably hâve no function at 
ail (discussed in Chapter 7, pp. 429-436). The existence of the many noncoding 
RNAs and our relative ignorance of their function is the main reason that we know 
only the approximate number of genes in the human genome. 

But even for protein-coding genes that hâve been unambiguously identified, 
we still hâve much to learn. Thousands of genomes hâve been sequenced, and we 
know from comparative genomics that many organisms share the same basic set 
of proteins. However, the functions of a very large number of identified proteins 
remain unknown. Depending on the organism, approximately one-third of the 
proteins encoded by a sequenced genome do not clearly resemble any protein 
that has been studied biochemically. This observation underscores a limitation of 
the emerging field of genomics: although comparative analysis of genomes reveals 
a great deal of information about the relationships between genes and organisms, 
it often does not provide immédiate information about how these genes function, 
or what rôles they hâve in the physiology of an organism. Comparison of the full 


Figure 8-40 Finding the régions in a 
DNA sequence that encode a protein. 

(A) Any région of the DNA sequence can, 
in principle, code for six different amino 
acid sequences, because any one of three 
different reading frames can be used to 
interpret the nucléotide sequence on each 
strand. Note that a nucléotide sequence 
is always read in the 5'-to-3' direction and 
encodes a polypeptide from the N-terminus 
to the C-terminus. For a random nucléotide 
sequence read in a particular trame, 
a stop signal for protein synthesis is 
encountered, on average, about once every 
20 amino acids. In this sample sequence 
of 48 base pairs, each such signal (stop 
codon) is colored blue, and only reading 
trame 2 lacks a stop signal. (B) Search 
of a 1700-base-pair DNA sequence for a 
possible protein-encoding sequence. The 
information is displayed as in (A), with each 
stop signal for protein synthesis denoted 
by a blue line. In addition, ail of the régions 
between possible start and stop signais 
for protein synthesis (see pp. 347-349) are 
displayed as red bars. Only reading frame 
1 actually encodes a protein, which is 475 
amino acid residues long. 
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gene complément of several thermophilic bacteria, for example, does not reveal 
why these bacteria thrive at températures exceeding 70°C. And examination of the 
genome of the incredibly radiorésistant bacterium Deinococcus radiodurans does 
not explain how this organism can survive a blast of radiation that can shatter 
glass. Further biochemical and genetic studies, lilce those described in the other 
sections of this chapter, are required to détermine how genes, and the proteins 
they produce, function in the context of living organisms. 

DNA Cloning Allows Any Protein to be Produced in Large 
Amounts 

In the last section, we saw how protein-coding genes can be identified in genome 
sequences. Using the genetic code (and provided the intron and exon boundaries 
are known), the amino acid sequence of any protein coded in a genome can be 
deduced. As was discussed earlier, this sequence can often provide an important 
due to the protein's function if found to be similar to the amino acid sequence of 
a protein that has already been studied (see Figure 8-23). Although this strategy 
is often successful, it typically provides only the lilcely biochemical function of the 
protein; for example, whether the protein resembles a kinase or a protease. It usu- 
ally remains for the expérimenter to verify (or réfuté) this assignment and, most 
importantly, to discover the protein's biological function in the whole organism; 
that is, to what attributes of the organism does the kinase or the protease contrib- 
ute and in what molecular pathways does it function? Nowadays, most new pro¬ 
teins are “discovered" through genome sequencing, and it often remains a great 
challenge to ascertain their functions. 

An important approach in determining gene function is to alter the gene (or in 
some cases, its expression pattern), to put the altered copy baclc into the germ line 
of the organism, and to deduce the function of the normal gene by the changes 
caused by its alteration. Various techniques to implement this strategy are dis¬ 
cussed in the next section of this chapter. But it is equally important to study the 
biochemical and structural properties of a gene product, as outlined in the first 
part of this chapter. One of the most important contributions of DNA cloning to 
cell and molecular biology is the ability to produce any protein, even the rare ones, 
in nearly unlimited amounts—as long as the gene coding for it is known. Such 
high-level production is usually carried out in living cells using expression vectors 
(Figure 8-41). These are generally plasmids that hâve been designed to produce a 
large amount of stable mRNA that can be efficiently translated into protein when 
the plasmid is introduced into bacterial, yeast, insect, or mammalian cells. To pre- 
vent the high level of the foreign protein from interfering with the celFs growth, 
the expression vector is often designed to delay the synthesis of the foreign mRNA 
and protein until shortly before the cells are harvested and lysed (Figure 8-42). 

Because the desired protein made from an expression vector is produced 
inside a cell, it must be purified away from the host-cell proteins by chromatogra- 
phy following cell lysis; but because it is such a plentiful species in the cell (often 
1-10% of the total cell protein), the purification is usually easy to accomplish 
in only a few steps. As we saw in the first part of this chapter, many expression 

Figure 8-41 Production of large amounts of a protein from a protein- 
coding DNA sequence cloned into an expression vector and introduced 
into cells. A plasmid vector has been engineered to contain a highly active 
promoter, which causes unusually large amounts of mRNA to be produced 
from an adjacent protein-coding gene inserted into the plasmid vector. 
Depending on the characteristics of the cloning vector, the plasmid is 
introduced into bacterial, yeast, insect, or mammalian cells, where the inserted 
gene is efficiently transcribed and translated into protein. If the gene to be 
overexpressed has no introns (typical for genes from bacteria, archaea, and 
simple eukaryotes), it can simply be cloned from genomic DNA by PCR. For 
cloned animal and plant genes, it is often more convenient to obtain the gene 
as cDNA, either from a cDNA library (see Figure 8-32) or cloned directly by 
PCR from RNA isolated from the organism (see Figure 8-37). Alternative^, the 
DNA coding for the protein can be made by Chemical synthesis (see p. 472). 
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vectors hâve been designed to add a molecular tag—a cluster of histidine residues 
or a small marlcer protein—to the expressed protein to facilitate easy purification 
by affinity chromatography (see Figure 8-11). A variety of expression vectors is 
available, each engineered to function in the type of cell in which the protein is 
to be made. 

This technology is also used to malce large amounts of many medically use- 
ful proteins, including hormones (such as insulin and growth factors) used as 
human pharmaceuticals, and viral coat proteins for use in vaccines. Expression 
vectors also allow scientists to produce many proteins of biological interest in 
large enough amounts for detailed structural studies. Nearly ail three-dimen- 
sional protein structures depicted in this book are of proteins produced in this 
way. Recombinant DNA techniques thus allow scientists to move with ease from 
protein to gene, and vice versa, so that the functions of both can be explored on 
multiple fronts (Figure 8-43). 

Summary 

DNA cloning allows a copy ofany spécifie part of a DNA or RNA sequence to be 
selected from the millions of other sequences in a cell and produced in unlimited 
amounts in pure form. DNA sequences can be amplified after breaking up chro¬ 
mosomal DNA and inserting the resultingDNA fragments into the chromosome of 
a self-replicating genetic element such as a plasmid. The resulting “genomic DNA 
library" is housed in millions ofbacterial cells , each carrying a different cloned DNA 
fragment. Individual cells from this library that are allowed to proliferate produce 
large amounts ofa single cloned DNA fragment. Bypassing cloning vectors and bac- 
terial cells altogether ; the polymerase chain reaction (PCR) allows DNA cloning to 
be performed directly with a DNA polymerase and DNA primers—provided that 
the DNA sequence of interest is already known. 

The procedures used to obtain DNA clones that correspond in sequence to 
mRNA molécules are the same, except that a DNA copy of the mRNA sequence, 
called cDNA, is first made. Unlilce genomic DNA clones , cDNA clones lack intron 
sequences , making them the clones ofchoicefor analyzing the protein product ofa 
gene. 

Nucleic acid hybridization reactions provide a sensitive means of detecting 
any nucléotide sequence of interest. The enormous specificity of this hybridization 
reaction allows any single-stranded sequence of nucléotides to be labeled with a 
radioisotope or Chemical and used as a probe to find a complementary partner 
strand, even in a cell or cell extract that contains millions of different DNA and RNA 
sequences. DNA hybridization also makes it possible to use PCR to amplify any sec¬ 
tion ofany genome once its sequence is known. 
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Figure 8-42 Production of large 
amounts of a protein by using a plasmid 
expression vector. In this example, an 
expression vector that overproduces a DNA 
helicase has been introduced into bacteria. 
In this expression vector, transcription from 
this coding sequence is under the control 
of a viral promoter that becomes active 
only at a température of 37°C or higher. 

The total cell protein, either from bacteria 
grown at 25°C (no helicase protein made) 
or after a shift of the same bacteria to 
42°C for up to 2 hours (helicase protein 
has become the most abundant protein 
species in the lysate), has been analyzed 
by SDS polyacrylamide-gel electrophoresis. 
(Courtesy of Jack Barry.) 
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Figure 8-43 Recombinant DNA techniques make it possible to move experimentally from gene to protein and from protein to gene. If a 

gene has been identified (right), its protein-coding sequence can be inserted into an expression vector to produce large quantifies of the protein 
(see Figure 8-41), which can then be studied biochemically or structurally. If a protein has been purified based on its biochemical properties, 
mass spectrometry (see Figure 8-18) can be used to obtain a partial amino acid sequence, which is used to search a genome sequence for the 
corresponding nucléotide sequence. The complété gene can then be cloned by PCR from a sequenced genome (see Figure 8-37). The gene can 
also be manipulated and introduced into cells or organisms to study its function, a topic covered in the next section of this chapter. 
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The nucléotide sequence ofany genome can be determined rapidly and simply 
by using highly automated techniques based on several different strategies. Com- 
parison ofthe genome sequences of different organisms allows us to trace the evolu- 
tionary relationships amonggenes and organisms , and it has proved valuable for 
discovering new genes and predicting their functions. 

Taken together ; these techniques for anaiyzing and manipulating DNA hâve 
made it possible to sequence, identify , and isolate genes from any organism of inter¬ 
est. Related technologies allow scientists to produce the protein products of these 
genes in the large quantifies needed for detailed analyses of their structure and 
function, as well as for medical purposes. 

STUDYING GENE EXPRESSION AND FUNCTION 

Ultimately, one wishes to détermine how genes—and the proteins they encode— 
function in the intact organism. Although it may seem counterintuitive, one of the 
most direct ways to find out what a gene does is to see what happens to the organ¬ 
ism when that gene is missing. Studying mutant organisms that hâve acquired 
changes or délétions in their nucléotide sequences is a time-honored practice 
in biology and forms the basis of the important field of genetics. Because muta¬ 
tions can disrupt cell processes, mutants often hold the lcey to understanding 
gene function. In the classical genetic approach, one begins by isolating mutants 
that hâve an interesting or unusual appearance: fruit Aies with white eyes or curly 
wings, for example. Working baclcward from the phenotype—the appearance or 
behavior of the individual—one then détermines the organism's génotype, the 
form of the gene responsible for that characteristic (Panel 8-2). 

Today, with numerous genome sequences available, the exploration of gene 
function often begins with a DNA sequence. Here, the challenge is to translate 
sequence into function. One approach, discussed earlier in the chapter, is to 
search databases for well-characterized proteins that hâve similar amino acid 
sequences to the protein encoded by a new gene. From there, the protein (or for 
noncoding genes, the RNA molécule) can be overexpressed and purified and the 
methods described in the first part of this chapter can be employed to study its 
three-dimensional structure and its biochemical properties. But to détermine 
directly a gene's function in a cell or organism, the most effective approach 
involves studying mutants that either laclc the gene or express an altered version 
of it. Determining which cell processes hâve been disrupted or compromised in 
such mutants will usually shed light on a gene's biological rôle. 

In this section, we describe several approaches to determining a gene's func¬ 
tion, starting either from an individual with an interesting phenotype or from a 
DNA sequence. We begin with the classical genetic approach, which starts with 
a genetic screen for isolating mutants of interest and then proceeds toward iden¬ 
tification of the gene or genes responsible for the observed phenotype. We then 
describe the set of techniques that are collectively called reverse genetics , in which 
one begins with a gene or gene sequence and attempts to détermine its func¬ 
tion. This approach often involves some intelligent guesswork—searching for 
similar sequences in other organisms or determining when and where a gene is 
expressed—as well as generating mutant organisms and characterizing their phe¬ 
notype. 

Classical Genetics Begins by Disrupting a Cell Process by 
Random Mutagenesis 

Before the advent of gene cloning technology, most genes were identified by the 
abnormalities produced when the gene was mutated. Indeed, the very concept of 
the gene was deduced from the heritability of such abnormalities. This classical 
genetic approach—identifying the genes responsible for mutant phenotypes—is 
most easily performed in organisms that reproduce rapidly and are amenable to 
genetic manipulation, such as bacteria, yeasts, nematode worms, and fruit Aies. 
Although spontaneous mutants can sometimes be found by examining extremely 
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PANEL 8-2: Review of Classical Genetics 



GENES AND PHENOTYPES 

Gene: a functional unit of inheritance, usually corresponding 

to the segment of DNA coding for a single protein. 
Genome: ail of an organism's DNA sequences. 

locus: the site of the gene in the genome 


alleles: alternative forms of a qene 

- 


GENOTYPE: the spécifie set of 
alleles forming the genome of 
an individual 

PHENOTYPE: the visible 
character of the individual 


homozygous A/A 


Wild-type: the normal, 
naturally occurring type 

heterozygous a/A 


Mutant: differing from the 
wild-type because of a genetic 
change (a mutation) 

homozygous a/a 


allele A is dominant (relative to a); allele a is récessive (relative to A) 

In the example above, the phenotype of the hétérozygote is the same as that of one of the 
homozygotes; in cases where it is different from both, the two alleles are said to be co-dominant. 


CHROMOSOMES 

centromere 


a chromosome at the beginning of the cell 
cycle, in phase; the single long bar 
represents one long double hélix of DNA 


short "p" arm 


short 
"p"arm 

pair of 
autosomes 


long ' 
"q" arm 



maternai 1 


maternai 3 


paternal2 


long "q" arm 

a chromosome near the end of the cell cycle, in 
metaphase; it is duplicated and condensed, consisting of 
two identical sister chromatids (each containing one DNA 
double hélix) joined at the centromere. 


A normal diploid chromosome set, as 
seen in a metaphase spread, prepared 
by bursting open a cell at metaphase 
and staining the scattered 
chromosomes. In the example shown 
schematically here, there are three 
pairs of autosomes (chromosomes 
inherited symmetrically from 
both parents, regardless of sex) and 
two sex chromosomes—an X from the 
mother and a Y from the father. The 
numbers and types of sex 
chromosomes and their rôle in sex 
détermination are variable from one 
class of organisms to another, as is the 
number of pairs of autosomes. 
sex chromosomes 



THE HAPLOID-DIPLOID CYCLE 
OF SEXUAL REPRODUCTION 



For simplicity, the cycle is shown for only 
one chromosome/chromosome pair. 


MEIOSIS AND GENETIC RECOMBINATION 


maternai chromosome 

A 

B 

paternal chromosome 


Y/J 

r YJ 

a 

b 


MEIOSIS AND 
RECOMBINATION 



site of crossing-over 



diploid germ cell 
AR 

génotype 

ab 


génotype aB 

Y/J 

a 



haploid gametes (eggs or sperm) 


The greater the distance 
between two loci on a single 
chromosome, the greater is the 
chance that they will be 
separated by Crossing over 
occurring at a site between them. 
If two genes are thus reassorted 
in x% of gametes, they are said 
to be separated on a 
chromosome by a genetic map 
distance of x map units (or 
x centimorgans). 
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TYPES OF MUTATIONS 


* 


POINT MUTATION: maps to a single site in the genome, 
corresponding to a single nucléotide pair or a very 
small part of a single gene 

\ 


/ ◄ - / 

INVERSION: inverts a segment of a chromosome 

léthal mutation: causes the developing organism to die 
prematurely. 

conditional mutation: produces its phenotypic effect only 
under certain conditions, called the restrictive conditions. 
Under other conditions—the permissive conditions—the 
effect is not seen. For a temperature-sensitive mutation, the 
restrictive condition typically is high température, while the 
permissive condition is low température, 
loss-of-function mutation: either reduces or abolishes the 
activity of the gene. These are the most common class of 
mutations. Loss-of-function mutations are usually 
récessive —the organism can usually function normally as long 
as it retains at least one normal copy of the affected gene. 
null mutation: a loss-of-function mutation that completely 
abolishes the activity of the gene. 


TWO GENES OR ONE? 

Given two mutations that produce the same phenotype, how can In the simplest type of complémentation test, an individual who 

we tell whether they are mutations in the same gene? If the is homozygous for one mutation is mated with an individual 

mutations are récessive (as they most often are), the answer can who is homozygous for the other. The phenotype of the 
be found by a complémentation test. offspring gives the answer to the question. 



homozygous mutant mother homozygous mutant father homozygous mutant mother homozygous mutant father 



hybrid offspring shows normal phenotype: hybrid offspring shows mutant phenotype: 

one normal copy of each gene is présent no normal copies of the mutated gene are présent 



DELETION: deletes a segment of a chromosome 



» 


TRANSLOCATION: breaks off a segment from one 
chromosome and attaches it to another 

gain-of-function mutation: increases the activity of the gene 
or makes it active in inappropriate circumstances; these 
mutations are usually dominant. 

dominant-negative mutation: dominant-acting mutation that 
blocks gene activity, causing a loss-of-function phenotype 
even in the presence of a normal copy of the gene. This 
phenomenon occurs when the mutant gene product 
interfères with the function of the normal gene product. 
suppressor mutation: suppresses the phenotypic effect of 
another mutation, so that the double mutant seems normal. 
An intragenic suppressor mutation lies within the gene 
affected by the first mutation; an extragenic suppressor 
mutation lies in a second gene—often one whose product 
interacts directly with the product of the first. 
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large populations—thousands or tens of thousands of individual organisms—iso- 
lating mutant individuals is much more efficient if one generates mutations with 
Chemicals or radiation that damage DNA. By treating organisms with such muta- 
gens, very large numbers of mutant individuals can be created quickly and then 
screened for a particular defect of interest, as we discuss shortly. 

An alternative approach to Chemical or radiation mutagenesis is called inser- 
tional mutagenesis. This method relies on the fact that exogenous DNA inserted 
randomly into the genome can produce mutations if the inserted fragment inter- 
rupts a gene or its regulatory sequences. The inserted DNA, whose sequence is 
lcnown, then serves as a molecular tag that aids in the subséquent identifica¬ 
tion and cloning of the disrupted gene (Figure 8-44). In Drosophila, the use of 
the transposable P element to inactivate genes has revolutionized the study of 
gene function in the fly. Transposable éléments (see Table 5-4, p. 288) hâve also 
been used to generate mutations in bacteria, yeast, mice, and the flowering plant 
Arabidopsis. 

Genetic Screens Identify Mutants with Spécifie Abnormalities 

Once a collection of mutants in a model organism such as yeast or fly has been 
produced, one generally must examine thousands of individuals to find the 
altered phenotype of interest. Such a search is called a genetic screen, and the 
larger the genome, the less likely it is that any particular gene will be mutated. 
Therefore, the larger the genome of an organism, the bigger the screening taslc 
becomes. The phenotype being screened for can be simple or complex. Simple 
phenotypes are easiest to detect: one can screen many organisms rapidly, for 
example, for mutations that malce it impossible for the organism to survive in the 
absence of a particular amino acid or nutrient. 

More complex phenotypes, such as defects inlearningorbehavior, mayrequire 
more elaborate screens (Figure 8-45). But even genetic screens that are used to 
dissect complex physiological Systems can be simple in design, which permits the 
simultaneous examination of large numbers of mutants. As an example, one par- 
ticularly élégant screen was designed to search for genes involved in visual Pro¬ 
cessing in zebrafish. The basis of this screen, which monitors the fishes' response 
to motion, is a change in behavior. Wild-type fish tend to swim in the direction of 
a perceived motion, whereas mutants with defects in their visual processing Sys¬ 
tems swim in random directions—a behavior that is easily detected. One mutant 
discovered in this screen is called lalcritz, which is missing 80% of the retinal gan¬ 
glion cells that help to relay visual signais from the eye to the brain. As the cellular 
organization of the zebrafish retina is similar to that of ail vertebrates, the study 
of such mutants should also provide insights into visual processing in humans. 

Because defects in genes that are required for fundamental cell processes— 
RNA synthesis and processing or cell-cycle control, for example—are usu- 
ally léthal, the functions of these genes are often studied in individuals with 



Figure 8-44 Insertional mutant of the 
snapdragon, Antirrhinum. A mutation 
in a single gene coding for a regulatory 
protein causes leafy shoots (left) to develop 
in place of flowers, which occur in the 
normal plant (right). The mutation causes 
cells to adopt a character that would be 
appropriate to a different part of the normal 
plant, so instead of a flower, the cells 
produce a leafy shoot. (Courtesy of Enrico 
Coen and Rosemary Carpenter.) 



1 mm 


Figure 8-45 A behavioral phenotype 
detected in a genetic screen. (A) Wild- 
type C. elegans engage in social feeding. 
The worms migrate around until they 
encounter their neighbors and commence 
feeding on bacteria. (B) Mutant animais 
feed by themselves. (Courtesy of Cornelia 
Bargmann, Cell 94: cover, 1998. With 
permission from Elsevier.) 
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mutagenized cells 
proliferate and form 
colonies at 23°C 


colonies replicated 
onto two identical 
plates and incubated 
at two different 
températures 



mutant cells proliferate 
and form a colony at 
the permissive 
température 


mutant cells fa il to 
proliferate and form 
a colony at the 
nonpermissive 
température 


conditional mutations. The mutant individuals function normally as long as 
"permissive" conditions prevail, but demonstrate abnormal gene function when 
subjected to "nonpermissive" (restrictive) conditions. In organisms with tem- 
perature-sensitive mutations , for example, the abnormality can be switched on 
and off experimentally simply by changing the ambient température; thus, a cell 
containing a température-sensitive mutation in a gene essential for survival will 
die at a nonpermissive température but proliferate normally at the permissive 
température (Figure 8-46). The temperature-sensitive gene in such a mutant usu- 
ally contains a point mutation that causes a subtle change in its protein product; 
for example, the mutant protein may function normally at low températures but 
unfold at higher températures. 

Temperature-sensitive mutations were crucial to find the bacterial genes that 
encode the proteins required for DNA réplication. The mutants were identified 
by screening populations of mutagen-treated bacteria for cells that stop making 
DNA when they are warmed from 30°C to 42°C. These mutants were later used 
to identify and characterize the corresponding DNA réplication proteins (dis- 
cussed in Chapter 5). Similarly, screens for temperature-sensitive mutations led 
to the identification of many proteins involved in regulating the cell cycle, as well 
as many proteins involved in moving proteins through the secretory pathway 
in yeast. Related screening approaches demonstrated the function of enzymes 
involved in the principal metabolic pathways of bacteria and yeast (discussed in 
Chapter 2) and identified many of the gene products responsible for the orderly 
development of the Drosophila embryo (discussed in Chapter 21). 

Mutations Can Cause Loss or Gain of Protein Function 

Gene mutations are generally classed as "loss of function" or "gain of function." A 
loss-of-function mutation results in a gene product that either does not worlc or 
worlcs too little; thus, it can reveal the normal function of the gene. A gain-of-func- 
tion mutation results in a gene product that worlcs too much, worlcs at the wrong 
time or place, or worlcs in a new way (Figure 8-47). 

An important early step in the genetic analysis of any mutant cell or organ- 
ism is to détermine whether the mutation causes a loss or a gain of function. A 
standard test is to détermine whether the mutation is dominant or récessive. A 
dominant mutation is one that still causes the mutant phenotype in the presence 
of a single copy of the wild-type gene. A récessive mutation is one that is no longer 
able to cause the mutant phenotype in the presence of a single wild-type copy of 
the gene. Although cases hâve been described in which a loss-of-function muta¬ 
tion is dominant or a gain-of-function mutation is récessive, in the vast majority of 
cases, récessive mutations are loss of function and dominant mutations are gain 


conditional loss- 

wild type loss-of-function mutation of-function mutation 



point mutation truncation délétion 37°C 25°C 


Figure 8-46 Screening for temperature- 
sensitive bacterial or yeast mutants. 

Mutagenized cells are plated out at the 
permissive température. They divide 
and form colonies, which are transferred 
to two identical Pétri dishes by repi ica 
plating. One of these plates is incubated 
at the permissive température, the other 
at the nonpermissive température. Cells 
containing a temperature-sensitive 
mutation in a gene essential for prolifération 
can divide at the normal, permissive 
température but fail to divide at the 
elevated, nonpermissive température. 
Temperature-sensitive mutations of this 
type were especially useful for identifying 
genes needed for DNA réplication, an 
essential process. 


Figure 8-47 Gene mutations that affect 
their protein product in different ways. 

In this example, the wild-type protein has 
a spécifie cell function denoted by the red 
rays. Mutations that eliminate this function 
or inactivate it at higher températures are 
shown. The conditional mutant protein 
carries an amino acid substitution (red) 
that prevents its proper folding at 37°C, 
but allows the protein to fold and function 
normally at 25°C. Such temperature- 
sensitive conditional mutations are 
especially useful for studying essential 
genes; the organism can be grown under 
the permissive condition and then be moved 
to the nonpermissive condition to study the 
conséquences of losing the gene product. 
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of function. It is easy to détermine if a mutation is dominant or récessive. One 
simply mates a mutant with a wild type to obtain diploid cells or organisms. The 
progeny from the mating will be heterozygous for the mutation. If the mutant phe¬ 
notype is no longer observed, one can conclude that the mutation is récessive and 
is very lilcely to be a loss-of-function mutation (see Panel 8-2). 

Complémentation Tests Reveal Whether Two Mutations Are in the 
Same Gene or Different Genes 

A large-scale genetic screen can turn up many different mutations that show the 
same phenotype. These defects might lie in different genes that function in the 
same process, or they might represent different mutations in the same gene. Alter¬ 
native forms of the same gene are lcnown as alleles. The most common différence 
between alleles is a substitution of a single nucléotide pair, but different alleles 
can also bear délétions, substitutions, and duplications. How can we tell, then, 
whether two mutations that produce the same phenotype occur in the same gene 
or in different genes? If the mutations are récessive—if, for example, they repre¬ 
sent a loss of function of a particular gene—a complémentation test can be used 
to ascertain whether the mutations fall in the same gene or in different genes. To 
test complémentation in a diploid organism, an individual that is homozygous 
for one mutation—that is, it possesses two identical alleles of the mutant gene in 
question—is mated with an individual that is homozygous for the other muta¬ 
tion. If the two mutations are in the same gene, the offspring show the mutant 
phenotype, because they still will hâve no normal copies of the gene in question 
(Figure 8-48). If, in contrast, the mutations fall in different genes, the resulting 
offspring show a normal phenotype, because they retain one normal copy (and 
one mutant copy) of each gene; the mutations thereby complément one another 
and restore a normal phenotype. Complémentation testing of mutants identi- 
fied during genetic screens has revealed, for example, that 5 different genes are 
required for yeast to digest the sugar galactose, 20 genes are needed for E. coli to 
build a functional flagellum, 48 genes are involved in assembling bactériophage 
T4 viral particles, and hundreds of genes are involved in the development of an 
adult nematode worm from a fertilized egg. 

Gene Products Can Be Ordered in Pathways by Epistasis Analysis 

Once a set of genes involved in a particular biological process has been identified, 
the next step is often to détermine in which order the genes function. Gene order 
is perhaps easiest to explain for metabolic pathways, where, for example, enzyme 
A is necessary to produce the substrate for enzyme B. In this case, we would say 
that the gene encoding enzyme A acts before (upstream of) the gene encoding 
enzyme B in the pathway. Similarly, where one protein régulâtes the activity of 
another protein, we would say that the former gene acts before the latter. Gene 
order can, in many cases, be determined purely by genetic analysis without any 
knowledge of the mechanism of action of the gene products involved. 

Suppose we hâve a biosynthetic process consisting of a sequence of steps, 
such that performance of step B is conditional on completion of the preceding 
step A; and suppose gene A is required for step A, and gene B is required for step 
B. Then a null mutation (a mutation that abolishes function) in gene A will arrest 
the process at step A, regardless of whether gene B is functional or not, whereas 
a null mutation in gene B will cause arrest at step B only if gene A is still active. 
In such a case, gene A is said to be epistatic to gene B. By comparing the pheno¬ 
types of the different combinations of mutations, we can therefore discover the 
order in which the genes act. This type of analysis is called epistasis analysis. As 
an example, the pathway of protein sécrétion in yeast has been analyzed in this 
way. Different mutations in this pathway cause proteins to accumulate aberrantly 
in the endoplasmic réticulum (ER) or in the Golgi apparatus. When a yeast cell is 
engineered to carry both a mutation that bloclcs protein processing in the ER and 
a mutation that bloclcs processing in the Golgi apparatus, proteins accumulate in 



Figure 8-48 A complémentation test can 
reveal that mutations in two different 
genes are responsible for the same 
abnormal phenotype. When an albino 
(white) bird from one strain is bred with an 
albino from a different strain, the resulting 
offspring (bottom) hâve normal coloration. 
This restoration of the wild-type plumage 
indicates that the two white breeds lack 
color because of récessive mutations 
in different genes. (From W. Bateson, 
Mendel’s Principles of Heredity, 1 st ed. 
Cambridge, UK: Cambridge University 
Press, 1913.) 
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apparatus vesicles 
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protein normal cell 

secretory mutant A 

secretory mutant B 

double mutant AB 

protein secreted 

protein accumulâtes 
in ER 

protein accumulâtes 
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Figure 8-49 Using genetics to détermine 
the order of function of genes. In normal 
cells, secretory proteins are loaded into 
vesicles, which fuse with the plasma 
membrane to secrete their contents into 
the extracellular medium. Two mutants, A 
and B, fail to secrete proteins. In mutant A, 
secretory proteins accumulate in the ER. In 
mutant B, secretory proteins accumulate 
in the Golgi. In the double mutant AB, 
proteins accumulate in the ER; this 
indicates that the gene détective in mutant 
A acts before the gene détective in mutant 
B in the secretory pathway. 


the ER. This indicates that proteins must pass through the ER before being sent to 
the Golgi before sécrétion (Figure 8-49). Strictly speaking, an epistasis analysis 
can only provide information about gene order in a pathway when both muta¬ 
tions are null alleles. When the mutations retain partial function, their epistasis 
interactions can be difficult to interpret. 

Sometimes, a double mutant will show a new or more severe phenotype than 
either single mutant alone. This type of genetic interaction is called a synthetic 
phenotype, and if the phenotype is death of the organism, it is called synthetic 
lethality. In most cases, a synthetic phenotype indicates that the two genes act in 
two different parallel pathways, either of which is capable of mediating the same 
cell process. Ihus, when both pathways are disrupted in the double mutant, the 
process fails altogether, and the synthetic phenotype is observed. 

Mutations Responsible for a Phenotype Can Be Identified Through 
DNA Analysis 

Once a collection of mutant organisms with interesting phenotypes has been 
obtained, the next taslc is to identify the gene or genes responsible for the altered 
phenotype. If the phenotype has been produced by insertional mutagenesis, 
locating the disrupted gene is fairly simple. DNA fragments containing the inser¬ 
tion (a transposon or a retrovirus, for example) are amplified by PCR, and the 
nucléotide sequence of the flanking DNA is determined. The gene affected by 
the insertion can then be identified by a computer-aided search of the complété 
genome sequence of the organism. 

If a DNA-damaging Chemical was used to generate the mutations, identifying 
the inactivated gene is often more laborious, but there are several powerful strate¬ 
gies available. If the genome size of the organism is small (for example, for bacte- 
ria or simple eukaryotes), it is possible to simply détermine the genome sequence 
of the mutant organism and identify the affected gene by comparison with the 
wild-type sequence. Because of the continuous accumulation of neutral muta¬ 
tions, there will probably be différences between the two genome sequences in 
addition to the mutation responsible for the phenotype. One way of proving that 
a mutation is causative is to introduce the putative mutation baclc into a normal 
organism and détermine whether or not it causes the mutant phenotype. We will 
discuss how this is accomplished later in the chapter. 

Rapid and Cheap DNA Sequencing Has Revolutionized Human 
Genetic Studies 

Genetic screens in model experimental organisms hâve been spectacularly suc- 
cessful in identifying genes and relating them to various phenotypes, including 
many that are conserved between these organisms and humans. But how can we 
study humans directly? They do not reproduce rapidly, cannot be treated with 
mutagens, and, if they hâve a defect in an essential process such as DNA réplica¬ 
tion, would die long before birth. 

Despite their limitations compared to model organisms, humans are becom- 
ing increasingly attractive subjects for genetic studies. Because the human 
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population is so large, spontaneous, nonlethal mutations hâve arisen in ail human 
genes—many times over. A substantial proportion of these remain in the genomes 
of present-day humans. The most deleterious of these mutations are discovered 
when the mutant individuals call attention to themselves by seeking medical help. 

With the recent advances that hâve enabled the sequencing of entire human 
genomes cheaply and quicldy, we can now identify such mutations and study 
their évolution and inheritance in ways that were impossible even a fewyears ago. 
By comparing the sequences of thousands of human genomes from ail around the 
world, we can begin to identify directly the DNA différences that distinguish one 
individual from another. These différences hold dues to our evolutionary origins 
and can be used to explore the roots of disease. 

Linked Blocks of Polymorphisms Hâve Been Passed Down from 
Our Ancestors 

When we compare the sequences of multiple human genomes, we find that any 
two individuals will differ in roughly 1 nucléotide pair in 1000. Most of these vari¬ 
ations are common and relatively harmless. When two sequence variants coexist 
in the population and both are common, the variants are called polymorphisms. 
The majority of polymorphisms are due to the substitution of a single nucléotide, 
called single-nucleotide polymorphisms or SNPs (Figure 8-50). The rest are due 
largely to insertions or délétions—called indels when the change is small, or copy 
number variations ( CNVs ) when it is large. Although these common variants can 
be found throughout the genome, they are not scattered randomly—or even inde- 
pendently. Instead, they tend to travel in groups called haplotype blocks —com¬ 
binations of polymorphisms that are inherited as a unit. 

To understand why such haplotype blocks exist, we need to consider our evo¬ 
lutionary history. It is thought that modem humans expanded from a relatively 
small population—perhaps around 10,000 individuals—that existed in Africa 
about 60,000 years ago. Among that small group of our ancestors, some individu¬ 
als will hâve carried one set of genetic variants, others a different set. The chromo¬ 
somes of a present-day human represent a shuffled combination of chromosome 
segments from different members of this small ancestral group of people. Because 
only about two thousand générations separate us from them, large segments of 
these ancestral chromosomes hâve passed from parent to child, unbrolcen by the 
crossover events that occur during meiosis. As described in Chapter 5, only a few 
crossovers occur between each set of homologous chromosomes during each 
meiosis (see Figure 5-53). 

As a resuit, certain sets of DNA sequences—and their associated polymor¬ 
phisms—hâve been inherited in linked groups, with little genetic rearrangement 
across the générations. These are the haplotype blocks. Like genes that exist in 
different allelic forms, haplotype blocks also corne in a limited number of variants 
that are common in the human population, each representing a combination of 
DNA polymorphisms passed down from a particular ancestor long ago. 


individual A 
individual B 
individual C 
individual D 


-1000 nucléotide pairs 





t t t 

SNP1 SNP2 SNP3 


Figure 8-50 Single-nucleotide polymorphisms (SNPs) are sites in the genome where two or 
more alternative choices of a nucléotide are common in the population. Most such variations 
in the human genome occur at locations where they do not significantly affect a gene’s function. 
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Polymorphisms Can Aid the Search for Mutations Associated with 
Disease 

Mutations that give rise, in a reproducible way, to rare but clearly defined abnor- 
malities, such as albinism, hemophilia, or congénital deafness, can often be iden- 
tified by studies of affected families. Such single-gene, or monogenic, disorders 
are often referred to as Mendelian because their pattern of inheritance is easy to 
traclc. Moreover, individuals who inherit the causative mutation will exhibit the 
abnormality irrespective of environmental factors such as diet or exercise. But for 
many common diseases, the genetic roots are more complex. Instead of a single 
allele of a single gene, such disorders stem from a combination of contributions 
from multiple genes. And often, environmental factors hâve strong influences 
on the severity of the disorder. For these multigenic conditions, such as diabètes 
or arthritis, population studies are often helpful in tracking down the genes that 
increase the rislc of getting the disease. 

In population studies, investigators collect DNA samples from a large number 
of people who hâve the disease and compare them to samples from a group of 
people who do not hâve the disease. They look for variants—SNPs, for example— 
that are more common among the people who hâve the disease. Because DNA 
sequences that are close together on a chromosome tend to be inherited together, 
the presence of such SNPs could indicate that an allele that increases the rislc of 
the disease might lie nearby (Figure 8-51). Although, in principle, the disease 
could be caused by the SNP itself, the culprit is much more lilcely to be a change 
that is merely linlced to the SNP as part of a haplotype bloclc. 

Such genome-wide association studies hâve been used to search for genes that 
prédisposé individuals to common diseases, including diabètes, coronary artery 
disease, rheumatoid arthritis, and even dépréssion. For many of these condi¬ 
tions, the DNA polymorphisms identified increase the rislc of disease only slightly. 
Moreover, environmental factors (diet, exercise, for example) play an important 
rôle in the onset and severity of the disease. Nonetheless, the identification of 
genes affected by these polymorphisms is leading to a mechanistic understanding 
of some of our most common disorders. 


Genomics Is Accelerating the Discovery of Rare Mutations That 
Prédisposé Us to Serious Disease 

The genetic variants that hâve thus far allowed us to identify some of the genes 
that increase our rislc of disease are common ones. They arose long ago in our evo- 
lutionary past and are now présent, in one form or another, in a substantial por¬ 
tion (1% or more) of the population. Such polymorphisms are thought to account 
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Figure 8-51 Genes that affect the 
risk of developing a common disease 
can often be tracked down through 
linkage to SNPs. Here, the patterns 
of SNPs are compared between two 
sets of individuals—a set of healthy 
Controls and a set affected by a particular 
common disease. A segment of a 
typical chromosome is shown. For most 
polymorphie sites in this segment, it is 
a random matter whether an individual 
has one SNP variant (red vertical bars) 
or another (blue vertical bars); this same 
randomness is seen both for the control 
group and for the affected individuals. 
However, in the part of the chromosome 
that is shaded in darker gray, a bias is 
seen: most normal individuals hâve the 
blue SNP variants, whereas most affected 
individuals hâve the red SNP variants. 

This suggests that this région contains, 
or is close to, a gene that is genetically 
linked to these red SNP variants and that 
prédisposés individuals to the disease. 
Using carefully selected contrais and 
thousands of affected individuals, this 
approach can help track down disease- 
related genes, even when they confer only 
a slight increase in the risk of developing 
the disease. 
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for about 90% of the différences between one person's genome and another's. But 
when we try to tie these common variants to différences in disease susceptibility 
or other heritable traits, such as height, we find that they do not hâve as much 
prédictive power as we had anticipated: thus, for example, most confer relatively 
small increases—less than twofold—in the rislc of developing a common disease. 

In contrast to polymorphisms, rare DNA variants—those much less frequent 
in humans than SNPs—can hâve large effects on the rislc of developing some com¬ 
mon diseases. For example, a number of different loss-of-function mutations, 
each individually rare, hâve been found to increase greatly the prédisposition 
to autism and schizophrenia. Many of these are de novo mutations, which arose 
spontaneously in the germ-line cells of one or the other parent. The fact that these 
mutations arise spontaneously with some frequency could help explain why 
these common disorders—each observed in about 1% of the population—remain 
with us, even though the affected individuals leave few or no descendants. These 
rare mutations may arise in any one of hundreds of different genes, which could 
explain much of the clinical variability of autism and schizophrenia. Because they 
are lcept rare by natural sélection, most such variants with a large effect on rislc 
would be missed by genome-wide association studies. 

Now that DNA sequencing has become fast and inexpensive, the most efficient 
and cost-effective way to identify these rare, large-effect mutations is by sequenc¬ 
ing the genomes of affected individuals, along with those of their parents and sib- 
lings as Controls. 

Reverse Genetics Begins with a Known Gene and Détermines 
Which Cell Processes Require Its Function 

As we hâve seen, classical genetics starts with a mutant phenotype (or, in the case 
of humans, a range of characteristics) and identifies the mutations, and conse- 
quently the genes, responsible for it. Recombinant DNA technology has made 
possible a different type of genetic approach, one that is used widely in a variety 
of genetically tractable species. Instead of beginning with a mutant organism and 
using it to identify a gene and its protein, an investigator can start with a particu- 
lar gene and proceed to malce mutations in it, creating mutant cells or organisms 
so as to analyze the gene's function. Because this approach reverses the tradi- 
tional direction of genetic discovery—proceeding from genes to mutations, rather 
than vice versa—it is commonly referred to as reverse genetics. And because the 
genome of the organism is deliberately altered in a particular way, this approach 
is also called genome engineering or genome editing. We shall see in this chapter 
that this approach can be scaled upward so that whole collections of organisms 
can be created, each of which has a different gene altered. 

There are several ways a gene of interest can be altered. In the simplest, the 
gene can simply be deleted from the genome, although in a diploid organism, 
this requires that both copies—one on each chromosome homolog—be deleted. 
Although somewhat counterintuitive, one of the best ways to discover the func¬ 
tion of a gene is by observing the effects of not having it. Such "gene knoclcouts" 
are especially useful if the gene is not essential. Through reverse genetics, the gene 
in question (even if it is essential) can also be replaced by one that is expressed in 
the wrong tissue or at the wrong time in development; this type of manipulation 
often provides important dues to the gene's normal function. For example, a gene 
of interest can be modified to be expressed at will by the expérimenter (Figure 
8-52). Finally, genes can also be engineered so that they are expressed normally 
in most cell types and tissues but deleted in certain cell types or tissues selected 
by the expérimenter (see Figure 5-66). This approach is especially useful when a 
gene has different rôles in different tissues. 

It is also possible to make subtler changes to a gene. It is sometimes useful to 
malce slight changes in a protein's structure so that one can begin to dissect which 
portions of a protein are important for its function. The activity of an enzyme, for 
example, can be studied by changing a single amino acid in its active site. It is 
also possible, through genome engineering, to create new types of proteins in an 
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Figure 8-52 Engineered genes can be turned on and off with small molécules. Here, the DNA-binding portion of a bacterial 
protein (the tétracycline, Tet, repressor) has been fused to a portion of a mammalian transcriptional activator and expressed 
in cultured mammalian cells. The engineered geneX, présent in place of the normal gene, has its usual gene control région 
replaced by c/'s-regulatory sequences recognized by the tétracycline repressor. In the absence of doxycycline (a particularly 
stable version of tétracycline), the engineered gene is expressed; in the presence of doxycycline, the gene is turned off because 
the drug causes the tétracycline repressor to dissociate from the DNA. This strategy can also be used in mice by incorporating 
the engineered genes into the germ line. In many tissues, the gene can be turned on and off simply by adding or removing 
doxycycline from the animal’s water. If the tétracycline repressor construct is placed under the control of a tissue-specific gene 
control région, the engineered gene will be turned on and off only in that tissue. 


animal. For example, a gene can be fused to the gene for a fluorescent protein. 
When this altered gene is introduced into the genome, the protein can be tracked 
in the living organism by monitoring its fluorescence. 

Altered genes can be created in several ways. Perhaps the simplest is to chem- 
ically synthesize the DNA that malces up the gene. In this way, the investigator 
can specify any type of variant of the normal gene. It is also possible to construct 
altered genes using recombinant DNA technology, as described earlier in this 
chapter. Once obtained, altered genes can be introduced into cells in a variety 
of ways. DNA can be microinjected into mammalian cells with a glass micropi¬ 
pette or introduced by a virus that has been engineered to carry foreign genes. In 
plant cells, genes are frequently introduced by a technique called particle bom- 
bardment: DNA samples are painted onto tiny gold beads and then literally shot 
through the cell wall with a specially modified gun. Electroporation is the method 
of choice for introducing DNA into bacteria and some other cells. In this tech¬ 
nique, a brief electric shoclc renders the cell membrane temporarily permeable, 
allowing foreign DNA to enter the cytoplasm. 

To be most useful to expérimenter s, the altered gene, once it is introduced into 
a cell, must recombine with the cell's genome so that the normal gene is replaced. 
In simple organisms such as bacteria and yeasts, this process occurs with high fre- 
quency using the celFs own homologous recombination machinery, as described 
in Chapter 5. In more complex organisms that hâve elaborate developmental 
programs, the procedure is more complicated because the altered gene must be 
introduced into the germ line, as we next describe. 

Animais and Plants Can Be Genetically Altered 

Animais and plants that hâve been genetically engineered by gene délétion or 
gene replacement are called transgenic organisms, and any foreign or modified 
genes that are added are called transgenes. We discuss transgenic plants later in 
this chapter and, for now, concentrate our discussion on transgenic mice, as enor- 
mous progress has been made in this area. If a DNA molécule carrying a mutated 
mouse gene is transferred into a mouse cell, it often inserts into the chromosomes 
at random, but methods hâve been developed to direct the mutant gene to replace 
the normal gene by homologous recombination. By exploiting these "gene target- 
ing” events, any spécifie gene can be altered or inactivated in a mouse cell by a 
direct gene replacement. In the case in which both copies of the gene of interest 
are completely inactivated or deleted, the resulting animal is called a " knoclcout" 
mouse. The technique is summarized in Figure 8-53. 
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Figure 8-53 Summary of the procedures 
used for making gene replacements 
in mice. In the first step (A), an altered 
version of the gene is introduced into 
cultured ES (embryonic stem) cells. These 
cells are described in detail in Chapter 
22. Only a few ES cells will hâve their 
corresponding normal genes replaced by 
the altered gene through a homologous 
recombination event. These cells can be 
identified by PCR and cultured to produce 
many descendants, each of which carries 
an altered gene in place of one of its two 
normal corresponding genes. In the next 
step of the procedure (B), these altered ES 
cells are injected into a very early mouse 
embryo; the cells are incorporated into the 
growing embryo, and a mouse produced 
by such an embryo will contain some 
somatic cells (indicated by orange) that 
carry the altered gene. Some of these mice 
will also contain germ-line cells that contain 
the altered gene; when bred with a normal 
mouse, some of the progeny of these mice 
will contain one copy of the altered gene in 
ail of their cells. 

The mice with the transgene in their 
germ line are then bred to produce 
both a male and a female animal, each 
heterozygous for the gene replacement 
(that is, they hâve one normal and one 
mutant copy of the gene). When these two 
mice are mated (not shown), one-fourth of 
their progeny will be homozygous for the 
altered gene. 


The ability to préparé transgenic mice lacking a lcnown normal gene has been 
a major advance, and the technique has been used to détermine the functions of 
many mouse genes (Figure 8-54). If the gene functions in early development, a 
lcnoclcout mouse will usually die before it reaches adulthood. These léthal defects 
can be carefully analyzed to help détermine the function of the missing gene. 
As described in Chapter 5, an especially useful type of transgenic animal talces 
advantage of a site-specific recombination System to excise—and thus disable— 
the target gene in a particular place or at a particular time (see Figure 5-66). In this 
case, the target gene in embryonic stem (ES) cells is replaced by a fully functional 
version of the gene that is flanlced by a pair of the short DNA sequences, called lox 
sites, that are recognized by the Cre recombinase protein. The transgenic mice that 
resuit are phenotypically normal. They are then mated with transgenic mice that 
express the Cre recombinase gene under the control of an inducible promoter. In 
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the spécifie cells or tissues in which Cre is switched on, it catalyzes recombination 
between the lox sequences—excising a target gene and eliminating its activity 
(see Figure 22-5). 


The Bacterial CRISPR System Has Been Adapted to Edit 
Genomes in a Wide Variety of Species 


One of the difficulties in malcing transgenic mice by the procedure just described 
is that the introduced DNA molécule (bearing the experimentally altered gene) 
often inserts at random in the genome, and many ES cells must therefore be 
screened individually to find one that has the "correct” gene replacement. 

Creative use of the CRISPR System, discovered in bacteria as a defense against 
viruses, has largely solved this problem. As described in Chapter 7, the CRISPR 
System uses a guide RNA sequence to target (through complementary base-pair- 
ing) double-stranded DNA, which it then cleaves (see Figure 7-78). The gene cod- 
ing for the lcey component of this System, the bacterial Cas9 protein, has been 
transferred into a variety of organisms, where it greatly simplifies the process of 
malcing transgenic organisms (Figure 8-55A and B). The basic strategy is as fol- 
lows: Cas9 protein is expressed in ES cells along with a guide RNA designed by the 
expérimenter to target a particular location on the genome. The Cas9 and guide 
RNA associate, the complex is brought to the matching sequence on the genome, 
and the Cas9 protein malces a double-strand break. As we saw in Chapter 5, dou- 
ble-strand breaks are often repaired by homologous recombination; here, the 
template chosen by the cell to repair the damage is often the altered gene, which 
is introduced to ES cells by the expérimenter. In this way, the normal gene can be 
selectively damaged by the CRISPR System and replaced at high efficiency by the 
experimentally altered gene. 

The CRISPR System has a variety of other uses. Its particular power lies with its 
ability to target Cas9 to thousands of different positions across a genome through 
the simple rules of complementary base-pairing. Thus, if a catalytically inactive 
Cas9 protein is fused to a transcription activator or repressor, it is possible, in prin- 
ciple, to turn any gene on or off (Figure 8-55C and D). 
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Figure 8-54 Transgenic mice engineered 
to express a mutant DNA helicase 
show prématuré aging. The helicase, 
encoded by th eXpd gene, is involved 
in both transcription and DNA repair. 
Compared with a wild-type mouse of the 
same âge (A), a transgenic mouse that 
expresses a defective version of Xpd 
(B) exhibits many of the symptoms of 
prématuré aging, including osteoporosis, 
émaciation, early graying, infertility, and 
reduced life-span. The mutation in Xpd 
used here impairs the activity of the 
helicase and mimics a mutation that in 
humans causes trichothiodystrophy, a 
disorder characterized by brittle hair, 
skeletal abnormalities, and a very reduced 
life expectancy. These resu Its indicate 
that an accumulation of DNA damage can 
contribute to the aging process in both 
humans and mice. (From J. de Boer et 
al., Science 296:1276-1279, 2002. With 
permission from AAAS.) 


Figure 8-55 Use of CRISPR to study 
gene function in a wide variety of 
species. (A) The Cas9 protein (artificially 
expressed in the species of interest) binds to 
a guide RNA, designed by the expérimenter 
and also expressed. The portion of RNA 
in light blue is needed for associations 
with Cas9; that in darkblue is specified by 
the expérimenter to match a position on 
the genome. The only other requirement 
is that the adjacent genome sequence 
includes a short PAM (protospacer adjacent 
motif) that is needed for Cas9 to cleave. 

As described in Chapter 7, this sequence 
is how the CRISPR System in bacteria 
distinguishes its own genome from that of 
invading viruses. (B) When directed to make 
double-strand breaks, the CRISPR System 
greatly improves the ability to replace an 
endogenous gene with an experimentally 
altered gene since the altered gene is used 
to “repair” the double-strand break (C, D). 

By using a mutant form of Cas9 that can no 
longer cleave DNA, Cas9 can be used to 
activate a normally dormant gene (C) or turn 
off an actively expressed gene (D). (Adapted 
from P. Mali et al., Nat. Methods 10:957- 
963, 2013. With permission from Macmillan 
Publishers Ltd.) 
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The CRISPR System has several advantages over other strategies for experi- 
mentally manipulating gene expression. First, it is relatively easy for the expéri¬ 
menter to design the guide RNA: it simply follows standard base pairing conven¬ 
tion. Second, the gene to be controlled does not hâve to be modified; the CRISPR 
strategy exploits DNA sequences already présent in the genome. Third, numer- 
ous genes can be controlled simultaneously. Cas9 has to be expressed only once, 
but many guide RNAs can be expressed in the same cell; this strategy allows the 
expérimenter to turn on or off a whole set of genes at once. 

The export of the CRISPR System from bacteria to virtually ail other experi¬ 
mental organisms (including mice, zebrafish, worms, Ries, rice, and wheat) has 
revolutionized the study of gene function. Lilce the earlier discovery of restriction 
enzymes, this breakthrough came from scientists studying a fascinating phenom- 
enon in bacteria without—at first—realizing the enormous impact these discov- 
eries would hâve on ail aspects of biology. 

Large Collections of Engineered Mutations Provide a Tool for 
Examining the Function of Every Gene in an Organism 

Extensive collaborative efforts hâve produced comprehensive libraries of muta¬ 
tions in a variety of model organisms, including S. cerevisiae, C. elegans, Drosoph- 
ila, Arabidopsis, and even the mouse. The ultimate aim in each case is to produce a 
collection of mutant strains in which every gene in the organism has been system- 
atically deleted or altered in such a way that it can be conditionally disrupted. Col¬ 
lections of this type provide an invaluable resource for investigating gene function 
on a genomic scale. For example, a large collection of mutant organisms can be 
screened for a particular phenotype. Lilce the classic genetic approaches described 
earlier, this is one of the most powerful ways to identify the genes responsible for 
a particular phenotype. Unlilce the classical genetic approach, however, the set of 
mutants is “pre-engineered,” so that there is no need to rely on chance events such 
as spontaneous mutations or transposon insertions. In addition, each of the indi- 
vidual mutations within the collection is often engineered to contain a distinct 
molecular “barcode"—in the form of a unique DNA sequence—designed to malce 
identification of the altered gene rapid and routine (Figure 8-56). 
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Figure 8-56 Making barcoded collections of mutant organisms. A délétion construct for use 
in yeast contains DNA sequences (red) homologous to each end of a target gene x, a selectable 
marker gene (blue), and a unique “barcode” sequence approximately 20 nucléotide pairs in 
length (green). This DNA is introduced into yeast cells, where it readily replaces the target gene 
by homologous recombination. Cells that carry a successful gene replacement are identified by 
expression of the selectable marker gene, typically a gene that provides résistance to a drug. By 
using a collection of such constructs, each spécifie for one gene, a library of yeast mutants was 
constructed containing a mutant for every gene. Essential genes cannot be studied this way, as 
their délétion from the genome causes the cells to die. In this case, the target gene is replaced by a 
version of the gene that can be regulated by the expérimenter (see Figure 8-52). The gene can then 
be turned off and the effect of this can be monitored before the cells die. 
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Figure 8-57 Genome-wide screens for fitness using a large pool of 
barcoded yeast délétion mutants. A large pool of yeast mutants, each 
with a different gene deleted and présent in equal amounts, is grown under 
conditions selected by the expérimenter. Some mutants (blue) grow normally, 
but others show reduced growth (orange and green) or no growth at ail (red). 
The fitness of each mutant is experimentally determined in the following way. 
After the growth phase is completed, genomic DNA (isolated from the mixture 
of strains) is purified and the relative abundance of each mutant is determined 
by quantifying the level of the DNA barcode matched to each délétion. 

This can be done by sequencing the pooled genomic DNA or hybridizing 
it to microarrays (see Figure 8-64) that contain DNA oligonucleotides 
complementary to each barcode. In this way, the contribution of every gene 
to growth under the specified condition can be rapidly ascertained. This type 
of study has revealed that of the approximately 6000 coding genes in yeast, 
only about 1000 are essential under standard growth conditions. 

In S. cerevisiae, the taslc of generating a complété set of 6000 mutants, each 
missing only one gene, was accomplished several years ago. Because each mutant 
strain has an individual barcode sequence embedded in its genome, a large mix¬ 
ture of engineered strains can be grown under various sélective test conditions— 
such as nutritional deprivation, a température shift, or the presence of various 
drugs—and the cells that survive can be rapidly identified by the unique sequence 
tags présent in their genomes. By assessing how well each mutant in the mixture 
fares, one can begin to discern which genes are essential, useful, or irrelevant for 
growth under the various conditions (Figure 8-57). 

The insights generated by examining mutant libraries can be considérable. 
For example, studies of an extensive collection of mutants in Mycoplasma gen- 
italium —the organism with the smallest known genome—hâve identified the 
minimum complément of genes essential for cellular life. Growth under labora- 
tory conditions requires about three-quarters of the 480 protein-coding genes in 
M. genitalium. Approximately 100 of these essential genes are of unlcnown func- 
tion, which suggests that a surprising number of the basic molecular mechanisms 
that underlie life hâve yet to be discovered. 

Collections of mutant organisms are also available for many animal and plant 
species. For example, it is possible to “order,” by phone or email from a consortium 
of investigators, a délétion or insertion mutant for almost ail coding genes in Dro- 
sophila. Likewise, a nearly complété set of mutants exists for the “model" plant 
Arabidopsis . And the adaptation of the CRISPR System for use in mice means that, 
in the near future, we can expect to be able to turn on or off—at will—each gene 
in the mouse genome. Although we are still ignorant about the function of most 
genes in most organisms, these technologies allow an exploration of gene func¬ 
tion on a scale that was unimaginable a decade ago. 

RNA Interférence Is a Simple and Rapid Way to Test Gene 
Function 

Although knocking out (or conditionally expressing) a gene in an organism and 
studying the conséquences is the most powerful approach for understanding the 
functions of the gene, RNA interférence {RNAi, for short), is an alternative, par- 
ticularly convenient approach. As discussed in Chapter 7, this method exploits a 
natural mechanism used in many plants, animais, and fungi to protect themselves 
against viruses and transposable éléments. The technique introduces into a cell or 
organism a double-stranded RNA molécule whose nucléotide sequence matches 
that of part of the gene to be inactivated. After the RNA is processed, it hybrid- 
izes with the target-gene RNA (either mRNA or noncoding RNA) and reduces its 
expression by the mechanisms shown in Figure 7-75. 

RNAi is frequently used to inactivate genes in Drosophila and mammalian cell 
culture lines. Indeed, a set of 15,000 Drosophila RNAi molécules (one for every 
coding gene) allows researchers, in several months, to test the rôle of every fly 
gene in any process that can be monitored using cultured cells. RNAi has also been 
widely used to study gene function in whole organisms, including the nematode 
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Figure 8-58 Gene function can be tested by RNA interférence. (A) Double-stranded RNA 
(dsRNA) can be introduced into C. elegans by (1) feeding the worms E. coli that express the dsRNA 
or (2) injecting the dsRNA directly into the animal’s gut. (B) In a wild-type worm embryo, the egg and 
sperm pronuclei (red arrowheads) corne together in the posterior hait of the embryo shortly after 
fertilization. (C) In an embryo in which a particular gene has been inactivated by RNAi, the pronuclei 
fail to migrate. This experiment revealed an important but previously unknown function of this gene 
in embryonic development. (B and C, from P. Gônczy et al., Nature 408:331-336, 2000. With 
permission from Macmillan Publishers Ltd.) 



20 pm 


C. elegans. When working with worms, introducing the double-stranded RNA is 
quite simple: the RNA can be injected directly into the intestine of the animal, or 
the worm can be fed with E. coli engineered to produce the RNA (Figure 8-58). The 
RNA is amplified (see p. 431) and distributed throughout the body of the worm, 
where it inhibits expression of the target gene in different tissue types. RNAi is 
being used to help in assigning functions to the entire complément of worm genes 
(Figure 8-59). 

A related technique has also been applied to mice. In this case, the RNAi mol¬ 
écules are not injected or fed to the mouse; rather, recombinant DNA techniques 
are used to malce transgenic animais that express the RNAi under the control of an 
inducible promoter. Often this is a specially designed RNA that can fold baclc on 
itself and, through base-pairing, produce a double-stranded région that is recog- 
nized by the RNAi machinery. In the simplest cases, the process inactivâtes only 
the genes that exactly match the RNAi sequence. Depending on the inducible 
promoter used, the RNAi can be produced only in a specified tissue or only at a 
particular time in development, allowing the functions of the target genes to be 
analyzed in elaborate detail. 

RNAi has made reverse genetics simple and efficient in many organisms, but 
it has several potential limitations compared with true genetic knockouts. For 
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Figure 8-59 RNA interférence provides 
a convenient method for conducting 
genome-wide genetic screens. In this 
experiment, each well in this 96-well 
plate is filled with E coli that produce 
a different double-stranded RNA. Each 
interfering RNA matches the nucléotide 
sequence of a single C. elegans gene, 
thereby inactivating it. About 10 worms 
are added to each well, where they ingest 
the genetically modified bacteria. The plate 
is incubated for several days, which gives 
the RNAs time to inactivate their target 
genes—and the worms time to grow, mate, 
and produce offspring. The plate is then 
examined in a microscope, which can be 
controlled robotically, to screen for genes 
that affect the worms’ ability to survive, 
reproduce, develop, and behave. Shown 
here are normal worms alongside worms 
that show an impaired ability to reproduce 
due to inactivation of a particular “fertility” 
gene. (From B. Lehner et al., Nat. Genet. 
38:896-903, 2006. With permission from 
Macmillan Publishers Ltd.) 
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unknown reasons, RNAi does not effïciently inactivate ail genes. Moreover, within 
whole organisms, certain tissues may be résistant to the action of RNAi (for exam¬ 
ple, neurons in nematodes). Another problem arises because many organisms 
contain large gene families, the members of which exhibit sequence similarity. 
RNAi therefore sometimes produces “off-target" effects, inactivating related genes 
in addition to the targeted gene. One strategy to avoid such problems is to use 
multiple small RNA molécules matched to different régions of the same gene. 
Ultimately, the results of any RNAi experiment must be viewed as a strong due to, 
but not necessarily a proof of, normal gene function. 

Reporter Genes Reveal When and Where a Gene Is Expressed 

In the preceding section, we discussed how genetic approaches can be used to 
assess a gene's function in cultured cells or, even better, in the intact organism. 
Although this information is crucial to understanding gene function, it does not 
generally reveal the molecular mechanisms through which the gene product 
worlcs in the cell. For example, genetics on its own rarely tells us ail the places 
in the organism where the gene is expressed, or how its expression is controlled. 
It does not necessarily reveal whether the gene acts in the nucléus, the cytosol, 
on the cell surface, or in one of the numerous other compartments of the cell. 
And it does not reveal how a gene product might change its location or its expres¬ 
sion pattern when the external environment of the cell changes. Key insights into 
gene function can be obtained by simply observing when and where a gene is 
expressed. A variety of approaches, most involving some form of genetic engi¬ 
neering, can easily provide this critical information. 

As discussed in detail in Chapter 7, czs-regulatory DNA sequences, located 
upstream or downstream of the coding région, control gene transcription. These 
regulatory sequences, which détermine precisely when and where the gene is 
expressed, can be easily studied by placing a reporter gene under their control 
and introducing these recombinant DNA molécules into cells (Figure 8-60). In 
this way, the normal expression pattern of a gene can be determined, as well as 
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Figure 8-60 Using a reporter protein 
to détermine the pattern of a gene’s 
expression. (A) In this example, the coding 
sequence for protein X is replaced by 
the coding sequence for reporter protein 
Y. The expression patterns for X and Y 
are the same. (B) Various fragments of 
DNA containing candidate c/s-regulatory 
sequences are added in combinations to 
produce test DNA molécules encoding 
reporter gene Y. These recombinant DNA 
molécules are then tested for expression 
after introducing them into a variety of 
different types of mammalian cells. The 
results are summarized in (C). 

For experiments in eukaryotic cells, two 
commonly used reporter proteins are the 
enzyme p-galactosidase (p-galj (see Figure 
7-28) and green fluorescent protein (GFP) 
(see Figure 9-22). 
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the contribution of individual c/s-regulatory sequences in establishing this pat¬ 
tern (see also Figure 7-29). 

Reporter genes also allow any protein to be tracked over time in living cells. 
Here, the reporter gene typically encodes a fluorescent protein, often green fluo¬ 
rescent protein (GFP), the molécule that gives luminescent jellyfish their green- 
ish glow. The GFP is simply attached—in the coding frame—to the protein-cod- 
ing gene of interest. The resulting GFP fusion protein often behaves in the same 
way the normal protein does and its location can be monitored by fluorescence 
microscopy, a topic that is discussed in the next chapter (see Figure 9-25). GFP 
fusion has become a standard strategy for tracking not only the location but also 
the movement of spécifie proteins in living cells. In addition, the use of multiple 
GFP variants that fluoresce at different wavelengths can provide insights into how 
different cells interact in a living tissue (Figure 8-61). 

In situ Hybridization Can Reveal the Location of mRNAs and 
Noncoding RNAs 

It is also possible to directly observe the time and place that an RNA product of a 
gene is expressed using in situ hybridization. For protein-coding genes, this strat¬ 
egy often provides the same general information as the reporter gene approaches 
described above; however, it is crucial for genes whose final product is RNA rather 
than protein. We encountered in situ hybridization earlier in the chapter (see Fig¬ 
ure 8-34); it relies on the basic principles of nucleic acid hybridization. Typically, 
tissues are gently fixed so that their RNA is retained in an exposed form that can 
hybridize with a labeled complementary DNA or RNA probe. In this way, the pat¬ 
terns of differential gene expression can be observed in tissues, and the location 
of spécifie RNAs can be determined (Figure 8-62). An advantage of in situ hybrid¬ 
ization over other approaches is that genetic engineering is not required. Thus, 
it is often simpler and faster and can be used for genetically intractable species. 

Expression of Individual Genes Can Be Measured Using 
Quantitative RT-PCR 

Although reporter genes and in situ hybridization accurately reveal patterns of 
gene expression, they are not the most powerful methods for quantifying amounts 
of individual RNAs in cells. We hâve seen that RNA sequencing can provide infor¬ 
mation about the relative abundance of different RNA molécules (see Figure 7-3). 
Here, the number of "sequence reads" (short bits of nucléotide sequence) is pro- 
portional to the abundance of the RNA species. But this method is limited to RNAs 


Figure 8-61 GFPs that fluoresce at 
different wavelengths help reveal the 
connections that individual neurons 
make within the brain. This image shows 
differently colored neurons in one région 
of a mouse brain. The neurons randomly 
express different combinations of differently 
colored GFPs (see Figure 9-13), making 
it possible to distinguish and trace many 
individual neurons within a population. 
These images were obtained by genetically 
engineering the genes for four different 
fluorescent proteins, each flanked by loxP 
sites of recombination (see Figure 5-66), 
and integrating them into the mouse 
germ line. When crossed to a mouse that 
produced the Cre recombinase in neuronal 
cells, the fluorescent protein genes were 
randomly excised, producing neurons that 
express many different combinations of 
the four fluorescent proteins. Over 100 
combinations of fluorescent protein can be 
produced, allowing scientists to distinguish 
one neuron from the next. The stunning 
appearance of these labeled neurons has 
earned these animais the colorful nickname 
“brainbow mice.” (From J. Livet et al., 
Nature 450:56-62, 2007. With permission 
from Macmillan Publishers Ltd.) 



2 mm 


Figure 8-62 In situ hybridization to 
mRNAs has been used to generate an 
atlas of gene expression in the mouse 
brain. This computer-generated image 
shows the expression of several different 
mRNAs spécifie to an area of the brain 
associated with learning and memory. 
Similar maps of expression patterns of 
ail known genes in the mouse brain are 
compiled in the brain atlas project, which is 
available online. (From M. Hawrylycz et al., 
PLoS Comput. Biol. 7:e1001065, 2011.) 
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Figure 8-63 RNA levels can be measured by quantitative RT-PCR. The 

fluorescence measured is generated by a dye that fluoresces only when 
bound to the double-stranded DNA products of the RT-PCR (see Figure 
8-36). The red sample has a higher concentration of the mRNA being 
measured than does the blue sample, since it requires fewer PCR cycles to 
reach the same half-maximal concentration of double-stranded DNA. Based 
on this différence, the relative amounts of the mRNA in the two samples can 
be precisely determined. 

that are expressed at reasonably high levels, and it is difficult to quantify (or even 
identify) rare RNAs. A more accurate method is based on the principles of PCR 
(Figure 8-63). Called quantitative RT-PCR (reverse transcription-polymerase 
chain reaction), this method begins with the total population of RNA molécules 
purified from a tissue or a cell culture. It is important that no DNA be présent in 
the préparation; it must be purified away or enzymatically degraded. Two DNA 
primers that specifically match the mRNA of interest are added, along with reverse 
transcriptase, DNA polymerase, and the four deoxyribonucleoside triphosphates 
needed for DNA synthesis. The first round of synthesis is the reverse transcription 
of the RNA into DNA using one of the primers. Next, a sériés of heating and cool- 
ing cycles allows the amplification of that DNA strand by PCR (see Figure 8-36). 
The quantitative part of this method relies on a direct relationship between the 
rate at which the PCR product is generated and the original concentration of the 
mRNA species of interest. By adding Chemical dyes to the PCR that fluoresce only 
when bound to double-stranded DNA, a simple fluorescence measurement can 
be used to traclc the progress of the reaction and thereby accurately deduce the 
starting concentration of the mRNA that is amplified. Although it seems compli- 
cated, this quantitative RT-PCR technique is relatively fast and simple to perform 
in the laboratory; it is currently the method of choice for accurately quantifying 
mRNA levels from any given gene. 

Analysis of mRNAs by Microarray or RNA-seq Provides a 
Snapshot of Gene Expression 

As discussed in Chapter 7, a cell expresses only a subset of the many thousands 
of genes available in its genome; moreover, this subset differs from one cell type 
to another or, in the same cell, from one environment to the next. One way to 
détermine which genes are being expressed by a population of cells or a tissue is 
to analyze which mRNAs are being produced. 

The first tool that allowed investigators to analyze simultaneously the thou¬ 
sands of different RNAs produced by cells or tissues was the DNA microarray. 
Developed in the 1990s, DNA microarrays are glass microscope slides that contain 
hundreds of thousands of DNA fragments, each of which serves as a probe for the 
mRNA produced by a spécifie gene. Such microarrays allow investigators to mon- 
itor the expression of every gene in a genome in a single experiment. To do the 
analysis, mRNAs are extracted from cells or tissues and converted to cDNAs (see 
Figure 8-31). The cDNAs are fluorescently labeled and allowed to hybridize to the 
fragments bound to the microarray. An automated fluorescence microscope then 
détermines which mRNAs were présent in the original sample based on the array 
positions to which the cDNAs are bound (Figure 8-64). 

Although microarrays are relatively inexpensive and easy to use, they suffer 
from one obvious drawbaclc: the sequences of the mRNA samples to be analyzed 
must be lcnown in advance and represented by a corresponding probe on the 
array. With the development of improved sequencing technologies, investigators 
increasingly use RNA-seq, discussed earlier, as a more direct approach for cata- 
loging the RNAs produced by a cell. For example, this approach can readily detect 
alternative RNA splicing, RNA editing, and the many noncoding RNAs produced 
from a complex genome. 

DNA microarrays and RNA-seq analysis hâve been used to examine everything 
from the changes in gene expression that malce strawberries ripen to the gene 
expression "signatures" of different types of human cancer cells; or from changes 
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Figure 8-64 DNA microarrays are used to analyze the production of 
thousands of different mRNAs in a single experiment. In this example, 
mRNA is collected from two different cell samples—for example, cells 
treated with a hormone and untreated cells of the same type—to allow for 
a direct comparison of the spécifie genes expressed under both conditions. 
The mRNAs are converted to cDNAs that are labeled with a red fluorescent 
dye for one sample and a green fluorescent dye for the other. The labeled 
samples are mixed and then allowed to hybridize to the microarray. Each 
microscopie spot on the microarray is a 50-nucleotide DNA molécule of 
defined sequence made by Chemical synthesis and spotted on the array. 

The DNA sequence represented by each spot is different, and the hundreds 
of thousands of such spots are designed to span the sequence of the 
genome. The DNA sequence of each spot is kept track of by computer. After 
incubation, the array is washed and the fluorescence scanned. Only a small 
proportion of the microarray, representing 676 genes, is shown. Red spots 
indicate that the gene in sample 1 is expressed at a higher level than the 
corresponding gene in sample 2, and the green spots indicate the opposite. 
Yellow spots reveal genes that are expressed at about equal levels in both cell 
samples. The intensity of the fluorescence provides an estimate of how much 
RNA is présent from a gene. Dark spots indicate little or no expression of the 
gene whose probe is located at that position in the array. 


that occur as cells progress through the cell cycle to those made in response to 
sudden shifts in température. Indeed, because these approaches allow the simul- 
taneous monitoring of large numbers of RNAs, they can detect subtle changes in a 
cell, changes that might not be manifested in its outward appearance or behavior. 

Comprehensive studies of gene expression also provide information that is 
useful for predicting gene function. Earlier in this chapter, we discussed howiden- 
tifying a protein's interaction partners can yield dues about that protein's func¬ 
tion. A similar principle holds true for genes: information about a gene's function 
can be deduced by identifying genes that share its expression pattern. Using an 
approach called cluster analysis, one can identify sets of genes that are coordi- 
nately regulated. Genes that are turned on or turned off together under different 
circumstances are lilcely to work in concert in the cell: they may encode proteins 
that are part of the same multiprotein machine, or proteins that are involved in a 
complex coordinated activity, such as DNA réplication or RNA splicing. Charac- 
terizing a gene whose function is unknown by grouping it with lcnown genes that 
share its transcriptional behavior is sometimes called “guilt by association.” Clus¬ 
ter analyses hâve been used to analyze the gene expression profiles that underlie 
many interesting biological processes, including wound healing in humans (Fig¬ 
ure 8-65). 
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Figure 8-65 Using cluster analysis to identify sets of genes that are coordinately regulated. Genes that hâve the same expression pattern 
are likely to be involved in common pathways or processes. To perform a cluster analysis, RNA-seq or microarray data are obtained from cell 
samples exposed to a variety of different conditions, and genes that show coordinate changes in their expression pattern are grouped together. In 
this experiment, human fibroblasts were deprived of sérum for 48 hours; sérum was then added back to the cultures at time 0 and the cells were 
harvested for microarray analysis at different time points. Of the 8600 genes depicted here (each represented by a thin, vertical line), just over 300 
showed threefold or greater variation in their expression patterns in response to sérum réintroduction. Here, red indicates an increase in expression; 
green is a decrease in expression. On the basis of the results of many other experiments, the 8600 genes hâve been grouped in clusters based 
on similar patterns of expression. The results of this analysis show that genes involved in wound healing are turned on in response to sérum, while 
genes involved in regulating cell-cycle progression and cholestérol biosynthesis are shut down. (From M.B. Eisen et al., Proc. NatlAcad. Soi. USA 
94:14863-14868, 1998. With permission from National Academy of Sciences.) 
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Figure 8-66 Chromatin immunoprécipitation. This method allows the 
identification of ail the sites in a genome that a transcription regulator 
occupies in vivo. The identifies of the precipitated, amplified DNA fragments 
are determined by DNA sequencing. 


Genome-wide Chromatin Immunoprécipitation Identifies Sites on 
the Genome Occupied by Transcription Regulators 
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We hâve discussed several strategies to measure the levels of individual RNAs in a 
cell and to monitor changes in their levels in response to external signais. But this 
information does not tell us how such changes are brought about. We saw in Chap- 
ter 7 that transcription regulators, by binding to cz's-regulatory sequences in DNA, 
are responsible for establishing and changing patterns of transcription. Typically, 
these proteins do not occupy ail of their potential czs-regulatory sequences in 
the genome under ail conditions. For example, in some cell types, the regulatory 
protein may not be expressed, or it may be présent but laclc an obligatory part¬ 
ner protein, or it may be excluded from the nucléus until an appropriate signal is 
received from the cell's environment. Even if the protein is présent in the nucléus 
and is competent to bind DNA, other transcription regulators or components of 
chromatin can occupy overlapping DNA sequences and thereby occlude some of 
its cis -regulatory sequences in the genome. 

Chromatin immunoprécipitation provides a way to experimentally déter¬ 
mine ail the c/s-regulatory sequences in a genome that are occupied by a given 
transcription regulator under a particular set of conditions (Figure 8-66). In this 
approach, proteins are covalently cross-linked to DNA in living cells, the cells are 
brolcen open, and the DNA is mechanically sheared into small fragments. Anti- 
bodies directed against a given transcription regulator are then used to purify the 
DNA that became covalently cross-linked to that protein in the cell. This DNA is 
then sequenced using the rapid methods discussed earlier; the précisé location 
of each precipitated DNA fragment along the genome is determined by compar- 
ing its DNA sequence to that of the whole genome sequence (Figure 8-67). In 
this way, ail of the sites occupied by the transcription regulator in the cell sample 
can be mapped across the celFs genome (see Figure 7-37). In combination with 
microarray or RNA-seq information, chromatin immunoprécipitation can iden- 
tify the key transcriptional regulator responsible for specifying a particular pat¬ 
tern of gene expression. 

Chromatin immunoprécipitation can also be used to deduce the cz's-regula- 
tory sequences recognized by a given transcription regulator. Here, ail the DNA 
sequences precipitated by the regulator are lined up (by computer) and features 
in common are tabulated to produce the spectrum of cz's-regulatory sequences 
recognized by the protein (see Figure 7-9A). Chromatin immunoprécipitation is 
also used routinely to identify the positions along a genome that are bound by the 
various types of modified histones discussed in Chapter 4. In this case, antibodies 
spécifie to the particular histone modification are employed (see Figure 8-67). A 
variation of the technique can also be used to map positions of chromosomes that 
are in physical proximity (see Figure 4-48). 
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Ribosome Profiling Reveals Which mRNAs Are Being Translatée! in 
the Cell 

In preceding sections, we discussed several ways that RNA levels in the cell can be 
monitored. But for mRNAs, this represents only one step in gene expression, and 
we are often more interested in the final level of the protein produced by the gene. 
As described in the first part of this chapter, mass-spectroscopy methods can be 
used to monitor the levels of ail proteins in the cell, including modified forms of 
the proteins. However, if we want to understand how synthesis of proteins is con- 
trolled by the cell, we need to consider the translation step of gene expression. 

An approach called ribosome profiling provides an instantaneous map of the 
position of ribosomes on each mRNA in the cell and thereby identifies those 
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mRNAs that are being actively translatée!. To accomplish this, total RNA from a 
cell line or tissue is exposed to RNAses under conditions where only those RNA 
sequences covered by ribosomes are spared. The protected RNAs are released 
from ribosomes, converted to DNA, and the nucléotide sequence of each is deter- 
mined (Figure 8-68). When these sequences are mapped on the genome, the 
position of ribosomes across each mRNA species can be ascertained. 

Ribosome profiling has revealed many cases where mRNAs are abundant but 
are not translated until the cell receives an external signal. It has also shown that 
many open reading frames (ORFs) that were too short to be annotated as genes 
are actively translated and probably encode functional, albeit very small, proteins 
(Figure 8-69). Finally, ribosome profiling has revealed the ways that cells rapidly 
and globally change their translation patterns in response to sudden changes in 
température, nutrient availability, or Chemical stress. 


genomic 
DNA 

exon intron 




Figure 8-67 Results of several chromatin 
immunoprécipitations showing proteins 
bound to the control région that control 
expression of the Oct4 gene. In this 
sériés of chromatin immunoprécipitation 
experiments, antibodies directed against 
a transcription regulator (first three panels) 
or a particular histone modification (fourth 
panel) were used to precipitate bound, 
cross-linked DNA. Precipitated DNA was 
sequenced, and the positions across the 
genome were mapped. (Only the small part 
of the mouse genome containing the Oct4 
gene is shown.) The results show that, in 
the embryonic stem cells analyzed in these 
experiments, Oct4 binds upstream of its 
own gene and that Sox2 and Nanog are 
bound in close proximity. Oct4, Sox2, and 
Nanog are key regulators in embryonic 
stem cells (discussed in Chapter 22) and 
this experiment reveals the position on 
the genome through which they exert 
their effects on Oct4 expression. In the 
fourth panel, the positions of a histone 
modification associated with actively 
transcribed genes is shown (see Figure 
4-39). Finally, the bottom panel shows the 
RNA produced from the Oct4 gene under 
the same conditions used for the chromatin 
immunoprécipitations. Note that the introns 
and exons are relatively easy to identify 
from these RNA-seq data. 


Recombinant DNA Methods Hâve Revolutionized Human Health 

We hâve seen that nucleic acid méthodologies developed in the past 40 years hâve 
completely changed the way that cell and molecular biology is studied. But they 
hâve also had a profound effect on our day-to-day lives. Many human pharma- 
ceuticals in routine use (insulin, human growth hormone, blood-clotting factors, 
and interferon, for example) are based on cloning human genes and expressing 
the encoded proteins in large amounts. As DNA sequencing continues to drop 
in cost, more and more individuals will elect to hâve their genome sequenced; 
this information can be used to predict susceptibility to diseases (often with the 
option of minimizing this possibility by appropriate behavior) or to predict the 
way an individual will respond to a given drug. The genomes of tumor cells from 
an individual can be sequenced to détermine the best type of anticancer treat- 
ment. And mutations that cause or greatly increase the rislc of disease continue 
to be identified at an unprecedented pace. Using the recombinant DNA technol¬ 
ogies discussed in this chapter, these mutations can then be introduced into ani¬ 
mais, such as mice, that can be studied in the laboratory. The resulting transgenic 
animais, which often mimic some of the phenotypic abnormalities associated 
with the condition in patients, can be used to explore the cellular and molecular 
basis of the disease and to screen for drugs that could potentially be used thera- 
peutically in humans. 
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Figure 8-68 Ribosome profiling. RNA 

is purified from cells and digested with 
an RNAse to leave only those portions 
of the mRNAs that are protected by a 
bound ribosome. These short pièces 
of protected RNA (approximately 20 
nucléotides in length) are converted to DNA 
and sequenced. The resulting information 
is displayed as the number of sequence 
reads along each position of the genome. 

In the diagram here, the data for only one 
gene, whose mRNA is being efficiently 
translated, are shown. Ribosome profiling 
provides this type of information for every 
mRNA produced by the cell. 
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Transgenic Plants Are Important for Agriculture 

Although we tend to thinlc of recombinant DNA research in terms of animal biol- 
ogy, these techniques hâve also had a profound impact on the study of plants. In 
fact, certain features of plants malce them especially amenable to recombinant 
DNA methods. 

When a piece of plant tissue is cultured in a stérile medium containing nutri- 
ents and appropriate growth regulators, some of the cells are stimulated to pro- 
liferate indefinitely in a disorganized manner, producing a mass of relatively 
undifferentiated cells called a callus. If the nutrients and growth regulators are 
carefully manipulated, one can induce the formation of a shoot within the callus, 
and in many species a whole new plant can be regenerated from such shoots. In a 
number of plants—including tobacco, pétunia, carrot, potato, and Arabidopsis— a 
single cell from such a callus (known as a totipotent cell ) can be grown into a small 
clump of cells from which a whole plant can be regenerated (see Figure 7-2B). Just 
as mutant mice can be derived by the genetic manipulation of embryonic stem 
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Figure 8-69 Ribosome profiling can 
identify new genes. This experiment 
shows the discovery of a previously 
unrecognized gene—one that encodes a 
protein of only 20 amino acids. At the top is 
shown a portion of a viral genome with two 
previously annotated genes. Below are the 
results of a ribosome profiling experiment, 
displayed across the same section of the 
genome, after the virus was infected into 
human cells. The results show that the 
left-hand gene is not expressed under 
these conditions, the right-hand gene is 
expressed at low levels, and a previously 
unrecognized gene that lies between them 
is expressed at high levels. 
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Figure 8-70 Transgenic plants can 
be made using recombinant DNA 
techniques optimized for plants. A 

dise is eut out of a leaf and incubated in 
a culture of Agrobacterium that carries 
a recombinant plasmid with both a 
selectable marker and a desired genetically 
engineered gene. The wounded plant cells 
at the edge of the dise release substances 
that attract the bacteria, which inject their 
DNA into the plant cells. Only those plant 
cells that take up the appropriate DNA and 
express the selectable marker gene survive 
and proliferate and form a callus. The 
manipulation of growth factors supplied to 
the callus induces it to form shoots, which 
subsequently root and grow into adult 
plants carrying the engineered gene. 


cells in culture, so transgenic plants can be created from plant cells transfected 
with DNA in culture (Figure 8-70). 

The ability to produce transgenic plants has greatly accelerated progress in 
many areas of plant cell biology. It has played an important part, for example, in 
isolating receptors for growth regulators and in analyzing the mechanisms of mor- 
phogenesis and of gene expression in plants. These techniques hâve also opened 
up many new possibilities in agriculture that could benefit both the farmer and 
the consumer. They hâve made it possible, for example, to modify the ratio of lipid, 
starch, and protein in seeds, to impart pest and virus résistance to plants, and 
to create modified plants that tolerate extreme habitats such as sait marshes or 
water-stressed soil. One variety of rice has been genetically engineered to produce 
P-carotene, the precursor of vitamin A. Were it to replace conventional rice, this 
"golden rice"—so-called because of its faint yellow color—could help to alleviate 
severe vitamin A deficiency, which causes blindness in hundreds of thousands of 
children in the developing world each year. 


Summary 

Genetics and genetic engineering provide powerful toolsfor understanding the func- 
tion ofindividual genes in cells and organisms. In the classical genetic approach, 
random mutagenesis is coupled with screening to identify mutants that are défi¬ 
cient in a particular biological process. These mutants are then used to locate and 
study the genes responsihlefor that process. 

Gene function can also be ascertained by reverse genetic techniques. DNA engi¬ 
neering methods can be used to alter genes and to re-insert them into a cell's chro¬ 
mosomes so that they become a permanent part ofthe genome. If the cell used for 
this gene transfer is afertilized egg (for an animal) or a totipotent plant cell in cul¬ 
ture, transgenic organisms can be produced that express the mutant gene and pass 
it on to their progeny. Especially important for cell and molecular biology is the 
ability to alter cells and organisms in highly spécifie ways—allowing one to discern 
the effect on the cell or the organism ofa designed change in a single protein orRNA 
molécule. For example , genomes can be altered so that the expression ofany gene 
can be switched on or ojfby the expérimenter. 
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Many of these methods are being expanded to investigate gene function on a 
genome-wide scale. The génération of mutant libraries in which every gene in an 
organism has been systematically deleted , disrupted, or made controllable by the 
expérimenter provides invaluable to ois for exploring the rôle ofeach gene in the 
elaborate molecular collaboration thatgives rise to life. Technologies such as RNA- 
seq and DNA microarrays can monitor the expression oftens ofthousands ofgenes 
simultaneously, providing detailed, comprehensive snapshots ofthe dynamic pat¬ 
terns ofgene expression that underlie complex cell processes. 

MATHEMATICAL ANALYSIS OF CELL FUNCTIONS 

Quantitative experiments combined with mathematical theory mark the begin- 
ning of modem science. Galileo, Kepler, Newton, and their contemporaries did 
more than set out some rules of mechanics and offer an explanation of the move- 
ments of the planets around the Sun: they showed how a quantitative mathemat¬ 
ical approach could provide a depth and précision of understanding, at least for 
physical Systems, that had never before been dreamed to be possible. 

What is it that gives mathematics this almost magical power to explain the nat- 
ural world, and why has mathematics played so much more important a part in 
physical sciences than in biology? What do biologists need to know about math¬ 
ematics? 

Mathematics can be viewed as a tool for deriving logical conséquences from 
propositions. It differs from ordinary intuitive reasoning in its insistence on rig- 
orous, accurate logic and the précisé treatment of quantitative information. If the 
initial propositions are correct, then the déductions drawn from them by mathe¬ 
matics will be true. The surprising power of mathematics cornes from the length 
of the chains of reasoning that rigorous logic and mathematical arguments malce 
possible, and from the unexpectedness of the conclusions that can be reached, 
often revealing connections that one would not otherwise hâve guessed at. Revers- 
ing the argument, mathematics provides a way to test experimental hypothèses: if 
mathematical reasoning from a given hypothesis leads to a prédiction that is not 
true, then the hypothesis is not true. 

Clearly, mathematics is not much use unless we can frame our ideas—our ini¬ 
tial hypothèses—about the given System in a précisé, quantitative form. A math¬ 
ematical édifice raised on a rickety or—even worse—a vague or overcomplicated 
set of propositions is lilcely to lead us astray. For mathematics to be useful, we 
must focus our analysis on simple subsystems in which we can piclc out key quan¬ 
titative parameters and frame well-defined hypothèses. This approach has been 
used with great success in physics for centuries, but it has been less common in 
biology. But times are changing, and more and more it is becoming possible for 
biologists to exploit the power of quantitative mathematical analysis. 

In this final section of our methods chapter, we do not attempt to teach read- 
ers every way in which mathematics can be fruitfully applied to biological prob- 
lems. Rather, we simply aim to give a sense of what mathematics and quantitative 
approaches can do for us in modem biology. We focus primarily on the important 
principles that mathematics teaches us about the dynamics of molecular interac¬ 
tions, and how mathematics can unveil surprising and useful features of complex 
Systems containing feedback. We will illustrate these principles using the régula¬ 
tion of gene expression by transcription regulators lilce those discussed in Chapter 
7. The same principles apply to the post-transcriptional regulatory Systems that 
govern cell signaling (Chapter 15), cell-cycle control (Chapter 17), and essentially 
ail cell processes. 

Regulatory Networks Dépend on Molecular Interactions 

Cell function and régulation dépend on transient interactions among thousands 
of different macromolecules in the cell. We often summarize these interactions in 
this boolc with schematic cartoons. These diagrams are useful, but a complété pic- 
ture requires a deeper, more quantitative level of understanding. To meaningfully 
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assess the biological impact of any interaction in the cell, we need to know in 
précisé terms how the molécules interact, how they catalyze reactions, and, most 
importantly, how the behaviors of the molécules change over time. If a cartoon 
shows that protein A activâtes protein B, for example, we cannot judge the impor¬ 
tance of this relationship without quantitative details about the concentrations, 
affinities, and kinetic behaviors of proteins A and B. 

Let us begin by defining two different types of regulatory interaction in our 
cartoons: one designating inhibition and the other designating activation. If the 
protein product of gene Ais a transcription repressor that inhibits the expression 
of gene Z, we depict the relationship as a red bar-headed line (H) drawn between 
genes A and Z (Figure 8-71). If the protein product of gene Y is a transcription 
activator that induces the expression of gene Z, then a green arrow ( -► ) is drawn 
between genes Y and Z. 

The régulation of one gene's expression by another is more complicated than 
a single arrow connecting them, and a complété understanding of this régulation 
requires that we tease apart the underlying biochemical processes. Figure 8-72A 
slcetches some of the biochemical steps in the activation of gene expression by a 
transcription activator. A gene encoding the activator, designated as gene A, will 
produce its product, protein A, via an RNA intermediate. This protein A will then 
bind to px, the regulatory promoter of gene X, to form the complex A:px Once the 
A:px complex forms, it stimulâtes the production of an RNA transcript that is sub- 
sequently translated to produce protein X. 

We will focus here on the binding interaction that lies at the heart of this reg¬ 
ulatory System: the interaction between protein A and the promoter px . Any mol¬ 
écule of protein A that is bound to px can also dissociate from it. The steps repre- 
sented by the green activation arrow in Figure 8-72A include both the binding of 
A to px &nd the dissociation of the complex A:pxto re-form A and px, as illustrated 
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Figure 8-71 Diagrams that summarize 
biochemical relationships. Here, a simple 
cartoon indicates that gene X represses 
gene Z (jeft) whereas gene Y activâtes gene 
Z (right). 
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at steady State: 
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[A:p x ] = [A][p x ] = K[A][p x ] Equation 8-1 
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(C) 

\P}\ = [p x \ + [A\p x \ 

substituting [p x ] from the above équation into 
Equation 8-1 yields: 

[A:p x ] = K[A\([pT] - [. A:p x ]) 

[Ap x ](1 + K[A]) = K[A][pT] 

MPxI = 1 K + l ^ ] [A] Equation 8-2 

(D) 


bound fraction = 


[A:p x \ K[A] 
[PÏ] 1 + K[A] 


Equation 8-3 


(E) 


Figure 8-72 A simple transcriptional 
interaction. (A) Genes A and X each 
produce a protein, with the product of 
gene A serving as a transcription activator 
to stimulate expression of gene X. As 
indicated by the green arrow, stimulation 
dépends in part on the binding of protein 
A to the promoter région of gene X, 
designated as px- (B) The binding of protein 
A to the gene promoter is determined 
by the concentrations of the two binding 
partners (denoted as [A] and \px], in units 
of mol/liter, or M), the association rate 
constant k œ (in units of sec -1 M -1 ), and 
the dissociation rate constant /c 0 ff (in units 
of sec -1 ). (C) At steady State, the rates of 
association and dissociation are equal, and 
the concentration of the bound complex is 
determined by Equation 8-1, in which the 
two rate constants are combined in the 
equilibrium constant K. (D) Equation 8-2 
can be derived to calculate the steady- 
state concentration of bound complex at a 
known total concentration of the promoter 
|pj]. (E) Rearrangement of Equation 
8-2 yields Equation 8-3, which allows 
calculation of the fraction of promoter px 
that is occupied by protein A. 
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by the notation in Figure 8-72B. This reaction notation is more informative than 
the diagrams in our figures, but has its own limitations. Suppose that the concen¬ 
tration of A increases by a factor of ten as a response to an environmental input. 
If A increases, we intuitively know that A:px should increase too, but we cannot 
détermine the amount of the increase without additional information. We need to 
know the affinity of the binding interaction and the concentrations of the compo- 
nents. With this information in hand, we can rigorously dérivé the answer. 

As discussed earlier and in Chapter 3 (see Figure 3-44), we know that the for¬ 
mation of a complex between two binding partners, such as A and px, dépends 
on a rate constant /c on , which describes how many productive collisions occur per 
unit time per protein at a given concentration of px The rate of complex forma¬ 
tion equals the product of this rate constant lc on and the concentrations of A and 
px {see Figure 8-72B). Complex dissociation occurs at a rate A: 0 ff multiplied by the 
concentration of the complex. The rate constant A: 0 ff can differ by orders of magni¬ 
tude for different DNA sequences because it dépends on the strength of the non- 
covalent bonds formed between A and px 

We are primarily interested in understanding the amount of bound promoter 
complex at equilibrium or steady State , where the rate of complex formation 
equals the rate of complex dissociation. Under these conditions, the concentra¬ 
tion of the promoter complex is specified by a simple équation that combines the 
two rate constants into a single equilibrium constant K = k 0n /k 0 n (Equation 8-1; 
Figure 8-72C). iCis sometimes called the association constant, K a . The larger this 
constant K , the stronger the interaction between A and px (see Figure 3-44). The 
reciprocal of AC is the dissociation constant, K& 

To calculate the steady-state concentration of promoter complex using Equa¬ 
tion 8-1, we need to account for another complication: both A and px exist in two 
forms—free in solution and bound to each other. In most cases, we know the total 
concentration of px and not the free or bound concentrations, so we must find a 
way to use the total concentration in our calculations. To do this, we first specify 
that the total concentration of px {[px D is the sum of the concentrations of free 
([px ]) and bound {[A:p x ]) forms (Figure 8-72D). This leads to a new équation that 
allows us to use [p£\ to calculate the steady-state concentration of the promoter 
complex ([A:px]) (Equation 8-2, Figure 8-72D). 

Protein A also exists in two forms: free ([A]) and bound to px{[A:px ]). In a cell, 
there are typically one or two copies of px (assuming there is only one gene X per 
haploid genome) and multiple copies of A. As a resuit, we can safely assume that 
from the viewpoint of A, [A:px] is negligible relative to the total [A T ]. This means 
that [A] ~ [A t ], and we can just plug in the values of total [A T ] in Equation 8-2 
without incurring appréciable error in the calculation of [ A:px ]. 

Now, we are ready to détermine the effects of increasing the concentration of 
A. Suppose that K = 10 8 M -1 , which is a typical value for many such interactions. 
The starting concentration of A is [A T ] = 10“ 9 M, and [p^] = 10 _1 ° M (assuming 
there is one copy of gene X in a haploid yeast cell, for example, with a volume 
of around 2 x 10 -14 L). Using Equation 8-2, we find that a tenfold increase in the 
concentration of A causes the amount of promoter complex [A:px] to increase 5.5- 
fold, from 0.09 x 10 _1 ° M to 0.5 x 10 _1 ° M at steady State. The effects of a tenfold 
increase in the concentration of A will vary dramatically depending on its starting 
concentration relative to the equilibrium constant. Only through this mathemat- 
ical approach can we achieve a thorough understanding of what these effects will 
be and what impact they will hâve on the biological response. 

To assess the biological impact of a change in transcription activator levels, it is 
also important in many cases to détermine the fraction of the target gene promoter 
that is bound by the activator, since this number will be directly proportional to 
the activity of the gene's promoter. In our case, we can calculate the fraction of the 
gene X promoter, px that has protein A bound to it by rearranging Equation 8-2 
(Equation 8-3, Figure 8-72E). This fraction can be viewed as the probability that 
promoter px is occupied, averaged over time. It is also equal to the average occu- 
pancy across a large population of cells at any instant in time. When there is no 
protein A présent, px is always free, the bound fraction is zéro, and transcription is 
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off. When [A] = l/K, the promoter px has a 50% chance of being occupied. When 
[A] greatly exceeds l/K, the bound fraction is almost equal to one, meaning that 
px is fully occupied and transcription is maximal. 

Differential Equations Help Us Predict Transient Behavior 

The most important and basic insights for which we, as biologists, dépend on 
mathematics concern the behavior of regulatory Systems over time. This is the 
central theme of dynamics, and it was for the solution of problems in dynamics 
that the techniques of calculus were developed, by Newton and Leibniz, in the 
seventeenth century. Briefly, the general problem is this: if we are given the rates 
of change of a set of variables that characterize the System at any instant, how can 
we compute its future State? The problem becomes especially interesting, and the 
prédictions often remarlcable, when the rates of change themselves dépend on the 
values of the State variables, as in Systems with feedback. 

Let us return to Equation 8-2 (Figure 8-72D), which tells us that when [A] 
changes, [A:px] at steady State will also change to a new concentration that we 
can calculate with précision. However, [A:px] does not change instantaneously to 
this value. If we hope to understand the behavior of this System in detail, we must 
also ask how long it takes [A:px] to get to its new steady-state value inside the cell. 
Equation 8-2 cannot answer this question. We need calculus. 

The most common strategy for solving this problem is to use ordinary differen¬ 
tial équations. The équations that describe biochemical reactions hâve a simple 
premise: the rate of change in the concentration of any molecular species X (that 
is, d[X]/dt) is given by the balance of the rate of its appearance with that of its dis- 
appearance. For our example, the rate of change in the concentration of the bound 
promoter complex, [A:px] , is determined by the rates of complex assembly and dis- 
assembly. We can incorporate these rates into the differential équation shown in 
Figure 8-73A (Equation 8-4). When [A] changes, Equation 8-4 can be solved 
to generate the concentration of [A:px] as a function of time. Notice that when 
k on [A] [px] = k 0 ff [A:px ], then d[A:px\/dt = 0 and [A:px] stops changing. At this point, 
the System has reached the steady State. 

Calculation of ail [A:px] values as a function of time, using Equation 8-4, allows 
us to détermine the rate at which [A:p x ] reaches its steady-state value. Because 
this value is attained asymptotically, it is often most useful to compare the times 
needed to get to 50, 90, or 99 percent of this new steady State. The simplest way to 
détermine these values is to solve Equation 8-4 with a method called numerical 
intégration, which involves plugging in values for ail of the parameters (/c on ; 
etc.) and then using a computer to détermine the values of [A:px] over time, start- 
ing from given initial concentrations of [A] and [px] - For k on = 0.5 x 10 7 sec -1 M -1 , 
^off = 0.5 x 10 1 sec -1 ( K = 10 8 M -1 as above), and [p^] = 10 _1 ° M, it takes [A:px] 
about 5, 20, and 40 seconds to reach 50, 90, and 99 percent of the new steady-state 
value following a sudden tenfold change in [A] (Figure 8-73B). Thus, a sudden 
jump in [A] does not hâve instantaneous effects, as we might hâve assumed from 
looking at the cartoon in Figure 8-72A. 

Differential équations therefore allowus to understand the transient dynamics 
of biochemical reactions. This tool is critical for achieving a deep understanding 
of cell behavior, in part because it allows us to détermine the dependence of the 
dynamics inside cells on parameters that are spécifie to the particular molécules 
involved. For example, if we double the values of both k on and 1c 0 ff, then Equa¬ 
tion 8-1 (Figure 8-72C) indicates that the steady-state value of [A:px] does not 
change. However, the time it takes to reach 50% of this steady State after a ten-fold 


d[A:p x ] 

dt 


= rate of complex formation - rate of complex dissociation 


d[A:p x ] 

dt 


= k on [A][p x ] - k off [A:p x ] 


Equation 8-4 


(A) 


Figure 8-73 Using differential équations 
to study the dynamics and steady- 
state behavior of a biological System. 

(A) Equation 8-4 is an ordinary differential 
équation for calculating the rate of change 
in the formation of bound promoter 
complex in response to a change in other 
components. (B) Formation of [A:px] after 
a tenfold increase in \A], as determined by 
solving Equation 8-4. In blue is the solution 
corresponding to k ou = 0.5 x 10 7 sec -1 M -1 
and /c 0 ff = 0.5x10 _1 sec -1 . In this case, it 
takes [A\px] about 5, 20, and 40 seconds 
to reach 50, 90, and 99 percent of the new 
steady-state value. For the red curve, the 
/c on and /( 0 ff values are doubled, and the 
System reaches the same steady State 
more rapidly. 
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change in [A] in our example changes from about 5 seconds to 2 seconds (see Fig¬ 
ure 8-73B). These insights are not accessible from either cartoons or equilibrium 
équations. This is an unusually simple example; mathematical descriptions such 
as differential équations become more indispensible for understanding biological 
interactions as the number of interactions increases. 


Figure 8-74 Effect of protein lifetime 
on the timing of the response. 

(A) Equations for calculation of the rates of 
gene Xtranscription, protein X production, 
and protein X dégradation, as explained 
in the text. (B) Equation 8-5 is an ordinary 
differential équation for calculating the 
rate of change in protein X in response to 
changes in other components. (C) When 
the rate of change in protein X is zéro 
(steady State), its concentration can be 
calculated with Equation 8-6, revealing 
a direct relationship with protein lifetime 
(r). (D) The solution of Equation 8-5 
spécifiés the concentration of protein X 
over time as it approaches its steady-state 
concentration. (E) Response time dépends 
on protein lifetime. As described in the text, 
the time that it takes a protein to reach 
a new steady State is greater when the 
protein is more stable. Here, the blue line 
corresponds to a protein with a lifetime that 
is 2.5-fold shorter than the lifetime of the 
protein in red. 


Both Promoter Activity and Protein Dégradation Affect the Rate of 
Change of Protein Concentration 

To understand our gene regulatory System further, we also need to describe the 
dynamics of protein X production in response to changes in the amount of tran¬ 
scription activator protein A. Here again, we use an ordinary differential équation 
for the rate of change of protein X concentration—determined by the balance of 
the rate of production of protein X through expression of gene X and the protein's 
rate of dégradation. 

Let us begin with the rate of protein X production, which is determined pri- 
marily by the occupancy of the promoter of gene Xby protein A. The binding and 
dissociation of a transcription regulator at a promoter generally occurs on a much 
faster time scale than transcription initiation, causing many binding and unbind- 
ing events to occur before transcription proceeds. As a resuit, we can assume that 
the binding reaction is at equilibrium on the time scale of transcription, and we 
can calculate promoter occupancy by protein A using the equilibrium équation 
discussed earlier (Equation 8-3, Figure 8-72E). To détermine transcription rate, 
we simply multiply the occupied promoter fraction by a transcription rate con¬ 
stant, fi, that represents the binding of RNA polymerase and the subséquent steps 
that lead to production of mRNA and protein (Figure 8-74A). If each mRNA mol¬ 
écule produces, on average, m molécules of protein product, then we can déter¬ 
mine protein production rate by multiplying the transcription rate by m (Figure 
8-74A). 

Now let us consider the factors that influence protein X dégradation and 
its dilution due to cell growth. Dégradation generally results in an exponential 
décliné in protein levels, and the average time required for a spécifie protein to be 
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degraded is defined as its mean lifetime, r. In our current example ; the rate of dég¬ 
radation of protein X dépends on its mean lifetime Zx, which talces into account 
active dégradation as well as its dilution as the cell grows. The dégradation rate 
dépends on the concentration of protein X and is calculated by dividing this con¬ 
centration by the lifetime (Figure 8-74A). 

With équations for rates of production and dégradation in hand, we can now 
generate a differential équation to détermine the rate of change of protein X as a 
function of time (Equation 8-5, Figure 8-74B). This équation can be solved by the 
numerical methods mentioned earlier. According to the solution of this équation, 
when transcription begins, the concentration of protein X rises to a steady-state 
level at which the concentration of X is not changing anymore; that is, its rate of 
change is zéro. When this occurs, rearrangement of Equation 8-5 yields an équa¬ 
tion that can be used to détermine the steady-state value of X, [X st ] (Equation 8-6, 
Figure 8-74C). An important concept emerges from the mathematics: the steady- 
state concentration of a gene product is directly proportional to its lifetime. If life¬ 
time doubles, protein concentration doubles as well. 

The Time Required to Reach Steady State Dépends on Protein 
Lifetime 

We can see from Equation 8-6 (see Figure 8-74C) that when the concentration 
of protein A rises, protein X increases to a new steady-state value, But this 
cannot happen instantaneously. Instead, X changes dynamically according to the 
solution of its differential rate équation (Equation 8-5). The solution of this équa¬ 
tion reveals that the concentration of Aover time is related to its steady-state con¬ 
centration according to the équation in Figure 8-74D. Once again, mathematics 
uncovers a simple but important concept that is not intuitively obvious: following 
a sudden increase in [A], [X] rises to a new steady State at an exponential rate that 
is inversely related to its lifetime; the faster Xis degraded, the less time it talces it to 
reach its new steady-state value (Figure 8-74E). The faster response time cornes at 
a higher metabolic cost, however, since proteins with a rapid response time must 
be produced and degraded at a high rate. For proteins that are not rapidly turned 
over, the response time is very long, and protein concentration is determined pri- 
marily by the dilution that results from cell growth and division. 

Quantitative Methods Are Similar for Transcription Repressors and 
Activators 

Positive control is not the only mechanism that cells use to regulate the expres¬ 
sion of their genes. As we discussed in Chapter 7, cells also actively shut off genes, 
often by employing transcription repressor proteins that bind to spécifie sites on 
target genes, thereby blocking access to RNA polymerase. We can analyze the 
function of these repressors by the same quantitative methods described above 
for transcription activators. If a repressor protein R binds to the regulatory région 
of gene X and represses its transcription, then the fraction of gene binding sites 
occupied by the repressor is specified by the same équation we used earlier for 
the transcription activator (Figure 8-75A). In this case, however, it is only when 
the DNA is free that RNA polymerase can bind to the promoter and transcribe the 
gene. Ihus, the quantity of interest is the unbound fraction, which can be viewed 
as the probability that the site is free, averaged over multiple binding and unbind- 
ing events. When the repressor concentration is zéro, the unbound fraction is 1 
and the promoter is fully active; when the repressor concentration greatly exceeds 
l/K , the unbound fraction approaches zéro. Figures 8-75B and C compare these 
relationships for a transcription activator and a transcription repressor. 

We can create a differential équation that provides the rate of change in pro¬ 
tein Awhen repressor concentrations change (Equation 8-7, Figure 8-75D). As in 
the case of the transcription activator, the steady-state concentration of protein 
X increases as its lifetime increases, but it decreases as the concentration of the 
transcription repressor increases. 
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Figure 8-75 How promoter occupancy 
dépends on the binding affinity of a 
transcription regulator protein. (A) The 

fraction of a binding site that is occupied by 
a transcription repressor R is determined 
by an équation that is similar to the one 
we used for a transcription activator (see 
Figure 8-72E), except that in the case of a 
repressor we are interested primarily in the 
unbound fraction. (B) For a transcription 
activator A, half of the promoters are 
occupied when [A] = MKa. Gene activity is 
proportional to this bound fraction. (C) For 
a transcription repressor R, gene activity 
is proportional to the unbound fraction 
of promoters. As indicated, this fraction 
is reduced to half of its maximal value 
when [R\=A/Kr. (D) As in the case of the 
transcription activator A (see Figure 8-74), 
we can dérivé équations to assess the 
timing of protein X production as a function 
of repressor concentrations. 


(D) 


Négative Feedback Is a Powerful Strategy in Cell Régulation 

Thus far, we hâve considered simple regulatory Systems of just a few components. 
In most of the complex regulatory Systems that govern cell behaviors, multiple 
modules are linlced to produce larger circuits that we call networlc motifs, which 
can produce surprisingly complex and biologically useful responses whose prop- 
erties become apparent only through mathematical analysis. A particularly com- 
mon and important networlc motif is the négative feedback loop, which can hâve 
dramatically different functions depending on how it is structured. 

We talce as a first example a networlc motif consisting of two linlced modules 
(Figure 8-76A). Here, an input signal initiâtes the transcription of gene A, which 
produces a transcription activator protein A. This activâtes gene R, which synthe- 
sizes a transcription repressor protein R. Protein R in turn binds to the promoter 
of gene A to inhibit its expression. This cyclical organization créâtes a négative 
feedback loop that one can intuitively understand as a mechanism to prevent 
proteins from accumulating to high levels. But what can we learn about négative 
feedback loops, and their value in biology, by using mathematics to model them? 

The négative feedback loop in Figure 8-76A can be modeled using Equation 
8-7 (see Figure 8-75D) for the repression of gene A and Equation 8-5 (see Figure 
8-74B) for the activation of gene R. Thus, for proteins A and R, we use the set of dif- 
ferential équations (Equation set 8-8) shown in Figure 8-76B. The two équations 
in this set are coupled, which means that they must be solved together to describe 

Figure 8-76 A simple négative feedback motif. (A) Gene A negatively 
régulâtes its own expression by activating gene R. The product of gene A* is a 
transcription repressor that inhibits gene A. (B) Equation set 8-8 can be solved 
to détermine the dynamics of System components over time. (C) A System with 
négative feedback (blue) reaches its steady State faster than a System with 
no feedback (red). The plots indicate the levels of protein A, expressed as a 
fraction of the steady-state level. The blue line reflects the solution of Equation 
set 8-8, which includes négative feedback of gene A by the repressor R. The 
red line represents the solution when the rate of synthesis of A was set to a 
constant value that is unaffected by the repressor R. 
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the behavior of A and R over time for any value of the input. As before, we plug in 
values for the parameters (J3r, tr, etc.) and then use a computer to détermine the 
values of [A] and [R] as a function of time after a sudden input activâtes gene A. 

The results reveal several important properties of négative feedback. First, 
rather surprisingly, négative feedback increases the speed of the response to the 
activating input. As shown in Figure 8-76C, the System with négative feedback 
reaches its new steady State faster than the System with no feedback. 

Second, négative feedback is useful for protecting cells from perturbations 
that continuously arise in the celFs internai environment—due either to random 
variations in the birth and death of molécules or to fluctuations in environmental 
variables such as température and nutritional supplies. Let us imagine, for exam¬ 
ple, that P a? the transcription rate constant for gene A, fluctuâtes by 25% of its 
value and aslc whether and how much the levels of protein R are affected. The 
results, shown in Figure 8-77, reveal that a change in fi a causes a smaller change 
in the steady-state value of R when the network has négative feedback. 


Figure 8-77 The effect of fluctuations 
in kinetic rate constants on a System 
with négative feedback compared to 
one without feedback. The plot at left 
represents the levels of protein R after a 
sudden activating stimulus, according to 
the regulatory scheme in Figure 8-76A and 
determined by the solution of Equation set 
8-8 (see Figure 8-76B). A perturbation was 
induced by changing Pa from 4 M/min (red 
line) to 3 M/min (blue line). The plot at right 
shows the results when négative feedback 
was removed. The System with négative 
feedback deviates less from its normal 
operation as P changes than does the 
System with no feedback. Notice that, as 
in Figure 8-76C, the System with négative 
feedback also reaches its steady State 
more rapidly. 


Delayed Négative Feedback Can Induce Oscillations 

A beautiful thing happens when a négative feedback loop contains some delay 
mechanism that slows the feedback signal through the loop: rather than gener- 
ating a new stable State as in a rapid négative feedback loop, a delayed loop gén¬ 
érâtes puises, or oscillations, in the levels of its components. This can be seen, 
for example, if the number of components in a négative feedback loop increases, 
which leads to delays in the amount of time required for the cycle of signais to be 
completed. Figure 8-78 compares the behavior of two network motifs—one with 
a three-stage and one with a five-stage négative feedback loop. Using the same 
kinetic parameters at each stage in the two loops, one finds that stable oscillations 
arise in the longer loop, while in the shorter loop the same parameters lead to 
relatively rapid convergence to a stable steady State. 

Changes in the parameters of a delayed négative feedback loop—binding 
affinities, transcription rates, or protein stabilities, for example—can change the 
amplitude and period of the oscillations, providing a remarkably versatile mech¬ 
anism for generating ail sorts of oscillators that can be used for various purposes 
in the cell. Indeed, many naturally occurring oscillators, including the calcium 
oscillators described in Chapter 15 and the cell-cycle network described in Chap¬ 
ter 17, use delayed négative feedback as the basis for biologically important oscil¬ 
lations. Not ail of the oscillations observed in cells are thought to hâve a function, 
however. Oscillations become inévitable in a highly complex, multicomponent 
biochemical pathway lilce glycolysis, due simply to the large number of feedback 
loops that appear to be required for its régulation. 

DNA Binding By a Repressor or an Activator Can Be Cooperative 

We hâve focused thus far on the binding of a single transcription regulator to a 
single site in a gene promoter. Many promoters, however, contain multiple adja¬ 
cent binding sites for the same transcription regulator, and it is not uncommon for 
these regulators to interact with each other on the DNA to form dimers or larger 
oligomers. These interactions can resuit in a cooperative form of DNA binding, 
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such that DNA-binding affïnity increases at higher concentrations of the tran¬ 
scription regulator. Cooperativity produces a steeper transcriptional response to 
increasing regulator concentration than the response that can be generated by the 
binding of a monomeric protein to a single site. A steep transcriptional response 
of this sort, when présent in conjunction with positive feedback, is an important 
ingrédient for producing Systems with the ability to switch between different dis¬ 
crète phenotypic States. To begin to understand how this occurs, we need to mod- 
ify our équations to include cooperativity. 

Cooperative binding events can produce steep S-shaped (or sigmoidal) rela- 
tionships between the concentration of regulatory protein and the amount bound 
on the DNA (see Figure 15-16). In this case, a number called the Hill coefficient ( h ) 
describes the degree of cooperativity, and we can include this coefficient in our 
équations for calculating the bound fraction of promoter (Figure 8-79A). As the 
Hill coefficient increases, the dependence of binding on protein concentration 
becomes steeper (Figure 8-79B). In principle, the Hill coefficient is similar to the 
number of molécules that must corne together to generate a reaction. In practice, 
however, cooperativity is rarely complété, and the Hill coefficient does not reach 
this number. 


bound fraction = 


( K A [A]) h 

-- for activators, or 

1 + (K A [A]) h 


(,K R [R]) h 

-— for repressors 

1 + (K R [R]) h 


(A) 




Figure 8-78 Oscillations arising 
from delayed négative feedback. 

A transcriptional circuit with three 
components (A, B) is less likely to oscillate 
than a transcriptional circuit with five 
components (C, D). The X (light blue), 

Y (dark blue), and Z (brown) here represent 
transcription regulatory proteins. For the 
simulations in (B) and (D), the System was 
initiated from random initial conditions for 
X, Y, and Z Oscillations are produced by 
a delay induced as the signal propagates 
through the loop. 


Figure 8-79 How the cooperative 
binding of transcription regulatory 
proteins affects the fraction of 
promoters bound. (A) Cooperativity is 
incorporated into our mathematical models 
by including a Hill coefficient (h) in the 
équations used previously to détermine the 
fraction of bound promoter (see Figures 
8-72E and 8-75A). When h is 1, the 
équations shown here become identical to 
the équations used previously, and there is 
no cooperativity. (B) The left panel depicts a 
cooperatively bound transcription activator 
and the right panel depicts a cooperatively 
bound transcription repressor. Recall 
from Figure 8-75B that gene activity 
is proportional to bound activator (left 
panel) or unbound repressor (right panel). 
Note that the plots get steeper as the Hill 
coefficient increases. 
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Positive Feedback Is Important for Switchlike Responses and 
Bistability 

We turn now to positive feedback and its very important conséquences. First and 
foremost, positive feedback can malce a System bistable, enabling it to persist in 
either of two (or more) alternative steady States. The idea is simple and can be 
conveyed by drawing an analogy with a candie, which can exist either in a burning 
State or in an unlit State. The burning State is maintained by positive feedback: the 
heat generated by burning lceeps the flame alight. The unlit State is maintained 
by the absence of this feedback signal: so long as sufficient heat has never been 
applied, the candie will stay unlit. 

For the biological System, as for the candie, bistability has an important corol- 
lary: it means that the System has a memory, such that its présent State dépends 
on its history. If we start with the System in an Off State and gradually racle up the 
concentration of the activator protein, there will corne a point where auto stimula¬ 
tion becomes self-sustaining (the candie lights), and the System moves rapidly to 
an On State. If we now intervene to decrease the level of activator, there will corne 
a point where the same thing happens in reverse, and the System moves rapidly 
baclc to an Off State. But the transition points for switching on and switching off 
are different, and so the current State of the System dépends on the route by which 
it has been talcen in the past—a phenomenon called hystérésis. 

A simple case of positive feedback can be seen in a regulatory System in which 
a transcription regulator activâtes (directly or indirectly) its own expression, as in 
Figure 8-80A. Positive feedback can also arise in a circuit with many intervening 
repressors or activators, so long as the net overall effect of the interactions is acti¬ 
vation (Figure 8-80B and C). 

To illustrate how positive feedback can generate stable States, let us focus on a 
simple positive feedback loop containing two repressors, X and Y, each of which 
inhibits expression of the other (Figure 8-81 A). As we saw with Equation set 8-8 
(Figure 8-76B) earlier, we can create differential équations describing the rate 
of change of [X] and [Y] (Equation set 8-9, Figure 8-81 B). We can further mod- 
ify these équations to include cooperativity by adding Hill coefficients. As we did 
earlier, we can then create équations for calculating the concentrations of [X] 
and [Y] when the System reaches a steady State (that is, when (d[X]/dt) - 0 and 
(i d[Y\/dt ) = 0; Equations 8-10 and 8-11, Figure 8-81C). 

Equations 8-10 and 8-11 can be used to carry out an intriguing mathemati- 
cal procedure called a nullcline analysis. These équations define the relationships 
between the concentration of Xat steady State, [X st ], and the concentration of Y 
at steady State, [Y st ], which must be simultaneously satisfied. We can plug in dif¬ 
ferent values for [Y st ] in Equation 8-10, and calculate the corresponding [X st ] for 
each of these values. We can then graph [X st ] as a function of [Y st \. Next, we repeat 
the process by varying [X st ] in Equation 8-11 to graph the resulting [Y st ]. The inter¬ 
sections of these two graphs détermine the theoretically possible steady States of 
the System. For Systems in which the Hill coefficients hx and hy are much larger 
than 1, the lines in the two graphs intersect at three locations (Figure 8-81D). In 
other Systems that hâve the same arrangement of regulators but different parame- 
ters, there might only be one intersection, indicating the presence of only a single 
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Figure 8-80 Positive feedback of a gene 
onto itself through serially connected 
interactions. A sequence of activators and 
repressors of any length can be connected 
to produce a positive feedback loop, as 
long as the overall sign is positive. Because 
the négative of a négative is positive, not 
only circuit (A) and (B) but also circuit (C) 
create positive feedback. 
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Figure 8-81 Agraphical nullcline analysis. (A) X inhibits Y and YinhibitsX, resulting 
in a positive feedback loop. (B) Equation set 8-9 can be used to détermine the rate of 
change in the concentrations of proteins X and Y. (C) Equations 8-10 and 8-11 provide 
the concentrations of proteins X and Y, respectively, when these concentrations reach 
a steady State. (D, E) Blue curves (called nullclines) are plots of [X s Jcalculated from 
Equation 8-10 over a range of concentrations of [Y st ]. Red curves indicate values of [Y s t] 
calculated from Equation 8-11 over a range of concentrations of [X st ]. At an intersection 
of the two lines, both [X] and [Y\ are at steady State. For plot (D), the binding of both 
proteins to their target gene promoters was cooperative (hx and hy much larger than 1), 
resulting in the presence of multiple intersections of the nullclines—suggesting that the 
System can assume multiple discrète steady States. In plot (E), the binding of protein 
X to the promoter of gene Y was not cooperative (hx close to 1), resulting in only one 
nullcline intersection and thus just one likely steady State. 


steady State. For example, when there is a low cooperativity of protein X binding 
to the promoter of gene Y (that is, a small Hill coefficient, hx, in Equation 8-11), 
the plot of [Y] is less curved (Figure 8-81E), and it is less likely that there will be 
multiple intersections of the two curves. 

We emphasized earlier that positive feedback typically generates a bistable Sys¬ 
tem with two stable steady States. Why does the System modeled in Figure 8-81D 
hâve three? This conundrum can be explained by solving the reaction rate équa¬ 
tions (Equation set 8-9, Figure 8-81B) for various different starting conditions of 
[X] and [Y], determining ail values of [X] and [Y] as a function of time. Starting 
with each set of initial concentrations of [X] and [Y], these calculations produce 
a so-called trajectory of points, each indicated by a curved green line on Figure 
8-82A. A fascinating pattern emerges: each trajectory moves across the plot and 
settles in one of two steady States, but never in the third (middle steady State). 
We conclude that the middle steady State is unstable because it cannot "attract" 
any trajectories. The System therefore has only two stable steady States. Thus, the 
number of stable steady States in a System need not be equal to the total number 
of its theoretically possible steady States. In fact, stable steady States are usually 
separated by unstable ones, as in our example. 

Once this System adopts a fate by settling in one of the two steady States, does 
it hâve the ability to switch to the other State? The numerical solution of Equation 





concentration of X 


Figure 8-82 Analysis of the stability of 
a System’s steady States. (A) The dotted 
lines are the nullclines for the System shown 
in Figure 8-81. Also shown are dynamic 
trajectories (green) that show the changes 
over time in [X] and [Y], starting at a variety 
of different initial concentrations (determined 
by solution of Equation set 8-9; see Figure 
8-81 B). By plotting [X] versus [Y\ at each 
time point, we find that, although there are 
three possible steady States in this System, 
the dynamic trajectories converge on only 
two of them. The middle steady State is 
avoided: it is unstable, being unable to 
attract any trajectories. (B) Imagine that 
the System is at the upper-left steady State 
and expériences a perturbation (black 
arrows), such as a random fluctuation in 
the production rates of X and/or Y. If the 
perturbation is small (arrow 1), the System 
will return to the same steady State. On 
the other hand, a perturbation that drives 
the System beyond the unstable (middle) 
steady State (arrow 2) causes it to switch 
to the lower-right steady State. The set of 
perturbations that a System can withstand 
without switching from one steady State 
to the other is known as the région of 
attraction of that steady State. 
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set 8-9 can again provide an answer. In Figure 8-82B, we show the solution of this 
équation set for two perturbations from the upper-left steady State. For a small 
perturbation, the System returns to its original steady State. But the larger pertur¬ 
bation causes the System to switch to the alternate steady State. Ihus, this System 
can be switched from one stable steady State to the other by subjecting it to an 
input (or a perturbation) that is large enough to malce the other steady State more 
attractive. More generally, every stable steady State has a corresponding région of 
attraction , which can be intuitively thought of as the range of perturbations (of [X] 
or [Y] in this example) for which the dynamic trajectories converge baclc to that 
particular steady State, rather than switch to the other one. 

The concept of a région of attraction has interesting implications for the herita- 
bility of transcriptional States and the transition rates between them. If the région 
of attraction around one steady State is large, for example, then most cells in the 
population will assume this particular State. Furthermore, this State is lilcely to be 
inherited by daughter cells, since minor perturbations, like those ensuing from 
an asymmetric distribution of molécules during cell division, will rarely be suffi- 
cient to induce switching to the other steady State. We should expect that the use 
of positive feedback, coupled to cooperativity, will quite often be associated with 
Systems requiring stable cell memory. 

Robustness Is an Important Characteristic of Biological Networks 

Biological regulatory Systems are exposed to frequent and sometimes extreme 
variations in external conditions or the concentrations or activities of lcey compo- 
nents. The ability of these Systems to function normally in the face of such pertur¬ 
bations is called robustness. If we understand a complex System to the extent that 
we can reproduce its behavior with a computational model, then the robustness of 
the System can be assessed by determining how well its normal function persists 
following changes in various parameters, such as rate constants and component 
concentrations. We hâve already seen, for example, how the presence of négative 
feedback reduces the sensitivity of the steady State to changes in the values of the 
system's parameters (see Figure 8-77). Considérations of robustness also apply 
to dynamic behaviors. Thus, for example, when discussing négative feedback, we 
described how the behavior of a System tends to become more oscillatory as the 
number of components that constitute the feedback loop increases. If we use dif¬ 
ferent values of the parameters in models derived for Systems like those in Figure 
8-78, we find that the System with the longer loop tends to exhibit stable oscilla¬ 
tions within a much broader range of parameters, indicating that this System pro¬ 
vides a more robust oscillator. We can perform similar calculations to détermine 
the ability of different Systems to achieve robust bistability arising from positive 
feedback. Thus, one benefit of computational models is that they allow us to probe 
the robustness of biological networlcs in a systematic and rigorous way. 

Two Transcription Regulators That Bind to the Same Gene 
Promoter Can Exert Combinatorial Control 

Thus far, we hâve discussed how one transcription regulator can modulate the 
expression level of a gene. Most genes, however, are controlled by more than one 
type of transcription regulator, providing combinatorial control that allows two or 
more inputs to influence the expression of one gene. We can use computational 
methods to unveil some of the important regulatory features of combinatorial 
control Systems. 

Consider a gene whose promoter contains binding sites for two regulatory 
proteins, A and R , which bind to their individual sites independently. There are 
four possible binding configurations (Figure 8-83A). Suppose that A is a tran¬ 
scription activator, R is a transcription repressor, and the gene is only active when 
A is bound and R is not bound. We learned earlier that the probability that A is 
bound and the probability that R is not bound can be determined by the équa¬ 
tions in Figure 8-84A. The product of these two probabilities gives us the proba¬ 
bility of gene activation. 
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This example illustrâtes an AND NOT logic function (A and not R) (see Figure 
8-83A). Maximal activation of this gene is accomplished when [A] is high and [R] 
is zéro. However, intermediate levels of gene activation are also possible depend- 
ing on the levels of A and R and also on the binding affinities of [A] and [R] for their 
respective sites (that is, K A and K R ). When K A » K R , even a small concentration of 
[A] is capable of overcoming repression by R. Conversely, if I< A « K R , then much 
more [A] is needed to activate the gene (Figure 8-84B and C). 

Many other logic functions can govern combinatorial gene régulation. For 
example, an AND logic gâte results when two activators, Al and A2, are both 
required for a gene to be transcribed (Figures 8-83B and 8-84D). In E. coli cells, 
the AraJ gene Controls some aspects of arabinose sugar metabolism: its expres¬ 
sion requires two transcription regulators, one activated by arabinose and the 
other activated by the small molécule cAMP (Figure 8-84E). 


fraction of A bound = 


K a [A\ 

1 + K a [A] 


fraction of R not bound =-— 

1 + K r [R] 


PA, R) = 


K a [A] 
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Figure 8-83 Combinatorial control of 
gene expression. There are many ways in 
which gene expression can be controlled 
by two transcription regulators. To define 
precisely the relationship between the two 
inputs and the gene expression output, a 
regulatory circuit is often described as a 
spécifie type of logic gâte, a term borrowed 
from electronic circuit design. A simple 
example is the OR logic gâte (not shown 
here), in which a gene is controlled by two 
transcription activators, and one or the 
other can activate gene expression. (A) In 
a System with an activator A and repressor 
R, if transcription is turned on only when A 
is bound and R is not, then the resuit is an 
AND NOT logic gâte. We saw an example 
of this logic in Chapter 7 (Figure 7-15). 

(B) An AND gâte results when two 
transcription activators, Al andA2, are 
both required to turn on a gene. 


Figure 8-84 How the quantitative output of 
a gene dépends on both its combinatorial 
logic and the affinities of transcription 
regulators. (A) In a combinatorial gene 
regulatory System like that illustrated in Figure 
8-83A, the fraction of promoters bound by 
activator A and not bound by repressor R are 
each determined as shown here. The product 
of these probabilities provides the probability, 
P(A, R), that a gene promoter is active. 

(B-E) In these four panels, red indicates 
high gene expression and blue indicates low 
gene expression. (B) and (C) depict gene 
expression from the System described 
in panel (A). The two panels demonstrate 
how the System behaves when the relative 
affinities of the two transcription regulators 
change as indicated above each panel. 

(D) Gene expression in a case where the 
gene turns on only at high levels of both 
activating inputs (Al and A2), as shown in 
Figure 8-83B. (E) Experimental data showing 
measured expression of a gene in E. coli 
that is combinatorially regulated by two 
inputs: arabinose and cAMP. Note the close 
resemblance to panel (D). (E, adapted from 
S. Kaplan et al., Mol. Cell 29:786-792, 2008.) 
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An Incohérent Feed-forward Interaction Generates Puises 

Imagine that a sudden input signal immediately activâtes a transcription activator 
A and that the same input signal induces the much slower synthesis of a tran¬ 
scription repressor protein R that acts on the same gene X. If A and R control gene 
expression by an AND NOT logic function lilce that described above, our intuition 
tells us that this System should be able to generate a puise of transcription: when 
A is activated (and R is absent), the transcription of gene X will begin and cause 
an increase in the concentration of protein X, but then transcription will shut off 
when the concentration of R increases to a sufficiently high value. 

Arrangements of this type are common in the cell. In E. coli , for example, galac¬ 
tose metabolic genes are positively regulated by the catabolite activator protein 
(CAP), which is activated at high levels of cAMP. The same genes are repressed 
by the GalS repressor protein, which is encoded by a gene whose transcription is 
likewise activated by CAP. Thus, an increase in input (cAMP) activâtes A (CAP), 
and transcription of the galactose genes begins. But activation of A also causes 
a subséquent buildup of R (GalS), which causes the same genes to be repressed 
after a delay. This results in an incohérent feed-forward motifs Figure 8-85A). 

The response of the incohérent feed-forward motif will vary, depending on 
the parameters of the System. Suppose, for example, that the transcription acti¬ 
vator protein A binds more wealdy to the gene regulatory région than do es the 
transcription repressor protein R (K A < Kr). In this case, there will be a transient 
burst of protein synthesized by the affected gene (gene X) in response to a sudden 
activating input (Figure 8-85B). In contrast, the output will be more sustained if 
K a is much larger than Kr, because the repression will be too wealc to overcome 
the gene activation (Figure 8-85C). Other properties of this network, such as the 
dependence of the amplitude of the puise on the various rate constants in the Sys¬ 
tem, can be explored with the same computational tools. Thus, our intuitive guess 
about how this System would behave was only partially correct; even the simplest 
of networks dépends on précisé interaction strengths, demonstrating yet again 
why mathematics is needed to complément cartoon drawings. 


Figure 8-85 How an incohérent feed- 
forward motif can generate a brief 
puise of gene activation in response 
to a sustained input. (A) Diagram of an 
incohérent feed-forward motif in which the 
transcription activator A and the repressor 
R control the expression of gene X using 
the AND NOT logic of Figure 8-83A. 

(B) When Ka « Kr, this motif generates a 
puise of protein X expression, such that the 
output goes back down even if the input 
remains high. (C) When Ka » Kr, the same 
motif responds to a sustained input by 
generating a sustained output. 


A Cohérent Feed-forward Interaction Detects Persistent Inputs 

In the bacterium E. coli, the sugar arabinose is only consumed when the preferred 
sugar, glucose, is scarce. The strategy that cells use to assess the presence of arabi¬ 
nose and absence of glucose involves a feed-forward arrangement that is different 
from the one just described. In this case, déplétion of glucose causes an increase 
of cAMP, which is sensed by the CAP transcription activator protein, as described 
previously. In this case, however, CAP also induces the synthesis of a second tran¬ 
scription activator, AraC. Both activator proteins are necessary to activate arabi¬ 
nose metabolic genes (the AND logic function in Figure 8-83B). 
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This arrangement, known as a cohérent feed-forward motif, has the interest- 
ing characteristics illustrated in Figure 8-86. Imagine that two activators, Al and 
A2, are both required to initiate transcription of a gene. The input to the network 
activâtes Al directly, but only activâtes A2 through this Al activation. Thus, for 
a protein to be synthesized from this gene, long-term inputs are required that 
allow both Al and A2 to be produced in active form. Brief input puises are either 
ignored or produce small outputs. The requirement for a long input is important if 
assurances about a signal are needed before a costly cellular program is triggered. 
For example, glucose is the sugar on which E. coli cells grow best. Before cells trig- 
ger arabinose metabolism in the example above, it might be bénéficiai to be sure 
that glucose has been depleted (a sustained CAP puise), rather than inducing the 
arabinose program during a transient glucose fluctuation. 

The Same Network Can Behave Differently in Different Cells Due 
to Stochastic Effects 

Up to this point, we hâve assumed that ail cells in a population produce identical 
behaviors if they contain the same network. It is important, however, to account 
for the fact that cells often show considérable individuality in their responses. 
Consider a situation in which a single mother cell divides into two daughter cells 
of equal volume. If the mother cell has only one molécule of a given protein, then 
only one daughter will inherit it. The daughters, though genetically identical, are 
already different. This variability is most pronounced for molécules that are prés¬ 
ent in small numbers. Nevertheless, even when there are many copies of a partic- 
ular protein (or RNA), it is very unlikely that both daughter cells will end up with 
exactly the same number of molécules. 

This is just one illustration of a universal feature of cells: their behaviors are 
often stochastic, meaning that they display variability in their protein content and 
therefore exhibit variations in phenotypes. In addition to the asymmetric parti- 
tioning of molécules following cell division, variability can originate from many 
Chemical reactions. Imagine, for example, that our mother cell contains a sim¬ 
ple gene regulatory circuit with a positive feedback loop lilce that shown in Fig¬ 
ure 8-80B. Even if both daughter cells receive a copy of this circuit, including one 
copy of the initial transcription activator protein, there will be variability in the 
time required for promoter binding—and it will be statistically nearly impossible 
for the genes in the two daughter cells to become activated at precisely the same 
time. If the System is bistable and poised near a switching point, then variability 
in the response might flip the switch in only one daughter cell. Two daughter cells 
that were born identical can thereby acquire, by chance, a dramatic différence in 
phenotype. 

More generally, isogenic populations of cells grown in the same environment 
display diversity in size, shape, cell-cycle position, and gene expression. These 
différences arise because biochemical reactions require probabilistic collisions 


Figure 8-86 How a cohérent feed- 
forward motif responds to various 
inputs. (A) Diagram of a cohérent feed- 
forward motif in which the transcription 
activators Al and A2 together activate 
expression of gene X using the AND logic 
of Figure 8-83B. (B) The response to a 
brief input can be either weak (as shown) or 
nonexistent. This allows the motif to ignore 
random fluctuations in the concentration of 
signaling molécules. (C) A prolonged input 
produces a strong response that can turn 
off rapidly. 
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between randomly moving molécules, with each event resulting in changes in the 
number of molecular species by integer amounts. The amplified effect of fluctu¬ 
ations in a molecular reactant, or the compounded effects of fluctuations across 
many molecular reactants, often accumulâtes as an observable phenotype. This 
can endow a cell with individuality and generate non-genetic cell-to-cell variabil- 
ity in a population. 

Non-genetic variability can be studied in the laboratory by single-cell measure- 
ments of fluorescent proteins expressed from genes under the control of a spécifie 
promoter. Live cells can be mounted on a slide and viewed through a fluorescence 
microscope, revealing the striking variability in protein expression levels (Figure 
8-87). Another approach is to use flow cytometry, which worlcs by streaming a 
dilute suspension of cells past an illuminator and measuring the fluorescence of 
individual cells as they flow past the detector (see Figure 8-2). Fluorescence values 
can be used to build histograms that reveal the variability in a process across a 
population of cells, with a broad histogram indicating higher variability. 

Several Computational Approaches Can Be Used to Model the 
Reactions in Cells 

We hâve focused primarily on the use of ordinary differential équations to model 
the dynamics of simple regulatory circuits. These models are called deterministic, 
because they do not incorporate stochastic variability and will always produce 
the same resuit from a spécifie set of parameters. As we hâve seen, such models 
can provide useful insights, particularly in the detailed mechanistic analysis of 
small regulatory circuits. However, other types of computational approaches are 
also needed to comprehend the great complexity of cell behavior. Stochastic mod¬ 
els, for example, attempt to account for the very important problem of random 
variability in molecular networks. These models do not provide deterministic pré¬ 
dictions about the behavior of molécules; instead, they incorporate random vari¬ 
ation into molécule numbers and interactions, and the purpose of these models 
is to obtain a better understanding of the probability that a System will exist in a 
certain State over time. 

Numerous other modeling strategies hâve been or are being developed. Bool- 
ean networks are used for the qualitative analysis of complex gene regulatory 
networks containing large numbers of interacting components. In these models, 
each molécule is a node that can exist in either the active or inactive State, thereby 
affecting the State of the nodes it is linlced to. Models of this sort provide insights 
into the flow of information through a network, and they were useful in helping 
us understand the complex gene regulatory network that Controls the early devel¬ 
opment of the sea urchin (see Figure 7-43). Boolean networks therefore reduce 
complex networks to a highly simplified (and potentially inaccurate) form. At the 
other extreme are agent-based simulations, in which thousands of molécules (or 
"agents”) in a System are modeled individually, and their probable behaviors and 
interactions with each other over time are calculated on the basis of predicted 
physical and Chemical behaviors, often while talcing stochastic variation into 
account. Agent-based approaches are computationally demanding but hâve the 
potential to generate highly lifelike simulations of real biological Systems. 

Statistical Methods Are Critical For the Analysis of Biological Data 

Dynamics, differential équations, and theoretical modeling are not the be-all and 
end-all of mathematics. Other branches of the subject are no less important for 
biologists. Statistics—the mathematics of probabilistic processes and noisy data- 
sets—is an inescapable part of every biologiste life. 

This is true in two main ways. First, imperfect measurement devices and other 
errors generate experimental noise in our data. Second, ail cell-biological pro¬ 
cesses dépend on the stochastic behavior of individual molécules, as we just dis- 
cussed, and this results in biological noise in our results. How, in the face of ail 
this noise, do we corne to conclusions about the truth of hypothèses? The answer 
is statistical analysis, which shows how to move from one level of description to 



Figure 8-87 Different levels of gene 
expression in individual cells within a 
population of E. coli bacteria. For this 
experiment, two different reporter proteins 
(one fluorescing green, the other red), 
controlled by a copy of the same promoter, 
hâve been introduced into ail of the 
bacteria. Some cells express only one gene 
copy, and so appear either red or green, 
while others express both gene copies, 
and so appear yellow. This experiment 
reveals variable levels of fluorescence, 
indicating variable levels of gene expression 
within an apparently uniform population of 
cells. (From M.B. Elowitz et al., Science 
297:1183-1186, 2002. With permission 
from AAAS.) 
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another: from a set of erratic individual data points to a simpler description of the 
lcey features of the data. 

Statistics teaches us that the more times we repeat our measurements, the bet- 
ter and more refined the conclusions we can draw from them. Given many répéti¬ 
tions, it becomes possible to describe our data in terms of variables that summa- 
rize the features that matter: the mean value of the measured variable, talcen over 
the set of data points; the magnitude of the noise (the standard déviation of the 
set of data points); the lilcely error in our estimate of the mean value (the standard 
error of the mean); and, for specialists, the details of the probability distribution 
describing the likelihood that an individual measurement will yield a given value. 
For ail these things, statistics provides recipes and quantitative formulas that biol- 
ogists must understand if they are to malce rigorous conclusions on the basis of 
variable results. 


Summary 

Quantitative mathematical analysis can provide a powerful extra dimension in our 
understanding ofcell régulation andfunction. Cell regulatory Systems often dépend 
on macromolecular interactions, and mathematical analysis of the dynamics of 
these interactions can unveil important insights into the importance of binding 
affinities and protein stability in the génération of transcriptional or other signais. 
Regulatory Systems often employ networlc motifs that generate useful behaviors: 
a rapid négative feedback loop dampens the response to input signais; a delayed 
négative feedback loop créâtes a biochemical oscillator; positive feedback yields a 
System that alternâtes between two stable States; and feed-forward motifs provide 
Systems that generate transient signal puises or respond only to sustained inputs. 
The dynamic behavior of these network motifs can be dissected in detail with deter- 
ministic and stochastic mathematical modeling. 


WHATWE DONT KNOW 


• Many of the tools that revolutionized 
DNA technology were discovered by 
scientists studying basic biological 
problems that had no obvious 
applications. What are the best 
strategies to ensure that such crucially 
important technologies will continue to 
be discovered? 

• As the cost of DNA sequencing 
decreases and the amount of 
sequence data accumulâtes, how 
are we going to keep track of and 
meaningfully analyze this vast amount 
of information? What new questions 
will this information allow us to answer? 

• Can we develop tools to analyze 
each of the post-transcriptional 
modifications on the proteins in living 
cells, so as to follow ail of their 
changes in real time? 

• Can we develop mathematical 
models to accurately describe the 
enormous complexity of cellular 
networks and to predict undiscovered 
components and mechanisms? 


PROBLEMS 

Which statements are true? Explain why or why not. 

8-1 Because a monoclonal antibody recognizes a spé¬ 
cifie antigenic site (epitope), it binds only to the spécifie 
protein against which it was made. 

8-2 Given the inexorable march of technology, it 
seems inévitable that the sensitivity of détection of mol¬ 
écules will ultimately be pushed beyond the yoctomole 
level (10 -24 mole). 

8-3 If each cycle of PCR doubles the amount of DNA 
synthesized in the previous cycle, then 10 cycles will give a 
10 3 -fold amplification, 20 cycles will give a 10 6 -fold ampli¬ 
fication, and 30 cycles will give a 10 9 -fold amplification. 

8-4 To judge the biological importance of an interac¬ 
tion between protein A and protein B , we need to know 
quantitative details about their concentrations, affinities, 
and kinetic behaviors. 

8-5 The rate of change in the concentration of any 
molecular species X is given by the balance between its 
rate of appearance and its rate of disappearance. 

8-6 After a sudden increase in transcription, a protein 
with a slow rate of dégradation will reach a new steady 
State level more quickly than a protein with a rapid rate of 
dégradation. 


Discuss the following problems. 

8-7 A common step in the isolation of cells from a 
sample of animal tissue is to treat the tissue with trypsin, 
collagénase, and EDTA. Why is such a treatment nece- 
ssary, and what does each component accomplish? And 
why does this treatment not kill the cells? 

8-8 Tropomyosin, at 93 lcd, sédiments at 2.6S, whereas 
the 65-lcd protein, hemoglobin, sédiments at 4.3S. (The 
sédimentation coefficient S is a linear measure of the rate 
of sédimentation.) These two proteins are drawn to scale 
in Figure Q8-1. How is it that the bigger protein sédiments 
more slowly than the smaller one? Can you thinlc of an 
analogy from everyday expérience that might help you 
with this problem? 



Figure Q8-1 Scale models of tropomyosin 
and hemoglobin (Problem 8-8). 
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8-9 Hybridoma technology allows one to generate 
monoclonal antibodies to virtually any protein. Why is 
it, then, that genetically tagging proteins with epitopes is 
such a commonly used technique, especially since an epi- 
tope tag has the potential to interfère with the function of 
the protein? 

8-10 How many copies of a protein need to be présent 
in a cell in order for it to be visible as a band on an SDS 
gel? Assume that you can load 100 pg of cell extract onto 
a gel and that you can detect 10 ng in a single band by sil- 
ver staining the gel. The concentration of protein in cells 
is about 200 mg/mL, and a typical mammalian cell has a 
volume of about 1000 pm 3 and a typical bacterium a vol¬ 
ume of about 1 pm 3 . Given these parameters, calculate 
the number of copies of a 120-lcd protein that would need 
to be présent in a mammalian cell and in a bacterium in 
order to give a détectable band on a gel. You might try an 
order-of-magnitude guess before you malce the calcula¬ 
tions. 

8-11 You hâve isolated the proteins from two adjacent 
spots after two-dimensional polyacrylamide-gel electro- 
phoresis and digested them with trypsin. When the masses 
of the peptides were measured by MALDI-TOF mass spec- 
trometry, the peptides from the two proteins were found to 
be identical except for one (Figure Q8-2). For this peptide, 
the mass-to-charge [m/z) values differed by 80, a value 
that does not correspond to a différence in amino acid 
sequence. (For example, glutamic acid instead of valine at 
one position would give an m/z différence of around 30.) 
Can you suggest a possible différence between the two 
peptides that might account for the observed m/z différ¬ 
ence? 
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Figure Q8-2 

Masses of peptides 
measured by 
MALDI-TOF mass 
spectrometry 
(Problem 8-11). 

Only the numbered 
peaks differ between 
the two protein 
samples. 


m/z (mass-to-charge ratio) 


DNA to be amplified 


5' -GACCTGTGGAAGC 
3' -CTGGACACCTTCG 


CATACGGGATTGA-3' 

GTATGCCCTAACT-5' 


primers 


(1) 5' -GACCTGTGGAAGC-3' 

(2) 5'-CTGGACACCTTCG-3' 

(3) 5' -CGAAGGTGTCCAG-3' 

(4) 5'-GCTTCCACAGGTC-3' 


(5) 5' -CATACGGGATTGA-3' 

(6) 5' -GTATGCCCTAACT-3' 

(7) 5' -TGTTAGGGCATAC-3' 

(8) 5' -TCAATCCCGTATG-3' 


Figure Q8-3 DNA to be amplified and potential PCR primers 
(Problem 8-12). 


8-14 Explain the différence between a gain-of-function 
mutation and a dominant-négative mutation. Why are 
both these types of mutation usually dominant? 

8-15 Discuss the following statement: “We would 
hâve no idea today of the importance of insulin as a reg- 
ulatory hormone if its absence were not associated with 
the human disease diabètes. It is the dramatic consé¬ 
quences of its absence that focused early efforts on the 
identification of insulin and the study of its normal rôle in 
physiology." 

8-16 You hâve just gotten baclc the results from an RNA- 
seq analysis of mRNAs from liver. You had anticipated 
counting the number of reads of each mRNA to déter¬ 
mine the relative abundance of different mRNAs. But you 
are puzzled because many of the mRNAs hâve given you 
results lilce those shown in Figure Q8-4. How is it that dif¬ 
ferent parts of an mRNA can be represented at different 
levels? 

8-17 Examine the network motifs in Figure Q8-5. 
Décidé which ones are négative feedback loops and which 
are positive. Explain your reasoning. 

8-18 Imagine that a random perturbation positions a 
bistable System precisely at the boundary between two sta¬ 
ble States (at the orange dot in Figure Q8-6). How would 
the System respond? 


8-12 You want to amplify the DNA between the two 
stretches of sequence shown in Figure Q8-3. Of the listed 
primers, choose the pair that will allow you to amplify the 
DNA by PCR. 

8-13 In the very first round of PCR using genomic DNA, 
the DNA primers prime synthesis that terminâtes only 
when the cycle ends (or when a random end of DNA is 
encountered). Yet, by the end of 20 to 30 cycles—a typical 
amplification—the only visible product is defined pre¬ 
cisely by the ends of the DNA primers. In what cycle is a 
double-stranded fragment of the correct size first gener- 
ated? 



Figure Q8-4 RNA-seq reads for a liver mRNA (Problem 8-16). The 
exon structure of the mRNA is indicated, with protein-coding segments 
indicated in light blue and untranslated régions in dark blue. The 
numbers of sequencing reads are indicated by the heights of the 
vertical lines above the mRNA. 

























































































































































CHAPTER 8 END-OF-CHAPTER PROBLEMS 


527 


(A) ACTIVATING (B) ACTIVATING 

IN PUT INPUT 

1 1 


GENEX GENEX 


1 1 

© 

1 

1 

© 

1 

■ GENE Y 

GENE Y 


i 

• 

1 

• 

X 

I GENEZ 

I GENEZ 

; 


• 

• 

k J 

- J 



Figure Q8-6 Perturbations of a bistable System (Problem 8-18). As 
shown by the green Unes, after perturbation 1 the System returns to 
its original stable State (green dot at left), and after perturbation 2, the 
System moves to the other stable State (green dot at right). Perturbation 
3 moves the System to the précisé boundary between the two stable 
States (orange dot). 
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A. Which single operator site is the most important 
for repression? How can you tell? 

B. Do combinations of operator sites (Figure Q8-7, 
constructs 1, 2, 3, and 5) substantially increase repression 
by the dimeric repressor? Do combinations of operator 
sites substantially increase repression by the tetrameric 
repressor? If the two repressors behave differently, offer an 
explanation for the différence. 

C. The wild-type repressor binds O 3 very wealdy 
when it is by itself on a segment of DNA. However, if Oi is 
included on the same segment of DNA, the repressor binds 
0 3 quite well. How can that be? 


92 bp 401 bp 2-mer 4-mer 


Figure Q8-5 Network motifs composed of transcription activators and 
repressors (Problem 8-17). 


8-19 Detailed analysis of the regulatory région of the 
Lac operon has revealed surprising complexity. Instead 
of a single binding site for the Lac repressor, as might be 
expected, there are three sites termed operators: Oi, 0 2 , 
and 0 3 , arrayed along the DNA as shown in Figure Q8-7. 
To probe the functions of these three sites, you malce a 
sériés of constructs in which various combinations of oper¬ 
ator sites are présent. You examine their ability to repress 
expression of P-galactosidase, using either tetrameric 
(wild type) or dimeric (mutant) forms of the Lac repres¬ 
sor. The dimeric form of the repressor can bind to a single 
operator (with the same affmity as the tetramer) with each 
monomer binding to half the site. The tetramer, the form 
normally expressed in cells, can bind to two sites simulta- 
neously. When you measure repression of P-galactosidase 
expression, you find the results shown in Figure Q8-7, with 
higher numbers indicating more effective repression. 
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Figure Q8-7 Repression of p-galactosidase by promoter régions that 
contain different combinations of Lac repressor binding sites (Problem 
8-19). The base-pair (bp) séparation of the three operator sites is 
shown. Numbers at right refer to the level of repression, with higher 
numbers indicating more effective repression by dimeric (2-mer) or 
tetrameric (4-mer) repressors. (From S. Oehler et al., EMBO J. 
9:973-979, 1990. With permission from John Wiley and Sons.) 
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Visualizing Cells 


CHAPTER 

9 


Understanding the structural organization of cells is essential for learning how 
they function. In this chapter, we briefly describe some of the principal micro- 
scopy methods used to study cells. Optical microscopy will be our starting point 
because cell biology began with the light microscope, and it is still an indispen- 
sible tool. The development of methods for the spécifie labeling and imaging of 
individual cellular constituents and the reconstruction of their three-dimensional 
architecture has meant that, far from falling into disuse, optical microscopy con¬ 
tinues to increase in importance. One advantage of optical microscopy is that light 
is relatively nondestructive. By tagging spécifie cell components with fluorescent 
probes, such as intrinsically fluorescent proteins, we can watch their movement, 
dynamics, and interactions in living cells. Although conventional optical micro¬ 
scopy is limited in resolution by the wavelength of visible light, new methods clev- 
erly bypass this limitation and allow the position of even single molécules to be 
mapped. By using a beam of électrons instead of visible light, électron microscopy 
can image the interior of cells, and their macromolecular components, at almost 
atomic resolution, and in three dimensions. 

This chapter is intended as a companion, rather than an introduction, to the 
chapters that follow; readers may wish to refer back to it as they encounter appli¬ 
cations of microscopy to basic biological problems in the later pages of the boolc. 


IN THIS CHAPTER 

LOOKING AT CELLS IN THE 
LIGHT MICROSCOPE 

LOOKING AT CELLS AND 
MOLECULES IN THE ELECTRON 
MICROSCOPE 


LOOKING AT CELLS IN THE LIGHT MICROSCOPE 

A typical animal cell is 10-20 pm in diameter, which is about one-fifth the size of 
the smallest object that we can normally see with the nalced eye. Only after good 
light microscopes became available in the early part of the nineteenth century did 
Schleiden and Schwann propose that ail plant and animal tissues were aggregates 
of individual cells. Their proposai in 1838, lcnown as the cell doctrine, marks the 
formai birth of cell biology. 

Animal cells are not only tiny, but they are also colorless and translucent. The 
discovery of their main internai features, therefore, depended on the develop¬ 
ment, in the late nineteenth century, of a variety of stains that provided sufficient 
contrast to malce those features visible. Similarly, the far more powerful électron 
microscope introduced in the early 1940s required the development of new tech¬ 
niques for preserving and staining cells before the full complexities of their inter¬ 
nai fine structure could begin to emerge. To this day, microscopy often relies as 
much on techniques for preparing the specimen as on the performance of the 
microscope itself. In the following discussions, we therefore consider both instru¬ 
ments and specimen préparation, beginning with the light microscope. 

The images in Figure 9-1 illustrate a stepwise progression from a thumb to a 
cluster of atoms. Each successive image represents a tenfold increase in magnifi- 
cation. The nalced eye can see features in the first two panels, the light microscope 
allows us to see details corresponding to about the fourth or fifth panel, and the 
électron microscope talces us to about the seventh or eighth panel. Figure 9-2 
shows the sizes of various cellular and subcellular structures and the ranges of 
size that different types of microscopes can visualize. 
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The Light Microscope Can Résolve Details 0.2 |Lxm Apart 

For well over 100 years, ail microscopes were constrained by a fundamental lim¬ 
itation: that a given type of radiation cannot be used to probe structural details 
much smaller than its own wavelength. A limit to the resolution of a light micro¬ 
scope was therefore set by the wavelength of visible light, which ranges from 
about 0.4 pm (for violet) to 0.7 pm (for deep red). In practical terms, bacteria and 
mitochondria, which are about 500 nm (0.5 pm) wide, are generally the small- 
est objects whose shape we can clearly discern in the light microscope; details 
smaller than this are obscured by effects resulting from the wavelike nature of 
light. To understand why this occurs, we must follow the behavior of a beam of 
light as it passes through the lenses of a microscope (Figure 9-3). 

Because of its wave nature, light do es not follow the idealized straight ray 
paths that geometrical optics predicts. Instead, light waves travel through an 
optical System by many slightly different routes, lilce ripples in water, so that they 


Figure 9-1 A sense of scale between 
living cells and atoms. Each diagram 
shows an image magnified by a factor of 
ten in an imaginary progression from a 
thumb, through skin cells, to a ribosome, 
to a cluster of atoms forming part of 
one of the many protein molécules in 
our body. Atomic details of biological 
macromolecules, as shown in the last two 
panels, are usually beyond the power of 
the électron microscope. While color has 
been used here in ail the panels, it is not a 
feature of objects much smaller than the 
wavelength of light, so the last five panels 
should really be in black and white. 
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Figure 9-2 Resolving power. Sizes of cells and their components are drawn on a logarithmic scale, indicating the range of 
objects that can be readily resolved by the naked eye and in the light and électron microscopes. Note that new superresolution 
microscopy techniques, discussed in detail later, allow an improvement in resolution by an order of magnitude compared with 
conventional light microscopy. 


interfère with one another and cause optical diffraction effects. If two trains of 
waves reaching the same point by different paths are precisely in phase, with crest 
matching crest and trough matching trough, they will reinforce each other so as 
to increase brightness. In contrast, if the trains of waves are out of phase, they will 
interfère with each other in such a way as to cancel each other partly or entirely 
(Figure 9-4). The interaction of light with an object changes the phase relation- 
ships of the light waves in a way that produces complex interférence effects. At 
high magnification, for example, the shadow of an edge that is evenly illuminated 
with light of uniform wavelength appears as a set of parallel lines (Figure 9-5), 
whereas that of a circular spot appears as a set of concentric rings. For the same 
reason, a single point seen through a microscope appears as a blurred dise, and 
two point objects close together give overlapping images and may merge into one. 


The following units of length are 
commonly employed in microscopy: 
jim (micrometer) = 10 -6 m 
nm (nanometer) = 10 -9 m 
Â (Ângstrôm unit) = 10 -10 m 


(A) image on retina (B) 




Figure 9-3 A light microscope. 

(A) Diagram showing the light path in a 
compound microscope. Light is focused on 
the specimen by lenses in the condenser. 

A combination of objective lenses, tube 
lenses, and eyepiece lenses is arranged to 
focus an image of the illuminated specimen 
in the eye. (B) A modem research light 
microscope. (B, courtesy of Cari Zeiss 
Microscopy, GmbH.) 
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Although no amount of refinement of the lenses can overcome the diffraction 
limit imposed by the wavelike nature of light, other ways of cleverly bypassing this 
limit hâve emerged, creating so-called superresolution imaging techniques that 
can even detect the position of single molécules. 

The limiting séparation at which two objects appear distinct—the so-called 
limit of resolution— dépends on both the wavelength of the light and the numeri¬ 
cal aperture of the lens System used. The numerical aperture affects the light-gath- 
ering ability of the lens and is related both to the angle of the cône of light that 
can enter it and to the refractive index of the medium the lens is operating in; 
the wider the microscope opens its eye, so to spealc, the more sharply it can see 
(Figure 9-6). The refractive index is the ratio of the speed of light in a vacuum to 
the speed of light in a particular transparent medium. For example, for water this 
is 1.33, meaning that light travels 1.33 times slower in water than in a vacuum. 
Under the best conditions, with violet light (wavelength = 0.4 pm) and a numer¬ 
ical aperture of 1.4, the basic light microscope can theoretically achieve a limit 
of resolution of about 0.2 pm, or 200 nm. Some microscope malcers at the end 
of the nineteenth century achieved this resolution, but it is routinely matched in 
contemporary, factory-produced microscopes. Although it is possible to enlarge 
an image as much as we want—for example, by projecting it onto a screen—it 
is not possible, in a conventional light microscope, to résolve two objects in the 
light microscope that are separated by less than about 0.2 pm; they will appear 
as a single object. It is important, however, to distinguish between resolution and 
détection. If a small object, below the resolution limit, itself emits light, then we 
may still be able to see or detect it. Thus, we can see a single fluorescently labeled 
microtubule even though it is about ten times thinner than the resolution limit of 
the light microscope. Diffraction effects, however, will cause it to appear blurred 
and at least 0.2 pm thiclc (see Figure 9-16). In a similar way, we can see the stars in 
the night slcy, even though their diameters are far below the angular resolution of 
our unaided eyes: they ail appear as similar, slightly blurred points of light, differ- 
ing only in their color and brightness. 

Photon Noise Créâtes Additional Limits to Resolution When Light 
Levels Are Low 

Any image, whether produced by an électron microscope or by an optical micro¬ 
scope, is made by particles—électrons or photons—striking a detector of some 
sort. But these particles are governed by quantum mechanics, so the numbers 
reaching the detector are predictable only in a statistical sense. Finite samples, 
collected by imaging for a limited period of time (that is, by taking a snapshot), will 
show random variation: successive snapshots of the same scene will not be exactly 
identical. Moreover, every détection method has some level of background sig¬ 
nal or noise, adding to the statistical uncertainty. With bright illumination, corre- 
sponding to very large numbers of photons or électrons, the features of the imaged 


Figure 9-4 Interférence between light 
waves. When two light waves combine in 
phase, the amplitude of the résultant wave 
is larger and the brightness is increased. 
Two light waves that are out of phase 
cancel each other partly and produce a 
wave whose amplitude, and therefore 
brightness, is decreased. 



(A) (B) 


Figure 9-5 Images of an edge and of a 
point of light. (A) The interférence effects, 
or fringes, seen at high magnification when 
light of a spécifie wavelength passes the 
edge of a solid object placed between 
the light source and the observer. (B) The 
image of a point source of light. Diffraction 
spreads this out into a complex, circular 
pattern, whose width dépends on the 
numerical aperture of the optical System: 
the smaller the aperture, the bigger (more 
blurred) the diffracted image. Two point 
sources can be just resolved when the 
center of the image of one lies on the first 
dark ring in the image of the other: this is 
used to define the limit of resolution. 
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LENSES 


IMAGE 



LIGHT 


the objective lens 
collects a cône of 
light rays to create 
an image 


the condenser lens 
focuses a cône of 
light rays onto 
each point of the 
specimen 


RESOLUTION: the resolving power of the 
microscope dépends on the width of the 
cône of illumination and therefore on both 
the condenser and the objective lens. It is 
calculated using the formula 


where: 


resolution = 


0.61 X 
n sin 0 


0 = half the angular width of the cône of 
rays collected by the objective lens 
from a typical point in the central 
région of the specimen (since the 
maximum width is 180°, 
sin 0 has a maximum value of 1) 
n = the refractive index of the medium 
(usually air or oil) separating the 
specimen from the objective and 
condenser lenses 

X = the wavelength of light used (for white 
light a figure of 0.53 pm is commonly 
assumed) 


NUMERICAL APERTURE: n sin 0 in the 

équation above is called the numerical aperture 
of the lens and is a function of its light- 
collecting ability. For dry lenses this cannot be 
more than 1, but for oil-immersion lenses it can 
be as high as 1.4. The higher the numerical 


aperture, the greater the resolution and the 
brighterthe image (brightness is important in 
fluorescence microscopy). However, this advan- 
tage does necessitate very short working distances 
and a very small depth of field. 


Figure 9-6 Numerical aperture. The path 
of light rays passing through a transparent 
specimen in a microscope illustrâtes the 
concept of numerical aperture and its 
relation to the limit of resolution. 


specimen are accurately determined based on the distribution of these particles 
at the detector. However, with smaller numbers of particles, the structural details 
of the specimen are obscured by the statistical fluctuations in the numbers of par¬ 
ticles detected in each région, which give the image a speclded appearance and 
limit its précision. The term noise describes this random variability. 

Living Cells Are Seen Clearly in a Phase-Contrast or a 
Differential-Interference-Contrast Microscope 

There are many ways in which contrast in a specimen can be generated (Figure 
9-7A). While fixing and staining a specimen can generate contrast through color, 
microscopists hâve always been challenged by the possibility that some compo- 
nents of the cell may be lost or distorted during specimen préparation. The only 
certain way to avoid the problem is to examine cells while they are alive, without 
fixing or freezing. For this purpose, light microscopes with spécial optical Systems 
are especially useful. 

In the normal bright-field microscope, light passing through a cell in culture 
forms the image directly. Another System, dark-field microscopy, exploits the fact 
that light rays can be scattered in ail directions by small objects in their path. If 
oblique lighting from the condenser is arranged, which does not directly enter 
the objective, focused but unstained objects in a living cell can scatter the rays, 
some of which then enter the objective to create a bright image against a blaclc 
baclcground (Figure 9-7B). 

When light passes through a living cell, the phase of the light wave is changed 
according to the cell's refractive index: a relatively thiclc or dense part of the cell, 
such as a nucléus, slows the light passing through it. The phase of the light, con- 
sequently, is shifted relative to light that has passed through an adjacent thinner 
région of the cytoplasm (Figure 9-7C). The phase-contrast microscope and, in a 
more complex way, the differential-interference-contrast microscope increase 
these phase différences so that the waves are more nearly out of phase, produc- 
ing amplitude différences when the sets of waves recombine, thereby creating an 
image of the cell's structure. Both types of light microscopy are widely used to 
visualize living cells (see Movie 17.2). Figure 9-8 compares images of the same 
cell obtained by four kinds of light microscopy. 
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(A) incident light 
(white) 




(B) oblique incident light 


(C) incident light 
(white) 


Figure 9-7 Contrast in light microscopy. (A) The stained portion of the cell will absorb light of some wavelengths, which 
dépends on the stain, but will allow other wavelengths to pass through it. A colored image of the cell is thereby obtained that is 
visible in the normal bright-field light microscope. (B) In the dark-field microscope, oblique rays of light focused on the specimen 
do not enter the objective lens, but light that is scattered by components in the living cell can be collected to produce a bright 
image on a dark background. (C) Light passing through the unstained living cell expériences very little change in amplitude, 
and the structural details cannot be seen even if the image is highly magnified. The phase of the light, however, is altered by 
its passage through either thicker or denser parts of the cell, and small phase différences can be made visible by exploiting 
interférence effects using a phase-contrast or a differential-interference-contrast microscope. 


Phase-contrast, differential-interference-contrast, and dark-field microscopy 
malce it possible to watch the movements involved in such processes as mitosis 
and cell migration. Since many cellular motions are too slow to be seen in real 
time, it is often helpful to malce time-lapse movies in which the caméra records 
successive frames separated by a short time delay, so that when the resulting pic- 
ture sériés is played at normal speed, events appear greatly speeded up. 

Images Can Be Enhanced and Analyzed by Digital Techniques 

In recent years, electronic, or digital, imaging Systems, and the associated tech- 
nology of image processing, hâve had a major impact on light microscopy. Cer¬ 
tain practical limitations of microscopes relating to imperfections in the optical 
System hâve been largely overcome. Electronic imaging Systems hâve also cir- 
cumvented two fundamental limitations of the human eye: the eye cannot see 
well in extremely dim light, and it cannot perceive small différences in light inten- 
sity against a bright background. To increase our ability to observe cells in these 
difficult conditions, we can attach a sensitive digital caméra to a microscope. 
These caméras detect light by means of charge-coupled devices (CCDs), or high 
sensitivity complementary métal-oxide semiconductor (CMOS) sensors, similar 
to those found in digital caméras. Such image sensors are 10 times more sensitive 
than the human eye and can detect 100 times more intensity levels. It is therefore 
possible to observe cells for long periods at very low light levels, thereby avoiding 
the damaging effects of prolonged bright light (and heat). Such low-light cam¬ 
éras are especially important for viewing fluorescent molécules in living cells, as 
explained below. 

Because images produced by digital caméras are in electronic form, they can 
be processed in various ways to extract latent information. Such image processing 
malces it possible to compensate for several optical faults in microscopes. More- 
over, by digital image processing, contrast can be greatly enhanced to overcome 
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Figure 9-8 Four types of light microscopy. Four images are shown of the same fibroblast cell in culture. Ail images can 
be obtained with most modem microscopes by interchanging optical components. (A) Bright-field microscopy, in which light 
is transmitted straight through the specimen. (B) Phase-contrast microscopy, in which phase alterations of light transmitted 
through the specimen are translated into brightness changes. (C) Differential-interference-contrast microscopy, which highlights 
edges where there is a steep change of refractive index. (D) Dark-field microscopy, in which the specimen is lit from the side and 
only the scattered light is seen. 


the eye's limitations in detecting small différences in light intensity, and back- 
ground irregularities in the optical System can be digitally subtracted. This proce¬ 
dure reveals small transparent objects that were previously impossible to distin- 
guish from the background. 


Intact Tissues Are Usually Fixed and Sectioned Before Microscopy 


Because most tissue samples are too thiclc for their individual cells to be examined 
directly at high resolution, they are often eut into very thin transparent slices, or 
sections. To preserve the cells within the tissue they must be treated with a fixa- 
tive. Common fixatives include glutaraldehyde, which forms covalent bonds with 
the free amino groups of proteins, cross-linking them so they are stabilized and 
locked into position. 

Because tissues are generally soft and fragile, even after fixation, they need to 
be either frozen or embedded in a supporting medium before being sectioned. 
The usual embedding media are waxes or resins. In liquid form, these media both 
permeate and surround the fixed tissue; they can then be hardened (by cooling or 
by polymerization) to form a solid bloclc, which is readily sectioned with a micro¬ 
tome. This is a machine with a sharp blade, usually of Steel or glass, which opér¬ 
âtes lilce a meat-slicer (Figure 9-9). The sections (typically 0.5-10 pm thiclc) are 
then laid fiat on the surface of a glass microscope slide. 

There is little in the contents of most cells (which are 70% water by weight) to 
impede the passage of light rays. Thus, most cells in their natural State, even if fixed 
and sectioned, are almost invisible in an ordinary light microscope. We hâve seen 
that cellular components can be made visible by techniques such as phase-con¬ 
trast and differential-interference-contrast microscopy, but these methods tell us 
almost nothing about the underlying chemistry. There are three main approaches 
to working with thin tissue sections that reveal différences in types of molécules 
that are présent. 

First, and traditionally, sections can be stained with organic dyes that hâve 
some spécifie affinity for particular subcellular components. The dye hematox- 
ylin, for example, has an affinity for negatively charged molécules and therefore 
reveals the distribution of DNA, RNA, and acidic proteins in a cell (Figure 9-10). 
The Chemical basis for the specificity of many dyes, however, is not lcnown. 


movement of 




ribbon of sections on 
glass slide, stained 
and mounted under 
a glass cover slip 


Figure 9-9 Making tissue sections. 

This illustration shows how an embedded 
tissue is sectioned with a microtome in 
préparation for examination in the light 
microscope. 
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Figure 9-10 Staining of cell components. 

(A) This section of cells in the urine- 
collecting ducts of the kidney was stained 
with hematoxylin and eosin, two dyes 
commonly used in histology. Each duct 
is made of closely packed cells (with 
nuclei stained red) that form a ring. 

The ring is surrounded by extracellular 
matrix, stained purple. (B) This section 
of a young plant root is stained with two 
dyes, safran in and fast green. The fast 
green stains the cellulosic cell walls while 
the safranin stains the lignified xylem cell 
walls bright red. (A, from P.R. Wheater et 
al., Functional Histology, 2nd ed. London: 
Churchill Livingstone, 1987; B, courtesy of 
Stephen Grâce.) 


Second, sectioned tissues can be used to visualize spécifie patterns of differ- 
ential gene expression. In situ hybridization, discussed earlier (see Figure 8-34), 
reveals the cellular distribution and abundance of spécifie expressed RNA mol¬ 
écules in sectioned material or in whole mounts of small organisms or organs. 
This is particularly effective when used in conjunction with fluorescent probes 
(Figure 9-11). 

A third and very sensitive approach, generally and widely applicable for local - 
izing proteins of interest, also dépends on using fluorescent probes and markers, 
as we explain next. 


Spécifie Molécules Can Be Located in Cells by Fluorescence 
Microscopy 

Fluorescent molécules absorb light at one wavelength and émit it at another, lon¬ 
ger wavelength (Figure 9-12A). If we illuminate such a molécule at its absorbing 
wavelength and then view it through a filter that allows only light of the emitted 
wavelength to pass, it will glow against a darlc background. Because the baclc- 
ground is darlc, even a minute amount of the glowing fluorescent dye can be 
detected. In contrast, the same number of molécules of a nonfluorescent stain, 
viewed conventionally, would be practically indiscernible because the absorption 
of light by molécules in the stain would resuit in only the faintest tinge of color in 
the light transmitted through that part of the specimen. 

The fluorescent dyes used for staining cells are visualized with a fluorescence 
microscope. This microscope is similar to an ordinary light microscope except 
that the illuminating light, from a very powerful source, is passed through two 
sets of filters—one to filter the light before it reaches the specimen and one to 



Figure 9-11 RNA in situ hybridization. 

As described in Chapter 8 (see 
Figure 8-62), it is possible to visualize 
the distribution of different RNAs in 
tissues using in situ hybridization. Here, 
the transcription pattern of five different 
genes involved in patterning the early fly 
embryo is revealed in a single embryo. 

Each RNA probe has been fluorescently 
labeled in a different way, some directly 
and some indirectly; the resulting images 
are displayed each in a different color 
(“false-colored”) and combined to give an 
image where different color combinations 
represent different sets of genes expressed. 
The genes whose expression pattern 
is revealed here are wingless (yellow), 
engrailed (blue), short gastrulation (red), 
intermediate neuroblasts defective (green), 
and muscle spécifie homeobox (purple). 
(From D. Kosman et al., Science 305:846, 
2004. With permission from AAAS.) 
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Figure 9-12 Fluorescence and the fluorescence microscope. (A) An orbital électron of a fluorochrome molécule can be 
raised to an excited State following the absorption of a photon. Fluorescence occurs when the électron returns to its ground 
State and emits a photon of light at a longer wavelength. Too much exposure to light, or too bright a light, can also destroy 
the fluorochrome molécule, in a process called photobleaching. (B) In the fluorescence microscope, a filter set consists of two 
barrier filters (1 and 3) and a dichroic (beam-splitting) mirror (2). This example shows the filter set for détection of the fluorescent 
molécule fluorescein. High-numerical-aperture objective lenses are especially important in this type of microscopy because, 
for a given magnification, the brightness of the fluorescent image is proportional to the fourth power of the numerical aperture 
(see also Figure 9-6). 


filter the light obtained from the specimen. The first filter passes only the wave- 
lengths that excite the particular fluorescent dye, while the second filter bloclcs 
out this light and passes only those wavelengths emitted when the dye fluoresces 
(Figure 9-12B). 

Fluorescence microscopy is most often used to detect spécifie proteins or 
other molécules in cells and tissues. A very powerful and widely used technique 
is to couple fluorescent dyes to antibody molécules, which then serve as highly 
spécifie and versatile staining reagents that bind selectively to the particular mac- 
romolecules they recognize in cells or in the extracellular matrix. Two fluorescent 
dyes that hâve been commonly used for this purpose are fluorescein , which emits 
an intense green fluorescence when excited with blue light, and rhodamine , which 
emits deep red fluorescence when excited with green-yellow light (Figure 9-13). 
By coupling one antibody to fluorescein and another to rhodamine, the distribu¬ 
tions of different molécules can be compared in the same cell; the two molécules 
are visualized separately in the microscope by switching baclc and forth between 
two sets of filters, each spécifie for one dye. As shown in Figure 9-14, three flu¬ 
orescent dyes can be used in the same way to distinguish among three types of 
molécules in the same cell. Many newer fluorescent dyes, such as Cy3, Cy5, and 
the Alexa dyes, hâve been specifically developed for fluorescence microscopy 

Figure 9-13 Fluorescent probes. The maximum excitation and émission 
wavelengths of several commonly used fluorescent probes are shown in 
relation to the corresponding colors of the spectrum. The photon emitted 
by a fluorescent molécule is necessarily of lower energy (longer wavelength) 
than the absorbed photon and this accounts for the différence between the 
excitation and émission peaks. CFP, GFP, YFP, and RFP are cyan, green, 
yellow, and red fluorescent proteins, respectively. DAPI is widely used as a 
general fluorescent DNA probe, which absorbs ultraviolet light and fluoresces 
bright blue. FITC is an abbreviation for fluorescein isothiocyanate, a widely 
used dérivative of fluorescein, which fluoresces bright green. The other 
probes are ail commonly used to fluorescently label antibodies and other 
proteins. The use of fluorescent proteins will be discussed later in the chapter. 
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Figure 9-14 Different fluorescent probes 
can be visualized in the same cell. In this 
composite micrograph of a cell in mitosis, 
three different fluorescent probes hâve 
been used to label three different cellular 
components (Movie 9.1). The spindle 
microtubules are revealed with a green 
fluorescent antibody, centromeres with a 
red fluorescent antibody, and the DNA of 
the condensed chromosomes with the 
blue fluorescent dye DAPI. (Courtesy of 
Kevin F. Sullivan.) 
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(see Figure 9-13) but, lilce many organic fluorochromes, they fade fairly rapidly 
when continuously illuminated. More stable fluorochromes hâve been developed 
based on inorganic chemistry. Tiny crystals of semiconductor material, called 
nanoparticles, or quantum dots , can be excited to fluoresce by a broad spectrum 
of blue light. Their emitted light has a color that dépends on the exact size of the 
nanocrystal, between 2 and 10 nm in diameter, and additionally the fluorescence 
fades only slowly with time (Figure 9-15). These nanoparticles, when coupled to 
other probes such as antibodies, are therefore idéal for tracking molécules over 
time. If introduced into a living cell, in an embryo for example, the progeny of that 
cell can be followed many days later by their fluorescence, allowing cell lineages 
to be traclced. 


0 sec 60 sec 180 sec 


(B) 


(C) 
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Figure 9-15 Fluorescent nanoparticles or quantum dots. (A) Quantum dots are tiny particles 
of cadmium selenide, a semiconductor, with a coating to make them water-soluble. They can 
be coupled to protein molécules such as antibodies or streptavidin and, when introduced into 
a cell, will bind to a target protein of interest. Different-sized quantum dots émit light of different 
colors—the larger the dot, the longer the wavelength—but they are ail excited by the same blue 
light. Quantum dots can keep shining for weeks, unlike most fluorescent organic dyes. (B) In this 
cell, microtubules are labeled (green) with an organic fluorescent dye (Alexa 488), while a nuclear 
protein is stained (red) with quantum dots bound to streptavidin. On continuous exposure to strong 
blue light, the fluorescent dye fades quickly while the quantum dots continue to shine. (C) In this 
cell, the labeling pattern is reversed; a nuclear protein is labeled (green) with an organic fluorescent 
dye (Alexa 488), while microtubules are labeled (red) with quantum dots. Again, the quantum dots 
far outlast the fluorescent dye. (B and C, from L. Medintz et al., Nat. Mater. 4:435-446, 2005. With 
permission from Macmillan Publishers Ltd.) 
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Figure 9-16 Immunofluorescence. 

(A) A transmission électron micrograph of 
the periphery of a cultured épithélial cell 
showing the distribution of microtubules 
and other filaments. (B) The same area 
stained with fluorescent antibodies against 
tubulin, the protein that assembles to 
form microtubules, using the technique of 
indirect immunocytochemistry (see Figure 
9-17). Red arrows indicate individual 
microtubules that are readily recognizable 
in both images. Note that, because of 
diffraction effects, the microtubules in the 
light microscope appear 0.2 jim wide 
rather than their true width of 0.025 jim. 
(From M. Osborn, R. Webster and 
K. Weber, J. Cell Biol. 77:R27-R34, 

1978. With permission from The 
Rockefeller University Press.) 


Later in the chapter, additional fluorescence microscopy methods will be dis- 
cussed that can be used to monitor changes in the concentration and location of 
spécifie molécules inside living cells. 


Antibodies Can Be Used to Detect Spécifie Molécules 

Antibodies are proteins produced by the vertebrate immune System as a defense 
against infection (discussed in Chapter 24). They are unique among proteins in 
that they are made in billions of different forms, each with a different binding site 
that recognizes a spécifie target molécule (or antigen ). The précisé antigen speci- 
ficity of antibodies malces them powerful tools for the cell biologist. When labeled 
with fluorescent dyes, antibodies are invaluable for locating spécifie molécules 
in cells by fluorescence microscopy (Figure 9-16); labeled with electron-dense 
particles such as colloidal gold spheres, they are used for similar purposes in the 
électron microscope (discussed below). The antibodies employed in microscopy 
are commonly either purified from antiserum so as to remove ail nonspecific anti¬ 
bodies, or they are spécifie monoclonal antibodies that only recognize the target 
molécule. 

When we use antibodies as probes to detect and assay spécifie molécules in 
cells, we frequently use Chemical methods to amplify the fluorescent signal they 
produce. For example, although a marlcer molécule such as a fluorescent dye 
can be linlced directly to an antibody—the primary antibody—a stronger signal 
is achieved by using an unlabeled primary antibody and then detecting it with a 
group of labeled secondary antibodies that bind to it (Figure 9-17). This process is 
called indirect immunocytochemistry. 

Some amplification methods use an enzyme as a marlcer molécule attached 
to the secondary antibody. The enzyme alkaline phosphatase, for example, in the 
presence of appropriate Chemicals, produces inorganic phosphate that in turn 


primary antibody: 
rabbit antibody 
directed against 
antigen A 


immobilized 
antigen A 



Y 


secondary antibodies: 
marker-coupled antibodies 
directed against rabbit 


antibodies 


marker 



Figure 9-17 Indirect immunocytochemistry. 

This détection method is very sensitive 
because many molécules of the secondary 
antibody recognize each primary antibody. 

The secondary antibody is covalently 
coupled to a marker molécule that makes 
it readily détectable. Commonly used 
marker molécules include fluorescent 
probes (for fluorescence microscopy), the 
enzyme horseradish peroxidase (for either 
conventional light microscopy or électron 
microscopy), colloidal gold spheres (for 
électron microscopy), and the enzymes 
alkaline phosphatase or peroxidase (for 
biochemical détection). 
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leads to the local formation of a colored precipitate. This reveals the location of 
the secondary antibody and hence the location of the antibody-antigen complex. 
Since each enzyme molécule acts catalytically to generate many thousands of 
molécules of product, even tiny amounts of antigen can be detected. Although 
the enzyme amplification malces enzyme-linked methods sensitive, diffusion of 
the colored precipitate away from the enzyme limits the spatial resolution of this 
method for microscopy, and fluorescent labels are usually used for the most sen¬ 
sitive and précisé optical localization. 


Imaging of Complex Three-Dimensional Objects Is Possible with 
the Optical Microscope 

For ordinary light microscopy, as we hâve seen, a tissue has to be sliced into thin 
sections to be examined; the thinner the section, the crisper the image. Since 
information about the third dimension is lost upon sectioning, how, then, can we 
get a picture of the three-dimensional architecture of a cell or tissue, and how can 
we view the microscopie structure of a specimen that, for one reason or another, 
cannot first be sliced into sections? Although an optical microscope is focused on 
a particular focal plane within a three-dimensional specimen, ail the other parts 
of the specimen, above and below the plane of focus, are also illuminated and 
the light originating from these régions contributes to the image as “out-of-focus" 
blur. This can malce it very hard to interpret the image in detail and can lead to fine 
image structure being obscured by the out-of-focus light. 

Two distinct but complementary approaches solve this problem: one is com- 
putational, the other optical. These three-dimensional microscopie imaging 
methods malce it possible to focus on a chosen plane in a thiclc specimen while 
rejecting the light that cornes from out-of-focus régions above and below that 
plane. Thus one sees a crisp, thin optical section. From a sériés of such optical 
sections talcen at different depths and stored in a computer, a three-dimensional 
image can be reconstructed. The methods do for the microscopist what the com- 
puted tomography (CT) scanner does (by different means) for the radiologist 
investigating a human body: both machines give detailed sectional views of the 
interior of an intact structure. 

The computational approach is often called image deconvolution. To under- 
stand how it worlcs, remember that the wavelike nature of light means that the 
microscope lens System produces a small blurred dise as the image of a point 
light source (see Figure 9-5), with increased blurring if the point source lies above 
or below the focal plane. This blurred image of a point source is called the point 
spread Junction (see Figure 9-36). An image of a complex object can then be 
thought of as being built up by replacing each point of the specimen by a corre- 
sponding blurred dise, resulting in an image that is blurred overall. For decon¬ 
volution, we first obtain a sériés of (blurred) images, usually with a cooled CCD 
caméra or more recently a CMOS caméra, focusing the microscope in turn on a 
sériés of focal planes—in effect, a (blurred) three-dimensional image. Digital Pro¬ 
cessing of the stade of digital images then removes as much of the blur as pos¬ 
sible. In essence, the computer program uses the measured point spread func- 
tion of a point source of light from that microscope to détermine what the effect 
of the blurring would hâve been on the image, and then applies an équivalent 
“deblurring” (deconvolution), turning the blurred three-dimensional image into 
a sériés of clean optical sections, albeit still constrained by the diffraction limit. 
Figure 9-18 shows an example. 

The Confocal Microscope Produces Optical Sections by Excluding 
Out-of-Focus Light 

The confocal microscope achieves a resuit similar to that of deconvolution, but 
does so by manipulating the light before it is measured; it is an analog technique 
rather than a digital one. The optical details of the confocal microscope are com¬ 
plex, but the basic idea is simple, as illustrated in Figure 9-19, and the results are 




Figure 9-18 Image deconvolution. 

(A) A light micrograph of the large polytene 
chromosomes from Drosophila, stained 
with a fluorescent DNA-binding dye. 

(B) The same field of view after image 
deconvolution clearly reveals the banding 
pattern on the chromosomes. Each band 
is about 0.25 gm thick, approaching the 
diffraction limit of the light microscope. 
(Courtesy of the John Sedat Laboratory.) 
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Figure 9-19 The confocal fluorescence 
microscope. (A) This simplifiée! diagram 
shows that the basic arrangement of 
optical components is similar to that of the 
standard fluorescence microscope shown 
in Figure 9-12, except that a laser is used 
to illuminate a small pinhole whose image 
is focused at a single point in the three- 
dimensional (3-D) specimen. (B) Emitted 
fluorescence from this focal point in the 
specimen is focused at a second (confocal) 
pinhole. (C) Emitted light from elsewhere 
in the specimen is not focused at the 
pinhole and therefore does not contribute 
to the final image. By scanning the beam 
of light across the specimen, a very sharp 
two-dimensional image of the exact plane 
of focus is built up that is not significantly 
degraded by light from other régions of the 
specimen. 


fluorescent specimen is illuminated 
with a focused point of light from 
a pinhole 


emitted fluorescent 
light from in-focus 
point is focused at 
pinhole and reaches 
detector 


emitted light from out- 
of-focus point is out of 
focus at pinhole and 
is largely excluded 
from detector 


far superior to those obtained by conventional light microscopy (Figure 9-20A 
and B). 

The confocal microscope is generally used with fluorescence optics (see Fig¬ 
ure 9-12), but instead of illuminating the whole specimen at once, in the usual 
way, the optical System at any instant focuses a spot of light onto a single point at 
a spécifie depth in the specimen. This requires a source of pinpoint illumination 
that is usually supplied by a laser whose light has been passed through a pinhole. 
The fluorescence emitted from the illuminated material is collected at a suitable 
light detector and used to generate an image. A pinhole aperture is placed in front 
of the detector, at a position that is confocal with the illuminating pinhole—that 
is, precisely where the rays emitted from the illuminated point in the specimen 
corne to a focus. Thus, the light from this point in the specimen converges on this 
aperture and enters the detector. 

By contrast, the light from régions out of the plane of focus of the spotlight is 
also out of focus at the pinhole aperture and is therefore largely excluded from 
the detector (see Figure 9-19). To build up a two-dimensional image, data from 
each point in the plane of focus are collected sequentially by scanning across the 
field from left to right in a regular pattern of pixels and are displayed on a com¬ 
puter screen. Although not shown in Figure 9-19, the scanning is usually done by 
deflecting the beam with an oscillating mirror placed between the dichroic mir- 
ror and the objective lens in such a way that the illuminating spotlight and the 



Figure 9-20 Confocal fluorescence 
microscopy produces clear optical 
sections and three-dimensional data 
sets. The first two micrographs are of the 
same intact gastrula-stage Drosophila 
embryo, which has been stained with a 
fluorescent probe for actin filaments. 

(A) The conventional, unprocessed image 
is blurred by the presence of fluorescent 
structures above and below the plane 
of focus. (B) In the confocal image, this 
out-of-focus information is removed, 
resulting in a crisp optical section of 
the cells in the embryo. (C) A three- 
dimensional reconstruction of an object 
can be assembled from a stack of such 
optical sections. In this case, the complex 
branching structure of the mitochondrial 
compartiment in a single live yeast cell 
is shown. (A and B, courtesy of Richard 
Warn and Peter Shaw; C, courtesy of 
Stefan Hell.) 
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Figure 9-21 Multiphoton imaging. Infrared laser light causes less damage to living cells than 
visible light and can also penetrate further, allowing microscopists to peer deeper into living tissues. 
The two-photon effect, in which a fluorochrome can be excited by two coincident infrared photons 
instead of a single high-energy photon, allows us to see nearly 0.5 mm inside the cortex of a live 
mouse brain. A dye, whose fluorescence changes with the calcium concentration, reveals active 
synapses (yellow) on the dendritic spines (red) that change as a function of time; in this case, there 
is a day between each image. (Courtesy of Thomas Oertner and Karel Svoboda.) 


confocal pinhole at the detector remain strictly in register. Variations in design 
now allow the rapid collection of data at video rates. 

The confocal microscope has been used to résolve the structure of numerous 
complex three-dimensional objects (Figure 9-20C) including the networlcs of 
cytoslceletal fibers in the cytoplasm and the arrangements of chromosomes and 
genes in the nucléus. 

The relative merits of deconvolution methods and confocal microscopy for 
three-dimensional optical microscopy dépend on the specimen being imaged. 
Confocal microscopes tend to be better for thiclcer specimens with high levels of 
out-of-focus light. They are also generally easier to use than deconvolution Sys¬ 
tems and the final optical sections can be seen quicldy. In contrast, the cooled 
CCD or CMOS caméras used for deconvolution Systems are extremely efficient at 
collecting small amounts of light, and they can be used to malce detailed three-di¬ 
mensional images from specimens that are too wealdy stained or too easily dam- 
aged by the bright light used for confocal microscopy. 

Both methods, however, hâve another drawback; neither is good at coping 
with very thiclc specimens. Deconvolution methods quicldy become ineffective 
any deeper than about 40 pm into a specimen, while confocal microscopes can 
only obtain images up to a depth of about 150 pm. Spécial microscopes can now 
talce advantage of the way in which fluorescent molécules are excited, to probe 
even deeper into a specimen. Fluorescent molécules are usually excited by a sin¬ 
gle high-energy photon, of shorter wavelength than the emitted light, but they can 
in addition be excited by the absorption of two (or more) photons of lower energy, 
as long as they both arrive within a femtosecond or so of each other. The use of 
this longer-wavelength excitation has some important advantages. In addition 
to reducing background noise, red or near-infrared light can penetrate deeper 
into a specimen. Multiphoton microscopes, constructed to talce advantage of this 
two-photon effect, can obtain sharp images, sometimes even at a depth of 250 pm 
within a specimen. This is particularly valuable for studies of living tissues, nota- 
bly in imaging the dynamic activity of synapses and neurons just below the sur¬ 
face of living brains (Figure 9-21). 

Individual Proteins Can Be Fluorescently Tagged in Living Cells 
and Organisms 

Even the most stable cell structures must be assembled, disassembled, and reor- 
ganized during the celFs life cycle. Other structures, often enormous on the molec- 
ular scale, rapidly change, move, and reorganize themselves as the cell conducts 
its internai affairs and responds to its environment. Complex, highly organized 
pièces of molecular machinery move components around the cell, controlling 
traffic into and out of the nucléus, from one organelle to another, and into and out 
of the cell itself. 

Various techniques hâve been developed to visualize the spécifie components 
involved in such dynamic phenomena. Many of these methods use fluorescent 
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proteins, and they require a trade-off between structural préservation and effi¬ 
cient labeling. Ail of the fluorescent molécules discussed so far are made outside 
the cell and then artificially introduced into it. But use of genes coding for protein 
molécules that are themselves inherently fluorescent also enables the création of 
organisms and cell lines that malce their own visible tags and labels, without the 
introduction of foreign molécules. These cellular exhibitionists display their inner 
workings in glowing fluorescent color. 

Foremost among the fluorescent proteins used for these purposes by cell biol- 
ogists is the green fluorescent protein (GFP), isolated from the jellyfish Aequo- 
rea Victoria. This protein is encoded by a single gene, which can be cloned and 
introduced into cells of other species. The freshly translated protein is not fluo¬ 
rescent, but within an hour or so (less for some alleles of the gene, more for oth- 
ers) it undergoes a self-catalyzed post-translational modification to generate an 
efficient fluorochrome, shielded within the interior of a barrel-like protein, which 
will now fluoresce when illuminated appropriately with blue light (Figure 9-22). 
Extensive site-directed mutagenesis performed on the original gene sequence 
has resulted in multiple variants that can be used effectively in organisms ranging 
from animais and plants to fungi and microbes. The fluorescence efficiency has 
also been improved, and variants hâve been generated with altered absorption 
and émission spectra from the blue-green, like blue fluorescent protein or BFP, to 
the far visible red. Other, related fluorescent proteins hâve since been discovered 
(for example, in corals) that also extend the range into the red région of the spec- 
trum, like red fluorescent protein or RFP. 

One of the simplest uses of GFP is as a reporter molécule, a fluorescent probe to 
monitor gene expression. A transgenic organism can be made with the GFP-cod¬ 
ing sequence placed under the transcriptional control of the promoter belonging 
to a gene of interest, giving a directly visible readout of the gene's expression pat¬ 
tern in the living organism (Figure 9-23). In another application, a peptide loca¬ 
tion signal can be added to the GFP to direct it to a particular cell compartment, 
such as the endoplasmic réticulum or a mitochondrion, lighting up these organ- 
elles so they can be observed in the living State (see Figure 12-31). 

The GFP DNA coding sequence can also be inserted at the beginning or end of 
the gene for another protein, yielding a chimeric product consisting of that pro¬ 
tein with a GFP domain attached. In many cases, this GFP fusion protein behaves 
in the same way as the original protein, directly revealing its location and activ¬ 
ités by means of its genetically encoded fluorescence (Figure 9-24). It is often 
possible to prove that the GFP fusion protein is functionally équivalent to the 
untagged protein, for example by using it to rescue a mutant lacking that protein. 
GFP tagging is the clearest and most unequivocal way of showing the distribution 
and dynamics of a protein in a living organism (Figure 9-25 and see Movie 16.8). 

Protein Dynamics Can Be Followed in Living Cells 

Fluorescent proteins are now exploited not just to see where in a cell a partic¬ 
ular protein is located, but also to uncover its kinetic properties and to find out 
whether it might interact with other molécules. We now describe three techniques 
in which fluorescent proteins are used in this way. 

First, interactions between one protein and another can be monitored by 
fluorescence résonance energy transfer, also called Fôrster résonance energy 


Figure 9-23 Green fluorescent protein (GFP) as a reporter. For this 
experiment, carried out in the fruit fly, the GFP gene was joined (using 
recombinant DNA techniques) to a fly promoter that is active only in a 
specialized set of neurons. This image of a live fly embryo was captured 
by a fluorescence microscope and shows approximately 20 neurons, each 
with long projections (axons and dendrites) that communicate with other 
(nonfluorescent) cells. These neurons are located just under the surface of the 
animal and allow it to sense its immédiate environment. (From W.B. Grueber 
et al., Curr. Biol. 13:618-626, 2003. With permission from Elsevier.) 



Figure 9-22 Green fluorescent protein 
(GFP). The structure of GFP, shown here 
schematically, highlights the eleven 
P strands that form the staves of a barrel. 
Buried within the barrel is the active 
chromophore (dark green) that is formed 
post-translationally from the protruding side 
chains of three amino acid residues. (From 
M. Ormô et al., Science 273:1392-1395, 

1996. With permission from AAAS.) 
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Figure 9-24 GFP-tagged proteins. This living cell from a tobacco plant 
is expressing high levels of green fluorescent protein, fused to a protein 
that is targeted to mitochondria, which accordingly appear green. The 
mitochondria are seen to cluster around the chloroplasts, whose chlorophyll 
autofluorescence marks them out in red. (Courtesy of Olivier Grandjean.) 


transfer, both abbreviated FRET. In this technique, two molécules of interest are 
each labeled with a different fluorochrome, chosen so that the émission spectrum 
of one fluorochrome, the donor, overlaps with the absorption spectrum of the 
other, the acceptor. If the two proteins bind so as to bring their fluorochromes into 
very close proximity (doser than about 5 nm), one fluorochrome, when excited, 
can transfer energy from the absorbed light directly (by résonance, nonradiatively) 
to the other. Thus, when the complex is illuminated at the excitation wavelength of 
the first fluorochrome, fluorescent light is produced at the émission wavelength of 
the second. This method can be used with two different spectral variants of GFP 
as fluorochromes to monitor processes such as the interaction of signaling mol¬ 
écules with their receptors, or proteins in macromolecular complexes at spécifie 
locations inside living cells (Figure 9-26). The FRET can be measured by quantify- 
ing the réduction of the donor fluorescence in the presence of the acceptor. 

A second example of a fluorescence-tagging technique that allows detailed 
observations of proteins within cells involves synthesizing an inactive form of the 
fluorescent molécule of interest, introducing it into the cell, and then activating 
it suddenly at a chosen site in the cell by focusing a spot of light on it. This pro- 
cess is referred to as photoactivation. Many inactive photosensitive precursors 
of this type, often called caged molécules , hâve been made based on a variety of 
fluorescent molécules. A microscope can be used to focus a strong puise of light 
from a laser on any tiny région of the cell, so that the expérimenter can control 
exactly where and when the fluorescent molécule is photoactivated. The tech¬ 
nique allows us to follow complex and rapid intracellular processes, such as the 
actions of signaling molécules or the movements of cytoslceletal proteins. 

When a photoactivatable fluorescent tag is attached to a purified protein, it 
is important that the modified protein remain biologically active: labeling with a 
caged fluorescent dye adds a bulky group to the surface of a protein, which can 
easily change the protein's properties. A satisfactory labeling protocol is usually 
found by trial and error. Once a biologically active labeled protein has been pro¬ 
duced, it needs to be introduced into the living cell where its behavior can be fol- 
lowed. Tubulin labeled with caged fluorescein, for example, can be injected into 
a dividing cell, where it is incorporated into microtubules of the mitotic spindle. 
When a small région of the spindle is illuminated with a laser, the labeled tubulin 
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Figure 9-25 Dynamics of GFP tagging. This sequence of micrographs shows a set of three- 
dimensional images of a living nucléus taken over the course of 135 minutes. Tobacco cells hâve 
been stably transformée! with GFP fused to a spliceosomal protein that is concentrated in small 
nuclear bodies called Cajal bodies (see Figure 6-46). The fluorescent Cajal bodies, easily visible in 
a living cell with confocal microscopy, are dynamic structures that move around within the nucléus. 
(Courtesy of Kurt Boudonck, Liam Dolan, and Peter Shaw.) 
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becomes fluorescent, so that its movement along the spindle microtubules can be 
readily followed (Figure 9-27). 

A further development in photoactivation is the discovery that the genes 
encoding GFP and related fluorescent proteins can be engineered to produce 
protein variants, usually with one or more amino acid changes, that fiuoresce 
only wealdy under normal excitation conditions, but can be induced to fiuoresce 
either more strongly or with a color shift (for example, from green to red) by acti- 
vating them with a strong puise of light at a different wavelength. In principle, 
the microscopist can then follow the local in vivo behavior of any protein that 
can be expressed as a fusion with one of these GFP variants. These genetically 
encoded, photoactivatable fluorescent proteins allow the lifetime and behavior 
of any protein to be studied independently of other newly synthesized proteins 
(Figure 9-28). 

A third way to exploit GFP fused to a protein of interest is known as fluores¬ 
cence recovery after photobleaching (FRAP). Here, one uses a strong focused 
beam of light from a laser to extinguish the GFP fluorescence in a specified région 
of the cell, after which one can analyze the way in which remaining unbleached 
fluorescent protein molécules move into the bleached area as a function of time. 



Figure 9-26 Fluorescence résonance 
energy transfer (FRET). To détermine 
whether (and when) two proteins interact 
inside a cell, the proteins are first produced 
as fusion proteins attached to different 
color variants of green fluorescent protein 
(GFP). (A) In this example, protein X is 
coupled to a blue fluorescent protein, which 
is excited by violet light (370-440 nm) and 
emits blue light (440-480 nm); protein Y 
is coupled to a green fluorescent protein, 
which is excited by blue light (440-480 nm) 
and emits green light (510 nm). (B) If protein 
X and Y do not interact, illuminating the 
sample with violet light yields fluorescence 
from the blue fluorescent protein only. 

(C) When protein X and protein Y interact, 
the résonance transfer of energy, FRET, 
can now occur. Illuminating the sample 
with violet light excites the blue fluorescent 
protein, which transfers its energy to the 
green fluorescent protein, resulting in an 
émission of green light. The fluorochromes 
must be quite close together—within about 
1-5 nm of one another—for FRET to occur. 
Because not every molécule of protein X 
and protein Y is bound at ail times, some 
blue light may still be detected. But as the 
two proteins begin to interact, émission 
from the donor blue fluorescent protein 
falls as the émission from the acceptor 
GFP rises. 


Figure 9-27 Determining microtubule 
flux in the mitotic spindle with caged 
fluorescein linked to tubulin. 

(A) A metaphase spindle formed in vitro 
from an extract of Xenopus eggs has 
incorporated three fluorescent markers: 
rhodamine-labeled tubulin (red) to mark 
ail the microtubules, a blue DNA-binding 
dye that labels the chromosomes, and 
caged-fluorescein-labeled tubulin, which is 
also incorporated into ail the microtubules 
but is invisible because it is nonfluorescent 
until activated by ultraviolet (UV) light. 

(B) A beam of UV light activâtes, or 
“uncages,” the caged-fluorescein-labeled 
tubulin locally, mainly just to the left side 
of the metaphase plate. Over the next 
few minutes—after 1.5 minutes in (C) and 
after 2.5 minutes in (D)—the uncaged- 
fluorescein-tubulin signal moves toward 
the left spindle pôle, indicating that tubulin 
is continuously moving poleward even 
though the spindle (visualized by the red 
rhodamine-labeled tubulin fluorescence) 
remains largely unchanged. (From 

K.E. Sawin and T.J. Mitchison, J. Cell Biol. 
112:941-954, 1991. With permission 
from The Rockefeller University Press.) 
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This technique is usually carried out with a confocal microscope and, lilce photo¬ 
activation, can deliver valuable quantitative data about a protein's kinetic param- 
eters, such as diffusion coefficients, active transport rates, or binding and dissoci¬ 
ation rates from other proteins (Figure 9-29). 


Figure 9-28 Photoactivation. 

Photoactivation is the light-induced 
activation of an inert molécule to an 
active State. In this experiment, shown 
schematically in (A), a photoactivatable 
variant of GFP is expressed in a cultured 
animal cell. (B) Before activation (time 
0 sec), little or no GFP fluorescence is 
detected in the selected région (red circle) 
when excited by blue light at 488 nm. After 
activation of the GFP with an ultraviolet 
laser puise at 413 nm, it rapidly fluoresces 
brightly in the selected région (green). The 
movement of GFP, as it diffuses out of this 
région, can be measured. Since only the 
photoactivated proteins are fluorescent 
within the cell, the trafficking, turnover, and 
degradative pathways of proteins can be 
monitored. (B, from J. Lippincott-Schwartz 
and G.Fl. Patterson, Science 300:87-91, 
2003.) 


Light-Emitting Indicators Can Measure Rapidly Changing 
Intracellular Ion Concentrations 


One way to study the chemistry of a single living cell is to insert the tip of a fine, 
glass, ion-sensitive microelectrode directly into the cell interior through the 
plasma membrane. This technique is used to measure the intracellular concen¬ 
trations of common inorganic ions, such as H + , Na + , IC, Cl - , and Ca 2+ . However, 
ion-sensitive microelectrodes reveal the ion concentration only at one point 
in a cell, and for an ion présent at a very low concentration, such as Ca 2+ , their 
responses are slow and somewhat erratic. Thus, these microelectrodes are not 
ideally suited to record the rapid and transient changes in the concentration of 
cytosolic Ca 2+ that hâve an important rôle in allowing cells to respond to extracel¬ 
lular signais. Such changes can be analyzed with ion-sensitive indicators, whose 
light émission reflects the local concentration of the ion. Some of these indica¬ 
tors are luminescent (emitting light spontaneously), while others are fluorescent 
(emitting light on exposure to light). 



photobleach selected région fluorescence recovery 



untreated 


(B) control cell 


Figure 9-29 Fluorescence recovery 
after photobleaching (FRAP). A 

strong focused puise of laser light will 
extinguish, or bleach, the fluorescence 
of GFP. By selectively photobleaching 
a set of fluorescently tagged protein 
molécules within a defined région of a cell, 
the microscopist can monitor recovery 
over time, as the remaining fluorescent 
molécules move into the bleached région 
(see Movie 10.6). (A) The experiment 
shown uses monkey cells in culture that 
express galactosyltransférase, an enzyme 
that constantly recycles between the Golgi 
apparatus and the endoplasmic réticulum 
(ER). The Golgi apparatus in one of the 
two cells is selectively photobleached, 
while the production of new fluorescent 
protein is blocked by treating the cells with 
cycloheximide. The recovery, resulting from 
fluorescent enzyme molécules moving from 
the ER to the Golgi, can then be followed 
over a period of time. (B) Schematic 
diagram of the experiment shown in (A). 

(A, from J. Lippincott-Schwartz, 

Histochem. Cell Biol. 116:97-107, 2001. 
With permission from Springer-Verlag.) 
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Figure 9-30 Aequorin, a luminescent protein. The luminescent protein 
aequorin emits blue light in the presence of free Ca 2+ . Here, an egg of the 
medaka fish has been injected with aequorin, which has diffused throughout 
the cytosol, and the egg has then been fertilized with a sperm and examined 
with the help of a very sensitive caméra. The four photographs were taken 
looking down on the site of sperm entry at intervals of 10 seconds and 
reveal a wave of release of free Ca 2+ into the cytosol from internai stores 
just beneath the plasma membrane. This wave sweeps across the egg 
starting from the site of sperm entry, as indicated in the diagrams on the 
left. (Photographs reproduced from J.C. Gilkey, L.F. Jaffe, E.B. Ridgway and 
G.T. Reynolds, J. Cell Biol. 76:448-466, 1978. With permission from The 
Rockefeller University Press.) 


Aequorin is a luminescent protein isolated from the same marine jellyfish 
that produces GFP. It emits blue light in the presence of Ca 2+ and responds to 
changes in Ca 2+ concentration in the range of 0.5-10 pM. If microinjected into an 
egg, for example, aequorin emits a flash of light in response to the sudden local- 
ized release of free Ca 2+ into the cytoplasm that occurs when the egg is fertilized 
(Figure 9-30). Aequorin has also been expressed transgenically in plants and 
other organisms to provide a method of monitoring Ca 2+ in ail their cells without 
the need for microinjection, which can be a difficult procedure. 

Bioluminescent molécules lilce aequorin émit tiny amounts of light—at best, 
a few photons per indicator molécule—that are difficult to measure. Fluorescent 
indicators produce orders of magnitude more photons per molécule; they are 
therefore easier to measure and can give better spatial resolution. Genetically 
encoded fluorescent Ca 2+ indicators hâve been synthesized that bind Ca 2+ 
tightly and are excited by or émit light at slightly different wavelengths when 
they are free of Ca 2+ than when they are in their Ca 2+ -bound form. By measuring 
the ratio of fluorescence intensity at two excitation or émission wavelengths, 
we can détermine the concentration ratio of the Ca 2+ -bound indicator to the 
Ca 2+ -free indicator, thereby providing an accurate measurement of the free 
Ca 2+ concentration (see Movie 15.4). Indicators of this type are widely used for 
second-by-second monitoring of changes in intracellular Ca 2+ concentration, or 
other ion concentrations, in the different parts of a cell viewed in a fluorescence 
microscope (Figure 9-31). 

Similar fluorescent indicators measure other ions; some detect H + , for exam¬ 
ple, and hence measure intracellular pH. Some of these indicators can enter cells 
by diffusion and thus need not be microinjected; this malces it possible to mon- 
itor large numbers of individual cells simultaneously in a fluorescence micro¬ 
scope. New types of indicators, used in conjunction with modem image-process- 
ing methods, malce possible similarly rapid and précisé methods for analyzing 
changes in the concentrations of many types of small molécules in cells. 
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Single Molécules Can Be Visualized by Total Internai Reflection 
Fluorescence Microscopy 

In ordinary microscopes, single fluorescent molécules such as tagged proteins 
cannot be reliably detected. The limitation has nothing to do with the resolution 
limit, but instead arises from the strong background due to light emitted or scat- 
tered by out-of-focus molécules. This tends to blot out the fluorescence from the 

Figure 9-31 Visualizing intracellular Ca 2+ concentrations by using a 
fluorescent indicator. The branching tree of dendrites of a Purkinje cell in 
the cerebellum receives more than 100,000 synapses from other neurons. 

The output from the cell is conveyed along the single axon seen leaving the 
cell body at the bottom of the picture. This image of the intracellular Ca 2+ 
concentration in a single Purkinje cell (from the brain of a guinea pig) was 
taken with a low-light caméra and the Ca 2+ -sensitive fluorescent indicator 
fura-2. The concentration of free Ca 2+ is represented by different colors, red 
being the highest and blue the lowest. The highest Ca 2+ levels are présent in 
the thousands of dendritic branches. (Courtesy of D.W. Tank, J.A. Connor, 

M. Sugimori, and R.R. Llinas.) 
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Figure 9-32 TIRF microscopy allows the détection of single fluorescent molécules. (A) TIRF microscopy uses excitatory 
laser light to illuminate the cover-slip surface at the critical angle at which ail the light is reflected by the glass-water interface. 
Some electromagnetic energy extends a short distance across the interface as an evanescent wave that excites just those 
molécules that are attached to the cover slip or are very close to its surface. (B) TIRF microscopy is used here to image 
individual myosin-GFP molécules (green dots) attached to nonfluorescent actin filaments (C), which are invisible but stuck to the 
surface of the cover slip. (Courtesy of Dmitry Cherny and Clive R. Bagshaw.) 


particular molécule of interest. This problem can be solved by the use of a spécial 
optical technique called total internai reflection fluorescence (TIRF) microscopy. 
In a TIRF microscope, laser light shines onto the cover-slip surface at the précisé 
critical angle at which total internai reflection occurs (Figure 9-32A). Because of 
total internai reflection, the light does not enter the sample, and the majority of 
fluorescent molécules are not, therefore, illuminated. However, electromagnetic 
energy does extend, as an evanescent field, for a very short distance beyond the 
surface of the cover slip and into the specimen, allowing just those molécules in 
the layer closest to the surface to become excited. When these molécules fluo¬ 
resce, their emitted light is no longer competing with out-of-focus light from the 
overlying molécules, and can nowbe detected. TIRF has allowed several dramatic 
experiments, for instance imaging of single motor pro teins moving along micro- 
tubules or single actin filaments forming and branching. At présent, the tech¬ 
nique is restricted to a thin layer within only 100-200 nm of the cell surface (Figure 
9-32B and C). 

Individual Molécules Can Be Touched, Imaged, and Moved Using 
Atomic Force Microscopy 

While TIRF allows single molécules to be visualized under certain conditions, it is 
strictly a passive observation method. In order to probe molecular function, it is 
ultimately useful to be able to manipulate individual molécules themselves, and 
atomic force microscopy (AFM) provides a method to do just that. In an AFM device, 
an extremely small and sharply pointed tip, often of Silicon or Silicon nitride, is 
made using nanofabrication methods similar to those used in the semiconductor 
industry. The tip of the AFM probe is attached to a springy cantilever arm mounted 
on a highly précisé positioning System that allows it to be moved over very small 
distances. In addition to this précisé movement capability, the AFM device is 
able to collect information about a variety of forces that it encounters—including 
electrostatic, van der Waals, and mechanical forces—which are felt by its tip as 
it moves close to or touches the surface (Figure 9-33A). When AFM was first 
developed, it was intended as an imaging technology to measure molecular-scale 
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Figure 9-33 Single molécules can be imaged and manipulated by atomic force microscopy. (A) Schematic diagram of the key components of 
an atomic force microscope (AFM), showing the force-sensing tip attached to one end of a single protein molécule, as in the experiment described in 
(D). (B) and (C) An AFM in imaging mode created these images of a single heteroduplex DNA molécule with a MutS protein dimer (larger white régions) 
bound near its center, at the point of a mismatched base pair. MutS is the first protein that binds to DNA when the mismatch repair process is initiated 
(see Figure 5-19). The smaller white dots are single streptavidin molécules, used to label the two ends of each DNA molécule. (D) Titin is an enormous 
protein molécule that provides muscle with its passive elasticity (see Figure 16-34). The extensibility of this protein can be directly tested using a short, 
artificially produced protein that contains eight repeated immunoglobulin (Ig) domains from one région of the titin protein. In this experiment, the tip 
of the AFM is used to pick up, and progressively stretch, a single molécule until it eventually ruptures. As force is applied, each Ig domain suddenly 
begins to unfold, and the force needed in each case (about 200 pN) can be recorded. The région of the force-extension curve shaded green records 
the sequential unfolding event for each of the eight protein domains. (B and C, from Y. Jiang and P.E. Marszalek, EMBO J. 30:2881-2893, 2011. 
Reprinted with permission of John Wiley & Sons; D, adapted from W.A. Linke et al., J. Struct. Biol. 137:194-205, 2002. With permission from Elsevier.) 


features on a surface. When used in this mode, the probe is scanned over the 
surface, moving up and down as necessary to maintain a constant interaction force 
with the surface, thus revealing any objects such as proteins or other molécules 
that might be présent on the otherwise fiat surface (Figure 9-33B and C). AFM is 
not limited to simply imaging surfaces, however, and can also be used to pick up 
and move single molécules that adsorb strongly to the tip. Using this technology, 
the mechanical properties of individual protein molécules can be measured in 
detail. For example, AFM has been used to unfold a single protein molécule in 
order to measure the energetics of domain folding (Figure 9-33 D). 

Superresolution Fluorescence Techniques Can Overcome 
Diffraction-Limited Resolution 

The variations on light microscopy we hâve described so far are ail constrained 
by the classic diffraction limit to resolution described earlier; that is, to about 200 
nm (see Figure 9-6). Yet many cellular structures—from nuclear pores to nuclé¬ 
osomes and clathrin-coated pits—are much smaller than this and so are unre- 
solvable by conventional light microscopy. Several approaches, however, are now 
available that bypass the limit imposed by the diffraction of light, and successfully 
allow objects as small as 20 nm to be imaged and clearly resolved: a remarkable, 
order-of-magnitude improvement. 
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The first of these so-called superresolution approaches, structured illumi¬ 
nation microscopy {SIM), is a fluorescence imaging method with a resolution of 
about 100 nm, or twice the resolution of conventional bright-field and confocal 
microscopy. SIM overcomes the diffraction limit by using a grated or structured 
pattern of light to illuminate the sample. The microscope's physical set-up and 
operation is quite complex, but the general principle can be thought of as simi- 
lar to creating a moiré pattern, an interférence pattern created by overlaying two 
grids with different angles or mesh sizes (Figure 9-34). In a similar way to creat¬ 
ing a moiré pattern, the illuminating grid and the sample features combine into 
an interférence pattern, from which the original high-resolution contributions 
to the image of features beyond the classical resolution limit can be calculated. 
Illumination by a grid means that the parts of the sample in the darlc stripes of 
the grid are not illuminated and therefore not imaged, so the imaging is repeated 
several times (usually three) after translating the grid through a fraction of the 
grid spacing between each image. As the interférence effect is strongest for image 
components close to the direction of the grid bars, the whole process is repeated 
with the grid pattern rotated through a sériés of angles to obtain an équivalent 
enhancement in ail directions. Finally, mathematically combining ail these sepa- 
rate images by computer créâtes an enhanced superresolution image. SIM is ver¬ 
satile because it can be used with any fluorescent dye or protein, and combining 
SIM images captured at consecutive focal planes can create three-dimensional 
data sets (Figure 9-35). 


Figure 9-34 Structured illumination 
microscopy. The principle, illustrated here, 
is to illuminate a sample with patterned 
light and measure the moiré pattern. 

Shown are (A) the pattern from an unknown 
structure and (B) a known pattern. 

(C) When these are combined, the resulting 
moiré pattern contains more information 
than is easily seen in (A), the original 
pattern. If the known pattern (B) has higher 
spatial frequencies, then better resolution 
will resuit. However, because the spatial 
patterns that can be created optically 
are also diffraction-limited, SIM can only 
improve the resolution by about a factor 
of two. (From B.O. Leung and K.C. Chou, 
Appl. Spectrosc. 65:967-980, 2011.) 



(A) 


(B) 


(C) 
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Figure 9-35 Structured illumination microscopy can be used to create three-dimensional data. These three-dimensional projections of the 
meiotic chromosomes at pachytene in a maize cell show the paired latéral éléments of the synaptonemal complexes. (A) The chromosome set has 
been stained with a fluorescent antibody to cohesin and is viewed here by conventional fluorescence microscopy. Because the distance between 
the two latéral éléments is about 200 nm, the diffraction limit, the two latéral éléments that make up each complex are not resolved. (B) In the 
three-dimensional SIM image, the improved resolution enables each latéral element, about 100 nm across, to be clearly resolved, and the two 
chromosomes can clearly be seen to coil around each other. (C) Because the complété three-dimensional data set for the whole nucléus is available, 
the path of each separate pair of chromosomes can be traced and artificially assigned a different color. (Courtesy of C.J. Rachel Wang, Peter Carlton 
and Zacheus Cande.) 
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Figure 9-36 The point spread function of 
a lens détermines resolution. (A) When a 
point source of light is brought to a focus 
by a lens System, diffraction effects mean 
that, instead of being imaged as a point, 
it is blurred in ail dimensions. (B) In the 
plane of the image, the distribution of light 
approximates a Gaussian distribution, 
whose width at half-maximum height under 
idéal conditions is about 200 nm. (C) Two 
point sources that are about 200 nm apart 
can still just be distinguished as separate 
objects in the image, but if they are any 
nearer than that, their images will overlap 
and not be resolvable. 


To get around the diffraction limit, the other two superresolution techniques 
exploit aspects of the point spread function, a property of the optical System men- 
tioned earlier. The point spread function is the distribution of light intensity within 
the three-dimensional, blurred image that is formed when a single point source 
of light is brought to a focus with a lens. Instead of being identical to the point 
source, the image has an intensity distribution that is approximately described 
by a Gaussian distribution, which in turn détermines the resolution of the lens 
System (Figure 9-36). Two points that are doser than the width at half-maximum 
height of this distribution will become hard to résolve because their images over¬ 
lap too much (see Figure 9-36C). 

In fluorescence microscopy, the excitation light is focused to a spot on the 
specimen by the objective lens, which then captures the photons emitted by any 
fluorescent molécule that the beam has raised from a ground State to an excited 
State. Because the excitation spot is blurred according to the point spread func¬ 
tion, fluorescent molécules that are doser than about 200 nm will be imaged as 
a single blurred spot. One approach to increasing the resolution is to switch ail 
the fluorescent molécules at the periphery of the blurry excitation spot baclc to 
their ground State, or to a State where they no longer fluoresce in the normal way, 
leaving only those at the very center to be recorded. This can be done in practice 
by adding a second, very bright laser beam that wraps around the excitation beam 
lilce a torus. The wavelength and intensity of this second beam are adjusted so as 
to switch the fluorescent molécules off everywhere except at the very center of 
the point spread function, a région that can be as small as 20 nm across (Figure 
9-37). The fluorescent probes used must be in a spécial class that is photoswitch- 
able: their émission can be reversibly switched on and off with lights of different 
wavelengths. As the specimen is scanned with this arrangement of lasers, fluo¬ 
rescent molécules are switched on and off, and the small point spread function at 
each location is recorded. The diffraction limit is breached because the technique 
ensures that similar but very closely spaced molécules are in one of two different 
States, either fluorescing or darlc. This approach is called STED (stimulated émis¬ 
sion déplétion microscopy) and various microscopes using versions of the general 
method are now in wide use. Resolutions of 20 nm hâve been achieved in biologi- 
cal specimens, and even higher resolution attained with nonbiological specimens 
(see Figure 9-37). 

Superresolution Can Also be Achieved Using Single-Molecule 
Localization Methods 

If a single fluorescent molécule is imaged, it appears as a circular blurry dise, but 
if sufficient photons hâve contributed to this image, the précisé mathematical 
center of the disclike image can be determined very accurately, often to within a 
few nanometers. But the problem with a specimen that contains a large number 
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Figure 9-37 Superresolution microscopy can be achieved by reducing 
the size of the point spread function. (A) The size of a normal focused 
beam of excitatory light. (B) An extremely strong superimposed laser beam, 
at a different wavelength and in the shape of a torus, depletes emitted 
fluorescence everywhere in the specimen except right in the center of the 
beam, reducing the effective width of the point spread function (C). As 
the specimen is scanned, this small point spread function can then build 
up a crisp image in a process called STED (stimulated émission déplétion 
microscopy). (D) Synaptic vesicles in live cultured neurons, fluorescently 
labeled and imaged by ordinary confocal microscopy, with a resolution of 
260 nm. (E) The same vesicles imaged by STED, with a resolution of 60 
nm, which allows single vesicles to be resolved. (F) Fluorescently labeled 
réplication factories in the nucléus of a cultured cell, imaged by ordinary 
confocal microscopy. (G) The same réplication factories imaged by STED. 
Single, discrète réplication sites can be resolved by STED that cannot be 
seen in the confocal image. (A, B, and C, from G. Donnert et al., Proc. Natl 
Acad. Sci. USA 103:11440-11445, 2006. With permission from National 
Academy of Sciences; D and E, from V. Westphal et al., Science 320:246- 
249, 2008. With permission from AAAS; F and G, from Z. Cseresnyes, 

U. Schwarz and C.M. Green, BMC Cell Biol. 10:88, 2009.) 



of adjacent fluorescent molécules, as we saw earlier, is that they each contribute 
blurry, overlapping point spread functions to the image, making the exact posi¬ 
tion of any one molécule impossible to résolve. Another way round this limitation 
is to arrange for only a very few, clearly separated molécules to actively fluoresce 
at any one moment. The exact position of each of these can then be computed, 
before subséquent sets of molécules are examined. 

In practice, this can be achieved by using lasers to sequentially switch on a 
sparse subset of fluorescent molécules in a specimen containing photoactivatable 
or photoswitchable fluorescent labels. Labels are activated, for example, by illu¬ 
mination with near-ultraviolet light, which modifies a small subset of molécules 
so that they fluoresce when exposed to an excitation beam at another wavelength. 
These are then imaged before bleaching quenches their fluorescence and a new 
subset is activated. Each molécule emits a few thousand photons in response to 
the excitation before switching off, and the switching process can be repeated 
hundreds or even thousands of times, allowing the exact coordinates of a very 
large set of single molécules to be determined. The full set can be combined and 
digitally displayed as an image in which the computed location of each individual 
molécule is exactly marlced (Figure 9-38). This class of methods has been vari- 
ously termed photo activated localization microscopy (PALM) or stochastic optical 
reconstruction microscopy ( STORM ). 

By switching the fluorophores off and on sequentially in different régions 
of the specimen as a function of time, ail the superresolution imaging methods 
described above allow the resolution of molécules that are much doser together 
than the 200 nm diffraction limit. In STED, the locations of the molécules are 
determined by using optical methods to define exactly where their fluorescence 
will be on or off. In PALM and STORM, individual fluorescent molécules are 
switched on and off at random over a period of time, allowing their positions to 
be accurately determined. PALM and STORM techniques hâve depended on the 
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successive cycles of activation and bleaching allow well-separated single fluorescent molécules 
to be detected 



a super-resolution image of the fluorescent structure is built up as the positions of successive small 

groups of molécules are added to the map 

(B) 



Figure 9-38 Single fluorescent molécules can be located with great accuracy. (A) Determining the exact mathematical 
center of the blurred image of a single fluorescent molécule becomes more accurate the more photons contribute to the final 
image. The point spread function described in the text dictâtes that the size of the molecular image is about 200 nm across, 
but in very bright specimens, the position of its center can be pinpointed to within a nanometer. (B) In this imaginary specimen, 
sparse subsets of fluorescent molécules are individually switched on briefly and then bleached. The exact positions of ail these 
well-spaced molécules can be gradually built up into an image at superresolution. (C) In this portion of a cell, the microtubules 
hâve been fluorescently labeled and imaged at the top in a Tl RF microscope (see Figure 9-32) and below, at superresolution, 
in a PALM microscope. The diameter of the microtubules in the lower panel now resembles their true size, about 25 nm, 
rather than the 250 nm in the blurred image at the top. (A, from A.L. McEvoy et al., BMC Biol. 8:106, 2010; C, courtesy of 
Cari Zeiss Ltd.) 


development of novel fluorescent probes that exhibit the appropriate switching 
behavior. Ail these methods are now being extended to incorporate multicolor 
imaging, three-dimensional imaging (Figure 9-39), and live-cell imaging in real 
time. Ending the long reign of the diffraction limit has certainly reinvigorated light 
microscopy and its place in cell biology research. 


Figure 9-39 Small fluorescent structures can be imaged in three 
dimensions with superresolution. (A) The image of two touching 
180-nm-diameter clathrin-coated pits on the plasma membrane of a cultured 
cell is diffraction-limited, and the individual pits cannot be distinguished in 
this conventional fluorescence image. (B) Using STORM superresolution 
microscopy, however, the pits are clearly resolvable. Not only can such pits 
be imaged using probes of different colors, but additional three-dimensional 
information can also be obtained. (C) and (D) Shown are two different 
orthogonal views of one single coated pit. The clathrin is labeled red and 
transferrin—the cargo within the pit—is labeled green. Images of this sort can 
be acquired in less than one second, making possible dynamic observations 
on living cells. These techniques dépend heavily on the development of new, 
very fast-switching, and extremely bright fluorescent probes. (A and 
B, from M. Bâtes et al., Science 317:1749-1753, 2007; C and D, from 
S.A. Jones et al., Nat. Methods 8:499-508, 2011. With permission from 
Macmillan Publishers Ltd.) 
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Summary 

Many light-microscope techniques are availablefor observing cells. Cells that hâve 
been fixed and stained can be studied in a conventional light microscope , whereas 
antibodies coupled to fluorescent dyes can be used to locate spécifie molécules in 
cells in a fluorescence microscope. Living cells can be seen with phase-contrast, 
dijferential-interference-contrast, darlc-field, or bright-field microscopes. Allforms 
of light microscopy are facilitated by digital image-processing techniques, which 
enhance sensitivity and refine the image. Confocal microscopy and image decon- 
volution both provide thin optical sections and can be used to reconstruct three-di- 
mensional images. 

Techniques are now availablefor detecting, measuring, andfollowing almost 
any desired molécule in a living cell. Fluorescent indicator dyes can be introduced 
to measure the concentrations of spécifie ions in individual cells or in different parts 
ofa cell. Virtually any protein of interest can be genetically engineered as a fluores¬ 
cent fusion protein, and then imaged in living cells by fluorescence microscopy. The 
dynamic behavior and interactions of many molécules can be followed in living 
cells by variations on the use of fluorescent protein tags, in some cases at the level of 
single molécules. Various superresolution techniques can circumvent the diffraction 
limit and résolve molécules separated by distances as small as 20 nm. 

LOOKING AT CELLS AND MOLECULES IN THE 
ELECTRON MICROSCOPE 

Light microscopy is limited in the fineness of detail that it can reveal. Microscopes 
using other types of radiation—in particular, électron microscopes—can résolve 
much smaller structures than is possible with visible light. This higher resolution 
cornes at a cost: specimen préparation for électron microscopy is complex and 
it is harder to be sure that what we see in the image corresponds precisely to the 
original living structure. It is possible, however, to use very rapid freezing to pré¬ 
serve structures faithfully for électron microscopy. Digital image analysis can be 
used to reconstruct three-dimensional objects by combining information either 
from many individual particles or from multiple tilted views of a single object. 
Together, these approaches extend the resolution and scope of électron micros¬ 
copy to the point at which we can faithfully image the structures of individual 
macromolecules and the complexes they form. 

The Electron Microscope Résolves the Fine Structure of the Cell 

The formai relationship between the diffraction limit to resolution and the wave- 
length of the illuminating radiation (see Figure 9-6) holds true for any form of 
radiation, whether it is a beam of light or a beam of électrons. With électrons, how¬ 
ever, the limit of resolution is very small. The wavelength of an électron decreases 
as its velocity increases. In an électron microscope with an accelerating voltage 
of 100,000 V, the wavelength of an électron is 0.004 nm. In theory, the resolution 
of such a microscope should be about 0.002 nm, which is 100,000 times that of 
the light microscope. Because the aberrations of an électron lens are considerably 
harder to correct than those of a glass lens, however, the practical resolving power 
of modem électron microscopes is, even with careful image processing to correct 
for lens aberrations, about 0.05 nm (0.5 Â) (Figure 9-40). This is because only the 
very center of the électron lenses can be used, and the effective numerical aper- 
ture is tiny. Furthermore, problems of specimen préparation, contrast, and radia¬ 
tion damage hâve generally limited the normal effective resolution for biological 
objects to 1 nm (10 Â). This is nonetheless about 200 times better than the resolu¬ 
tion of the light microscope. Moreover, the performance of électron microscopes 
is improved by électron illumination sources called field émission guns. These 
very bright and cohérent sources substantially improve the resolution achieved. 

In overall design, the transmission électron microscope (TEM) is similar to a 
light microscope, although it is much larger and “upside down" (Figure 9-41). 


0.14 nm 



Figure 9-40 The resolution of the 
électron microscope. This transmission 
électron micrograph of a monolayer of 
graphene résolves the individual carbon 
atoms as bright spots in a hexagonal 
lattice. Graphene is a single isolated atomic 
plane of graphite and forms the basis of 
carbon nanotubes. The distance between 
adjacent bonded carbon atoms is 
0.14 nm (1.4 Â). Such resolution can only 
be obtained in a specially built transmission 
électron microscope in which ail lens 
aberrations are carefully corrected, and 
with optimal specimens; it cannot be 
achieved with most conventional biological 
specimens. (From A. Dato et al., Chem. 
Commun. 40:6095-6097, 2009. With 
permission from The Royal Society of 
Chemistry.) 
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Figure 9-41 The principal features of 
a light microscope and a transmission 
électron microscope. These drawings 
emphasize the similarities of overall 
design. Whereas the lenses in the light 
microscope are made of glass, those in the 
électron microscope are magnetic coils. 

The électron microscope requires that the 
specimen be placed in a vacuum. The inset 
shows a transmission électron microscope 
in use. (Photograph courtesy of JEOL Ltd.) 


The source of illumination is a filament or cathode that emits électrons at the top 
of a cylindrical column about 2 m high. Since électrons are scattered by collisions 
with air molécules, air must first be pumped out of the column to create a vac¬ 
uum. The électrons are then accelerated from the filament by a nearby anode and 
allowed to pass through a tiny hole to form an électron beam that travels down the 
column. Magnetic coils placed at intervals along the column focus the électron 
beam, just as glass lenses focus the light in a light microscope. The specimen is put 
into the vacuum, through an airlock, into the path of the électron beam. As in light 
microscopy, the specimen is usually stained—in this case, with électron-dense 
material. Some of the électrons passing through the specimen are scattered by 
structures stained with the electron-dense material; the remainder are focused 
to form an image, in a manner analogous to the way an image is formed in a light 
microscope. The image can be observed on a phosphorescent screen or recorded 
with a high-resolution digital caméra. Because the scattered électrons are lost 
from the beam, the dense régions of the specimen show up in the image as areas 
of reduced électron flux, which look darlc. 

Biological Specimens Require Spécial Préparation for Electron 
Microscopy 

In the early days of its application to biological materials, the électron microscope 
revealed many previously unimagined structures in cells. But before these discov- 
eries could be made, électron microscopists had to develop new procedures for 
embedding, cutting, and staining tissues. 

Since the specimen is exposed to a very high vacuum in the électron micro¬ 
scope, living tissue is usually killed and preserved by fixation—first with glutar- 
aldehyde , which covalently cross-links protein molécules to their neighbors, and 
then with osmium tetroxide , which binds to and stabilizes lipid bilayers as well as 
proteins (Figure 9-42). Because électrons hâve very limited penetrating power, 
the fixed tissues normally hâve to be eut into extremely thin sections (25-100 nm 
thiclc, about 1/200 the thiclcness of a single cell) before they are viewed. This is 
achieved by dehydrating the specimen, permeating it with a monomeric resin 
that polymerizes to form a solid bloclc of plastic, then cutting the bloclc with a fine 
glass or diamond knife on a spécial microtome. The resulting thin sections, free of 
water and other volatile solvents, are supported on a small métal grid for viewing 
in the microscope (Figure 9-43). 
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glutaraldehyde osmium tetroxide 

Figure 9-42 Two common Chemical 
fixatives used for électron microscopy. 

The two reactive aldéhyde groups of 
glutaraldehyde enable it to cross-link 
various types of molécules, forming 
covalent bonds between them. Osmium 
tetroxide forms cross-linked complexes 
with many organic compounds, and in the 
process becomes reduced. This reaction is 
especially useful for fixing cell membranes, 
since the C=C double bonds présent 
in many fatty acids react with osmium 
tetroxide. 
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The steps required to préparé biological material for électron microscopy are 
challenging. How can we be sure that the image of the fixed, dehydrated, resin-em- 
bedded specimen bears any relation to the délicate, aqueous biological System 
présent in the living cell? The best current approaches to this problem dépend on 
rapid freezing. If an aqueous System is cooled fast enough and to a low enough 
température, the water and other components in it do not hâve time to rearrange 
themselves or crystallize into ice. Instead, the water is supercooled into a rigid but 
noncrystalline State—a “glass"—called vitreous ice. This State can be achieved by 
slamming the specimen onto a polished copper bloclc cooled by liquid hélium, by 
plunging it into or spraying it with a jet of a coolant such as liquid propane, or by 
cooling it at high pressure. 

Some rapidly frozen specimens can be examined directly in the électron micro¬ 
scope using a spécial cooled specimen holder. In other cases, the frozen bloclc can 
be fractured to reveal interior cell surfaces, or the surrounding ice can be sublimed 
away to expose external surfaces. However, we often want to examine thin sec¬ 
tions. A compromise is therefore to rapid-freeze the tissue, replace the water with 
organic solvents, embed the tissue in plastic resin, and finally eut sections and 
stain. Although technically still difficult, this approach stabilizes and préserves the 
tissue in a condition very close to its original living State (Figure 9-44). 

Image clarity in an électron micrograph dépends upon having a range of con- 
trasting électron densities within the specimen. Electron density in turn dépends 
on the atomic number of the atoms that are présent: the higher the atomic num- 
ber, the more électrons are scattered and the darlcer that part of the image. Bio¬ 
logical tissues are composed mainly of atoms of very low atomic number (pri- 
marily carbon, oxygen, nitrogen, and hydrogen). To malce them visible, tissues 
are usually impregnated (before or after sectioning) with the salts of heavy metals 
such as uranium, lead, and osmium. The degree of imprégnation, or "staining," 
with these salts will vary for different cell constituents. Lipids, for example, tend 
to stain darlcly after osmium fixation, revealing the location of cell membranes. 


copper grid covered with carbon 
and/or plastic film 



Figure 9-43 The métal grid that supports 
the thin sections of a specimen in a 
transmission électron microscope. 


Spécifie Macromolecules Can Be Localized by Immunogold 
Electron Microscopy 

We hâve seen howantibodies can be used in conjunction with fluorescence micros¬ 
copy to localize spécifie macromolecules. An analogous method—immunogold 
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Figure 9-44 Thin section of a cell. This 
thin section is of a yeast cell that has been 
very rapidly frozen and the vitreous ice 
replaced by organic solvents and then by 
plastic resin. The nucléus, mitochondria, cell 
wall, Golgi stacks, and ribosomes can ail 
be readily seen in a State that is presumed 
to be as lifelike as possible. (Courtesy of 
Andrew Staehelin.) 
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Figure 9-45 Localizing proteins in 
électron microscopy. Immunogold 
électron microscopy is used here to find 
the spécifie location of four different protein 
components within the spindle pôle body 
of yeast. At the top is a thin section of a 
yeast mitotic spindle showing the spindle 
microtubules that cross the nucléus and 
connect at each end to spindle pôle 
bodies embedded in the nuclear envelope. 
A diagram of the components of a single 
spindle pôle body is shown below. On 
separate sections, antibodies against 
four different proteins of the spindle pôle 
body are used, together with colloïdal 
gold particles (black dots), to reveal where 
within the complex structure each protein is 
located. (Courtesy of John Kilmartin.) 


électron microscopy— can be used in the électron microscope. The usual proce¬ 
dure is to incubate a thin section first with a spécifie primary antibody, and then 
with a secondary antibody to which a colloidal gold particle has been attached. 
The gold particle is electron-dense and can be seen as a black dot in the électron 
microscope (Figure 9-45). Different antibodies can be conjugated to different 
sized gold particles so multiple proteins can be localized in a single sample. 

A complication for immunogold labeling is that the antibodies and colloidal 
gold particles do not penetrate into the resin used for embedding; therefore, they 
detect antigens only at the surface of the section. This means that the method's 
sensitivity is low, since antigen molécules in the deeper parts of the section are 
not detected. Furthermore, we may get a false impression regarding which struc¬ 
tures contain the antigen and which do not. One solution is to label the specimen 
before embedding it in plastic, when cells and tissues are still fully accessible to 
labeling reagents. Extremely small gold particles, about 1 nm in diameter, worlc 
best for this procedure. Such small gold particles are usually not easily visible in 
the final sections, so additional silver or gold is nucleated around the tiny 1 nm 
gold particles in a Chemical process very much lilce photographie development. 

Different Views of a Single Object Can Be Combinée! to Give a 
Three-Dimensional Reconstruction 

Thin sections often fail to convey the three-dimensional arrangement of cellular 
components viewed in a TEM, and the image can be very misleading: a linear 
structure such as a microtubule may appear in section as a pointlilce object, for 
example, and a section through protruding parts of a single irregularly shaped 
solidbody may give the appearance of two or more separate objects (Figure 9-46). 
The third dimension can be reconstructed from serial sections, but this is a lengthy 
and tedious process. Even thin sections, however, hâve a significant depth com- 
pared with the resolution of the électron microscope, so the TEM image can also 
be misleading in an opposite way, through the superimposition of objects that lie 
at different depths. 
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Because of the large depth of field of électron microscopes, ail the parts of the 
three-dimensional specimen are in focus, and the resulting image is a projection 
(a superimposition of layers) of the structure along the viewing direction. The lost 
information in the third dimension can be recovered if we hâve views of the same 
specimen from many different directions. The computational methods for this 
technique are widely used in medical CT scans. In a CT scan, the imaging equip- 
ment is moved around the patient to generate the different views. In electron-mi- 
croscope (EM) tomography, the specimen holder is tilted in the microscope, 
which achieves the same resuit. In this way, we can arrive at a three-dimensional 
reconstruction, in a chosen standard orientation, by combining different views 
of a single object. Each individual view will be very noisy but by combining them 
in three dimensions and talcing an average, the noise can be largely eliminated. 
Starting with thiclc plastic sections of embedded material, three-dimensional 
reconstructions, or tomograms, are used extensively to describe the detailed anat- 
omy of spécifie régions of the cell, such as the Golgi apparatus (Figure 9-47) or 
the cytoskeleton. Increasingly, microscopists are also applying EM tomography 
to unstained frozen, hydrated sections, and even to rapidly frozen whole cells 
or organelles (Figure 9-48). Electron microscopy now provides a robust bridge 
between the scale of the single molécule and that of the whole cell. 

Images of Surfaces Can Be Obtained by Scanning Electron 
Microscopy 

A scanning électron microscope (SEM) directly produces an image of the 
three-dimensional structure of the surface of a specimen. The SEM is usually 
smaller, simpler, and cheaper than a transmission électron microscope. Whereas 
the TEM uses the électrons that hâve passed through the specimen to form an 



250 nm 



Figure 9-46 A three-dimensional 
reconstruction from serial sections. 

Single thin sections in the électron 
microscope sometimes give misleading 
impressions. In this example, most sections 
through a cell containing a branched 
mitochondrion seem to contain two or 
three separate mitochondria (compare 
Figure 9-44). Sections 4 and 7, moreover, 
might be interpreted as showing a 
mitochondrion in the process of dividing. 
The true three-dimensional shape can 
be reconstructed from a complété set of 
serial sections. 


Figure 9-47 Electron-microscope (EM) 
tomography. Samples that hâve been 
rapidly frozen, and then freeze-substituted 
and embedded in plastic, preserve their 
structure in a condition that is very close 
to their original living State (Movie 9.2). 

This example shows the three-dimensional 
structure of the Golgi apparatus from 
a rat kidney cell. Several thick sections 
(250 nm) of the cell were tilted in a high- 
voltage électron microscope, along two 
different axes, and about 160 different 
views recorded. The digital data allow 
individual thin slices of the complété three- 
dimensional data set, or tomogram, to 
be viewed; for example, the serial slices, 
each only 4 nm thick, are shown in (A) and 
(B). Very little changes from one slice to 
the next, but using the full data set, and 
manually color-coding the membranes (B), 
one can obtain a full three-dimensional 
reconstruction, at a resolution of about 
7 nm, of the complété Golgi complex and 
its associated vesicles (C). (From M.S. 
Ladinsky et al., J.Cell Biol. 144:1135-1149, 
1999. With permission from the authors.) 
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image, the SEM uses électrons that are scattered or emitted from the specimen's 
surface. The specimen to be examined is fixed, dried, and coated with a thin layer 
of heavy métal. Alternatively, it can be rapidly frozen, and then transferred to a 
cooled specimen stage for direct examination in the microscope. Often an entire 
plant part or small animal can be put into the microscope with very little prépara¬ 
tion (Figure 9-49). The specimen is scanned with a very narrow beam of électrons. 
The quantity of électrons scattered or emitted as this primary beam bombards 
each successive point of the metallic surface is measured and used to control the 
intensity of a second beam, which moves in synchrony with the primary beam 
and forms an image on a computer screen. Eventually a highly enlarged image of 
the surface as a whole is built up (Figure 9-50). 

The SEM technique provides great depth of field; moreover, since the amount 
of électron scattering dépends on the angle of the surface relative to the beam, 
the image has highlights and shadows that give it a three-dimensional appear- 
ance (see Figure 9-49 and Figure 9-51). Only surface features can be examined, 
however, and in most forms of SEM, the resolution attainable is not very high 
(about 10 nm, with an effective magnification of up to 20,000 times). As a resuit, 
the technique is usually used to study whole cells and tissues rather than subcel¬ 
lular organelles (see Movie 21.3). Very-high-resolution SEMs hâve, however, been 
developed with a bright cohérent-field émission gun as the électron source. This 
type of SEM can produce images that rival the resolution possible with a TEM 
(Figure 9-52). 

Négative Staining and Cryoelectron Microscopy Both Allow 
Macromolecules to Be Viewed at High Resolution 

If they are shadowed with a heavy métal to provide contrast, isolated macromol¬ 
ecules such as DNA or large proteins can be visualized readily in the électron 
microscope, but négative staining allows finer detail to be seen. In this tech¬ 
nique, the molécules are supported on a thin film of carbon and mixed with a 
solution of a heavy-metal sait such as uranyl acetate. After the sample has dried, 
a very thin film of métal sait covers the carbon film everywhere except where it 
has been excluded by the presence of an adsorbed macromolecule. Because the 
macromolecule allows électrons to pass through it much more readily than does 
the surrounding heavy-metal stain, a reverse or négative image of the molécule is 


Figure 9-48 Combining cryoelectron- 
microscope tomography and single- 
particle reconstruction. Small, unfixed, 
rapidly frozen specimens can be examined 
while still frozen. In this example, the 
small nuclei of the amoeba Dictyostelium 
were gently isolated and then very rapidly 
frozen before a sériés of angled views 
were recorded with the aid of a tilting 
microscope stage. These digital views 
are combined by EM tomographs to 
produce a three-dimensional tomogram. 
Two thin digital slices (10 nm) through 
this tomogram show (A) top views and 
(B) side views of individual nuclear pores 
(white arrows). (C) In the three-dimensional 
model, a surface rendering of the pores 
(blue) is seen embedded in the nuclear 
envelope (yellow). From a sériés of 
tomograms it was possible to extract data 
sets for nearly 300 separate nuclear pores, 
whose structures could then be averaged 
using the techniques of single-particle 
reconstruction. The surface-rendered view 
of one of these reconstructed pores is 
shown (D) from the nuclear face and (E) in 
cross section (compare with Figure 12-8). 
The pore complex is colored blue and 
the nuclear basket brown. (From M. Beck 
et al., Science 306:1387-1390, 2004. 

With permission from AAAS.) 



1 mm 


Figure 9-49 A developing wheat flower, 
or spike. This délicate flower spike was 
rapidly frozen, coated with a thin métal film, 
and examined in the frozen State with an 
SEM. This low-magnification micrograph 
demonstrates the large depth of focus of 
an SEM. (Courtesy of Kim Findlay.) 
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created. Négative staining is especially useful for viewing large macromolecular 
aggregates such as viruses or ribosomes, and for seeing the subunit structure of 
protein filaments (Figure 9-53). 

Shadowing and négative staining can provide high-contrast surface views of 
small macromolecular assemblies, but the size of the smallest métal particles in 
the shadow or stain used limits the resolution of both techniques. An alternative 
that allows us to visualize directly at high resolution even the interior features 
of three-dimensional structures such as viruses and organelles is cryoelectron 
microscopy, in which rapid freezing to form vitreous ice is again the lcey. A very 
thin (about 100 nm) film of an aqueous suspension of virus or purified macromo¬ 
lecular complex is prepared on a microscope grid and is then rapidly frozen by 
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Figure 9-50 The scanning électron 
microscope. In an SEM, the specimen is 
scanned by a beam of électrons brought 
to a focus on the specimen by the 
electromagnetic coils that act as lenses. 
The detector measures the quantity of 
électrons scattered or emitted as the beam 
bombards each successive point on the 
surface of the specimen and Controls the 
intensity of successive points in an image 
built up on a screen. The SEM créâtes 
striking images of three-dimensional 
objects with great depth of focus and 
a resolution between 3 nm and 20 nm 
depending on the instrument. (Photograph 
courtesy of Andrew Davies.) 


Figure 9-51 Scanning électron 
microscopy. (A) A scanning électron 
micrograph of the stereocilia projecting 
from a hair cell in the inner ear of a bullfrog. 
For comparison, the same structure is 
shown by (B) differential-interference- 
contrast light microscopy (Movie 9.3) 
and (C) thin-section transmission électron 
microscopy. (Courtesy of Richard Jacobs 
and James Hudspeth.) 
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being plunged into a coolant. A spécial sample holder lceeps this hydrated spéci¬ 
men at -160°C in the vacuum of the microscope, where it can be viewed direcdy 
without fixation, staining, or drying. Unlilce négative staining, in which what we 
see is the envelope of stain exclusion around the particle, hydrated cryoelectron 
microscopy produces an image from the macromolecular structure itself. How- 
ever, the contrast in this image is very low, and to extract the maximum amount 
of structural information, spécial image-processing techniques must be used, as 
we describe next. 


Multiple Images Can Be Combined to Increase Resolution 

As we saw earlier (p. 532), noise is important in light microscopy at lowlight levels, 
but it is a particularly severe problem for électron microscopy of unstained mac- 
romolecules. A protein molécule can tolerate a dose of only a few tens of électrons 
per square nanometer without damage, and this dose is orders of magnitude 
below what is needed to define an image at atomic resolution. 

The solution is to obtain images of many identical molécules—perhaps tens 
of thousands of individual images—and combine them to produce an averaged 
image, revealing structural details that are hidden by the noise in the original 
images. This procedure is called single-particle reconstruction. Before com- 
bining ail the individual images, however, they must be aligned with each other. 
Sometimes it is possible to induce proteins and complexes to form crystalline 
arrays, in which each molécule is held in the same orientation in a regular lattice. 
In this case, the alignment problem is easily solved, and several protein structures 
hâve been determined at atomic resolution by this type of électron crystallogra- 
phy. In principle, however, crystalline arrays are not absolutely required. With the 
help of a computer, the digital images of randomly distributed and unaligned mol¬ 
écules can be processed and combined to yield high-resolution reconstructions 
(see Movie 13.1). Although structures that hâve some intrinsic symmetry malce 
the taslc of alignment easier and more accurate, this technique has also been used 
for objects lilce ribosomes, with no symmetry. Figure 9-54 shows the structure of 
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Figure 9-52 The nuclear pore. Rapidly 
frozen nuclear envelopes were imaged in 
a high-resolution SEM, equipped with a 
field émission gun as the électron source. 
These views of each side of a nuclear pore 
represent the limit of resolution of the SEM 
(compare with Figure 12-8). (Courtesy of 
Martin Goldberg and Terry Allen.) 


WHAT WE DONT KNOW 


• We know in detail about many cell 
processes, such as DNA réplication 
and transcription and RNA translation, 
but will we ever be able to visualize 
such rapid molecular processes in 
action in cells? 

• Will we ever be able to image 
intracellular structures at the resolution 
of the électron microscope in living 
cells? 

• How can we improve crystallization 
and single-particle cryoelectron 
microscopy techniques to obtain high- 
resolution structures of ail important 
membrane channels and transporters? 
What new concepts might these 
structures reveal? 


Figure 9-53 Negatively stained actin 
filaments. In this transmission électron 
micrograph, each filament is about 8 nm in 
diameter and is seen, on close inspection, 
to be composed of a helical Chain of 
globular actin molécules. (Courtesy of 
Roger Craig.) 
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the protein capsid inside a human immunodeficiency virus (HIV) that has been 
determined at high resolution by the combination of many particles, multiple 
views, and molecular modeling. 

A resolution of 0.3 nm has been achieved by électron microscopy—enough 
to begin to see the internai atomic arrangements in a protein and to rival x-ray 
crystallography in resolution. Although électron microscopy is unlilcely to super- 
sede x-ray crystallography (discussed in Chapter 8) as a method for macromolec- 
ular structure détermination, it has some very clear advantages. First, it does not 
absolutely require crystalline specimens. Second, it can deal with extremely large 
complexes—structures that may be too large or too variable to crystallize satis- 
factorily. Third, it allows the rapid analysis of different conformations of protein 
complexes. 

The analysis of large and complexmacromolecular structures is helped consid- 
erably if the atomic structure of one or more of the subunits is known, for example 
from x-ray crystallography. Molecular models can then be mathematically “fitted" 
into the envelope of the structure determined at lower resolution using the élec¬ 
tron microscope (see Figures 16-16D and 16-46). Figure 9-55 shows the structure 
of a ribosome with the location of a bound release factor displayed in this way (see 
also Figure 6-72). 

Summary 

Discovering the detailed structure of membranes and organelles requires the higher 
resolution attainable in a transmission électron microscope. Spécifie macromole- 
cules can be localized after being labeled with colloidal gold linked to antibodies. 
Three-dimensional views ofthe surfaces of cells and tissues are obtained by scanning 
électron microscopy. The shapes ofisolated molécules can be readily determined by 
électron microscopy techniques involving fast freezing or négative staining. Elec¬ 
tron tomography and single-particle reconstruction use computational manipula¬ 
tions of data obtained from multiple images and multiple viewing angles to pro¬ 
duce detailed reconstructions of macromolecules and molecular complexes. The 
resolution obtained with these methods means that atomic structures ofindividual 



Figure 9-54 Single-particle 
reconstruction. The structure of a 
complété human immunodeficiency virus 
(HIV) capsid has been determined by a 
combination of cryoelectron microscopy, 
protein structure détermination, and 
modeling. (A) A single 4 nm slice from an 
EM tomographie model (see also Figure 
9-48) of an intact HIV particle with its 
membrane outer envelope and its internai, 
irregularly shaped protein capsid that 
houses its RNA genome. (B) Electron 
microscopy of capsid subunits that hâve 
self-assembled into a helical tube can be 
used to dérivé an electron-density map 
at a resolution of 8 nm, in which details of 
the hexamers are clearly visible. (C) Using 
the known atomic coordinates of a single 
subunit of the hexamer, the structure 
has been modeled into the electron- 
density map from (B). (D) A molecular 
reconstruction ofthe entire HIV capsid, 
based on the detailed structures shown 
in (A) and (C). This capsid contains 216 
hexamers (blue) and 12 pentamers (yellow). 
(Adapted from G. Zhao et al., Nature 
497:643-646, 2013. With permission from 
Macmillan Publishers Ltd. C, PDB code: 
3J34.) 


Figure 9-55 Single-particle 
reconstruction and molecular model 
fitting. Bacterial ribosomes, with and 
without the release factor required for 
peptide release from the ribosome, 
were used to dérivé high-resolution, 
three-dimensional cryoelectron microscopy 
maps at a resolution of better than 1 
nm. Images of nearly 20,000 separate 
ribosomes preserved in ice were used to 
produce single-particle reconstructions. 

(A) The 30S ribosomal subunit (yellow) and 
the 50S subunit (blue) can be distinguished 
from the additional électron density that 
can be attributed to the release factor RF2 
(ourple). (B) The known molecular structure 
of RF2 modeled into the électron density 
from (A). (From U.B.S. Rawat et al., Nature 
421:87-90, 2003. With permission from 
Macmillan Publishers Ltd.) 
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macromolecules can often be “fitted” to the images derived by électron microscopy. 
In this way, the TEM is increasingly able to bridge the gap between structures dis- 
covered byx-ray crystallography and those discovered with the light microscope. 


PROBLEMS 

Which statements are true? Explain why or why not. 

9-1 Because the DNA double hélix is only 2 nm wide— 
well below the limit of resolution of the light microscope— 
it is impossible to see chromosomes in living cells without 
spécial stains. 


9-7 Antibodies that bind to spécifie proteins are import¬ 
ant tools for defining the locations of molécules in cells. 
The sensitivity of the primary antibody—the antibody that 
reacts with the target molécule—is often enhanced by 
using labeled secondary antibodies that bind to it. What 
are the advantages and disadvantages of using secondary 
antibodies that carry fluorescent tags versus those that 
carry bound enzymes? 


9-2 A fluorescent molécule, having absorbed a single 
photon of light at one wavelength, always emits it at a lon¬ 
ger wavelength. 


Discuss the following problems. 


9-3 The diagrams in Figure Q9-1 show the paths of 
light rays passing through a specimen with a dry lens and 
with an oil-immersion lens. Offer an explanation for why 
oil-immersion lenses should give better resolution. Air, 
glass, and oil hâve refractive indices of 1.00,1.51, and 1.51, 
respectively. 


DRY LENS 


air 



objective 

lens 

cover slip 
slide 


OIL-IMMERSION LENS 




Figure Q9-1 Paths of light rays through dry and oil-immersion lenses 
(Problem 9-3). The red circle at the origin of the light rays is the 
specimen. 


9-8 Figure Q9-3 shows a sériés of modified fluorescent 
proteins that émit light in a range of colors. How do you 
suppose the exact same chromophore can fluoresce at so 
many different wavelengths? 


nniffii 


Figure Q9-3 A rainbow of colors produced by modified fluorescent 
proteins (Problem 9-8). (Courtesy of Nathan Shaner, Paul Steinbach 
and Roger Tsien.) 

9-9 Consider a fluorescent detector designed to report 
the cellular location of active protein tyrosine kinases. A 
blue (cyan) fluorescent protein (CFP) and a yellow fluo¬ 
rescent protein (YFP) were fused to either end of a hybrid 
protein domain. The hybrid protein segment consisted of 
a substrate peptide recognized by the Abl protein tyro¬ 
sine kinase and a phosphotyrosine-binding domain 
(Figure Q9-4A). Stimulation of the CFP domain do es not 
cause émission by the YFP domain when the domains are 


9-4 Figure Q9-2 shows a diagram of the human eye. The 
refractive indices of the components in the light path are: 
cornea 1.38, aqueous humor 1.33, crystalline lens 1.41, and 
vitreous humor 1.38. Where does the main refraction—the 
main focusing—occur? What rôle do you suppose the lens 
plays? 



Figure Q9-2 Diagram 
of the human eye 
(Problem 9-4). 


9-5 Why do humans see so poorly under water? And why 
do goggles help? 

9-6 Explain the différence between resolution and mag- 
nification. 


(A) REPORTER 


(B) FRET 


434 nm 



substrate 

peptide 


476 nm 


phosphotyrosine- 
binding protein 



Figure Q9-4 Fluorescent reporter protein designed to detect tyrosine 
phosphorylation (Problem 9-9). (A) Domain structure of reporter protein. 
Four domains are indicated: CFP, YFP, tyrosine kinase substrate peptide, 
and a phosphotyrosine-binding domain. (B) FRET assay. YFP/CFP 
is normalized to 1.0 at time zéro. The reporter was incubated in the 
presence (or absence) of Abl and ATP for the indicated times. Arrow 
indicates time of addition of a tyrosine phosphatase. (From A.Y. Ting, 
K.H. Kain, R.L. Klemke and R.Y. Tsien, Proc. Natl Acad. Soi. USA 
98:15003-15008, 2001. With permission from National Academy of 
Sciences.) 
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separated. When the CFP and YFP domains are brought 
close together, however, fluorescence résonance energy 
transfer (FRET) allows excitation of CFP to stimulate émis¬ 
sion by YFP. FRET shows up experimentally as an increase 
in the ratio of émission at 526 nm versus 476 nm (YFP/ 
CFP) when CFP is excited by 434 nm light. 


Incubation of the reporter protein with Abl protein 
tyrosine kinase in the presence of ATP gave an increase 
in YFP/CFP émission (Figure Q9-4B). In the absence of 
ATP or the Abl protein, no FRET occurred. FRET was also 
eliminated by addition of a tyrosine phosphatase (Figure 
Q9-4B). Describe as best you can how the reporter protein 
detects active Abl protein tyrosine kinase. 
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INTERNAL ORGANIZATION OF THE CELL 
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CHAPTER 

10 


Cell membranes are crucial to the life of the cell. The plasma membrane encloses 
the cell, defines its boundaries, and maintains the essential différences between 
the cytosol and the extracellular environment. Inside eulcaryotic cells, the mem¬ 
branes of the nucléus, endoplasmic réticulum, Golgi apparatus, mitochondria, 
and other membrane-enclosed organelles maintain the characteristic différences 
between the contents of each organelle and the cytosol. Ion gradients across 
membranes, established by the activities of specialized membrane proteins, can 
be used to synthesize ATP, to drive the transport of selected solutés across the 
membrane, or, as in nerve and muscle cells, to produce and transmit electrical 
signais. In ail cells, the plasma membrane also contains proteins that act as sen- 
sors of external signais, allowing the cell to change its behavior in response to 
environmental eues, including signais from other cells; these protein sensors, or 
receptors, transfer information—rather than molécules—across the membrane. 

Despite their differing functions, ail biological membranes hâve a common 
general structure: each is a very thin hlm of lipid and protein molécules, held 
together mainly by noncovalent interactions (Figure 10-1). Cell membranes 


IN THIS CHAPTER 

THE LIPID BILAYER 
MEMBRANE PROTEINS 



(A) 


Figure 10-1 Two views of a cell membrane. (A) An électron 
micrograph of a segment of the plasma membrane of a 
human red blood cell seen in cross section, showing its bilayer 
structure. (B) A three-dimensional schematic view of a cell 
membrane and the general disposition of its lipid and protein 
constituents. (A, courtesy of Daniel S. Friend.) 
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are dynamic, fluid structures, and most of their molécules move about in the 
plane of the membrane. The lipid molécules are arranged as a continuous dou¬ 
ble layer about 5 nm thiclc. Ibis lipid bilayer provides the basic fluid structure of 
the membrane and serves as a relatively imperméable barrier to the passage of 
most water-soluble molécules. Most membrane proteins span the lipid bilayer and 
médiate nearly ail of the other functions of the membrane, including the trans¬ 
port of spécifie molécules across it, and the catalysis of membrane-associated 
reactions such as ATP synthesis. In the plasma membrane, some transmembrane 
proteins serve as structural links that connect the cytoslceleton through the lipid 
bilayer to either the extracellular matrix or an adjacent cell, while others serve as 
receptors to detect and transduce Chemical signais in the cell's environment. It 
talces many kinds of membrane proteins to enable a cell to function and interact 
with its environment, and it is estimated that about 30% of the proteins encoded 
in an animaks genome are membrane proteins. 

In this chapter, we consider the structure and organization of the two main 
constituents of biological membranes—the lipids and the proteins. Although 
we focus mainly on the plasma membrane, most concepts discussed apply to 
the various internai membranes of eulcaryotic cells as well. The functions of cell 
membranes are considered in later chapters: their rôle in energy conversion and 
ATP synthesis, for example, is discussed in Chapter 14; their rôle in the transmem¬ 
brane transport of small molécules in Chapter 11; and their rôles in cell signaling 
and cell adhesion in Chapters 15 and 19, respectively. In Chapters 12 and 13, we 
discuss the internai membranes of the cell and the protein traffic through and 
between them. 


THE LIPID BILAYER 

The lipid bilayer provides the basic structure for ail cell membranes. It is easily 
seen by électron microscopy, and its bilayer structure is attributable exclusively 
to the spécial properties of the lipid molécules, which assemble spontaneously 
into bilayers even under simple artificial conditions. In this section, we discuss 
the different types of lipid molécules found in cell membranes and the general 
properties of lipid bilayers. 

Phosphoglycerides, Sphingolipids, and Sterols Are the Major 
Lipids in Cell Membranes 

Lipid molécules constitute about 50% of the mass of most animal cell membranes, 
nearly ail of the remainder being protein. There are approximately 5 x 10 6 lipid 
molécules in a 1 pm x 1 pm area of lipid bilayer, or about 10 9 lipid molécules in 
the plasma membrane of a small animal cell. Ail of the lipid molécules in cell 
membranes are amphiphilic—that is, they hâve a hydrophilic (“water-loving”) or 
polar end and a hydrophobie (“water-fearing") or nonpolar end. 

The most abundant membrane lipids are the phospholipids. These hâve a 
polar head group containing a phosphate group and two hydrophobie hydrocar¬ 
bon tails. In animal, plant, and bacterial cells, the tails are usually fatty acids, and 
they can differ in length (they normally contain between 14 and 24 carbon atoms). 
One tail typically has one or more czs-double bonds (that is, it is unsaturated ), 
while the other tail does not (that is, it is saturated ). As shown in Figure 10-2, 
each ds-double bond créâtes a kinlc in the tail. Différences in the length and sat¬ 
uration of the fatty acid tails influence how phospholipid molécules pack against 
one another, thereby affecting the fluidity of the membrane, as we discuss later. 

The main phospholipids in most animal cell membranes are the phospho¬ 
glycerides, which hâve a three-carbon glycerol backbone (see Figure 10-2). Two 
long-chain fatty acids are linlced through ester bonds to adjacent carbon atoms 
of the glycerol, and the third carbon atom of the glycerol is attached to a phos¬ 
phate group, which in turn is linlced to one of several types of head group. By 
combining several different fatty acids and head groups, cells malce many dif¬ 
ferent phosphoglycerides. Phosphatidylethanolamine , phosphatidylserine , and 
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phosphatidylcholine are the most abundant ones in mammalian cell membranes 
(Figure 10-3A-C). 

Another important class of phospholipids are the sphingolipids, which are built 
from sphingosine rather than glycerol (FigurelO-3D-E). Sphingosine is a long acyl 
chain with an amino group (NH 2 ) and two hydroxyl groups (OH) at one end. In 
sphingomyelin, the most common sphingolipid, a fatty acid tail is attached to the 
amino group, and a phosphocholine group is attached to the terminal hydroxyl 
group. Together, the phospholipids phosphatidylcholine, phosphatidylethanol- 
amine, phosphatidylserine, and sphingomyelin constitute more than half the 
mass of lipid in most mammalian cell membranes (see Table 10-1, p. 571). 
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Figure 10-2 The parts of a typical 
phospholipid molécule. This example 
is a phosphatidylcholine, represented (A) 
schematically, (B) by a formula, (C) as a 
space-filling model (Movie 10.1), and 
(D) as a symbol. 
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Figure 10-3 Four major phospholipids in 
mammalian plasma membranes. Different 
head groups are represented by different 
colors in the symbols. The lipid molécules 
shown in (A-C) are phosphoglycerides, 
which are derived from glycerol. The 
molécule in (D) is sphingomyelin, which 
is derived from sphingosine (E) and is 
therefore a sphingolipid. Note that only 
phosphatidylserine carries a net négative 
charge, the importance of which we discuss 
later; the other three are electrically neutral at 
physiological pH, carrying one positive and 
one négative charge. 
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Figure 10-4 The structure of cholestérol. 

Cholestérol is represented (A) by a formula, 
(B) by a schematic drawing, and (C) as a 
space-filling model. 


In addition to phospholipids, the lipid bilayers in many cell membranes con- 
tain glycolipids and cholestérol. Glycolipids resemble sphingolipids, but, instead 
of a phosphate-linked head group, they hâve sugars attached. We discuss glyco¬ 
lipids later. Eukaryotic plasma membranes contain especially large amounts of 
cholestérol—up to one molécule for every phospholipid molécule. Cholestérol 
is a sterol. It contains a rigid ring structure, to which is attached a single polar 
hydroxyl group and a short nonpolar hydrocarbon chain (Figure 10-4). The cho¬ 
lestérol molécules orient themselves in the bilayer with their hydroxyl group close 
to the polar head groups of adjacent phospholipid molécules (Figure 10-5). 


Phospholipids Spontaneously Form Bilayers 

The shape and amphiphilic nature of the phospholipid molécules cause them to 
form bilayers spontaneously in aqueous environments. As discussed in Chapter 
2, hydrophilic molécules dissolve readily in water because they contain charged 
groups or uncharged polar groups that can form either favorable electrostatic 
interactions or hydrogen bonds with water molécules (Figure 10-6A). Hydro¬ 
phobie molécules, by contrast, are insoluble in water because ail, or almost ail, 
of their atoms are uncharged and nonpolar and therefore cannot form energeti- 
cally favorable interactions with water molécules. If dispersed in water, they force 
the adjacent water molécules to reorganize into icelike cages that surround the 
hydrophobie molécule (Figure 10-6B). Because these cage structures are more 
ordered than the surrounding water, their formation increases the free energy. 
This free-energy cost is minimized, however, if the hydrophobie molécules (or 
the hydrophobie portions of amphiphilic molécules) cluster together so that the 
smallest number of water molécules is affected. 

When amphiphilic molécules are exposed to an aqueous environment, they 
behave as you would expect from the above discussion. They spontaneously 
aggregate to bury their hydrophobie tails in the interior, where they are shielded 
from the water, and they expose their hydrophilic heads to water. Depending 
on their shape, they can do this in either of two ways: they can form spherical 
micelles , with the tails inward, or they can form double-layered sheets, or bilay¬ 
ers , with the hydrophobie tails sandwiched between the hydrophilic head groups 
(Figure 10-7). 

The same forces that drive phospholipids to form bilayers also provide a 
self-sealing property. A small tear in the bilayer créâtes a free edge with water; 
because this is energetically unfavorable, the lipids tend to rearrange sponta¬ 
neously to eliminate the free edge. (In eukaryotic plasma membranes, the fusion 
of intracellular vesicles repairs larger tears.) The prohibition of free edges has a 
profound conséquence: the only way for a bilayer to avoid having edges is by clos- 
ing in on itself and forming a sealed compartment (Figure 10-8). This remarkable 
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Figure 10-5 Cholestérol in a lipid 
bilayer. Schematic drawing (to scale) of a 
cholestérol molécule interacting with two 
phospholipid molécules in one monolayer 
of a lipid bilayer. 
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behavior, fundamental to the création of a living cell, follows directly from the 
shape and amphiphilic nature of the phospholipid molécule. 

A lipid bilayer also has other characteristics that malce it an idéal structure for 
cell membranes. One of the most important of these is its fluidity, which is crucial 
to many membrane functions (Movie 10.2). 

The Lipid Bilayer Is a Two-dimensional Fluid 

Around 1970, researchers first recognized that individual lipid molécules are able 
to diffuse freely within the plane of a lipid bilayer. The initial démonstration came 
from studies of synthetic (artificial) lipid bilayers, which can be made in the form 
of spherical vesicles, called liposomes (Figure 10-9); or in the form of planar 
bilayers formed across a hole in a partition between two aqueous compartments 
or on a solid support. 

Various techniques hâve been used to measure the motion of individual lipid 
molécules and their components. One can construct a lipid molécule, for exam¬ 
ple, with a fluorescent dye or a small gold particle attached to its polar head group 
and follow the diffusion of even individual molécules in a membrane. Alterna- 
tively, one can modify a lipid head group to carry a "spin label," such as a nitroxide 
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Figure 10-6 How hydrophilic and 
hydrophobie molécules interact differently 
with water. (A) Because acetone is polar, 
it can form hydrogen bonds (red) and 
favorable electrostatic interactions (yellow) 
with water molécules, which are also polar. 
Thus, acetone readily dissolves in water. 

(B) By contrast, 2-methyl propane is entirely 
hydrophobie. Because it cannot form 
favorable interactions with water, it forces 
adjacent water molécules to reorganize into 
icelike cage structures, which increases the 
free energy. This compound is therefore 
virtually insoluble in water. The Symbol 
5 _ indicates a partial négative charge, and 
5 + indicates a partial positive charge. Polar 
atoms are shown in color and nonpolar 
groups are shown in g ray. 
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Figure 10-7 Packing arrangements of amphiphilic molécules in an aqueous environment. 

(A) These molécules spontaneously form micelles or bilayers in water, depending on their shape. 
Cone-shaped amphiphilic molécules (above) form micelles, whereas cylinder-shaped amphiphilic 
molécules such as phospholipids (below) form bilayers. (B) A micelle and a lipid bilayer seen in 
cross section. Note that micelles of amphiphilic molécules are thought to be much more irregular 
than drawn here (see Figure 10-26C). 


Figure 10-8 The spontaneous closure of 
a phospholipid bilayer to form a sealed 
compartment. The closed structure is 
stable because it avoids the exposure of 
the hydrophobie hydrocarbon tails to water, 
which would be energetically unfavorable. 
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Figure 10-9 Liposomes. (A) An électron micrograph of unfixed, unstained, 
synthetic phospholipid vesicles—liposomes—in water, which hâve been 
rapidly frozen at liquid-nitrogen température. (B) A drawing of a small 
spherical liposome seen in cross section. Liposomes are commonly used as 
model membranes in experimental studies, especially to study incorporated 
membrane proteins. (A, from P. Frederik and D. Hubert, Methods Enzymol. 
391:431-448, 2005. With permission from Elsevier.) 


group (=N-0); this contains an unpaired électron whose spin créâtes a paramag- 
netic signal that can be detected by électron spin résonance (ESR) spectroscopy, 
the principles of which are similar to those of nuclear magnetic résonance (NMR), 
discussed in Chapter 8. The motion and orientation of a spin-labeled lipid in a 
bilayer can be deduced from the ESR spectrum. Such studies show that phospho¬ 
lipid molécules in synthetic bilayers very rarely migrate from the monolayer (also 
called a leaflet) on one side to that on the other. This process, lcnown as “flip-flop," 
occurs on a time scale of hours for any individual molécule, although cholestérol 
is an exception to this rule and can flip-flop rapidly. In contrast, lipid molécules 
rapidly exchange places with their neighbors within a monolayer (~10 7 times per 
second). This gives rise to a rapid latéral diffusion, with a diffusion coefficient (D) 
of about 10 -8 cm 2 /sec, which means that an average lipid molécule diffuses the 
length of a large bacterial cell (~2 pm) in about 1 second. These studies hâve also 
shown that individual lipid molécules rotate very rapidly about their long axis and 
hâve flexible hydrocarbon chains. Computer simulations show that lipid molé¬ 
cules in synthetic bilayers are very disordered, presenting an irregular surface of 
variously spaced and oriented head groups to the water phase on either side of the 
bilayer (Figure 10-10). 

Similar mobility studies on labeled lipid molécules in isolated biological 
membranes and in living cells give results similar to those in synthetic bilayers. 
They demonstrate that the lipid component of a biological membrane is a two-di- 
mensional liquid in which the constituent molécules are free to move laterally. As 
in synthetic bilayers, individual phospholipid molécules are normally confined 
to their own monolayer. This confinement créâtes a problem for their synthesis. 
Phospholipid molécules are manufactured in only one monolayer of a membrane, 
mainly in the cytosolic monolayer of the endoplasmic réticulum membrane. If 
none of these newly made molécules could migrate reasonably promptly to the 
noncytosolic monolayer, new lipid bilayer could not be made. The problem is 
solved by a spécial class of membrane proteins called phospholipid translocators, 
or flippases, which catalyze the rapid flip-flop of phospholipids from one mono¬ 
layer to the other, as discussed in Chapter 12. 

Despite the fluidity of the lipid bilayer, liposomes do not fuse spontaneously 
with one another when suspended in water. Fusion does not occur because the 
polar lipid head groups bind water molécules that need to be displaced for the 
bilayers of two different liposomes to fuse. The hydration shell that lceeps lipo¬ 
somes apart also insulates the many internai membranes in a eukaryotic cell 
and prevents their uncontrolled fusion, thereby maintaining the compartmen- 
tal integrity of membrane-enclosed organelles. Ail cell membrane fusion events 
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Figure 10-10 The mobility of 
phospholipid molécules in an artificial 
lipid bilayer. Starting with a model of 100 
phosphatidylcholine molécules arranged 
in a regular bilayer, a computer calculated 
the position of every atom after 300 
picoseconds of simulated time. From these 
theoretical calculations, a model of the lipid 
bilayer emerges that accounts for almost ail 
of the measurable properties of a synthetic 
lipid bilayer, including its thickness, 
number of lipid molécules per membrane 
area, depth of water pénétration, and 
unevenness of the two surfaces. Note that 
the tails in one monolayer can interact with 
those in the other monolayer, if the tails 
are long enough. (B) The different motions 
of a lipid molécule in a bilayer. (A, based 
on S.W. Chiu et al., Biophys. J. 69:1230- 
1245, 1995. With permission from the 
Biophysical Society.) 
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are catalyzed by tightly regulated fusion proteins, which force appropriate mem¬ 
branes into tight proximity, squeezing out the water layer that lceeps the bilayers 
apart, as we discuss in Chapter 13. 

The Fluidity of a Lipid Bilayer Dépends on Its Composition 

The fluidity of cell membranes has to be precisely regulated. Certain membrane 
transport processes and enzyme activities, for example, cease when the bilayer 
viscosity is experimentally increased beyond a threshold level. 

The fluidity of a lipid bilayer dépends on both its composition and its tempéra¬ 
ture, as is readily demonstrated in studies of synthetic lipid bilayers. A synthetic 
bilayer made from a single type of phospholipid changes from a liquid State to a 
two-dimensional rigid crystalline (or gel) State at a characteristic température. This 
change of State is called a phase transition , and the température at which it occurs 
is lower (that is, the membrane becomes more difficult to freeze) if the hydrocar¬ 
bon chains are short or hâve double bonds. A shorter chain length reduces the 
tendency of the hydrocarbon tails to interact with one another, in both the same 
and opposite monolayer, and c/s-double bonds produce kinlcs in the chains that 
malce them more difficult to pack together, so that the membrane remains fluid at 
lower températures (Figure 10-11). Bacteria, yeasts, and other organisms whose 
température fluctuâtes with that of their environment adjust the fatty acid com¬ 
position of their membrane lipids to maintain a relatively constant fluidity. As the 
température falls, for instance, the cells of those organisms synthesize fatty acids 
with more cz's-double bonds, thereby avoiding the decrease in bilayer fluidity that 
would otherwise resuit from the température drop. 

Cholestérol modulâtes the properties of lipid bilayers. When mixed with phos- 
pholipids, it enhances the permeability-barrier properties of the lipid bilayer. 
Cholestérol inserts into the bilayer with its hydroxyl group close to the polar head 
groups of the phospholipids, so that its rigid, platelike steroid rings interact with— 
and partly immobilize—those régions of the hydrocarbon chains closest to the 
polar head groups (see Figure 10-5 and Movie 10.3). By decreasing the mobility of 
the first few CH 2 groups of the chains of the phospholipid molécules, cholestérol 
malces the lipid bilayer less déformable in this région and thereby decreases the 
permeability of the bilayer to small water-soluble molécules. Although choles¬ 
térol tightens the packing of the lipids in a bilayer, it does not malce membranes 
any less fluid. At the high concentrations found in most eukaryotic plasma mem¬ 
branes, cholestérol also prevents the hydrocarbon chains from coming together 
and crystallizing. 

Table 10-1 compares the lipid compositions of several biological membranes. 
Note that bacterial plasma membranes are often composed of one main type 
of phospholipid and contain no cholestérol. In archaea, lipids usually contain 
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Figure 10-11 The influence of c/s- 
double bonds in hydrocarbon chains. 

The double bonds make it more difficult to 
pack the chains together, thereby making 
the lipid bilayer more difficult to freeze. In 
addition, because the hydrocarbon chains 
of unsaturated lipids are more spread 
apart, lipid bilayers containing them are 
thinner than bilayers formed exclusively 
from saturated lipids. 


Approximate Lipid Compositions of Different Cell Membranes 


Lipid 

Percentage of total lipid by weight 

Liver cell 
plasma 
membrane 

Red blood 
cell plasma 
membrane 

Myelin 

Mitochondrion 
(inner and outer 
membranes) 

Endoplasmic 

réticulum 

E. coli 
bacterium 

Cholestérol 

17 

23 

22 

3 

6 

0 

Phosphatidylethanolamine 

7 

18 

15 

28 

17 

70 

Phosphatidylserine 

4 

7 

9 

2 

5 

trace 

Phosphatidylcholine 

24 

17 

10 

44 

40 

0 

Sphingomyelin 

19 

18 

8 

0 

5 

0 

Glycolipids 

7 

3 

28 

trace 

trace 

0 

Others 

22 

14 

8 

23 

27 

30 
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20-25-carbon-long prenyl chains instead of fatty acids; prenyl and fatty acid 
chains are similarly hydrophobie and flexible (see Figure 10-20F); in thermo- 
philic archaea, the longest lipid chains span both leaflets, making the membrane 
particularly stable to heat. Thus, lipid bilayers can be built from molécules with 
similar features but different molecular designs. The plasma membranes of most 
eukaryotic cells are more varied than those of prokaryotes and archaea, not only 
in containing large amounts of cholestérol but also in containing a mixture of dif¬ 
ferent phospholipids. 

Analysis of membrane lipids by mass spectrometry has revealed that the lipid 
composition of a typical eukaryotic cell membrane is much more complex than 
originally thought. These membranes contain a bewildering variety of perhaps 
500-2000 different lipid species with even the simple plasma membrane of a red 
blood cell containing well over 150. While some of this complexity reflects the 
combinatorial variation in head groups, hydrocarbon chain lengths, and desat- 
uration of the major phospholipid classes, some membranes also contain many 
structurally distinct minor lipids, at least some of which hâve important functions. 
The inositol phospholipids , for example, are présent in small quantities in animal 
cell membranes and hâve crucial functions in guiding membrane traffic and in 
cell signaling (discussed in Chapters 13 and 15, respectively). Their local synthesis 
and destruction are regulated by a large number of enzymes, which create both 
small intracellular signaling molécules and lipid docking sites on membranes that 
recruit spécifie proteins from the cytosol, as we discuss later. 


Despite Their Fluidity, Lipid Bilayers Can Form Domains of Different 
Compositions 

Because a lipid bilayer is a two-dimensional fluid, we might expect most types 
of lipid molécules in it to be well mixed and randomly distributed in their own 
monolayer. The van der Waals attractive forces between neighboring hydrocarbon 
tails are not sélective enough to hold groups of phospholipid molécules together. 
With certain lipid mixtures in artificial bilayers, however, one can observe phase 
ségrégations in which spécifie lipids corne together in separate domains (Figure 
10 - 12 ). 

There has been a long debate among cell biologists about whether the lipid 
molécules in the plasma membrane of living cells similarly segregate into spe- 
cialized domains, called lipid rafts. Although many lipids and membrane pro¬ 
teins are not distributed uniformly, large-scale lipid phase ségrégations are rarely 
seen in living cell membranes. Instead, spécifie membrane proteins and lipids are 
seen to concentrate in a more temporary, dynamic fashion facilitated by protein- 
protein interactions that allow the transient formation of specialized membrane 
régions (Figure 10-13). Such clusters can be tiny nanoclusters on a scale of a few 
molécules, or larger assemblies that can be seen with électron microscopy, such 
as the caveolae involved in endocytosis (discussed in Chapter 13). The tendency 
of mixtures of lipids to undergo phase partitioning, as seen in artificial bilayers 
(see Figure 10-12), may help create rafts in living cell membranes—organizing 
and concentrating membrane proteins either for transport in membrane vesicles 



Figure 10-12 Latéral phase séparation 
in artificial lipid bilayers. (A) Giant 
liposomes produced from a 1:1 mixture 
of phosphatidylcholine and sphingomyelin 
form uniform bilayers. (B) By contrast, 
liposomes produced from a 1:1:1 mixture 
of phosphatidylcholine, sphingomyelin, 
and cholestérol form bilayers with two 
separate phases. The liposomes are 
stained with trace concentrations of a 
fluorescent dye that preferentially partitions 
into one of the two phases. The average 
size of the domains formed in these giant 
artificial liposomes is much larger than that 
expected in cell membranes, where “lipid 
rafts” (see text) may be as small as a few 
nanometers in diameter. (A, from N. Kahya 
et al., J. Struct. Biol. 147:77-89, 2004. 
With permission from Elsevier; B, courtesy 
of Petra Schwille.) 
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(discussed in Chapter 13) or for working together in protein assemblies, such 
as when they convert extracellular signais into intracellular ones (discussed in 
Chapter 15). 


Figure 10-13 A model of a raft domain. 

Weak protein-protein, protein-lipid, and 
lipid-lipid interactions reinforce one another 
to partition the interacting components into 
raft domains. Cholestérol, sphingolipids, 
glycolipids, glycosylphosphatidylinositol 
(GPI)-anchored proteins, and some 
transmembrane proteins are enriched 
in these domains. Note that because of 
their composition, raft domains hâve an 
increased membrane thickness.We discuss 
glycolipids, GPl-anchored proteins, and 
oligosaccharide linkers later. (Adapted 
from D. Lingwood and K. Simons, Science 
327:46-50, 2010.) 


Lipid Droplets Are Surrounded by a Phospholipid Monolayer 

Most cells store an excess of lipids in lipid droplets, from where they can be 
retrieved as building bloclcs for membrane synthesis or as a food source. Fat 
cells, or adipocytes, are specialized for lipid storage. They contain a giant lipid 
droplet that fills up most of their cytoplasm. Most other cells hâve many smaller 
lipid droplets, the number and size varying with the cell's metabolic State. Fatty 
acids can be liberated from lipid droplets on demand and exported to other cells 
through the bloodstream. Lipid droplets store neutral lipids, such as triacylglycer- 
ols and cholestérol esters, which are synthesized from fatty acids and cholestérol 
by enzymes in the endoplasmic réticulum membrane. Because these lipids do not 
contain hydrophilic head groups, they are exclusively hydrophobie molécules, 
and therefore aggregate into three-dimensional droplets rather than into bilayers. 

Lipid droplets are unique organelles in that they are surrounded by a single 
monolayer of phospholipids, which contains a large variety of proteins. Some of 
the proteins are enzymes involved in lipid metabolism, but the functions of most 
are unknown. Lipid droplets form rapidly when cells are exposed to high con¬ 
centrations of fatty acids. They are thought to form from discrète régions of the 
endoplasmic réticulum membrane where many enzymes of lipid metabolism are 
concentrated. Figure 10-14 shows one model of how lipid droplets may form and 
acquire their surrounding monolayer of phospholipids and proteins. 


The Asymmetry of the Lipid Bilayer Is Functionally Important 

The lipid compositions of the two monolayers of the lipid bilayer in many mem¬ 
branes are strikingly different. In the human red blood cell (érythrocyte) mem¬ 
brane, for example, almost ail of the phospholipid molécules that hâve cho- 
line—(CH 3 ) 3 N + CH 2 CH 20 H—in their head group (phosphatidylcholine and 


phospholipid 

bilayer 



Figure 10-14 A model for the formation of 
lipid droplets. Neutral lipids are deposited 
between the two monolayers of the 
endoplasmic réticulum membrane. There, 
they aggregate into a three-dimensional 
droplet, which buds and pinches off from 
the endoplasmic réticulum membrane as 
a unique organelle, surrounded by a single 
monolayer of phospholipids and associated 
proteins. (Adapted from S. Martin and R.G. 
Parton, Nat. Rev. Mol. Cell Biol. 7:373-378, 
2006. With permission from Macmillan 
Publishers Ltd.) 
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Figure 10-15 The asymmetrical 
distribution of phospholipids and 
glycolipids in the lipid bilayer of human 
red blood cells. The colors used for 
the phospholipid head groups are those 
introduced in Figure 10-3. In addition, 
glycolipids are drawn with hexagonal 
polar head groups (blue). Cholestérol (not 
shown) is distributed roughly equally in both 
monolayers. 


sphingomyelin) are in the outer monolayer, whereas almost ail that contain a 
terminal primary amino group (phosphatidylethanolamine and phosphatidylser- 
ine) are in the inner monolayer (Figure 10-15). Because the negatively charged 
phosphatidylserine is located in the inner monolayer, there is a significant dif¬ 
férence in charge between the two halves of the bilayer. We discuss in Chapter 12 
how membrane-bound phospholipid translocators generate and maintain lipid 
asymmetry. 

Lipid asymmetry is functionally important, especially in converting extra¬ 
cellular signais into intracellular ones (discussed in Chapter 15). Many cytosolic 
proteins bind to spécifie lipid head groups found in the cytosolic monolayer of 
the lipid bilayer. The enzyme protein kinase C (PKC ), for example, which is acti- 
vated in response to various extracellular signais, binds to the cytosolic face of the 
plasma membrane, where phosphatidylserine is concentrated, and requires this 
negatively charged phospholipid for its activity. 

In other cases, spécifie lipid head groups must first be modified to create pro- 
tein-binding sites at a particular time and place. One example is phosphatidyli¬ 
nositol (PI), one of the minor phospholipids that are concentrated in the cytosolic 
monolayer of cell membranes (see Figure 13-10A-C). Various lipid kinases can 
add phosphate groups at distinct positions on the inositol ring, creating binding 
sites that recruit spécifie proteins from the cytosol to the membrane. An important 
example of such a lipid kinase is phosphoinositide 3-1cinase (PI 3-lcinase), which is 
activated in response to extracellular signais and helps to recruit spécifie intracel¬ 
lular signaling proteins to the cytosolic face of the plasma membrane (see Figure 
15-53). Similar lipid kinases phosphorylate inositol phospholipids in intracellular 
membranes and thereby help to recruit proteins that guide membrane transport. 

Phospholipids in the plasma membrane are used in yet another way to con- 
vert extracellular signais into intracellular ones. The plasma membrane contains 
various phospholipases that are activated by extracellular signais to cleave spé¬ 
cifie phospholipid molécules, generating fragments of these molécules that act 
as short-lived intracellular mediators. Phospholipase C, for example, cleaves an 
inositol phospholipid in the cytosolic monolayer of the plasma membrane to gen¬ 
erate two fragments, one of which remains in the membrane and helps activate 
protein kinase C, while the other is released into the cytosol and stimulâtes the 
release of Ca 2+ from the endoplasmic réticulum (see Figure 15-28). 

Animais exploit the phospholipid asymmetry of their plasma membranes to 
distinguish between live and dead cells. When animal cells undergo apoptosis (a 
form of programmed cell death, discussed in Chapter 18), phosphatidylserine, 
which is normally confined to the cytosolic (or inner) monolayer of the plasma 
membrane lipid bilayer, rapidly translocates to the extracellular (or outer) mono¬ 
layer. The phosphatidylserine exposed on the cell surface signais neighboring 
cells, such as macrophages, to phagocytose the dead cell and digest it. The trans¬ 
location of the phosphatidylserine in apoptotic cells is thought to occur by two 
mechanisms: 

1. The phospholipid translocator that normally transports this lipid from the 
outer monolayer to the inner monolayer is inactivated. 

2. A “scramblase" that transfers phospholipids nonspecifically in both direc¬ 
tions between the two monolayers is activated. 
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Glycolipids Are Found on the Surface of Ail Eukaryotic 
Plasma Membranes 

Sugar-containing lipid molécules called glycolipids hâve the most extreme asym- 
metry in their membrane distribution: these molécules, whether in the plasma 
membrane or in intracellular membranes, are found exclusively in the monolayer 
facing away from the cytosol. In animal cells, they are made from sphingosine, just 
lilce sphingomyelin (see Figure 10-3). These intriguing molécules tend to self-as- 
sociate, partly through hydrogen bonds between their sugars and partly through 
van der Waals forces between their long and straight hydrocarbon chains, which 
causes them to partition preferentially into lipid raft phases (see Figure 10-13). 
The asymmetric distribution of glycolipids in the bilayer results from the addi¬ 
tion of sugar groups to the lipid molécules in the lumen of the Golgi apparatus. 
Thus, the compartment in which they are manufactured is topologically équiv¬ 
alent to the exterior of the cell (discussed in Chapter 12). As they are delivered 
to the plasma membrane, the sugar groups are exposed at the cell surface (see 
Figure 10-15), where they hâve important rôles in interactions of the cell with its 
surroundings. 

Glycolipids probably occur in ail eukaryotic cell plasma membranes, where 
they generally constitute about 5% of the lipid molécules in the outer monolayer. 
They are also found in some intracellular membranes. The most complex of the 
glycolipids, the gangliosides, contain oligosaccharides with one or more sialic 
acid moieties, which give gangliosides a net négative charge (Figure 10-16). The 
most abundant of the more than 40 different gangliosides that hâve been iden- 
tified are in the plasma membrane of nerve cells, where gangliosides constitute 
5-10% of the total lipid mass; they are also found in much smaller quantities in 
other cell types. 

Hints as to the functions of glycolipids corne from their localization. In the 
plasma membrane of épithélial cells, for example, glycolipids are confined to the 
exposed apical surface, where they may help to protect the membrane against 
the harsh conditions frequently found there (such as low pH and high concen¬ 
trations of degradative enzymes). Charged glycolipids, such as gangliosides, may 
be important because of their electrical effects: their presence alters the electrical 
field across the membrane and the concentrations of ions—especially Ca 2+ —at 
the membrane surface. Glycolipids also function in cell-recognition processes, 



(A) galactocerebroside 


(B) Gmi ganglioside 



(C) a sialic acid (NANA) 


Figure 10-16 Glycolipid molécules. 

(A) Galactocerebroside is called a neutral 
glycolipid because the sugar that forms its 
head group is uncharged. (B) A ganglioside 
always contains one or more negatively 
charged sialic acid moiety. There are 
various types of sialic acid; in human cells, 
it is mostly /V-acetylneuraminic acid, or 
NANA), whose structure is shown in (C). 
Whereas in bacteria and plants al most ail 
glycolipids are derived from glycerol, as are 
most phospholipids, in animal cells almost 
ail glycolipids are based on sphingosine, as 
is the case for sphingomyelin (see Figure 
10-3). Gai = galactose; GIc = glucose, 
GalNAc = /V-acetylgalactosamine; these 
three sugars are uncharged. 
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in which membrane-bound carbohydrate-binding proteins ( lectins ) bind to the 
sugar groups on both glycolipids and glycoproteins in the process of cell-cell 
adhesion (discussed in Chapter 19). Mutant mice that are déficient in ail of their 
complex gangliosides show abnormalities in the nervous System, including axo- 
nal degeneration and reduced myelination. 

Some glycolipids provide entry points for certain bacterial toxins and viruses. 
The ganglioside Gmi (see Figure 10-16), for example, acts as a cell-surface recep- 
tor for the bacterial toxin that causes the debilitating diarrhea of choiera. Choiera 
toxin binds to and enters only those cells that hâve Gmi on their surface, including 
intestinal épithélial cells. Its entry into a cell leads to a prolonged increase in the 
concentration of intracellular cyclic AMP (discussed in Chapter 15), which in turn 
causes a large efflux of Cl - , leading to the sécrétion of Na + , I< + , HCO 3 ", and water 
into the intestine. Polyomaviruses also enter the cell after binding initially to gan¬ 
gliosides. 

Summary 

Biological membranes consist of a continuons double layer oflipid molécules in 
which membrane proteins are embedded. This lipid bilayer is fluid, with individual 
lipid molécules able to diffuse rapidly within their own monolayer. The membrane 
lipid molécules are amphiphilic. When placed in water ; they assemble sponta- 
neously into bilayers, which form sealed compartments. 

Although cell membranes can contain hundreds of different lipid species, the 
plasma membrane in animal cells contains three major classes—phospholipids, 
cholestérol, and glycolipids. Because of their different baclcbone structure , phos¬ 
pholipids fall into two subclasses—phosphoglycerides and sphingolipids. The lipid 
compositions ofthe inner and outer monolayers are different , reflecting the different 
functions ofthe two faces of a cell membrane. Different mixtures oflipids arefound 
in the membranes of cells of different types; as well as in the various membranes of 
a single eulcaryotic cell. Inositol phospholipids are a minor class of phospholipids, 
which in the cytosolic leaflet ofthe plasma membrane lipid bilayer play an import¬ 
ant part in cell signaling: in response to extracellular signais, spécifie lipid kinases 
phosphorylate the head groups ofthese lipids toform docking sites for cytosolic sig¬ 
naling proteins, whereas spécifie phospholipases cleave certain inositol phospho¬ 
lipids to generate small intracellular signaling molécules. 

MEMBRANE PROTEINS 

Although the lipid bilayer provides the basic structure of biological membranes, 
the membrane proteins perform most of the membrane's spécifie taslcs and 
therefore give each type of cell membrane its characteristic functional properties. 
Accordingly, the amounts and types of proteins in a membrane are highly vari¬ 
able. In the myelin membrane, which serves mainly as electrical insulation for 
nerve-cell axons, less than 25% of the membrane mass is protein. By contrast, in 
the membranes involved in ATP production (such as the internai membranes of 
mitochondria and chloroplasts), approximately 75% is protein. A typical plasma 
membrane is somewhere in between, with protein accounting for about half of its 
mass. Because lipid molécules are small compared with protein molécules, how- 
ever, there are always many more lipid molécules than protein molécules in cell 
membranes—about 50 lipid molécules for each protein molécule in cell mem¬ 
branes that are 50% protein by mass. Membrane proteins vary widely in structure 
and in the way they associate with the lipid bilayer, which reflects their diverse 
functions. 

Membrane Proteins Can Be Associated with the Lipid Bilayer in 
Various Ways 

Figure 10-17 shows the different ways in which proteins can associate with the 
membrane. Lilce their lipid neighbors, membrane proteins are amphiphilic, 
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having hydrophobie and hydrophilic régions. Many membrane proteins extend 
through the lipid bilayer, and hence are called transmembrane proteins, with 
part of their mass on either side (Figure 10-17, examples 1, 2, and 3). Their hydro¬ 
phobie régions pass through the membrane and interact with the hydrophobie 
tails of the lipid molécules in the interior of the bilayer, where they are seques- 
tered away from water. Their hydrophilic régions are exposed to water on either 
side of the membrane. The covalent attachment of a fatty acid chain that inserts 
into the cytosolic monolayer of the lipid bilayer increases the hydrophobicity of 
some of these transmembrane proteins (see Figure 10-17, example 1). 

Other membrane proteins are located entirely in the cytosol and are attached 
to the cytosolic monolayer of the lipid bilayer, either by an amphiphilic a hélix 
exposed on the surface of the protein (Figure 10-17, example 4) or by one or more 
covalently attached lipid chains (Figure 10-17, example 5). Yet other membrane 
proteins are entirely exposed at the external cell surface, being attached to the lipid 
bilayer only by a covalent linlcage (via a spécifie oligosaccharide) to a lipid anchor 
in the outer monolayer of the plasma membrane (Figure 10-17, example 6). 

The lipid-linked proteins in example 5 in Figure 10-17 are made as soluble pro¬ 
teins in the cytosol and are subsequently anchored to the membrane by the cova¬ 
lent attachment of the lipid group. The proteins in example 6, however, are made 
as single-pass membrane proteins in the endoplasmic réticulum (ER). While 
still in the ER, the transmembrane segment of the protein is cleaved off and a 
glycosylphosphatidylinositol (GPI) anchor is added, leaving the protein bound 
to the noncytosolic surface of the ER membrane solely by this anchor (discussed 
in Chapter 12); transport vesicles eventually deliver the protein to the plasma 
membrane (discussed in Chapter 13). 

By contrast to these examples, membrane-associated proteins do not extend 
into the hydrophobie interior of the lipid bilayer at ail; they are instead bound to 
either face of the membrane by noncovalent interactions with other membrane 
pro teins (Figure 10-17, examples 7 and 8). Many of the pro teins of this type can 
be released from the membrane by relatively gentle extraction procedures, such 
as exposure to solutions of very high or low ionic strength or of extreme pH, which 
interfère with protein-protein interactions but leave the lipid bilayer intact; these 
proteins are often referred to as peripheral membrane proteins. Transmembrane 
proteins and many proteins held in the bilayer by lipid groups or hydrophobie 
polypeptide régions that insert into the hydrophobie core of the lipid bilayer can- 
not be released in these ways. 

Lipid Anchors Control the Membrane Localization of Some 
Signaling Proteins 

How a membrane protein is associated with the lipid bilayer reflects the func- 
tion of the protein. Only transmembrane proteins can function on both sides of 


Figure 10-17 Various ways in which 
proteins associate with the lipid bilayer. 

Most membrane proteins are thought to 
extend across the bilayer as (1) a single 
a hélix, (2) as multiple a helices, or (3) 
as a rolled-up P sheet (a P barrel). Some 
of these “single-pass” and “multipass” 
proteins hâve a covalently attached 
fatty acid chain inserted in the cytosolic 
lipid monolayer (1). Other membrane 
proteins are exposed at only one side 
of the membrane. (4) Some of these are 
anchored to the cytosolic surface by an 
amphiphilic a hélix that partitions into the 
cytosolic monolayer of the lipid bilayer 
through the hydrophobie face of the hélix. 
(5) Others are attached to the bilayer solely 
by a covalently bound lipid chain—either 
a fatty acid chain or a prenyl group (see 
Figure 10-18)—in the cytosolic monolayer 
or, (6) via an oligosaccharide linker, to 
phosphatidylinositol in the noncytosolic 
monolayer—called a GPI anchor. (7, 8) 
Finally, membrane-associated proteins 
are attached to the membrane only 
by noncovalent interactions with other 
membrane proteins. The way in which 
the structure in (5) is formed is illustrated 
in Figure 10-18, while the way in which 
the GPI anchor shown in (6) is formed is 
illustrated in Figure 12-52. The details 
of how membrane proteins become 
associated with the lipid bilayer are 
discussed in Chapter 12. 
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the bilayer or transport molécules across it. Cell-surface receptors, for example, 
are usually transmembrane proteins that bind signal molécules in the extracel¬ 
lular space and generate different intracellular signais on the opposite side of the 
plasma membrane. To transfer small hydrophilic molécules across a membrane, 
a membrane transport protein must provide a path for the molécules to cross the 
hydrophobie permeability barrier of the lipid bilayer; the molecular architecture 
of multipass transmembrane proteins (Figure 10-17, examples 2 and 3) is ideally 
suited for this taslc, as we discuss in Chapter 11. 

Proteins that function on only one side of the lipid bilayer, by contrast, are 
often associated exclusively with either the lipid monolayer or a protein domain 
on that side. Some intracellular signaling proteins, for example, that help relay 
extracellular signais into the cell interior are bound to the cytosolic half of the 
plasma membrane by one or more covalently attached lipid groups, which can 
be fatty acid chains or prenyl groups (Figure 10-18). In some cases, myristic acid, 
a saturated 14-carbon fatty acid, is added to the N-terminal amino group of the 
protein during its synthesis on a ribosome. Ail members of the Srcfamily of cyto- 
plasmic protein tyrosine kinases (discussed in Chapter 15) are myristoylated in 
this way. Membrane attachment through a single lipid anchor is not very strong, 
however, and a second lipid group is often added to anchor proteins more firmly 
to a membrane. For most Src kinases, the second lipid modification is the attach¬ 
ment of palmitic acid, a saturated 16-carbon fatty acid, to a cysteine side chain of 
the protein. This modification occurs in response to an extracellular signal and 
helps recruit the kinases to the plasma membrane. When the signaling pathway is 
turned off, the palmitic acid is removed, allowing the kinase to return to the cyto¬ 
sol. Other intracellular signaling proteins, such as the Ras family small GTPases 
(discussed in Chapter 15), use a combination of prenyl group and palmitic acid 
attachment to recruit the proteins to the plasma membrane. 

Many proteins attach to membranes transiently. Some are classical peripheral 
membrane proteins that associate with membranes by regulated protein-pro- 
tein interactions. Others undergo a transition from soluble to membrane protein 
by a conformational change that exposes a hydrophobie peptide or covalently 
attached lipid anchor. Many of the small GTPases of the Rab protein family that 
regulate intracellular membrane traffic (discussed in Chapter 13), for example, 
switch depending on the nucléotide that is bound to the protein. In their GDP- 
bound State they are soluble and free in the cytosol, whereas in their GTP-bound 
State their lipid anchor is exposed and tethers them to membranes. They are 


Figure 10-18 Membrane protein 
attachment by a fatty acid chain or a 
prenyl group. The covalent attachment 
of either type of lipid can help localize a 
water-soluble protein to a membrane after 
its synthesis in the cytosol. (A) A fatty acid 
chain (myristic acid) is attached via an 
amide linkage to an N-terminal glycine. 

(B) A fatty acid chain (palmitic acid) 
is attached via a thioester linkage to 
a cysteine. (C) A prenyl chain (either 
farnesyl or a longer geranylgeranyl chain) 
is attached via a thioether linkage to a 
cysteine residue that is initially located four 
residues from the protein’s C-terminus. 
After prénylation, the terminal three 
amino acids are cleaved off, and the new 
C-terminus is methylated before insertion 
of the anchor into the membrane (not 
shown). The structures of the lipid anchors 
are shown below: (D) a myristoyl anchor 
(derived from a 14-carbon saturated 
fatty acid chain), (E) a palmitoyl anchor 
(a 16-carbon saturated fatty acid chain), 
and (F) a farnesyl anchor (a 15-carbon 
unsaturated hydrocarbon chain). 
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Figure 10-19 A segment of a membrane-spanning polypeptide chain 
Crossing the lipid bilayer as an a hélix. Only the a-carbon backbone 
of the polypeptide chain is shown, with the hydrophobie amino acids in 
green and yellow. The polypeptide segment shown is part of the bacterial 
photosynthetic reaction center, the structure of which was determined by 
x-ray diffraction. (Based on data from J. Deisenhofer et al., Nature 318:618- 
624, 1985, and H. Michel et al., EMBO J. 5:1149-1158, 1986.) 


membrane proteins at one moment and soluble proteins at the next. Such highly 
dynamic interactions greatly expand the répertoire of membrane functions. 

In Most Transmembrane Proteins, the Polypeptide Chain Crosses 
the Lipid Bilayer in an a-Helical Conformation 

A transmembrane protein always has a unique orientation in the membrane. This 
reflects both the asymmetric manner in which it is inserted into the lipid bilayer 
in the ER during its biosynthesis (discussed in Chapter 12) and the different func¬ 
tions of its cytosolic and noncytosolic domains. These domains are separated by 
the membrane-spanning segments of the polypeptide chain, which contact the 
hydrophobie environment of the lipid bilayer and are composed largely of amino 
acids with nonpolar side chains. Because the peptide bonds themselves are polar 
and because water is absent, ail peptide bonds in the bilayer are driven to form 
hydrogen bonds with one another. The hydrogen-bonding between peptide 
bonds is maximized if the polypeptide chain forms a regular a hélix as it crosses 
the bilayer, and this is how most membrane-spanning segments of polypeptide 
chains traverse the bilayer (Figure 10-19). 

In single-pass transmembrane proteins, the polypeptide chain crosses only 
once (see Figure 10-17, example 1), whereas in multipass transmembrane pro¬ 
teins, the polypeptide chain crosses multiple times (see Figure 10-17, example 2). 
An alternative way for the peptide bonds in the lipid bilayer to satisfy their hydro¬ 
gen-bonding requirements is for multiple transmembrane strands of a polypep¬ 
tide chain to be arranged as a P sheet that is rolled up into a cylinder (a so-called 
/? barrel; see Figure 10-17, example 3). This protein architecture is seen in the 
porin proteins that we discuss later. 

Progress in the x-ray crystallography of membrane proteins has enabled the 
détermination of the three-dimensional structure of many of them. The structures 
confirm that it is often possible to predict from the protein's amino acid sequence 
which parts of the polypeptide chain extend across the lipid bilayer. Segments 
containing about 20-30 amino acids, with a high degree of hydrophobicity, are 
long enough to span a lipid bilayer as an a hélix, and they can often be identified 
in hydropathy plots (Figure 10-20). From such plots, it is estimated that about 30% 

(A) GLYCOPHORIN (B) BACTERIORHODOPSIN 




Figure 10-20 Using hydropathy plots to 
localize potential a-helical membrane- 
spanning segments in a polypeptide 
chain. The free energy needed to transfer 
successive segments of a polypeptide 
chain from a nonpolar solvent to water is 
calculated from the amino acid composition 
of each segment using data obtained from 
model compounds. This calculation is 
made for segments of a fixed size (usually 
around 10-20 amino acids), beginning with 
each successive amino acid in the chain. 
The “hydropathy index” of the segment 
is plotted on the Y axis as a function of 
its location in the chain. A positive value 
indicates that free energy is required for 
transfer to water (i.e., the segment is 
hydrophobie), and the value assigned is 
an index of the amount of energy needed. 
Peaks in the hydropathy index appear at 
the positions of hydrophobie segments 
in the amino acid sequence. (A and B) 
Hydropathy plots for two membrane 
proteins that are discussed later in this 
chapter. Glycophorin (A) has a single 
membrane-spanning a hélix, and one 
corresponding peak in the hydropathy 
plot. Bacteriorhodopsin (B) has seven 
membrane-spanning a helices and seven 
corresponding peaks in the hydropathy 
plot. (A, adapted from D. Eisenberg, Annu. 
Rev. Biochem. 53:595-624, 1984. With 
permission from Annual Reviews.) 









580 Chapter 10: Membrane Structure 


of an organism's proteins are transmembrane proteins, emphasizing their impor¬ 
tance. Hydropathy plots cannot identify the membrane-spanning segments of a 
(3 barrel, as 10 amino acids or fewer are sufficient to traverse a lipid bilayer as an 
extended p strand and only every other amino acid side chain is hydrophobie. 

The strong drive to maximize hydrogen-bonding in the absence of water 
means that a polypeptide chain that enters the lipid bilayer is likely to pass entirely 
through it before changing direction, since chain bending requires a loss of reg- 
ular hydrogen-bonding interactions. But multipass transmembrane proteins can 
also contain régions that fold into the membrane from either side, squeezing into 
spaces between transmembrane a helices without contacting the hydrophobie 
core of the lipid bilayer. Because such régions interact only with other polypep¬ 
tide régions, they do not need to maximize hydrogen-bonding; they can therefore 
hâve a variety of secondary structures, including helices that extend only part way 
across the lipid bilayer (Figure 10-21). Such régions are important for the func- 
tion of some membrane proteins, including water channel and ion channel pro- 
teins, in which the régions contribute to the walls of the pores traversing the mem¬ 
brane and confer substrate specificity on the channels, as we discuss in Chapter 
11. These régions cannot be identified in hydropathy plots and are only revealed 
by x-ray crystallography or électron crystallography (a technique similar to x-ray 
diffraction but performed on two-dimensional arrays of proteins) of the protein's 
three-dimensional structure. 

Transmembrane a Helices Often Interact with One Another 

The transmembrane a helices of many single-pass membrane proteins do not 
contribute to the folding of the protein domains on either side of the membrane. 
As a conséquence, it is often possible to engineer cells to produce just the cyto- 
solic or extracellular domains of these proteins as water-soluble molécules. This 
approach has been invaluable for studying the structure and function of these 
domains, especially the domains of transmembrane receptor proteins (discussed 
in Chapter 15). A transmembrane a hélix, even in a single-pass membrane pro¬ 
tein, however, often does more than just anchor the protein to the lipid bilayer. 
Many single-pass membrane proteins form homo- or heterodimers that are held 
together by noncovalent, but strong and highly spécifie, interactions between the 
two transmembrane a helices; the sequence of the hydrophobie amino acids of 
these helices contains the information that directs the protein-protein interac¬ 
tion. 

Similarly, the transmembrane a helices in multipass membrane proteins 
occupy spécifie positions in the folded protein structure that are determined by 
interactions between the neighboring helices. These interactions are crucial for 
the structure and function of the many channels and transporter s that move mol¬ 
écules across cell membranes. 

In these proteins, neighboring transmembrane helices in the folded structure 
of the protein shield many of the other transmembrane helices from the mem¬ 
brane lipids. Why, then, are these shielded helices nevertheless composed pri- 
marily of hydrophobie amino acids? The answer lies in the way in which multi¬ 
pass proteins are integrated into the membrane during their biosynthesis. As we 
discuss in Chapter 12, transmembrane a helices are inserted into the lipid bilayer 
sequentially by a protein translocator. After leaving the translocator, each hélix 
is transiently surrounded by lipids, which requires that the hélix be hydropho¬ 
bie. It is only as the protein folds up into its final structure that contacts are made 
between adjacent helices, and protein-protein contacts replace some of the pro- 
tein-lipid contacts (Figure 10-22). 



Figure 10-21 Two short a helices in the 
aquaporin water channel, each of which 
spans only halfway through the lipid 
bilayer. In the plasma membrane, four 
monomers, one of which is shown here, 
form a tetramer. Each monomer has a 
hydrophilic pore at its center, which allows 
water molécules to cross the membrane 
in single file (see Figure 11-20 and 
Movie 11.6). The two short colored 
helices are buried at an interface formed 
by protein-protein interactions. The 
mechanism by which the channel allows 
the passage of water molécules is 
discussed in more detail in Chapter 11. 


Some p Barrels Form Large Channels 

Multipass membrane proteins that hâve their transmembrane segments arranged 
as /? barrels rather than as a helices are comparatively rigid and therefore tend 
to form crystals readily when isolated. Thus, some of them were among the first 
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Figure 10-22 Steps in the folding of a 
multipass transmembrane protein. 

When a newly synthesized transmembrane 
a hélix is released into the lipid bilayer, it is 
initially surrounded by lipid molécules. As 
the protein folds, contacts between the 
helices displace some of the lipid molécules 
surrounding the helices. 


multipass membrane protein structures to be determined by x-ray crystallogra- 
phy. The number of p strands in a P barrel varies widely, from as few as 8 strands 
to as many as 22 (Figure 10-23). 

P-barrel proteins are abundant in the outer membranes of bacteria, mitochon- 
dria, and chloroplasts. Some are pore-forming proteins, which create water-filled 
channels that allow selected small hydrophilic molécules to cross the membrane. 
The porins are well-studied examples (example 3 in Figure 10-23C). Many porin 
barrels are formed from a 16-strand, antiparallel p sheet rolled up into a cylindri- 
cal structure. Polar amino acid side chains line the aqueous channel on the inside, 
while nonpolar side chains project from the outside of the barrel to interact with 
the hydrophobie core of the lipid bilayer. Loops of the polypeptide chain often 
protrude into the lumen of the channel, narrowing it so that only certain solutés 
can pass. Some porins are therefore highly sélective: maltoporin, for example, 
preferentially allows maltose and maltose oligomers to cross the outer membrane 
of E. coli. 

The FepA protein is a more complex example of a P barrel transport protein 
(Figure 10-23D). It transports iron ions across the bacterial outer membrane. It 
is constructed from 22 P strands, and a large globular domain completely fills the 
inside of the barrel. Iron ions bind to this domain, which by an unknown mech- 
anism moves or changes its conformation to transfer the iron across the mem¬ 
brane. 

Not ail P-barrel proteins are transport proteins. Some form smaller barrels that 
are completely filled by amino acid side chains that project into the center of the 
barrel. These pro teins function as recep tors or enzymes (Figure 10-23A and B); 
the barrel serves as a rigid anchor, which holds the protein in the membrane and 
orients the cytosolic loops that form binding sites for spécifie intracellular molé¬ 
cules. 

Most multipass membrane proteins in eulcaryotic cells and in the bacterial 
plasma membrane are constructed from transmembrane a helices. The helices 


Figure 10-23 p barrels formed from 
different numbers of p strands. 

(A) The E coli OmpA protein serves as a 
receptor for a bacterial virus. (B) The E coli 
OMPLA protein is an enzyme (a lipase) that 
hydrolyzes lipid molécules. The amino acids 
that catalyze the enzymatic reaction (shown 
in red) protrude from the outside surface of 
the barrel. (C) A porin from the bacterium 
Rhodobacter capsulatus forms a water- 
filled pore across the outer membrane. 

The diameter of the channel is restricted 
by loops (shown in blue) that protrude into 
the channel. (D) The E. coli FepA protein 
transports iron ions. The inside of the barrel 
is completely filled by a globular protein 
domain (shown in blue) that contains an 
iron-binding site (not shown). 
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Figure 10-24 A single-pass transmembrane protein. Note that the 
polypeptide chain traverses the lipid bilayer as a right-handed a hélix and that 
the oligosaccharide chains and disulfide bonds are ail on the noncytosolic 
surface of the membrane. The sulfhydryl groups in the cytosolic domain 
of the protein do not normally form disulfide bonds because the reducing 
environment in the cytosol maintains these groups in their reduced (-SH) 
form. 

can slide against each other, allowing conformational changes in the protein that 
can open and shut ion channels, transport solutés, or transduce extracellular 
signais into intracellular ones. In P-barrel proteins, by contrast, hydrogen bonds 
bind each P strand rigidly to its neighbors, malcing conformational changes within 
the wall of the barrel unlikely. 

Many Membrane Proteins Are Glycosylated 

Most transmembrane proteins in animal cells are glycosylated. As in glycolip- 
ids, the sugar residues are added in the lumen of the ER and the Golgi appara- 
tus (discussed in Chapters 12 and 13). For this reason, the oligosaccharide chains 
are always présent on the noncytosolic side of the membrane. Another important 
différence between proteins (or parts of proteins) on the two sides of the mem¬ 
brane results from the reducing environment of the cytosol. This environment 
decreases the likelihood that intrachain or interchain disulfide (S-S) bonds will 
form between cysteines on the cytosolic side of membranes. These bonds form 
on the noncytosolic side, where they can help stabilize either the folded structure 
of the polypeptide chain or its association with other polypeptide chains (Figure 
10-24). 

Because the extracellular part of most plasma membrane proteins are glyco¬ 
sylated, carbohydrates extensively coat the surface of ail eukaryotic cells. These 
carbohydrates occur as oligosaccharide chains covalently bound to membrane 
proteins (glycoproteins) and lipids (glycolipids). They also occur as the polysac¬ 
charide chains of intégral membrane proteoglycan molécules. Proteoglycans, 
which consist of long polysaccharide chains linlced covalently to a protein core, 
are found mainly outside the cell, as part of the extracellular matrix (discussed in 
Chapter 19). But, for some proteoglycans, the protein core either extends across 
the lipid bilayer or is attached to the bilayer by a glycosylphosphatidylinositol 
(GPI) anchor. 

The terms cell coat or glycocalyx are sometimes used to describe the carbohy- 
drate-rich zone on the cell surface. This carbohydrate layer can be visualized by 
various stains, such as ruthénium red (Figure 10-25A), as well as by its affinity for 
carbohydrate-binding proteins called lectins, which can be labeled with a fluo¬ 
rescent dye or some other visible marlcer. Although most of the sugar groups are 
attached to intrinsic plasma membrane molécules, the carbohydrate layer also 
contains both glycoproteins and proteoglycans that hâve been secreted into the 
extracellular space and then adsorbed onto the cell surface (Figure 10-25B). Many 
of these adsorbed macromolecules are components of the extracellular matrix, so 
that the boundary between the plasma membrane and the extracellular matrix is 
often not sharply defined. One of the many functions of the carbohydrate layer is 
to protect cells against mechanical and Chemical damage; it also keeps various 
other cells at a distance, preventing unwanted cell-cell interactions. 

The oligosaccharide side chains of glycoproteins and glycolipids are enor- 
mously diverse in their arrangement of sugars. Although they usually contain 
fewer than 15 sugars, the chains are often branched, and the sugars can be bonded 
together by various kinds of covalent linlcages—unlilce the amino acids in a poly¬ 
peptide chain, which are ail linlced by identical peptide bonds. Even three sugars 
can be put together to form hundreds of different trisaccharides. Both the diversity 
and the exposed position of the oligosaccharides on the cell surface malce them 
especially well suited to function in spécifie cell-recognition processes. As we dis- 
cuss in Chapter 19, plasma-membrane-bound lectins that recognize spécifie oli¬ 
gosaccharides on cell-surface glycolipids and glycoproteins médiate a variety of 
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Figure 10-25 The carbohydrate layer 
on the cell surface. (A) This électron 
micrograph of the surface of a lymphocyte 
stained with ruthénium red emphasizes the 
thick carbohydrate-rich layer surrounding 
the cell. (B) The carbohydrate layer is made 
up of the oligosaccharide side chains of 
membrane glycolipids and membrane 
glycoproteins and the polysaccharide 
chains on membrane proteoglycans. 

In addition, adsorbed glycoproteins, 
and adsorbed proteoglycans (not 
shown), contribute to the carbohydrate 
layer in many cells. Note that ail of the 
carbohydrate is on the extracellular surface 
of the membrane. (A, courtesy of Audrey 
M. Glauert and G.M.W. Cook.) 



transient cell-cell adhesion processes, including those occurring in lymphocyte 
recirculation and inflammatory responses (see Figure 19-28). 

Membrane Proteins Can Be Solubilized and Purified in Détergents 

In general, only agents that disrupt hydrophobie associations and destroy the 
lipid bilayer can solubilize membrane proteins. The most useful of these for the 
membrane biochemist are détergents, which are small amphiphilic molécules of 
variable structure (Movie 10.4). Détergents are much more soluble in water than 
lipids. Their polar (hydrophilic) ends can be either charged (ionic), as in sodium 
dodecyl sulfate ( SDS ), or uncharged (nonionic), as in octylglucoside and Triton 
(Figure 10-26A). At low concentration, détergents are monomeric in solution, 
but when their concentration is increased above a threshold, called the critical 
micelle concentration {CMC), they aggregate to form micelles (Figure 10-26B-D). 
Above the CMC, detergent molécules rapidly diffuse in and out of micelles, lceep- 
ing the concentration of monomer in the solution constant, no matter how many 
micelles are présent. Both the CMC and the average number of detergent mol¬ 
écules in a micelle are characteristic properties of each detergent, but they also 
dépend on the température, pH, and sait concentration. Detergent solutions are 
therefore complex Systems and are difficult to study. 

When mixed with membranes, the hydrophobie ends of détergents bind to the 
hydrophobie régions of the membrane proteins, where they displace lipid mol¬ 
écules with a collar of detergent molécules. Since the other end of the detergent 
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Figure 10-26 The structure and function of détergents. (A) Three commonly used détergents are sodium dodecyl sulfate 
(SDS), an anionic detergent, and Triton X-100 and P-octylglucoside, two nonionic détergents. Triton X-100 is a mixture of 
compounds in which the région in brackets is repeated between 9 and 10 times. The hydrophobie portion of each detergent is 
shown in yellow, and the hydrophilic portion is shown in orange. (B) At low concentration, detergent molécules are monomeric 
in solution. As their concentration is increased beyond the critical micelle concentration (CMC), some of the detergent molécules 
form micelles. Note that the concentration of detergent monomer stays constant above the CMC. (C) Because they hâve both 
polar and nonpolar ends, detergent molécules are amphiphilic; and because they are cone-shaped, they form micelles rather 
than bilayers (see Figure 10-7). Detergent micelles are thought to hâve irregular shapes, and, due to packing constraints, 
the hydrophobie tails are partially exposed to water. (D) The space-filling model shows the structure of a micelle composed 
of 20 P-octylglucoside molécules, predicted by molecular dynamics calculations. The head groups are shown in red and the 
hydrophobie tails in gray. (B, adapted from G. Gunnarsson, B. Jônsson and H. Wennerstrôm, J. Phys. Chem. 84:3114-3121, 
1980; C, from S. Bogusz, R.M. Venable and R.W. Pastor, J. Phys. Chem. B 104:5462-5470, 2000.) 


molécule is polar, this binding tends to bring the membrane proteins into solution 
as detergent-protein complexes (Figure 10-27). Usually, some lipid molécules 
also remain attached to the protein. 

Strong ionic détergents, such as SDS, can solubilize even the most hydropho¬ 
bie membrane proteins. This allows the proteins to be analyzed by SDS polyacryl- 
amide-gel electrophoresis (discussed in Chapter 8), a procedure that has revolu- 
tionized the study of proteins. Such strong détergents, however, unfold (dénaturé) 
proteins by binding to their internai "hydrophobie cores," thereby rendering the 
proteins inactive and unusable for functional studies. Nonetheless, proteins can 
be readily separated and purified in their SDS-denatured form. In some cases, 
removal of the SDS allows the purified protein to renature, with recovery of func¬ 
tional activity. 

Many membrane proteins can be solubilized and then purified in an active 
form by the use of mild détergents. These détergents cover the hydrophobie 
régions on membrane-spanning segments that become exposed after lipid 
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removal but do not unfold the protein. If the detergent concentration of a solution 
of solubilized membrane proteins is reduced (by dilution, for example), mem¬ 
brane proteins do not remain soluble. In the presence of an excess of phospho- 
lipid molécules in such a solution, however, membrane proteins incorporate into 
small liposomes that form spontaneously. In this way, functionally active mem¬ 
brane protein Systems can be reconstituted from purified components, providing 
a powerful means of analyzing the activities of membrane transporters, ion chan- 
nels, signaling receptors, and so on (Figure 10-28). Such functional reconstitu¬ 
tion, for example, provided proof for the hypothesis that the enzymes that malce 


Figure 10-27 Solubilizing a membrane 
protein with a mild nonionic detergent. 

The detergent disrupts the lipid bilayer 
and brings the protein into solution as 
protein-lipid-detergent complexes. The 
phospholipids in the membrane are also 
solubilized by the detergent, as lipid- 
detergent micelles. 
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Figure 10-28 The use of mild nonionic 
détergents for solubilizing, purifying, 
and reconstituting functional membrane 
protein Systems. In this example, 
functional Na + -K + pump molécules are 
purified and incorporated into phospholipid 
vesicles. This pump is présent in the 
plasma membrane of most animal cells, 
where it uses the energy of ATP hydrolysis 
to pump Na + out of the cell and K + in, as 
discussed in Chapter 11. 
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Figure 10-29 Model of a membrane 
protein reconstituted into a 
nanodisc. When detergent is removed 
from a solution containing a multipass 
membrane protein, lipids, and a protein 
subunit of the high-density lipoprotein 
(HDL), the membrane protein becomes 
embedded in a small patch of lipid 
bilayer, which is surrounded by a belt of 
the HDL protein. In such nanodiscs, the 
hydrophobie edges of the bilayer patch are 
shielded by the protein belt, which renders 
the assembly water-soluble. 


ATP (ATP synthases) use H + gradients in mitochondrial, chloroplast, and bacterial 
membranes to produce ATP. 

Membrane proteins can also be reconstituted from detergent solution into 
nanodiscs, which are small, uniformly sized patches of membrane that are sur¬ 
rounded by a belt of protein, which covers the exposed edge of the bilayer to lceep 
the patch in solution (Figure 10-29). The belt is derived from high-density lipo- 
proteins (HDL), which lceep lipids soluble for transport in the blood. In nanodiscs 
the membrane protein of interest can be studied in its native lipid environment 
and is experimentally accessible from both sides of the bilayer, which is useful, 
for example, for ligand-binding experiments. Proteins contained in nanodiscs can 
also be analyzed by single particle électron microscopy techniques to détermine 
their structure. By this rapidly improving technique (discussed in Chapter 9), the 
structure of a membrane protein can be determined to high resolution without a 
requirement of the protein of interest to crystallize into a regular lattice, which is 
often hard to achieve for membrane proteins. 

Détergents hâve also played a crucial part in the purification and crystalliza- 
tion of membrane proteins. The development of new détergents and new expres¬ 
sion Systems that produce large quantities of membrane proteins from cDNA 
clones has led to a rapid increase in the number of three-dimensional structures 
of membrane proteins and protein complexes that are lcnown, although they are 
still few compared to the lcnown structures of water-soluble proteins and protein 
complexes. 

Bacteriorhodopsin Is a Light-driven Proton (H + ) Pump That 
Traverses the Lipid Bilayer as Seven a Helices 

In Chapter 11, we consider how multipass transmembrane proteins médiate 
the sélective transport of small hydrophilic molécules across cell membranes. 
But a detailed understanding of how such a membrane transport protein worlcs 
requires précisé information about its three-dimensional structure in the bilayer. 
Bacteriorhodopsin was the first membrane transport protein whose structure was 
determined, and it has remained the prototype of many multipass membrane 
pro teins with a similar structure. 

The "purple membrane" of the archaeon Halobacterium salinarum is a spe- 
cialized patch in the plasma membrane that contains a single species of pro¬ 
tein molécule, bacteriorhodopsin (Figure 10-30A). The protein functions as 
a light-activated H + pump that transfers H + out of the archaeal cell. Because 
the bacteriorhodopsin molécules are tightly paclced and arranged as a planar 
two-dimensional crystal (Figure 10-30B and C), it was possible to détermine 
their three-dimensional structure by combining électron microscopy and élec¬ 
tron diffraction analysis—a procedure called électron crystallography, which we 
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mentioned earlier. This method has provided the first structural views of many 
membrane proteins that were found to be diffïcult to crystallize from detergent 
solutions. For bacteriorhodopsin, the structure was later confirmed and extended 
to very high resolution by x-ray crystallography. 

Each bacteriorhodopsin molécule is folded into seven closely paclced trans¬ 
membrane a helices and contains a single light-absorbing group, or chromophore 
(in this case, retinal), which gives the protein its purple color. Retinal is vitamin A 
in its aldéhyde form and is identical to the chromophore found in rhodopsin of 
the photoreceptor cells of the vertebrate eye (discussed in Chapter 15). Retinal is 
covalently linlced to a lysine side chain of the bacteriorhodopsin protein. When 
activated by a single photon of light, the excited chromophore changes its shape 
and causes a sériés of small conformational changes in the protein, resulting in 
the transfer of one H + from the inside to the outside of the cell (Figure 10-31 A). In 
bright light, each bacteriorhodopsin molécule can pump several hundred protons 
per second. The light-driven proton transfer establishes an H + gradient across 
the plasma membrane, which in turn drives the production of ATP by a second 
protein in the celFs plasma membrane. The energy stored in the H + gradient also 
drives other energy-requiring processes in the cell. Thus, bacteriorhodopsin con- 
verts solar energy into a H + gradient, which provides energy to the archaeal cell. 

The high-resolution crystal structure of bacteriorhodopsin reveals many 
lipid molécules bound in spécifie places on the protein surface (Figure 10-31B). 
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Figure 10-30 Patches of purple 
membrane, which contain 
bacteriorhodopsin in the archaeon 
Halobacterium salinarum. (A) These 
archaea live in saltwater pools, where 
they are exposed to sunlight. They hâve 
evolved a variety of light-activated proteins, 
including bacteriorhodopsin, which is a 
light-activated H + pump in the plasma 
membrane. (B) The bacteriorhodopsin 
molécules in the purple membrane patches 
are tightly packed into two-dimensional 
crystalline arrays. (C) Details of the 
molecular surface visualized by atomic 
force microscopy. With this technique, 
individual bacteriorhodopsin molécules can 
be seen. (D) Outline of the approximate 
location of the bacteriorhodopsin monomer 
and the individual a helices in the image 
shown in (C). (B-C, courtesy of Dieter 
Oesterhelt; D, PDB code: 2BRD.) 


Figure 10-31 The three-dimensional 
structure of a bacteriorhodopsin 
molécule. (Movie 10.5) (A) The 

polypeptide chain crosses the lipid bilayer 
seven times as a helices. The location of 
the retinal chromophore (purple) and the 
probable pathway taken by H + during the 
light-activated pumping cycle are shown. 
The first and key step is the passing of an 
H + from the chromophore to the side chain 
of aspartic acid 85 (red, located next to the 
chromophore) that occurs upon absorption 
of a photon by the chromophore. 
Subsequently, other H + transfers—in the 
numerical order indicated and utilizing the 
hydrophilic amino acid side chains that line 
a path through the membrane—complété 
the pumping cycle and return the enzyme 
to its starting State. Color code: glutamic 
acid (orange), aspartic acid (red), arginine 
(blue). (B) The high-resolution crystal 
structure of bacteriorhodopsin shows 
many lipid molécules (yellow with red head 
groups) that are tightly bound to spécifie 
places on the surface of the protein. 

(A, adapted from H. Luecke et al., Science 
286:255-261, 1999. With permission from 
AAAS; B, from H. Luecke et al., J. Mol. 

Biol. 291:899-911, 1999. With permission 
from Academie Press.) 
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Interactions with spécifie lipids are thought to help stabilize many membrane 
proteins, which worlc best and sometimes crystallize more readily if some of the 
lipids remain bound during detergent extraction, or if spécifie lipids are added 
baclc to the proteins in detergent solutions. The specificity of these lipid-protein 
interactions helps explain why eukaryotic membranes contain such a variety of 
lipids, with head groups that differ in size, shape, and charge. We can thinlc of the 
membrane lipids as constituting a two-dimensional solvent for the proteins in the 
membrane, just as water constitutes a three-dimensional solvent for proteins in 
an aqueous solution: some membrane proteins can function only in the presence 
of spécifie lipid head groups, just as many enzymes in aqueous solution require a 
particular ion for activity. 

Bacteriorhodopsin is a member of a large superfamily of membrane proteins 
with similar structures but different functions. For example, rhodopsin in rod cells 
of the vertebrate retina and many cell-surface receptor proteins that bind extracel¬ 
lular signal molécules are also built from seven transmembrane a helices. These 
proteins function as signal transducers rather than as transporters: each responds 
to an extracellular signal by activating a GTP-binding protein (G protein) inside the 
cell and they are therefore called G-protein-coupled receptors (GPCRs ), as we dis- 
cuss in Chapter 15 (see Figure 15-6B). Although the structures of bacteriorhodop- 
sins and GPCRs are strikingly similar, they show no sequence similarity and thus 
probably belong to two evolutionarily distant branches of an ancient protein family. 
A related class of membrane proteins, the channelrhodopsins that green algae use 
to detect light, form ion channels when they absorb a photon. When engineered so 
that they are expressed in animal brains, these proteins hâve become invaluable 
tools in neurobiology because they allow spécifie neurons to be stimulated experi- 
mentally by shining light on them, as we discuss in Chapter 11 (Figure 11-32). 

Membrane Proteins Often Function as Large Complexes 

Many membrane proteins function as part of multicomponent complexes, sev- 
eral of which hâve been studied by x-ray crystallography. One is a bacterial pho- 
tosynthetic reaction center , which was the first membrane protein complex to be 
crystallized and analyzed by x-ray diffraction. In Chapter 14, we discuss how such 
photosynthetic complexes function to capture light energy and use it to pump 
H + across the membrane. Many of the membrane protein complexes involved in 
photosynthesis, proton pumping, and électron transport are even larger than the 
photosynthetic reaction center. The enormous photosystem II complex from cya- 
nobacteria, for example, contains 19 protein subunits and well over 60 transmem¬ 
brane helices (see Figure 14-49). Membrane proteins are often arranged in large 
complexes, not only for harvesting various forms of energy, but also for transduc- 
ing extracellular signais into intracellular ones (discussed in Chapter 15). 

Many Membrane Proteins Diffuse in the Plane of the Membrane 

Like most membrane lipids, membrane proteins do not tumble {flip-flop ) across 
the lipid bilayer, but they do rotate about an axis perpendicular to the plane of the 
bilayer ( rotational diffusion). In addition, many membrane proteins are able to 
move laterally within the membrane ( latéral diffusion). An experiment in which 
mouse cells were artificially fused with human cells to produce hybrid cells (het- 
erokaryons) provided the first direct evidence that some plasma membrane pro- 
teins are mobile in the plane of the membrane. Two differently labeled antibodies 
were used to distinguish selected mouse and human plasma membrane proteins. 
Although at first the mouse and human proteins were confined to their own 
halves of the newly formed heterokaryon, the two sets of proteins diffused and 
mixed over the entire cell surface in about half an hour (Figure 10-32). 

The latéral diffusion rates of membrane proteins can be measured by using 
the technique of fluorescence recovery after photobleaching (FRAP ). The method 
usually involves marking the membrane protein of interest with a spécifie flu¬ 
orescent group. This can be done either with a fluorescent ligand such as a 
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fluorophore-labeled antibody that binds to the protein or with recombinant DNA 
technology to express the protein fused to a fluorescent protein such as green flu¬ 
orescent protein (GFP) (discussed in Chapter 9). The fluorescent group is then 
bleached in a small area of membrane by a laser beam, and the time talcen for 
adjacent membrane proteins carrying unbleached ligand or GFP to diffuse into 
the bleached area is measured (Figure 10-33). From FRAP measurements, we can 
estimate the diffusion coefficient for the marlced cell-surface protein. The values 
of the diffusion coefficients for different membrane proteins in different cells are 
highly variable, because interactions with other proteins impede the diffusion 
of the proteins to varying degrees. Measurements of proteins that are minimally 
impeded in this way indicate that cell membranes hâve a viscosity comparable to 
that of olive oil. 

One drawback to the FRAP technique is that it monitors the movement of 
large populations of molécules in a relatively large area of membrane; one cannot 
follow individual protein molécules. If a protein fails to migrate into a bleached 
area, for example, one cannot tell whether the molécule is truly immobile or just 
restricted in its movement to a very small région of membrane—perhaps by cyto- 
slceletal proteins. Single-particle tracking techniques overcome this problem by 
labeling individual membrane molécules with antibodies coupled to fluorescent 
dyes or tiny gold particles and tracking their movement by video microscopy. 
Using single-particle tracking, one can record the diffusion path of a single mem¬ 
brane protein molécule over time. Results from ail of these techniques indicate 
that plasma membrane proteins differ widely in their diffusion characteristics, as 
we now discuss. 


Figure 10-32 An experiment 
demonstrating the diffusion of proteins 
in the plasma membrane of mouse- 
human hybrid cells. In this experiment, 
a mouse and a human cell were fused to 
create a hybrid cell, which was then stained 
with two fluorescently labeled antibodies. 
One antibody (labeled with a green dye) 
detects mouse plasma membrane proteins, 
the other antibody (labeled with a red dye) 
detects human plasma membrane proteins. 
When cells were stained immediately 
after fusion, mouse and human plasma 
membrane proteins are still found in the 
membrane domains originating from the 
mouse and human cell, respectively. After a 
short time, however, the plasma membrane 
proteins diffuse over the entire cell surface 
and completely intermix. (From L.D. Frye 
and M. Edidine, J. Cell Sci. 7:319-335, 
1970. With permission from The Company 
of Biologists.) 
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Figure 10-33 Measuring the rate 
of latéral diffusion of a membrane 
protein by fluorescence recovery after 
photobleaching. A spécifie protein of 
interest can be expressed as a fusion 
protein with green fluorescent protein 
(GFP), which is intrinsically fluorescent. 
The fluorescent molécules are bleached 
in a small area using a laser beam. The 
fluorescence intensity recovers as the 
bleached molécules diffuse away and 
unbleached molécules diffuse into the 
irradiated area (shown here in side and 
top views). The diffusion coefficient 
is calculated from a graph of the rate 
of recovery: the greater the diffusion 
coefficient of the membrane protein, the 
faster the recovery (Movie 10.6). 
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Cells Can Confine Proteins and Lipids to Spécifie Domains Within 
a Membrane 

The récognition that biological membranes are two-dimensional fluids was a 
major advance in understanding membrane structure and function. It has become 
clear, however, that the picture of a membrane as a lipid sea in which ail proteins 
float freely is greatly oversimplified. Most cells confine membrane proteins to spé¬ 
cifie régions in a continuous lipid bilayer. We hâve already discussed how bac- 
teriorhodopsin molécules in the purple membrane of Halobacterium assemble 
into large two-dimensional crystals, in which individual protein molécules are 
relatively fixed in relationship to one another (see Figure 10-30). ATP synthase 
complexes in the inner mitochondrial membrane also associate into long double 
rows, as we discuss in Chapter 14 (see Figure 14-32). Large aggregates of this kind 
diffuse very slowly. 

In épithélial cells, such as those that line the gut or the tubules of the kidney, 
certain plasma membrane enzymes and transport proteins are confined to the 
apical surface of the cells, whereas others are confined to the basal and latéral 
surfaces (Figure 10-34). This asymmetric distribution of membrane proteins is 
often essential for the function of the epithelium, as we discuss in Chapter 11 (see 
Figure 11-11). The lipid compositions of these two membrane domains are also 
different, demonstrating that épithélial cells can prevent the diffusion of lipid as 
well as protein molécules between the domains. The barriers set up by a spécifie 
type of intercellular junction (called a tight junction, discussed in Chapter 19; 
see Figure 19-18) maintain the séparation of both protein and lipid molécules. 
Clearly, the membrane proteins that form these intercellular junctions cannot be 
allowed to diffuse laterally in the interacting membranes. 

A cell can also create membrane domains without using intercellular junc¬ 
tions. As we already discussed, regulated protein-protein interactions in mem¬ 
branes are thought to create nanoscale raft domains that function in signaling 
and membrane trafficking. A more extreme example is seen in the mammalian 
spermatozoon, a single cell that consists of several structurally and functionally 
distinct parts covered by a continuous plasma membrane. When a sperm cell is 
examined by immunofluorescence microscopy with a variety of antibodies, each 
of which reacts with a spécifie cell-surface molécule, the plasma membrane is 
found to consist of at least three distinct domains (Figure 10-35). Some of the 
membrane molécules are able to diffuse freely within the confines of their own 
domain. The molecular nature of the “fence” that prevents the molécules from 
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Figure 10-34 How membrane molécules can be restricted to a particular membrane domain. 

In this drawing of an épithélial cell, protein A (in the apical domain of the plasma membrane) 
and protein B (in the basal and latéral domains) can diffuse laterally in their own domains but are 
prevented from entering the other domain, at least partly by the specialized cell—cell junction called 
a tight junction. Lipid molécules in the outer (extracellular) monolayer of the plasma membrane 
are likewise unable to diffuse between the two domains; lipids in the inner (cytosolic) monolayer, 
however, are able to do so (not shown). The basal lamina is a thin mat of extracellular matrix that 
séparâtes épithélial sheets from other tissues (discussed in Chapter 19). 
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Figure 10-35 Three domains in the 
plasma membrane of a guinea pig 
sperm. (A) A drawing of a guinea 
pig sperm. (B-D) In the three pairs of 
micrographs, phase-contrast micrographs 
are on the left, and the same cell is shown 
with cell-surface immunofluorescence 
staining on the right. Different monoclonal 
antibodies selectively label cell-surface 
molécules on (B) the anterior head, 

(C) the posterior head, and (D) the tail. 
(Micrographs courtesy of Selena Carroll 
and Diana Myles.) 
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leaving their domain is not known. Many other cells hâve similar membrane 
fences that confine membrane protein diffusion to certain membrane domains. 
The plasma membrane of nerve cells, for example, contains a domain enclosing 
the cell body and dendrites, and another enclosing the axon; it is thought that a 
belt of actin filaments tightly associated with the plasma membrane at the cell- 
body-axon junction forms part of the barrier. 

Figure 10-36 shows four common ways of immobilizing spécifie membrane 
proteins through protein-protein interactions. 


(A) 



The Cortical Cytoskeleton Gives Membranes Mechanical Strength 
and Restricts Membrane Protein Diffusion 

As shown in Figure 10-36B and C, a common way in which a cell restricts the lat¬ 
éral mobility of spécifie membrane proteins is to tether them to macromolecular 
assemblies on either side of the membrane. The characteristic biconcave shape of 
a red blood cell (Figure 10-37), for example, results from interactions of its plasma 
membrane proteins with an underlying cytoskeleton , which consists mainly of a 
meshworlc of the filamentous protein spectrin. Spectrin is a long, thin, flexible 
rod about 100 nm in length. As the principal component of the red cell cytoskel¬ 
eton, it maintains the structural integrity and shape of the plasma membrane, 
which is the red cell's only membrane, as the cell has no nucléus or other organ- 
elles. The spectrin cytoskeleton is riveted to the membrane through various mem¬ 
brane proteins. The final resuit is a déformable, netlilce meshworlc that covers 
the entire cytosolic surface of the red cell membrane (Figure 10-38). This spec- 
trin-based cytoskeleton enables the red cell to withstand the stress on its mem¬ 
brane as it is forced through narrow capillaries. Mice and humans with genetic 
abnormalities in spectrin are anémie and hâve red cells that are spherical (instead 
of concave) and fragile; the severity of the anémia increases with the degree of 
spectrin deficiency. 




Figure 10-36 Four ways of restricting 
the latéral mobility of spécifie plasma 
membrane proteins. (A) The proteins can 
self-assemble into large aggregates (as 
seen for bacteriorhodopsin in the purple 
membrane of Halobacterium salinarum)-, 
they can be tethered by interactions with 
assemblies of macromolecules (B) outside 
or (C) inside the cell; or (D) they can interact 
with proteins on the surface of another cell. 
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Figure 10-37 A scanning électron 
micrograph of human red blood cells. 

The cells hâve a biconcave shape and 
lack a nucléus and other organelles 
(Movie 10.7). (Courtesy of Bernadette 
Chailley.) 


An analogous but much more elaborate and highly dynamic cytoskeletal net- 
worlc exists beneath the plasma membrane of most other cells in our body. This 
networlc, which constitutes the cortex of the cell, is rich in actin filaments, which 
are attached to the plasma membrane in numerous ways. The dynamic remod- 
eling of the cortical actin networlc provides a driving force for many essential 
cell functions, including cell movement, endocytosis, and the formation of tran- 
sient, mobile plasma membrane structures such as filopodia and lamellopodia 
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Figure 10-38 The spectrin-based cytoskeleton on the cytosolic side of the human 
red blood cell plasma membrane. (A) The arrangement shown in the drawing has been 
deduced mainly from studies on the interactions of purified proteins in vitro. Spectrin 
heterodimers (enlarged in the drawing on the right) are linked together into a netlike 
meshwork by “junctional complexes” (enlarged in the drawing on the left). Each spectrin 
heterodimer consists of two antiparallel, loosely intertwined, flexible polypeptide chains called 
a and (3. The two spectrin chains are attached noncovalently to each other at multiple points, 
including at both ends. Both the a and P chains are composed largely of repeating domains. 
Two spectrin heterodimers join end-to-end to form tetramers. 

The junctional complexes are composed of short actin filaments (containing 13 actin 
monomers) and these proteins— band 4.1, adducin, and a tropomyosin molécule that 
probably détermines the length of the actin filaments. The cytoskeleton is linked to the 
membrane through two transmembrane proteins—a multipass protein called band 3 and a 
single-pass protein called glycophorin. The spectrin tetramers bind to some band 3 proteins 
via ankyrin molécules, and to glycophorin and band 3 (not shown) via band 4.1 proteins. 

(B) The électron micrograph shows the cytoskeleton on the cytosolic side of a red 
blood cell membrane after fixation and négative staining. The spectrin meshwork has been 
purposely stretched out to allow the details of its structure to be seen. In a normal cell, the 
meshwork shown would be much more crowded and occupy only about one-tenth of this 
area. (B, courtesy of T. Byers and D. Branton, Proc. Natl Acad. Soi. USA 82:6153-6157, 
1985. With permission from The National Academy of Sciences.) 
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discussed in Chapter 16. The cortex of nucleated cells also contains proteins that 
are structurally homologous to spectrin and the other components of the red cell 
cytoslceleton. We discuss the cortical cytoskeleton in nucleated cells and its inter¬ 
actions with the plasma membrane in Chapter 16. 

The cortical cytoslceletal networlc restricts diffusion of not only the plasma 
membrane proteins that are directly anchored to it. Because the cytoslceletal fila¬ 
ments are often closely apposed to the cytosolic surface of the plasma membrane, 
they can form mechanical barriers that obstruct the free diffusion of proteins in 
the membrane. These barriers partition the membrane into small domains, or 
corrals (Figure 10-39A), which can be either permanent, as in the sperm (see 
Figure 10-35), or transient. The barriers can be detected when the diffusion of 
individual membrane proteins is followed by high-speed, single-particle tracking. 
The proteins diffuse rapidly but are confined within an individual corral (Figure 
10-39B); occasionally, however, thermal motions cause a few cortical filaments 
to detach transiently from the membrane, allowing the protein to escape into an 
adjacent corral. 

The extent to which a transmembrane protein is confined within a corral dépends 
on its association with other proteins and the size of its cytoplasmic domain; pro¬ 
teins with a large cytosolic domain will hâve a harder time passing through cyto- 
skeletal barriers. When a cell-surface receptor binds its extracellular signal molé¬ 
cules, for example, large protein complexes build up on the cytosolic domain of 
the receptor, malcing it more difficult for the receptor to escape from its corral. It is 
thought that corralling helps concentrate such signaling complexes, increasing the 
speed and efficiency of the signaling process (discussed in Chapter 15). 


Figure 10-39 Corralling plasma 
membrane proteins by cortical 
cytoskeletal filaments. (A) The filaments 
are thought to provide diffusion barriers 
that divide the membrane into small 
domains, or corrals. (B) High-speed, 
single-particle tracking was used to follow 
the path of single fluorescently labeled 
membrane protein of one type over time. 
The trace shows that the individual protein 
molécules (the movement of each shown 
in a different color) diffuse within a tightly 
delimited membrane domain and only 
infrequently escape into a neighboring 
domain. (Adapted from A. Kusumi et 
al., Annu. Rev. Biophys. Biomol. Struct. 
34:351-378, 2005. With permission from 
Annual Reviews.) 


Membrane-bending Proteins Deform Bilayers 

Cell membranes assume many different shapes, as illustrated by the elaborate 
and varied structures of cell-surface protrusions and membrane-enclosed organ- 
elles in eukaryotic cells. Fiat sheets, narrow tubules, round vesicles, fenestrated 
sheets, and pitta bread-shaped cisternae are ail part of the répertoire: often, a vari- 
ety of shapes will be présent in different régions of the same continuous bilayer. 
Membrane shape is controlled dynamically, as many essential cell processes— 
including vesicle budding, cell movement, and cell division—require elaborate 
transient membrane deformations. In many cases, membrane shape is influ- 
enced by dynamic pushing and pulling forces exerted by cytoskeletal or extracel¬ 
lular structures, as we discuss in Chapters 13 and 16). A crucial part in producing 
these deformations is played by membrane-bending proteins, which control 
local membrane curvature. Often, cytoskeletal dynamics and membrane-b end- 
ing-protein forces worlc together. Membrane-bending proteins attach to spécifie 
membrane régions as needed and act by one or more of three principal mecha- 
nisms: 

1. Some insert hydrophobie protein domains or attached lipid anchors into 
one of the leaflets of a lipid bilayer. Increasing the area of only one leaflet 
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Figure 10-40 Three ways in which membrane-bending proteins shape membranes. Lipid bilayers are blue and 
proteins are green. (A) Bilayer without protein bound. (B) A hydrophobie région of the protein can insert as a wedge into 
one monolayer to pry lipid head groups apart. Such régions can either be amphiphilic helices as shown or hydrophobie 
hairpins. (C) The curved surface of the protein can bind to lipid head groups and deform the membrane or stabilize its 
curvature. (D) A protein can bind to and cluster lipids that hâve large head groups and thereby bend the membrane. 
(Adapted from W.A. Prinz and J.E. Hinshaw, Crit. Rev. Biochem. Mol. Biol. 44:278-291,2009.) 


causes the membrane to bend (Figure 10-40B). The proteins that shape the 
convoluted network of narrow ER tubules are thought to worlc in this way. 

2. Some membrane-bending proteins form rigid scaffolds that deform the 
membrane or stabilize an already bent membrane (Figure 10-40C). The 
coat proteins that shape the budding vesicles in intracellular transport fall 
into this class. 

3. Some membrane-bending proteins cause particular membrane lipids 
to cluster together, thereby inducing membrane curvature. The ability of 
a lipid to induce positive or négative membrane curvature is determined 
by the relative cross-sectional areas of its head group and its hydrocarbon 
tails. For example, the large head group of phosphoinositides malce these 
lipid molécules wedge-shaped, and their accumulation in a domain of one 
leaflet of a bilayer therefore induces positive curvature (Figure 10-40D). By 
contrast, phospholipases that remove lipid head groups produce inversely 
shaped lipid molécules that induce négative curvature. 

Often, different membrane-bending proteins collaborate to achieve a particular 
curvature, as in shaping a budding transport vesicle, as we discuss in Chapter 13. 


Summary 

Whereas the lipid bilayer détermines the basic structure ofbiological membranes, 
proteins are responsible for most membrane functions, serving as spécifie recep- 
tors, enzymes, transporters, and so on. Transmembrane proteins extend across the 
lipid bilayer. Some of these membrane proteins are single-pass proteins, in which 
the polypeptide chain crosses the bilayer as a single a hélix. Others are multipass 
proteins, in which the polypeptide chain crosses the bilayer multiple times—either 
as a sériés of a helices or as a fi sheet rolled up into the shape of a barrel. AU pro¬ 
teins responsible for the transport ofions and other small water-soluble molécules 
through the membrane are multipass proteins. Some membrane proteins do not 
span the bilayer but instead are attached to either side ofthe membrane: some are 
attached to the cytosolic side by an amphipathic a hélix on the protein surface or by 
the covalent attachment ofone or more lipid chains, others are attached to the non- 
cytosolic side by a GPI anchor. Some membrane-associated proteins are bound by 
noncovalent interactions with transmembrane proteins. In the plasma membrane 
ofall eukaryotic cells, most ofthe proteins exposed on the cell surface and some of 
the lipid molécules in the outer lipid monolayer hâve oligosaccharide chains cova- 
lently attached to them. Lilce the lipid molécules in the bilayer, many membrane 
proteins are able to diffuse rapidly in the plane of the membrane. However, cells 
hâve ways ofimmobilizing spécifie membrane proteins, as well as ways ofeonfining 
both membrane protein and lipid molécules to particular domains in a continuons 
lipid bilayer. The dynamic association of membrane-bending proteins confers on 
membranes their characteristic three-dimensional shapes. 


WHATWE DONT KNOW 


• Given the highly complex lipid 
composition of cell membranes, 
what are the variations within 
different organelle membranes in an 
animal cell? What are the functional 
conséquences of these différences, 
and what are the rôles of the minor 
lipid species? 

• Is the biophysical tendency of lipids 
to partition into separate phases within 
a lipid bilayer functionally utilized in cell 
membranes? If so, how is it regulated 
and what membrane functions does it 
control? 

• How commonly do spécifie lipid 
molécules associate with membrane 
proteins to regulate their function? 

• Given that the structure of only a tiny 
fraction of ail membrane proteins has 
been determined, what new principles 
of membrane protein structure remain 
to be discovered? 
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PROBLEMS 


Which statements are true? Explain why or why not. 

10-1 Although lipid molécules are free to diffuse in the 
plane of the bilayer, they cannot flip-flop across the bilayer 
unless enzyme catalysts called phospholipid translocators 
are présent in the membrane. 

10-2 Whereas ail the carbohydrate in the plasma mem¬ 
brane faces outward on the external surface of the cell, ail 
the carbohydrate on internai membranes faces toward the 
cytosol. 

10-3 Although membrane domains with different pro- 
tein compositions are well known, there are at présent no 
examples of membrane domains that differ in lipid com¬ 
position. 

Discuss the following problems. 

10-4 When a lipid bilayer is torn, why does it not seal 
itself by forming a “hemi-micelle” cap at the edges, as 
shown in Figure Q10-1? 


tear in bilayer 





i 


seal with hemi-micelle cap 


Figure Q10-1 A torn lipid bilayer sealed with a hypothetical “hemi- 
micelle” cap (Problem 10-4). 

10-5 Margarine is made from vegetable oil by a Chem¬ 
ical process. Do you suppose this process converts satu- 
rated fatty acids to unsaturated ones, or vice versa? Explain 
your answer. 

10-7 Monomeric single-pass transmembrane proteins 
span a membrane with a single a hélix that has character- 
istic Chemical properties in the région of the bilayer. Which 
of the three 20-amino-acid sequences listed below is the 
most lilcely candidate for such a transmembrane segment? 
Explain the reasons for your choice. (See baclc of boolc for 
one-letter amino acid code; FAMILY VW is a convenient 
mnemonic for hydrophobie amino acids.) 

A. ITLIYFGVMAGVIGTILLI S 

B. ITPIYFGPMAGVIGTPLLIS 

C. ITEIYFGRMAGVIGTDLLIS 

10-6 If a lipid raft is typically 70 nm in diameter and 
each lipid molécule has a diameter of 0.5 nm, about how 
many lipid molécules would there be in a lipid raft com- 
posed entirely of lipid? At a ratio of 50 lipid molécules 
per protein molécule (50% protein by mass), how many 


proteins would be in a typical raft? (Neglect the loss of lipid 
from the raft that would be required to accommodate the 
protein.) 

10-8 You are studying the binding of proteins to the 
cytoplasmic face of cultured neuroblastoma cells and 
hâve found a method that gives a good yield of inside-out 
vesicles from the plasma membrane. Unfortunately, your 
préparations are contaminated with variable amounts of 
right-side-out vesicles. Nothing you hâve tried avoids this 
problem. A friend suggests that you pass your vesicles over 
an affinity column made of lectin coupled to solid beads. 
What is the point of your friend's suggestion? 

10-9 Glycophorin, a protein in the plasma membrane 
of the red blood cell, normally exists as a homodimer that 
is held together entirely by interactions between its trans¬ 
membrane domains. Since transmembrane domains are 
hydrophobie, how is it that they can associate with one 
another so specifically? 

10-10 Three mechanisms by which membrane-bind- 
ing proteins bend a membrane are illustrated in Figure 
Q10-2A, B, and C. As shown, each of these cytosolic mem- 
brane-bending proteins would induce an invagination of 
the plasma membrane. Could similar kinds of cytosolic 
proteins induce a protrusion of the plasma membrane 
(Figure Q10-2D)? Which ones? Explain how they might 
worlc. 



(C) 






EXTRACELLULAR 

SPACE 


(D) 



protrusion 


Figure Q10-2 Bending of the plasma membrane by cytosolic proteins 
(Problem 10-10). (A) Insertion of a protein “finger” into the cytosolic 
leaflet of the membrane. (B) Binding of lipids to the curved surface of 
a membrane-binding protein. (C) Binding of membrane proteins to 
membrane lipids with large head groups. (D) A segment of the plasma 
membrane showing a protrusion. 
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Membrane Transport of Small 
Molécules and the Electrical 
Properties of Membranes 


CHAPTER 

11 


Because of its hydrophobie interior, the lipid bilayer of cell membranes restricts 
the passage of most polar molécules. Ibis barrier function allows the cell to main- 
tain concentrations of solutés in its cytosol that differ from those in the extracel¬ 
lular fluid and in each of the intracellular membrane-enclosed compartments. 
To benefit from this barrier, however, cells hâve had to evolve ways of transferring 
spécifie water-soluble molécules and ions across their membranes in order to 
ingest essential nutrients, excrete metabolic waste products, and regulate intra¬ 
cellular ion concentrations. Cells use specialized membrane transport proteins to 
accomplish this goal. The importance of such small molécule transport is reflected 
in the large number of genes in ail organisms that code for the transmembrane 
transport proteins involved, which malce up 15-30% of the membrane proteins in 
ail cells. Some mammalian cells, such as nerve and kidney cells, devote up to two- 
thirds of their total metabolic energy consumption to such transport processes. 

Cells can also transfer macromolecules and even large particles across their 
membranes, but the mechanisms involved in most of these cases differ from 
those used for transferring small molécules, and they are discussed in Chapters 
12 and 13. 

We begin this chapter by describing some general principles of how small 
water-soluble molécules traverse cell membranes. We then consider, in turn, the 
two main classes of membrane proteins that médiate this transmembrane traffic: 
transporters, which undergo sequential conformational changes to transport spé¬ 
cifie small molécules across membranes, and channels , which form narrow pores, 
allowing passive transmembrane movement, primarily of water and small inor- 
ganic ions. Transporters can be coupled to a source of energy to catalyze active 
transport, which together with sélective passive permeability, créâtes large dif¬ 
férences in the composition of the cytosol compared with that of either the extra¬ 
cellular fluid (Table 11-1) or the fluid within membrane-enclosed organelles. By 
generating inorganic ion-concentration différences across the lipid bilayer, cell 
membranes can store potential energy in the form of electrochemical gradients, 
which drive various transport processes, convey electrical signais in electrically 
excitable cells, and (in mitochondria, chloroplasts, and bacteria) malce most of 
the cell's ATP. We focus our discussion mainly on transport across the plasma 
membrane, but similar mechanisms operate across the other membranes of the 
eukaryotic cell, as discussed in later chapters. 

In the last part of the chapter, we concentrate mainly on the functions of ion 
channels in neurons (nerve cells). In these cells, channel proteins perform at their 
highest level of sophistication, enabling networlcs of neurons to carry out ail the 
astonishing feats your brain is capable of. 
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PRINCIPLES OF MEMBRANE 
TRANSPORT 
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PRINCIPLES OF MEMBRANE TRANSPORT 

We begin this section by describing the permeability properties of protein-free, 
synthetic lipid bilayers. We then introduce some of the terms used to describe the 
various forms of membrane transport and some strategies for characterizing the 
proteins and processes involved. 
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||A Comparison of Inorganic Ion Concentrations Inside and Outside 
a Typical Mammalian Cell* 


Component 

Cytoplasmic concentration 

Extracellular concentration 


(mM) 

(mM) 

Cations 

Na + 

5-15 

145 

K + 

140 

5 

Mg 2+ 

0.5 

1-2 

Ca 2+ 

1 er 4 

1-2 

H + 

7 x 10 -5 (10 -7 - 2 M or pH 7.2) 

4 X 10 -5 (10 -7 - 4 M or pH 7.4) 

Anions 

ci- 

5-15 

110 

*The cell must contain equal quantities of positive and négative charges (that is, it must be 
electrically neutral). Thus, in addition to Ch, the cell contains many other anions not listed in 
this table; in fact, most cell constituents are negatively charged (HCO3 - , PO4 3 -, nucleic acids, 

métabolites carrying phosphate and carboxyl groups, etc.). The concentrations of 

Ca 2+ and Mg 2+ given are for the free ions: although there is a total of about 20 mM Mg 2+ and 

1-2 mM Ca 2+ in cells 

, both ions are mostly bound to other substances (such as proteins, free 

nucléotides, RNA, etc.) and, for Ca 2+ , stored within various organelles. 


Protein-Free Lipid Bilayers Are Imperméable to Ions 

Given enough time, virtually any molécule will diffuse across a protein-free lipid 
bilayer down its concentration gradient. The rate of diffusion, however, varies 
enormously, depending partly on the size of the molécule but mostly on its rela¬ 
tive hydrophobicity (solubility in oil). In general, the smaller the molécule and the 
more hydrophobie, or nonpolar, it is, the more easily it will diffuse across a lipid 
bilayer. Small nonpolar molécules, such as O 2 and CO 2 , readily dissolve in lipid 
bilayers and therefore diffuse rapidly across them. Small uncharged polar molé¬ 
cules, such as water or urea, also diffuse across a bilayer, albeit much more slowly 
(Figure 11-1 and see Movie 10.3). By contrast, lipid bilayers are essentially imper¬ 
méable to charged molécules (ions), no matter how small: the charge and high 
degree of hydration of such molécules prevents them from entering the hydrocar¬ 
bon phase of the bilayer (Figure 11-2). 

There Are Two Main Classes of Membrane Transport Proteins: 
Transporter and Channels 

Like synthetic lipid bilayers, cell membranes allow small nonpolar molécules to 
permeate by diffusion. Cell membranes, however, also hâve to allow the passage 
of various polar molécules, such as ions, sugars, amino acids, nucléotides, water, 
and many cell métabolites that cross synthetic lipid bilayers only very slowly. Spé¬ 
cial membrane transport proteins transfer such solutés across cell membranes. 
These proteins occur in many forms and in ail types of biological membranes. Each 
protein often transports only a spécifie molecular species or sometimes a class of 
molécules (such as ions, sugars, or amino acids). Studies in the 1950s found that 
bacteria with a single-gene mutation were unable to transport sugars across their 
plasma membrane, thereby demonstrating the specificity of membrane transport 
proteins. We now know that humans with similar mutations suffer from various 
inherited diseases that hinder the transport of a spécifie soluté or soluté class in 
the kidney, intestine, or other cell type. Individuals with the inherited disease cys- 
tinuria, for example, cannot transport certain amino acids (including cystine, the 
disulfide-linked dimer of cysteine) from either the urine or the intestine into the 
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Figure 11-1 The relative permeability 
of a synthetic lipid bilayer to different 
classes of molécules. The smaller the 
molécule and, more importante, the less 
strongly it associâtes with water, the more 
rapidly the molécule diffuses across the 
bilayer. 
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Figure 11-2 Permeability coefficients for the passage of various 
molécules through synthetic lipid bilayers. The rate of flow of a soluté 
across the bilayer is directly proportional to the différence in its concentration 
on the two sides of the membrane. Multiplying this concentration différence (in 
mol/cm 3 ) by the permeability coefficient (in cm/sec) gives the flow of soluté in 
moles per second per square centimeter of bilayer. A concentration différence 
of tryptophan of 10 -4 mol/cm 3 (10 -4 mol / 10 -3 L = 0.1 M), for example, would 
cause a flow of 10~ 4 mol/cm 3 x 10~ 7 cm/sec = 10 -11 mol/sec through 1 cm 2 
of bilayer, or 6 x 10 4 molecules/sec through 1 jim 2 of bilayer. 


high permeability 


0 2 -- 


10 2 


1 


blood; the resulting accumulation of cystine in the urine leads to the formation of 
cystine stones in the kidneys. 

Ail membrane transport proteins that hâve been studied in detail are multi- 
pass transmembrane proteins—that is, their polypeptide chains traverse the lipid 
bilayer multiple times. By forming a protein-lined pathway across the membrane, 
these proteins enable spécifie hydrophilic solutés to cross the membrane without 
coming into direct contact with the hydrophobie interior of the lipid bilayer. 

Transporters and channels are the two major classes of membrane transport 
proteins (Figure 11-3). Transporters (also called carriers , or permeases ) bind the 
spécifie soluté to be transported and undergo a sériés of conformational changes 
that alternately expose solute-binding sites on one side of the membrane and 
then on the other to transfer the soluté across it. Channels, by contrast, interact 
with the soluté to be transported much more wealdy. They form continuous pores 
that extend across the lipid bilayer. When open, these pores allow spécifie solutés 
(such as inorganic ions of appropriate size and charge and in some cases small 
molécules, including water, glycerol, and ammonia) to pass through them and 
thereby cross the membrane. Not surprisingly, transport through channels occurs 
at a much faster rate than transport mediated by transporters. Although water can 
slowly diffuse across synthetic lipid bilayers, cells use dedicated channel proteins 
(called water channels , or aquaporins) that greatly increase the permeability of 
their membranes to water, as we discuss later. 

Active Transport Is Mediated by Transporters Coupled to an 
Energy Source 

Ail channels and many transporters allow solutés to cross the membrane only 
passively (“downhiH”), a process called passive transport. In the case of transport 
of a single uncharged molécule, the différence in the concentration on the two 
sides of the membrane—its concentration gradient —drives passive transport and 
détermines its direction (Figure 11-4A). If the soluté carries a net charge, how- 
ever, both its concentration gradient and the electrical potential différence across 
the membrane, the membrane potential , influence its transport. The concentra¬ 
tion gradient and the electrical gradient combine to form a net driving force, the 
electrochemical gradient, for each charged soluté (Figure 11-4B). We discuss 
electrochemical gradients in more detail later and in Chapter 14. In fact, almost ail 
plasma membranes hâve an electrical potential (i.e., a voltage) across them, with 
the inside usually négative with respect to the outside. This potential favors the 
entry of positively charged ions into the cell but opposes the entry of negatively 
charged ions (see Figure 11-4B); it also opposes the efflux of positively charged 
ions. 


h 2 o 


urea 

glycerol 


tryptophan 

glucose 


- 10“ 2 


— 10 


-4 



— 10“ 8 


— 10 " 


Na + -► — 10 " 


— 10" 14 
low permeability 


soluté 



solute-binding site 

(A) TRANSPORTER (B) CHANNEL PROTEIN 


Figure 11-3 Transporters and channel 
proteins. (A) A transporter alternâtes 
between two conformations, so that 
the solute-binding site is sequentially 
accessible on one side of the bilayer 
and then on the other. (B) In contrast, a 
channel protein forms a pore across the 
bilayer through which spécifie solutés can 
passively diffuse. 
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(B) 



concentration electrochemical gradient with 

gradient (with no a membrane potential 

membrane potential) 


Figure 11-4 Different forms of 
membrane transport and the influence 
of the membrane. Passive transport 
down a concentration gradient (or an 
electrochemical gradient—see B below) 
occurs spontaneously, by diffusion, either 
through the lipid bilayer directly or through 
channels or passive transporters. By 
contrast, active transport requires an input 
of metabolic energy and is always mediated 
by transporters that pump the soluté 
against its concentration or electrochemical 
gradient. (B) The electrochemical gradient 
of a charged soluté (an ion) affects its 
transport. This gradient combines the 
membrane potential and the concentration 
gradient of the soluté. The electrical and 
Chemical gradients can work additively to 
increase the driving force on an ion across 
the membrane (middle) or can work against 
each other (right). 


As shown in Figure 11-4A, in addition to passive transport, cells need to be 
able to actively pump certain solutés across the membrane “uphill," against their 
electrochemical gradients. Such active transport is mediated by transporters 
whose pumping activity is directional because it is tightly coupled to a source of 
metabolic energy, such as an ion gradient or ATP hydrolysis, as discussed later. 
Transmembrane movement of small molécules mediated by transporters can be 
either active or passive, whereas that mediated by channels is always passive (see 
Figure 11-4A). 

Summary 

Lipid bilayers are virtually imperméable to most polar molécules. To transport 
small water-soluble molécules into or out of cells or intracellular membrane-en- 
closed compartments , cell membranes contain varions membrane transport pro¬ 
teins, each ofwhich is responsible for transferring a particular soluté or class of 
solutés across the membrane. There are two classes of membrane transport pro¬ 
teins—transporters and channels. Both form protein pathways across the lipid 
bilayer. Whereas transmembrane movement mediated by transporters can be 
either active or passive, soluté flow through channel proteins is always passive. Both 
active and passive ion transport is influenced by the ion's concentration gradient 
and the membrane potential—that is; its electrochemical gradient. 

TRANSPORTERS AND ACTIVE MEMBRANE 
TRANSPORT 

The process by which a transporter transfers a soluté molécule across the lipid 
bilayer resembles an enzyme-substrate reaction, and in many ways transporters 
behave lilce enzymes. By contrast to ordinary enzyme-substrate reactions, how- 
ever, the transporter does not modify the transported soluté but instead delivers it 
unchanged to the other side of the membrane. 

Each type of transporter has one or more spécifie binding sites for its soluté 
(substrate). It transfers the soluté across the lipid bilayer by undergoing réversible 
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conformational changes that alternately expose the solute-binding site first on 
one side of the membrane and then on the other—but never on both sides at the 
same time. The transition occurs through an intermediate State in which the sol¬ 
uté is inaccessible, or occluded, from either side of the membrane (Figure 11-5). 
When the transporter is saturated (that is, when ail solute-binding sites are occu- 
pied), the rate of transport is maximal. This rate, referred to as V mSLX ( V for veloc- 
ity), is characteristic of the spécifie carrier. V max measures the rate at which the 
carrier can flip between its conformational States. In addition, each transporter 
has a characteristic affinity for its soluté, reflected in the K m of the reaction, which 
is equal to the concentration of soluté when the transport rate is half its maximum 
value (Figure 11-6). As with enzymes, the binding of soluté can be bloclced by 
either compétitive inhibitors (which compete for the same binding site and may 
or may not be transported) or noncompetitive inhibitors (which bind elsewhere 
and alter the structure of the transporter). 

As we discuss shortly, it requires only a relatively minor modification of the 
model shown in Figure 11-5 to linlc a transporter to a source of energy in order 
to pump a soluté uphill against its electrochemical gradient. Cells carry out such 
active transport in three main ways (Figure 11-7): 

1. Coupled transporters harness the energy stored in concentration gradients 
to couple the uphill transport of one soluté across the membrane to the 
downhill transport of another. 

2. ATP-driven pumps couple uphill transport to the hydrolysis of ATP. 

3. Light- or redox-driven pumps, which are lcnown in bacteria, archaea, mito- 
chondria, and chloroplasts, couple uphill transport to an input of energy 
from light, as with bacteriorhodopsin (discussed in Chapter 10), or from a 
redox reaction, as with cytochrome c oxidase (discussed in Chapter 14). 

Amino acid sequence and three-dimensional structure comparisons suggest 
that, in many cases, there are strong similarities in structure between transport¬ 
ers that médiate active transport and those that médiate passive transport. Some 
bacterial transporters, for example, that use the energy stored in the H + gradi¬ 
ent across the plasma membrane to drive the active uptake of various sugars are 
structurally similar to the transporters that médiate passive glucose transport 
into most animal cells. This suggests an evolutionary relationship between vari¬ 
ous transporters. Given the importance of small métabolites and sugars as energy 
sources, it is not surprising that the superfamily of transporters is an ancient one. 

We begin our discussion of active membrane transport by considering a class 
of coupled transporters that are driven by ion concentration gradients. These pro¬ 
teins hâve a crucial rôle in the transport of small métabolites across membranes 
in ail cells. We then discuss ATP-driven pumps, including the Na + -K + pump that is 
found in the plasma membrane of most animal cells. Examples of the third class 
of active transport—light- or redox-driven pumps—are discussed in Chapter 14. 

Active Transport Can Be Driven by Ion-Concentration Gradients 

Some transporters simply passively médiate the movement of a single soluté from 
one side of the membrane to the other at a rate determined by their V mSLX and 


Figure 11-5 A model of how a 
conformational change in a transporter 
médiates the passive movement of a 
soluté. The transporter is shown in three 
conformational States: in the outward- 
open State, the binding sites for soluté are 
exposed on the outside; in the occluded 
State, the same sites are not accessible 
from either side; and in the inward-open 
State, the sites are exposed on the inside. 
The transitions between the States occur 
randomly. They are completely réversible 
and do not dépend on whether the solute- 
binding site is occupied. Therefore, if 
the soluté concentration is higher on the 
outside of the bilayer, more soluté binds 
to the transporter in the outward-open 
conformation than in the inward-open 
conformation, and there is a net transport 
of soluté down its concentration gradient 
(or, if the soluté is an ion, down its 
electrochemical gradient). 



K m concentration of —► 
transportée! molécule 


Figure 11-6 The kinetics of simple 
diffusion compared with transporter- 
mediated diffusion. Whereas the rate 
of diffusion and channel-mediated 
transport is directly proportional to the 
soluté concentration (within the physical 
limits imposed by total surface area 
or total channels available), the rate of 
transporter-mediated diffusion reaches a 
maximum (V max ) when the transporter is 
saturated. The soluté concentration when 
the transport rate is at half its maximal 
value approximates the binding constant 
(K m ) of the transporter for the soluté and 
is analogous to the K m of an enzyme 
for its substrate. The graph applies to a 
transporter moving a single soluté; the 
kinetics of coupled transport of two or 
more solutés is more complex and exhibits 
cooperative behavior. 
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Figure 11-7 Three ways of driving 
active transport. The actively transported 
molécule is shown in orange, and the 
energy source is shown in red. Redox 
driven active transport is discussed in 
Chapter 14 (see Figures 14-18 and 14-19). 


K m ; they are called uniporters. Others function as coupled transporters, in which 
the transfer of one soluté strictly dépends on the transport of a second. Coupled 
transport involves either the simultaneous transfer of a second soluté in the same 
direction, performed by symporters (also called co-transporters ), or the transfer 
of a second soluté in the opposite direction, performed by antiporters (also called 
exchangers) (Figure 11-8). 

The tight coupling between the transfer of two solutés allows the coupled 
transporters to harvest the energy stored in the electrochemical gradient of one 
soluté, typically an inorganic ion, to transport the other. In this way, the free 
energy released during the movement of an inorganic ion down an electrochem¬ 
ical gradient is used as the driving force to pump other solutés uphill, against 
their electrochemical gradient. This strategy can worlc in either direction; some 
coupled transporters function as symporters, others as antiporters. In the plasma 
membrane of animal cells, Na + is the usual co-transported ion because its elec¬ 
trochemical gradient provides a large driving force for the active transport of a 
second molécule. The Na + that enters the cell during coupled transport is sub- 
sequently pumped out by an ATP-driven Na + -K + pump in the plasma membrane 
(as we discuss later), which, by maintaining the Na + gradient, indirectly drives 
the coupled transport. Such ion-driven coupled transporters as just described 
are said to médiate secondary active transport. In contrast, ATP-driven pumps are 
said to médiate primary active transport because in these the free energy of ATP 
hydrolysis is used to directly drive the transport of a soluté against its concentra¬ 
tion gradient. 

Intestinal and kidney épithélial cells contain a variety of symporters that are 
driven by the Na + gradient across the plasma membrane. Each Na + -driven sym- 
porter is spécifie for importing a small group of related sugars or amino acids 
into the cell. Because the Na + tends to move into the cell down its electrochem¬ 
ical gradient, the sugar or amino acid is, in a sense, “dragged" into the cell with 
it. The greater the electrochemical gradient for Na + , the more soluté is pumped 
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Figure 11-8 This schematic diagram 
shows transporters functioning as 
uniporters, symporters, and antiporters 
(Movie 11.1). 
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Figure 11-9 Mechanism of glucose transport fueled by a Na + gradient. As in the model shown in Figure 11-5, 
the transporter alternâtes between inward-open and outward-open States via an occluded intermediate State. Binding of 
Na + and glucose is cooperative—that is, the binding of either soluté increases the protein’s affinity for the other. Since the 
Na + concentration is much higher in the extracellular space than in the cytosol, glucose is more likely to bind to the transporter 
in the outward-facing State. The transition to the occluded State occurs only when both Na + and glucose are bound; their 
précisé interactions in the solute-binding sites slightly stabilize the occluded State and thereby make this transition energetically 
favorable. Stochastic fluctuations caused by thermal energy drive the transporter randomly into the inward-open or outward- 
open conformation. If it opens outwardly, nothing is achieved, and the process starts ail over. However, whenever it opens 
inwardly, Na + dissociâtes quickly in the low-Na + -concentration environment of the cytosol. Glucose dissociation is likewise 
enhanced when Na + is lost, because of cooperativity in binding of the two solutés. The overall resuit is the net transport of 
both Na + and glucose into the cell. Because the occluded State is not formed when only one of the solutés is bound, the 
transporter switches conformation only when it is fully occupied or fully empty, thereby assuring strict coupling of the transport 
of Na + and glucose. 


into the cell (Figure 11-9). Neurotransmitters (released by nerve cells to signal at 
synapses—as we discuss later) are talcen up again by Na + symporters after their 
release. These neurotransmitter transporters are important drug targets: stimu¬ 
lants, such as cocaine and antidepressants, inhibit them and thereby prolong sig- 
naling by the neurotransmitters, which are not cleared efficiently. 

Despite their great variety, transporters share structural features that can 
explain how they function and how they evolved. Transporters are typically 
built from bundles of 10 or more a helices that span the membrane. Soluté- and 
ion-binding sites are located midway through the membrane, where some helices 
are brolcen or distorted and amino acid side chains and polypeptide backbone 
atoms form ion- and solute-binding sites. In the inward-open and outward-open 
conformations, these binding sites are accessible by passageways from one side of 
the membrane but not the other. In switching between the two conformations, the 
transporter protein transiently adopts an occluded conformation, in which both 
passageways are closed; this prevents the driving ion and the transported soluté 
from Crossing the membrane unaccompanied, which would deplete the cell's 
energy store to no purpose. Because only transporters with both types of binding 
sites appropriately Med change their conformation, tight coupling between ion 
and soluté transport is assured. 

Lilce enzymes, transporters can worlc in the reverse direction if ion and soluté 
gradients are appropriately adjusted experimentally. This Chemical symmetry is 
mirrored in their physical structure. Crystallographic analyses hâve revealed that 
transporters are built from inverted repeats: the packing of the transmembrane a 
helices in one half of the hélix bundle is structurally similar to the packing in the 
other half, but the two halves are inverted in the membrane relative to each other. 
Transporters are therefore said to be pseudosymmetric, and the passageways 
that open and close on either side of the membrane hâve closely similar géomé¬ 
tries, allowing alternating access to the ion- and solute-binding sites in the center 
(Figure 11-10). It is thought that the two halves evolved by gene duplication of a 
smaller ancestor protein. 
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Some other types of important membrane transport proteins are also built from 
inverted repeats. Examples even include channel proteins such as the aquaporin 
water channel (discussed later) and the Sec61 channel through which nascent 
polypeptides move into the endoplasmic réticulum (discussed in Chapter 12 ). It 
is thought that these channels evolved from coupled transporters, in which the 
gating functions were lost, allowing them to open toward both sides of the mem¬ 
brane simultaneously to provide a continuous path across the membrane. 

In bacteria, yeasts, and plants, as well as in many membrane-enclosed organ- 
elles of animal cells, most ion-driven active transport Systems dépend on H + 
rather than Na + gradients, reflecting the prédominance of H + pumps in these 
membranes. An electrochemical H + gradient across the bacterial plasma mem¬ 
brane, for example, drives the inward active transport of many sugars and amino 
acids. 


Figure 11-10 Transporters are built from 
inverted repeats. (A) LeuT, a bacterial 
leucine/Na + symporter related to human 
neurotransmitter transporters, such as the 
serotonin transporter, is shown. The core 
of the transporter is built from two bundles, 
each composed of five a helices [blue 
and yellow). The helices shown in gray 
differ among members of this transporter 
family and are thought to play regulatory 
rôles, which are spécifie to a particular 
transporter. (B) Both core hélix bundles are 
packed in a similar arrangement (shown 
as a hand, with the broken hélix as the 
thumb), but the second bundle is inverted 
with respect to the first. The transporter^ 
structural pseudosymmetry reflects its 
functional symmetry: the transporter can 
work in either direction, depending on the 
direction of the ion gradient. (Adapted from 
K.R. Vinothkumar and R. Henderson, 

Q. Rev. Biophys. 43:65-158, 2010. With 
permission from Cambridge University 
Press. PDB code: 3F3E.) 


Transporters in the Plasma Membrane Regulate Cytosolic pH 

Most proteins operate optimally at a particular pH. Lysosomal enzymes, for 
example, function best at the low pH (~5) found in lysosomes, whereas cytosolic 
enzymes function best at the close-to-neutral pH (~7.2) found in the cytosol. It 
is therefore crucial that cells control the pH of their intracellular compartments. 

Most cells hâve one or more types of Na + -driven antiporters in their plasma 
membrane that help to maintain the cytosolic pH at about 7.2. These transporters 
use the energy stored in the Na + gradient to pump out excess H + , which either 
lealcs in or is produced in the cell by acid-forming reactions. Two mechanisms are 
used: either H + is directly transported out of the cell or HCO 3 - is brought into the 
cell to neutralize H + in the cytosol (according to the reaction HCO 3 " + H + —> H 2 O + 
CO 2 ). One of the antiporters that uses the first mechanism is a Na + -H + exchanger, 
which couples an influx of Na + to an efflux of H + . Another, which uses a combi¬ 
nation of the two mechanisms, is a Na + -driven CI -HCO 3 ~ exchanger that couples 
an influx of Na + and HC0 3 " to an efflux of Cl - and H + (so that NaHC0 3 cornes 
in and HCl goes out). The Na + -driven C 1 “-HC 03 _ exchanger is twice as effective 
as the Na + -H + exchanger: it pumps out one H + and neutralizes another for each 
Na + that enters the cell. If HC 03 ~ is available, as is usually the case, this antiporter 
is the most important transporter regulating the cytosolic pH. The pH inside the 
cell régulâtes both exchangers; when the pH in the cytosol falls, both exchangers 
increase their activity. 

A Na + -independent Cl -HCO 3 “ exchanger adjusts the cytosolic pH in the 
reverse direction. Like the Na + -dependent transporters, pH régulâtes the Na + -in- 
dependent CI -HCO 3 " exchanger, but the exchanger's activity increases as the 
cytosol becomes too alkaline. The movement of HC 03 - in this case is normally 
out of the cell, down its electrochemical gradient, which decreases the pH of the 
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cytosol. A Na + -independent C 1 _ -HC 03 _ exchanger in the membrane of red blood 
cells (called band 3 protein—see Figure 10-38) facilitâtes the quiclc discharge of 
CO 2 (as HCO 3 ") as the cells pass through capillaries in the lung. 

The intracellular pH is not entirely regulated by transporters in the plasma 
membrane: ATP-driven H + pumps are used to control the pH of many intracellu¬ 
lar compartments. As discussed in Chapter 13, H + pumps maintain the low pH in 
lysosomes, as well as in endosomes and secretory vesicles. These H + pumps use 
the energy of ATP hydrolysis to pump H + into these organelles from the cytosol. 

An Asymmetric Distribution of Transporters in Epithelial Cells 
Underlies the Transcellular Transport of Solutés 

In épithélial cells, such as those that absorb nutrients from the gut, transporters 
are distributed nonuniformly in the plasma membrane and thereby contribute 
to the transcellular transport of absorbed solutés. By the actions of the trans¬ 
porters in these cells, solutés are moved across the épithélial cell layer into the 
extracellular fluid from where they pass into the blood. As shown in Figure 11-11, 
Na + -linlced symporters located in the apical (absorptive) domain of the plasma 
membrane actively transport nutrients into the cell, building up substantial con¬ 
centration gradients for these solutés across the plasma membrane. Uniporters 
in the basal and latéral (basolateral) domains allow the nutrients to leave the cell 
passively down these concentration gradients. 

In many of these épithélial cells, the plasma membrane area is greatly increased 
by the formation of thousands of microvilli, which extend as thin, fingerlike pro¬ 
jections from the apical surface of each cell. Such microvilli can increase the total 
absorptive area of a cell as much as 25-fold, thereby enhancing its transport capa- 
bilities. 

As we hâve seen, ion gradients hâve a crucial rôle in driving many essential 
transport processes in cells. Ion pumps that use the energy of ATP hydrolysis 
establish and maintain these gradients, as we discuss next. 
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Figure 11-11 Transcellular transport. The 

transcellular transport of glucose across 
an intestinal épithélial cell dépends on the 
nonuniform distribution of transporters in 
the cell’s plasma membrane. The process 
shown here results in the transport of 
glucose from the intestinal lumen to the 
extracellular fluid (from where it passes 
into the blood). Glucose is pumped into 
the cell through the apical domain of the 
membrane by a Na + -powered glucose 
symporter. Glucose passes out of the 
cell (down its concentration gradient) by 
passive movement through a glucose 
uniporter in the basal and latéral membrane 
domains. The Na + gradient driving the 
glucose symport is maintained by the 
Na + -K + pump in the basal and latéral 
plasma membrane domains, which keeps 
the internai concentration of Na + low 
(Movie 11.2). Adjacent cells are connected 
by imperméable tight junctions, which 
hâve a dual function in the transport 
process illustrated: they prevent solutés 
from Crossing the epithelium between 
cells, allowing a concentration gradient 
of glucose to be maintained across the 
cell sheet (see Figure 19-18). They also 
serve as diffusion barriers (fences) within 
the plasma membrane, which help confine 
the various transporters to their respective 
membrane domains (see Figure 10-34). 
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There Are Three Classes of ATP-Driven Pumps 

ATP-driven pumps are often called transport ATPases because they hydrolyze ATP 
to ADP and phosphate and use the energy released to pump ions or other solutés 
across a membrane. There are three principal classes of ATP-driven pumps (Fig¬ 
ure 11-12), and représentatives of each are found in ail prokaryotic and eulcary- 
otic cells. 

1. P-type pumps are structurally and functionally related multipass trans¬ 
membrane proteins. They are called "P-type” because they phosphorylate 
themselves during the pumping cycle. This class includes many of the ion 
pumps that are responsible for setting up and maintaining gradients of 
Na + , K + , H + , and Ca 2+ across cell membranes. 

2. ABC transporters (ATP-Binding Cassette transporters) differ structur¬ 
ally from P-type ATPases and primarily pump small molécules across cell 
membranes. 

3. V-type pumps are turbine-like protein machines, constructed from multi¬ 
ple different subunits. The V-type proton pump transfers H + into organelles 
such as lysosomes, synaptic vesicles, and plant or yeast vacuoles (V = vacu- 
olar), to acidify the interior of these organelles (see Figure 13-37). 

Structurally related to the V-type pumps is a distinct family of F-type ATPases, 
more commonly called ATP synthases because they normally worlc in reverse: 
instead of using ATP hydrolysis to drive H + transport, they use the H + gradient 
across the membrane to drive the synthesis of ATP from ADP and phosphate (see 
Figure 14-30). ATP synthases are found in the plasma membrane of bacteria, the 
inner membrane of mitochondria, and the thylakoid membrane of chloroplasts. 
The H + gradient is generated either during the electron-transport steps of oxida- 
tive phosphorylation (in aérobic bacteria and mitochondria), during photosyn- 
thesis (in chloroplasts), or by the light-driven H + pump (bacteriorhodopsin) in 
Halobacterium. We discuss some of these proteins in detail in Chapter 14. 

For the remainder of this section, we focus on P-type pumps and ABC trans¬ 
porters. 


Figure 11-12 Three types of ATP-driven 
pumps. Like any enzyme, ail ATP-driven 
pumps can work in either direction, 
depending on the electrochemical 
gradients of their solutés and the ATP/ADP 
ratio. When the ATP/ADP ratio is high, they 
hydrolyze ATP; when the ATP/ADP ratio is 
low, they can synthesize ATP The F-type 
ATPase in mitochondria normally works in 
this “reverse” mode to make most of the 
cell’s ATP 


A P-type ATPase Pumps Ca 2+ into the Sarcoplasmic Réticulum 
in Muscle Cells 

Eukaryotic cells maintain very low concentrations of free Ca 2+ in their cytosol 
(~10 -7 M) in the face of a very much higher extracellular Ca 2+ concentration (~10 -3 
M). Therefore, even a small influx of Ca 2+ significantly increases the concentra¬ 
tion of free Ca 2+ in the cytosol, and the flow of Ca 2+ down its steep concentration 
gradient in response to extracellular signais is one means of transmitting these 
signais rapidly across the plasma membrane (discussed in Chapter 15). It is thus 
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important that the cell maintains a steep Ca 2+ gradient across its plasma mem¬ 
brane. Ca 2+ transporters that actively pump Ca 2+ out of the cell help maintain the 
gradient. One of these is a P-type Ca 2+ ATPase; the other is an antiporter (called 
a Na + -Ca 2+ exchanger) that is driven by the Na + electroChemical gradient (dis- 
cussed in Chapter 15). 

The Ca 2+ pump, or Ca 2+ ATPase, in the sarcoplasmic réticulum (SR) mem¬ 
brane of slceletal muscle cells is a well-understood P-type transport ATPase. The 
SR is a specialized type of endoplasmic réticulum that forms a networlc of tubular 
sacs in the muscle cell cytoplasm, and it serves as an intracellular store of Ca 2+ . 
When an action potential depolarizes the muscle cell plasma membrane, Ca 2+ is 
released into the cytosol from the SR through Ca 2+ -release channels , stimulating 
the muscle to contract (discussed in Chapters 15 and 16). The Ca 2+ pump, which 
accounts for about 90% of the membrane protein of the SR, moves Ca 2+ from the 
cytosol baclc into the SR. The endoplasmic réticulum of nonmuscle cells contains 
a similar Ca 2+ pump, but in smaller quantities. 

Enzymatic studies and analyses of the three-dimensional structures of trans¬ 
port intermediates of the SR Ca 2+ pump and related pumps hâve revealed the 
molecular mechanism of P-type transport ATPases in great detail. They ail hâve 
similar structures, containing 10 transmembrane a helices connected to three 
cytosolic domains (Figure 11-13). In the Ca 2+ pump, amino acid side chains pro- 
truding from the transmembrane helices form two centrally positioned binding 
sites for Ca 2+ . As shown in Figure 11-14, in the pump's ATP-bound nonphosphor- 
ylated State, these binding sites are accessible only from the cytosolic side of the 
SR membrane. Ca 2+ binding triggers a sériés of conformational changes that close 
the passageway to the cytosol and activate a phosphotransfer reaction in which 
the terminal phosphate of the ATP is transferred to an aspartate that is highly con- 
served among ail P-type ATPases. The ADP then dissociâtes and is replaced with 
a fresh ATP, causing another conformational change that opens a passageway to 
the SR lumen through which the two Ca 2+ ions exit. They are replaced by two H + 
ions and a water molécule that stabilize the empty Ca 2+ -binding sites and close 
the passageway to the SR lumen. Hydrolysis of the labile phosphoryl-aspartate 
bond returns the pump to the initial conformation, and the cycle starts again. The 
transient self-phosphorylation of the pump during its cycle is an essential charac- 
teristic of ail P-type pumps. 

The Plasma Membrane Na + -K + Pump Establishes Na + and K + 
Gradients Across the Plasma Membrane 

The concentration of K + is typically 10-30 times higher inside cells than outside, 
whereas the reverse is true of Na + (see Table 11-1, p. 598). ANa + -K + pump, or Na + - 
K + ATPase, found in the plasma membrane of virtually ail animal cells maintains 



Figure 11-13 The structure of the 
sarcoplasmic réticulum Ca 2+ pump. 

The ribbon model (left), derived from x-ray 
crystallographic analyses, shows the pump 
in its phosphorylated, ATP-bound State. 
The three globular cytosolic domains of 
the pump—the nucleotide-binding domain 
(clark green), the activator domain (blue), 
and the phosphorylation domain (red), also 
shown schematically on the right —change 
conformation dramatically during the 
pumping cycle. These changes in turn alter 
the arrangement of the transmembrane 
helices, which allows the Ca 2+ to be 
released from its binding cavity into the SR 
lumen (Movie 11.3). (PDB code: 3B9B.) 
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Figure 11-14 The pumping cycle of the 
sarcoplasmic réticulum Ca 2+ pump. 

Ion pumping proceeds by a sériés of 
stepwise conformational changes in 
which movements of the pump’s three 
cytosolic domains [the nucleotide-binding 
domain (N), the phosphorylation domain 
(P), and the activator domain (A)] are 
mechanically coupled to movements of the 
transmembrane a helices. Hélix movement 
opens and closes passageways through 
which Ca 2+ enters from the cytosol and 
binds to the two centrally located Ca 2+ 
binding sites. The two Ca 2+ then exit 
into the SR lumen and are replaced by 
two H + , which are transported in the 
opposite direction. The Ca 2+ -dependent 
phosphorylation and H + -dependent 
déphosphorylation of aspartic acid are 
universally conserved steps in the reaction 
cycle of ail P-type pumps: they cause the 
conformational transitions to occur in an 
orderly manner, enabling the proteins to do 
useful work. (Adapted from C. Toyoshima 
et al., Nature 432:361-368, 2004 and 
J.V. Moller et al., Q. Rev. Biophys. 43:501- 
566, 2010.) 


these concentration différences. Lilce the Ca 2+ pump, the Na + -K + pump belongs 
to the family of P-type ATPases and opérâtes as an ATP-driven antiporter, actively 
pumping Na + out of the cell against its steep electrochemical gradient and pump¬ 
ing K + in (Figure 11-15). 

We mentioned earlier that the Na + gradient produced by the Na + -K + pump 
drives the transport of most nutrients into animal cells and also has a crucial rôle 
in regulating cytosolic pH. A typical animal cell devotes almost one-third of its 
energy to fueling this pump, and the pump consumes even more energy in nerve 
cells and in cells that are dedicated to transport processes, such as those forming 
kidney tubules. 

Since the Na + -K + pump drives three positively charged ions out of the cell for 
every two it pumps in, it is electrogenic : it drives a net electric current across the 
membrane, tending to create an electrical potential, with the cell's inside being 
négative relative to the outside. This electrogenic effect of the pump, however, sel- 
dom directly contributes more than 10% to the membrane potential. The remain- 
ing 90%, as we discuss later, dépends only indirectly on the Na + -I< + pump. 



Figure 11-15 The function of the 
Na + -K + pump. This P-type ATPase 
actively pumps Na + out of and K + into a 
cell against their electrochemical gradients. 
It is structurally closely related to the 
Ca 2+ ATPase but differs in its selectivity for 
ions: for every molécule of ATP hydrolyzed 
by the pump, three Na + are pumped out 
and two K + are pumped in. As in the 
Ca 2+ pump, an aspartate is phosphorylated 
and dephosphorylated during the 
pumping cycle (Movie 11.4). 
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ABC Transporters Constitute the Largest Family of Membrane 
Transport Proteins 

The last type of transport ATPase that we discuss is the family of the ABC trans¬ 
porters, so named because each member contains two highly conserved ATPase 
domains, or ATP-Binding "Cassettes," on the cytosolic side of the membrane. 
ATP binding brings together the two ATPase domains, and ATP hydrolysis leads 
to their dissociation (Figure 11-16). These movements of the cytosolic domains 
are transmitted to the transmembrane segments, driving cycles of conformational 
changes that alternately expose solute-binding sites on one side of the membrane 
and then on the other, as we hâve seen for other transporters. In this way, ABC 
transporters harvest the energy released upon ATP binding and hydrolysis to 
drive transport of solutés across the bilayer. The transport is directional toward 
inside or toward outside, depending on the particular conformational change in 
the soluté binding site that is linlced to ATP hydrolysis (see Figure 11-16). 

ABC transporters constitute the largest family of membrane transport proteins 
and are of great clinical importance. The first of these proteins to be characterized 
was found in bacteria. We hâve already mentioned that the plasma membranes 
of ail bacteria contain transporters that use the H + gradient across the membrane 
to actively transport a variety of nutrients into the cell. In addition, bacteria use 
ABC transporters to import certain small molécules. In bacteria such as E. coli 
that hâve double membranes (Figure 11-17), the ABC transporters are located in 
the inner membrane, and an auxiliary mechanism opérâtes to capture the nutri¬ 
ents and deliver them to the transporters (Figure 11-18). 

In E. coli , 78 genes (an amazing 5% of the bacteriunTs genes) encode ABC 
transporters, and animal genomes encode an even larger number. Although each 
transporter is thought to be spécifie for a particular molécule or class of molécules, 
the variety of substrates transported by this superfamily is great and includes inor- 
ganic ions, amino acids, mono- and polysaccharides, peptides, lipids, drugs, and, 
in some cases, even proteins that can be larger than the transporter itself. 
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(A) A BACTERIAL ABC TRANSPORTER 


(B) A EUKARYOTIC ABC TRANSPORTER 
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Figure 11-16 Small-molecule transport 
by typical ABC transporters. ABC 

transporters consist of multiple domains. 
Typically, two hydrophobie domains, 
each built of six membrane-spanning a 
helices, together form the translocation 
pathway and provide substrate specificity. 
Two ATPase domains protrude into the 
cytosol. In some cases, the two halves 
of the transporter are formed by a single 
polypeptide, whereas in other cases they 
are formed by two or more separate 
polypeptides that assemble into a similar 
structure. Without ATP bound, the 
transporter exposes a substrate-binding 
site on one side of the membrane. ATP 
binding induces a conformational change 
that exposes the substrate-binding site 
on the opposite side; ATP hydrolysis 
followed by ADP dissociation returns the 
transporter to its original conformation. 

Most individual ABC transporters are 
unidirectional. (A) Both importing and 
exporting ABC transporters are found in 
bacteria; an ABC importer is shown in this 
cartoon. The crystal structure of a bacterial 
ABC transporter is shown in Figure 3-76. 
(B) In eukaryotes, most ABC transporters 
export substances—either from the cytosol 
to the extracellular space or from the 
cytosol to a membrane-bound intracellular 
compartiment such as the endoplasmic 
réticulum—or from the mitochondrial matrix 
to the cytosol. 
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The first eukaryotic ABC transporters identified were discovered because of 
their ability to pump hydrophobie drugs out of the cytosol. One of these transport¬ 
ers is the multidrug résistance (MDR) protein, also called P-glycoprotein. It is 
présent at elevated levels in many human cancer cells and malces the cells simul- 
taneously résistant to a variety of chemically unrelated cytotoxic drugs that are 
widely used in cancer chemotherapy. Treatment with any one of these drugs can 
resuit in the sélective survival and overgrowth of those cancer cells that express an 
especially large amount of the MDR transporter. These cells pump drugs out of the 
cell very efficiently and are therefore relatively résistant to the drugs' toxic effects 
(Movie 11.5). Sélection for cancer cells with résistance to one drug can thereby 
lead to résistance to a wide variety of anticancer drugs. Some studies indicate that 
up to 40% of human cancers develop multidrug résistance, making it a major hur- 
dle in the battle against cancer. 

A related and equally sinister phenomenon occurs in the protist Plasmodium 
falciparum, which causes malaria. More than 200 million people are infected 
worldwide with this parasite, which remains a major cause of human death, 
killing almost a million people every year. The development of résistance to the 
antimalarial drug chloroquine has hampered the control of malaria. The résistant 
P. falciparum hâve amplified a gene encoding an ABC transporter that pumps out 
the chloroquine. 


Figure 11 -17 A small section of 
the double membrane of an E. coli 
bacterium. The inner membrane is 
the cell’s plasma membrane. Between 
the inner and outer membranes is a 
highly porous, rigid peptidoglycan layer, 
composed of protein and polysaccharide 
that constitute the bacterial cell wall. It is 
attached to lipoprotein molécules in the 
outer membrane and fills the periplasmic 
space (only a little of the peptidoglycan 
layer is shown). This space also contains 
a variety of soluble protein molécules. The 
dashed threads (shown in green) at the 
top represent the polysaccharide chains of 
the spécial lipopolysaccharide molécules 
that form the external monolayer of the 
outer membrane; for clarity, only a few 
of these chains are shown. Bacteria with 
double membranes are called Gram- 
negative because they do not retain the 
dark blue dye used in Gram staining. 
Bacteria with single membranes (but 
thicker peptidoglycan cell walls), such as 
staphylococci and streptococci, retain 
the blue dye and are therefore called 
Gram-positive ; their single membrane is 
analogous to the inner (plasma) membrane 
of Gram-negative bacteria. 
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Figure 11-18 The auxiliary transport 
System associated with transport 
ATPases in bacteria with double 
membranes. The soluté diffuses through 
channel proteins (porins) in the outer 
membrane and binds to a periplasmic 
substrate-binding protein that delivers it to 
the ABC transporter, which pumps it across 
the plasma membrane. The peptidoglycan 
is omitted for simplicity; its porous structure 
allows the substrate-binding proteins and 
water-soluble solutés to move through it by 
diffusion. 
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In most vertebrate cells, an ABC transporter in the endoplasmic réticulum 
(ER) membrane (named transporter associated with antigen processing , or TAP 
transporter ) actively pumps a wide variety of peptides from the cytosol into the 
ER lumen. These peptides are produced by protein dégradation in protéasomes 
(discussed in Chapter 6). They are carried from the ER to the cell surface, where 
they are displayed for scrutiny by cytotoxic T lymphocytes, which kill the cell if the 
peptides are derived from a virus or other microorganism lurking in the cytosol of 
an infected cell (discussed in Chapter 24). 

Yet another member of the ABC transporter family is the cystic fibrosis trans¬ 
membrane conductance regulator protein (CFTR), which was discovered through 
studies of the common genetic disease cystic fibrosis. This disease is caused by 
a mutation in the gene encoding CFTR, a Cl - transport protein in the plasma 
membrane of épithélial cells. CFTR régulâtes ion concentrations in the extracel¬ 
lular fluid, especially in the lung. One in 27 Caucasians carries a gene encoding 
a mutant form of this protein; in 1 in 2900, both copies of the gene are mutated, 
causing the disease. In contrast to other ABC transporters, ATP binding and 
hydrolysis in the CFTR protein do not drive the transport process. Instead, they 
control the opening and closing of a continuous channel, which provides a pas¬ 
sive conduit for Cl - to move down its electroChemical gradient. Thus, some ABC 
proteins can function as transporters and others as gated channels. 

Summary 

Transporters bind spécifie solutés and transfer them across the lipid bilayer by 
undergoing conformational changes that alternately expose the solute-binding site 
on one side ofthe membrane and then on the other. Some transporters move a sin¬ 
gle soluté 'downhill/' whereas others can act as pumps to move a soluté “uphill” 
against its electro Chemical gradient , using energy provided by ATP hydrolysis; by a 
downhill flow of another soluté (such as Na + or H + ), or by light to drive the requisite 
sériés of conformational changes in an orderly manner. Transporters belong to a 
small number of protein families. Each family evolved from a common ancestral 
protein, and its members ail operate by a similar mechanism. The family ofP-type 
transport ATPases, which includes Ca 2+ andNa + -K + pumps, isan important exam¬ 
ple; each of these ATPases sequentially phosphorylâtes and dephosphorylates itself 
during the pumping cycle. The superfamily ofABC transporters is the largestfamily 
of membrane transport proteins and is especially important clinically. It includes 
proteins that are responsible for cystic fibrosis, for drug résistance in both cancer 
cells and malaria-causing parasites, and for pumping pathogen-derived peptides 
into the ER for cytotoxic lymphocytes to reorganize on the surface of infected cells. 

CHANNELS AND THE ELECTRICAL PROPERTIES 
OF MEMBRANES 

Unlilce transporters, channels form pores across membranes. One class of chan¬ 
nel proteins found in virtually ail animais forms gap junctions between adjacent 
cells; each plasma membrane contributes equally to the formation of the chan¬ 
nel, which connects the cytoplasm of the two cells. These channels are discussed 
in Chapter 19 and will not be considered further here. Both gap junctions and 
porins , the channels in the outer membranes of bacteria, mitochondria, and chlo- 
roplasts (discussed in Chapter 10), hâve relatively large and permissive pores, and 
it would be disastrous if they directly connected the inside of a cell to an extracel¬ 
lular space. Indeed, many bacterial toxins do exactly that to kill other cells (dis¬ 
cussed in Chapter 24). 

In contrast, most channels in the plasma membrane of animal and plant cells 
that connect the cytosol to the cell exterior necessarily hâve narrow, highly sélec¬ 
tive pores that can open and close rapidly. Because these proteins are concerned 
specifically with inorganic ion transport, they are referred to as ion channels. For 
transport efficiency, ion channels hâve an advantage over transporters, in that 
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they can pass up to 100 million ions through one open channel each second—a 
rate 10 5 times greater than the fastest rate of transport mediated by any lcnown 
transporter. As discussed earlier, however, channels cannot be coupled to an 
energy source to perform active transport, so the transport they médiate is always 
passive (downhill). Thus, the function of ion channels is to allow spécifie inorganic 
ions—primarily Na + , K + , Ca 2+ , or Cl - —to diffuse rapidly down their electrochem- 
ical gradients across the lipid bilayer. In this section, we will see that the ability 
to control ion fluxes through these channels is essential for many cell functions. 
Nerve cells (neurons), in particular, hâve made a specialty of using ion channels, 
and we will consider how they use many different ion channels to receive, con- 
duct, and transmit signais. Before we discuss ion channels, however, we briefly 
consider the aquaporin water channels that we mentioned earlier. 


Aquaporins Are Permeable to Water But Imperméable to Ions 

Because cells are mostly water (typically -70% by weight), water movement across 
cell membranes is fundamentally important for life. Cells also contain a high con¬ 
centration of solutés, including numerous negatively charged organic molécules 
that are confined inside the cell (the so-called fixed anions) and their accompa- 
nying cations that are required for charge balance. This créâtes an osmotic gra¬ 
dient, which mostly is balanced by an opposite osmotic gradient due to a high 
concentration of inorganic ions—chiefly Na + and Cl - —in the extracellular fluid. 
The small remaining osmotic force tends to "pull" water into the cell, causing it to 
swell until the forces are balanced. Because ail biological membranes are moder- 
ately permeable to water (see Figure 11-2), cell volume équilibrâtes in minutes or 
less in response to an osmotic gradient. For most animal cells, however, osmosis 
has only a minor rôle in regulating cell volume. This is because most of the cyto- 
plasm is in a gel-like State and resists large changes in its volume in response to 
changes in osmolarity. 

In addition to the direct diffusion of water across the lipid bilayer, some pro- 
karyotic and eukaryotic cells hâve water channels, or aquaporins, embedded in 
their plasma membrane to allow water to move more rapidly. Aquaporins are par- 
ticularly abundant in animal cells that must transport water at high rates, such as 
the épithélial cells of the kidney or exocrine cells that must transport or secrete 
large volumes of fluids, respectively (Figure 11-19). 

Aquaporins must solve a problem that is opposite to that facing ion channels. 
To avoid disrupting ion gradients across membranes, they hâve to allow the rapid 
passage of water molécules while completely blocking the passage of ions. The 
three-dimensional structure of an aquaporin reveals how it achieves this remarlc- 
able selectivity. The channels hâve a narrow pore that allows water molécules to 
traverse the membrane in single file, following the path of carbonyl oxygens that 
line one side of the pore (Figure 11-20A and B). Hydrophobie amino acids line 
the other side of the pore. The pore is too narrow for any hydrated ion to enter, and 
the energy cost of dehydrating an ion would be enormous because the hydropho¬ 
bie wall of the pore cannot interact with a dehydrated ion to compensate for the 
loss of water. This design readily explains why the aquaporins cannot conduct K + , 
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Figure 11-19 The rôle of aquaporins in 
fluid sécrétion. Cells lining the ducts of 
exocrine glands (as found, for example, in 
the pancréas and liver, and in mammary, 
sweat, and salivary glands) secrete large 
volumes of body fluids. These cells are 
organized into épithélial sheets in which 
their apical plasma membrane faces the 
lumen of the duct. Ion pumps and channels 
situated in the basolateral and apical 
plasma membrane move ions (mostly 
Na + and Ch) into the ductal lumen, 
creating an osmotic gradient between the 
surrounding tissue and the duct. Water 
molécules rapidly follow the osmotic 
gradient through aquaporins that are 
présent in high concentrations in both the 
apical and basolateral membranes. 
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Na + , Ca 2+ , or Cl - ions. These channels are also imperméable to H + , which is mainly 
présent in cells as H 3 OA These hydronium ions diffuse through water extremely 
rapidly, using a molecular relay mechanism that requires the making and breaking 
of hydrogen bonds between adjacent water molécules (Figure 11-20C). Aquapo- 
rins contain two strategically placed asparagines, which bind to the oxygen atom 
of the central water molécule in the line of water molécules traversing the pore, 
imposing a bipolarity on the entire column of water molécules (Figure 11 - 20 C and 
D). This makes it impossible for the "making and breaking” sequence of hydrogen 
bonds (shown in Figure 11-20C) to get past the central asparagine-bonded water 
molécule. Because both valences of this central oxygen are unavailable for hydro- 
gen-bonding, the central water molécule cannot participate in an H + relay, and 
the pore is therefore imperméable to H + . 

We now turn to ion channels, the subject of the rest of the chapter. 


Figure 11-20 The structure of 
aquaporins. (A) A ribbon diagram of an 
aquaporin monomer. In the membrane, 
aquaporins form tetramers, with each 
monomer containing an aqueous pore 
in its center (not shown). Each individual 
aquaporin channel passes about 10 9 
water molécules per second. (B) A 
longitudinal cross section through one 
aquaporin monomer, in the plane of the 
central pore. One face of the pore is 
lined with hydrophilic amino acids, which 
provide transient hydrogen bonds to water 
molécules; these bonds help line up the 
transiting water molécules in a single 
row and orient them as they traverse the 
pore. (C and D) A model explaining why 
aquaporins are imperméable to H + . 

(C) In water, H + diffuses extremely rapidly 
by being relayed from one water molécule 
to the next. (D) Carbonyl groups (C=0) 
lining the hydrophilic face of the pore align 
water molécules, and two strategically 
placed asparagines in the center help 
tether a central water molécule such that 
both valences on its oxygen are occupied. 
This arrangement bipolarizes the entire 
line of water molécules, with each water 
molécule acting as a hydrogen-bond 
acceptor from its inner neighbor (Movie 
11 . 6 ). (A and B, adapted from R.M. Stroud 
et al., Curr. Opin. Struct. Biol. 13:424-431, 
2003. With permission from Elsevier.) 


Ion Channels Are lon-Selective and Fluctuate Between Open and 
Closed States 


Two important properties distinguish ion channels from aqueous pores. First, 
they show ion selectivity, permitting some inorganic ions to pass, but not others. 
This suggests that their pores must be narrow enough in places to force perme- 
ating ions into intimate contact with the walls of the channel so that only ions of 
appropriate size and charge can pass. The permeating ions hâve to shed most or 
ail of their associated water molécules to pass, often in single file, through the nar- 
rowest part of the channel, which is called the selectivity filter; this limits their rate 
of passage (Figure 11-21). Thus, as the ion concentration increases, the flux of the 
ion through a channel increases proportionally but then levels off (saturâtes) at a 
maximum rate. 


lipid 

bilayer 


selectivity filter 




gâte 


CLOSED OPEN 


Figure 11-21 Atypical ion channel, 
which fluctuâtes between closed and 
open conformations. The ion channel 
shown here in cross section forms a pore 
across the lipid bilayer only in the “open” 
conformational State. The pore narrows 
to atomic dimensions in one région (the 
selectivity filter), where the ion selectivity of 
the channel is largely determined. Another 
région of the channel forms the gâte. 
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Figure 11-22 The gating of ion channels. This schematic drawing shows several kinds of stimuli 
that open ion channels. Mechanically gated channels often hâve cytoplasmic extensions (not 
shown) that link the channel to the cytoskeleton. 


The second important distinction between ion channels and aqueous pores is 
that ion channels are not continuously open. Instead, they are gated , which allows 
them to open briefly and then close again. Moreover, with prolonged (chemical or 
electrical) stimulation, most ion channels go into a closed “desensitized,” or “inac- 
tivated," State, in which they are refractory to further opening until the stimulus 
has been removed, as we discuss later. In most cases, the gâte opens in response 
to a spécifie stimulus. As shown in Figure 11-22, the main types of stimuli that are 
lcnown to cause ion channels to open are a change in the voltage across the mem¬ 
brane ( voltage-gated channels ), a mechanical stress ( mechanically gated chan¬ 
nels), or the binding of a ligand ( ligand-gated channels). The ligand can be either 
an extracellular mediator—specifically, a neurotransmitter {transmitter-gated 
channels )—or an intracellular mediator such as an ion ( ion-gated channels) or 
a nucléotide {nucléotide-gated channels). In addition, protein phosphorylation 
and déphosphorylation régulâtes the activity of many ion channels; this type of 
channel régulation is discussed, together with nucleotide-gated ion channels, in 
Chapter 15. 

More than 100 types of ion channels hâve been identified thus far, and new 
ones are still being discovered, each characterized by the ions it conducts, the 
mechanism by which it is gated, and its abundance and localization in the cell and 
in spécifie cells. Ion channels are responsible for the electrical excitability of mus¬ 
cle cells, and they médiate most forms of electrical signaling in the nervous Sys¬ 
tem. A single neuron typically contains 10 or more kinds of ion channels, located 
in different domains of its plasma membrane. But ion channels are not restricted 
to electrically excitable cells. They are présent in ail animal cells and are found in 
plant cells and microorganisms: they propagate the leaf-closing response of the 
mimosa plant, for example (Movie 11.7), and allowthe single-celled Paramecium 
to reverse direction after a collision. 

Ion channels that are permeable mainly to IC are found in the plasma mem¬ 
brane of almost ail cells. An important subset of IC channels opens even in an 
unstimulated or “resting” cell, and hence these are called K + leak channels. 
Although this term applies to many different IC channels, depending on the cell 
type, they serve a common purpose: by making the plasma membrane much 
more permeable to IC than to other ions, they hâve a crucial rôle in maintaining 
the membrane potential across ail plasma membranes, as we discuss next. 
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The Membrane Potential in Animal Cells Dépends Mainly on K + 
Leak Channels and the K + Gradient Across the Plasma Membrane 

A membrane potential arises when there is a différence in the electrical charge 
on the two sides of a membrane, due to a slight excess of positive ions over nég¬ 
ative ones on one side and a slight déficit on the other. Such charge différences 
can resuit both from active electrogenic pumping (see p. 608) and from passive 
ion diffusion. As we discuss in Chapter 14, electrogenic H + pumps in the mito¬ 
chondrial inner membrane generate most of the membrane potential across this 
membrane. Electrogenic pumps also generate most of the electrical potential 
across the plasma membrane in plants and fungi. In typical animal cells, however, 
passive ion movements malce the largest contribution to the electrical potential 
across the plasma membrane. 

As explained earlier, due to the action of the Na + -K + pump, there is little Na + 
inside the cell, and other intracellular inorganic cations hâve to be plentiful 
enough to balance the charge carried by the cell's fixed anions—the negatively 
charged organic molécules that are confined inside the cell. This balancing rôle 
is performed largely by K + , which is actively pumped into the cell by the Na + - 
K + pump and can also move freely in or out through the K + leak channels in the 
plasma membrane. Because of the presence of these channels, IC cornes almost 
to equilibrium, where an electrical force exerted by an excess of négative charges 
attracting IC into the cell balances the tendency of IC to leak out down its con¬ 
centration gradient. The membrane potential (of the plasma membrane) is the 
manifestation of this electrical force, and we can calculate its equilibrium value 
from the steepness of the IC concentration gradient. The following argument may 
help to malce this clear. 

Suppose that initially there is no voltage gradient across the plasma membrane 
(the membrane potential is zéro) but the concentration of IC is high inside the cell 
and low outside. IC will tend to leave the cell through the IC leak channels, driven 
by its concentration gradient. As IC begins to move out, each ion leaves behind 
an unbalanced négative charge, thereby creating an electrical field, or membrane 
potential, which will tend to oppose the further efflux of IC. The net efflux of IC 
halts when the membrane potential reaches a value at which this electrical driv- 
ing force on IC exactly balances the effect of its concentration gradient—that is, 
when the electrochemical gradient for IC is zéro. Although Cl - ions also equili- 
brate across the membrane, the membrane potential lceeps most of these ions out 
of the cell because their charge is négative. 

The equilibrium condition, in which there is no net fiow of ions across the 
plasma membrane, defines the resting membrane potential for this idealized cell. 
A simple but very important formula, the Nernst équation, quantifies the equilib¬ 
rium condition and, as explained in Panel 11-1, malces it possible to calculate 
the theoretical resting membrane potential if we lcnow the ratio of internai and 
external ion concentrations. As the plasma membrane of a real cell is not exclu- 
sively permeable to IC and Cl”, however, the actual resting membrane potential 
is usually not exactly equal to that predicted by the Nernst équation for IC or Cl - . 

The Resting Potential Decays Only Slowly When the 
Na + -K + Pump Is Stopped 

Movement of only a minute number of inorganic ions across the plasma mem¬ 
brane through ion channels suffices to set up the membrane potential. Thus, we 
can thinlc of the membrane potential as arising from movements of charge that 
leave ion concentrations practically unaffected and resuit in only a very slight 
discrepancy in the number of positive and négative ions on the two sides of the 
membrane (Figure 11-23). Moreover, these movements of charge are generally 
rapid, taking only a few milliseconds or less. 

Consider the change in the membrane potential in a real cell after the sudden 
inactivation of the Na + -IC pump. A slight drop in the membrane potential occurs 
immediately. This is because the pump is electrogenic and, when active, malces a 
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PANEL 11-1 : The Dérivation of the Nernst Equation 


THE NERNST EQUATION AND ION FLOW 


The flow of any inorganic ion through a membrane 
channel is driven by the electrochemical gradient for that 
ion. This gradient represents the combination of two 
influences: the voltage gradient and the concentration 
gradient of the ion across the membrane. When these 
two influences just balance each other, the 
electrochemical gradient for the ion is zéro, and there is 
no net flow of the ion through the channel. The voltage 
gradient (membrane potential) at which this equilibrium 
is reached is called the equilibrium potential for the ion. 

It can be calculated from an équation that will be derived 
below, called the Nernst équation. 

The Nernst équation is V = — In 

zF Q 


where 

V = the equilibrium potential in volts (internai 
potential minus external potential) 

C Q and Cj = outside and inside concentrations of the 
ion, respectively 

R = the gas constant (8.3 J mol -1 K -1 ) 

T = the absolute température (K) 

F = Faraday's constant (9.6 x 10 4 J V -1 mol -1 ) 
z = the valence (charge) of the ion 
In = logarithm to the base e 

The Nernst équation is derived as follows: 

A molécule in solution (a soluté) tends to move from a 
région of high concentration to a région of low 
concentration simply due to the random movement of 
molécules, which results in their equilibrium. 
Consequently, movement down a concentration gradient 
is accompanied by a favorable free-energy change 
(AG < 0), whereas movement up a concentration gradient 
is accompanied by an unfavorable free-energy change 
(AG > 0). (Free energy is introduced in Chapter 2 and 
discussed in the context of redox reactions in 
Panel 14-1, p. 765.) 

The free-energy change per mole of soluté moved across 
the plasma membrane (AG conc ) is equal to -RT In C Q / Cj. 

If the soluté is an ion, moving it into a cell across a 
membrane whose inside is at a voltage V relative to the 
outside will cause an additional free-energy change (per 
mole of soluté moved) of AG vo!t = zFV. 

At the point where the concentration and voltage 
gradients just balance, 


Thus, 


and, therefore, 


zFV-RT In —2_ = 0 
Ci 


l/= BL | n ü° 


zF G 


or, using the constant that converts natural logarithms to 
base 10, 

.. _ _ RT . Cq 

U= 2.3 — log 10 -ç 

For a univalent cation, 

2.3 — = 58 mV at 20°C and 61.5 mV at 37°C. 

F 

Thus, for such an ion at 37°C, 

V= + 6^.5 mV for C 0 /Cj= 10, 

whereas 

V=0 for C Q / C, = 1. 

The K + equilibrium potential (V K ), for example, is 
61.5 log 10 ([K + ] o /[K + ]j) millivolts 
(-89 mV for a typical cell, where [K + ] 0 = 5 mM 
and [K + ]j= 140 mM). 


At V K , there is no net flow of K + across the membrane. 

Similarly, when the membrane potential has a value of 
61.5 log 10 ([Na + ] o /[Na + ]j), 
the Na + equilibrium potential (V Na ), 
there is no net flow of Na + . 


For any particular membrane potential, V M , the net 
force tending to drive a particular type of ion out of the 
cell, is proportional to the différence between V M and the 
equilibrium potential for the ion: hence, 

for K + it is V M - V K 
and for Na + it is V M - \/ Na . 

When there is a voltage gradient across the membrane, 
the ions responsible for it—the positive ions on one side 
and the négative ions on the other—are concentrated in 
thin layers on either side of the membrane because of the 
attraction between positive and négative electric charges. 
The number of ions that go to form the layer of charge 
adjacent to the membrane is minute compared with the 
total number inside the cell. For example, the movement 
of 6000 Na + ions across 1 jum 2 of membrane will carry 
sufficient charge to shift the membrane potential by 
about 100 mV. 


AGconc + AG vo | t = 0 

and the ion distribution is at equilibrium across the 
membrane. 


Because there are about 3 x 10 7 Na + ions in a typical cell 
(1 jum 3 of bulk cytoplasm), such a movement of charge 
will generally hâve a negligible effect on the ion 
concentration gradients across the membrane. 
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exact balance of charges on each side of the 
membrane; membrane potential = 0 



a few of the positive ions (red) cross the 
membrane from right to left, leaving their 
négative counterions (red) behind; this sets 
up a nonzero membrane potential 


small direct contribution to the membrane potential by pumping out three Na + 
for every two I< + that it pumps in (see Figure 11-15). However, switching off the 
pump does not abolish the major component of the resting potential, which is 
generated by the K + equilibrium mechanism just described. This component of 
the membrane potential persists as long as the Na + concentration inside the cell 
stays low and the K + ion concentration high—typically for many minutes. But the 
plasma membrane is somewhat permeable to ail small ions, including Na + . There- 
fore, without the Na + -K + pump, the ion gradients set up by the pump will eventu- 
ally run down, and the membrane potential established by diffusion through the 
K + lealc channels will fall as well. As Na + enters, the cell eventually cornes to a new 
resting State where Na + , K + , and CF are ail at equilibrium across the membrane. 
The membrane potential in this State is much less than it was in the normal cell 
with an active Na + -K + pump. 

The resting potential of an animal cell varies between -20 mV and -120 mV, 
depending on the organism and cell type. Although the K + gradient always has a 
major influence on this potential, the gradients of other ions (and the disequili- 
brating effects of ion pumps) also hâve a significant effect: the more permeable 
the membrane for a given ion, the more strongly the membrane potential tends to 
be driven toward the equilibrium value for that ion. Consequently, changes in a 
membrane's permeability to ions can cause significant changes in the membrane 
potential. This is one of the key principles relating the electrical excitability of cells 
to the activities of ion channels. 

To understand how ion channels select their ions and how they open and 
close, we need to lcnow their atomic structure. The first ion channel to be crystal- 
lized and studied by x-ray diffraction was a bacterial IC channel. The details of its 
structure revolutionized our understanding of ion channels. 

The Three-Dimensional Structure of a Bacterial K + Channel Shows 
How an Ion Channel Can Work 

Scientists were puzzled by the remarkable ability of ion channels to combine 
exquisite ion selectivity with a high conductance. IC lealc channels, for exam¬ 
ple, conduct IC 10,000-fold faster than Na + , yet the two ions are both featureless 
spheres and hâve similar diameters (0.133 nm and 0.095 nm, respectively). A sin¬ 
gle amino acid substitution in the pore of an animal cell IC channel can resuit in 
a loss of ion selectivity and cell death. We cannot explain the normal IC selectivity 
by pore size, because Na + is smaller than IC. Moreover, the high conductance rate 
is incompatible with the channeFs having sélective, high-affinity IC-binding sites, 
as the binding of IC ions to such sites would greatly slow their passage. 

The puzzle was solved when the structure of a bacterial K + channel was deter- 
mined by x-ray crystallography. The channel is made from four identical trans¬ 
membrane subunits, which together form a central pore through the membrane. 
Each subunit contributes two transmembrane a helices, which are tilted outward 
in the membrane and together form a cône, with its wide end facing the outside of 


Figure 11-23 The ionic basis of a 
membrane potential. A small flow of 
inorganic ions through an ion channel 
carries sufficient charge to cause a large 
change in the membrane potential. The 
ions that give rise to the membrane 
potential lie in a thin (< 1 nm) surface layer 
close to the membrane, held there by 
their electrical attraction to their oppositely 
charged counterparts (counterions) on the 
other side of the membrane. For a typical 
cell, 1 microcoulomb of charge (6 x 10 12 
monovalent ions) per square centimeter of 
membrane, transferred from one side of 
the membrane to the other, changes the 
membrane potential by roughly 1 V. This 
means, for example, that in a spherical cell 
of diameter 10 gm, the number of K + ions 
that hâve to flow out to alter the membrane 
potential by 100 mV is only about 
1/100,000 of the total number of K + ions in 
the cytosol. This amount is so minute that 
the intracellular K + concentration remains 
virtually unchanged. 
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Figure 11-24 The structure of a bacterial K + channel. (A) The transmembrane a helices from only two of the four identical 
subunits are shown. From the cytosolic side, the pore (schematically shaded in blue) opens up into a vestibule in the middle of 
the membrane. The pore vestibule facilitâtes transport by allowing the K + ions to remain hydrated even though they are more 
than halfway across the membrane. The narrow selectivity filter of the pore links the vestibule to the outside of the cell. Carbonyl 
oxygens line the walls of the selectivity filter and form transient binding sites for dehydrated K + ions. Two K + ions occupy different 
sites in the selectivity filter, while a third K + ion is located in the center of the vestibule, where it is stabilized by electrical interactions 
with the more negatively charged ends of the pore helices. The ends of the four short “pore helices” (only two of which are shown) 
point precisely toward the center of the vestibule, thereby guiding K + ions into the selectivity filter (Movie 11.8). (B) Peptide bonds 
hâve an electric dipole, with more négative charge accumulated at the oxygen of the C=0 bond and at the nitrogen of the 
N-H bond. In an a hélix, hydrogen bonds (red) align the dipoles. As a conséquence, every a hélix has an electric dipole along its 
axis, resulting from summation of the dipoles of the individual peptide bonds, with a more negatively charged C-terminal end (5 - ) 
and a more positively charged N-terminal end (8 + ). (A, adapted from D.A. Doyle et al., Science 280:69-77, 1998.) 


the cell where K + ions exit from the channel (Figure 11-24). The polypeptide chain 
that connects the two transmembrane helices forms a short a hélix (the pore hélix ) 
and a crucial loop that protrudes into the wide section of the cône to form the 
selectivity filter. The selectivity loops from the four subunits form a short, rigid, 
narrow pore, which is lined by the carbonyl oxygen atoms of their polypeptide 
backbones. Because the selectivity loops of ail lcnown I< + channels hâve similar 
amino acid sequences, it is likely that they form a closely similar structure. 

The structure of the selectivity filter explains the ion selectivity of the channel. 
A IC ion must lose almost ail of its bound water molécules to enter the filter, where 
it interacts instead with the carbonyl oxygens lining the filter; the oxygens are rig- 
idly spaced at the exact distance to accommodate a IC ion. A Na + ion, in contrast, 
cannot enter the filter because the carbonyl oxygens are too far away from the 
smaller Na + ion to compensate for the energy expense associated with the loss of 
water molécules required for entry (Figure 11-25). 

Structural studies of IC channels and other ion channels hâve also indicated 
some general principles of howthese channels open and close. The gating involves 
movement of the helices in the membrane so that they either obstruct or open the 
path for ion movement. Depending on the particular type of channel, helices tilt, 
rotate, or bend during gating. The structure of a closed IC channel shows that by 
tilting the inner helices, the pore constricts like a diaphragm at its cytosolic end 
(Figure 11-26). Bulky hydrophobie amino acid side chains bloclc the small open- 
ing that remains, preventing the entry of ions. 

Many other ion channels operate on similar principles: the channel's gating 
helices are allosterically coupled to domains that form the ion-conducting path- 
way; and a conformational change in the gâte—in response, say, to ligand binding 
or altered membrane potential—brings about conformational change in the con- 
ducting pathway, either opening it or blocking it off. 
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Figure 11-25 K + specificity of the 
selectivity filter in a K + channel. The 

drawings show K + and Na + ions (A) in the 
vestibule and (B) in the selectivity filter of 
the pore, viewed in cross section. In the 
vestibule, the ions are hydrated. In the 
selectivity filter, they hâve lost their water, 
and the carbonyl oxygens are placed to 
accommodate a dehydrated K + ion. The 
déhydration of the K + ion requires energy, 
which is precisely balanced by the energy 
regained by the interaction of the ion with 
ail of the carbonyl oxygens that serve as 
surrogate water molécules. Because the 
Na + ion is too small to interact with the 
oxygens, it can enter the selectivity filter 
only at a great energetic expense. The 
filter therefore selects K + ions with high 
specificity. (A, adapted from Y. Zhou et al., 
Nature 414:43-48, 2001. With permission 
from Macmillan Publishers Ltd.) 


Mechanosensitive Channels Protect Bacterial Cells Against 
Extrême Osmotic Pressures 

Ail organisms, from single-cell bacteria to multicellular animais and plants, must 
sense and respond to mechanical forces in their external environment (such as 
Sound, touch, pressure, shear forces, and gravity) and in their internai environ¬ 
ment (such as osmotic pressure and membrane bending). Numerous proteins are 
known to be capable of responding to such mechanical forces, and a large subset 
of those proteins has been identified as possible mechanosensitive channels, but 
very few of the candidate proteins hâve been shown directly to be mechanically 
activated ion channels. One reason for this dearth in our knowledge is that most 
such channels are extremely rare. Auditory hair cells in the human cochlea, for 
example, contain extraordinarily sensitive mechanically gated ion channels, but 
each of the approximately 15,000 individual hair cells is thought to hâve a total of 
only 50-100 of them (Movie 11.9). Additional difficulties arise because the gating 
mechanisms of many mechanosensitive channel types require the channels to be 
embedded in complex architectures that require attachment to the extracellular 
matrix or to the cytoslceleton and are difficult to reconstitute in the test tube. The 
study of mechanosensitive receptors is a field of active investigation. 

A well-studied class of mechanosensitive channels is found in the bacterial 
plasma membrane. These channels open in response to mechanical stretch- 
ing of the lipid bilayer in which they are embedded. When a bacterium expéri¬ 
ences a low-ionic-strength external environment (hypotonie conditions), such as 



hélix 


CLOSED 


OPEN 


Figure 11-26 A model for the gating 
of a bacterial K + channel. The channel 
is viewed in cross section. To adopt 
the closed conformation, the four inner 
transmembrane helices that line the pore 
on the cytosolic side of the selectivity filter 
(see Figure 11-24) rearrange to close the 
cytosolic entrance to the channel. 
(Adapted from E. Perozo et al., Science 
285:73-78, 1999.) 
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(A) CLOS ED 



(B) OPEN 



Figure 11-27 The structure of 
mechanosensitive channels. The crystal 
structures of MscS in its (A) closed and 
(B) open conformation are shown. The 
side views (lower panels) show the entire 
protein, including the large intracellular 
domain. The face views (upper panels) 
show the transmembrane domains only. 
The open structure occupies more area in 
the lipid bilayer and is energetically favored 
when a membrane is stretched. This may 
explain why the MscS channel opens as 
pressure builds up inside the cell. (PDB 
codes: 20AU, 2W5.) 



rainwater, the cell swells as water seeps in due to an increase in the osmotic pres¬ 
sure. If the pressure rises to dangerous levels, the cell opens mechanosensitive 
channels that allow small molécules to lealc out. Bacteria that are experimentally 
placed in fresh water can rapidly lose more than 95% of their small molécules in 
this manner, including amino acids, sugars, and potassium ions. However, they 
lceep their macromolecules safely inside and thus can recover quickly after envi- 
ronmental conditions return to normal. 

Mechanical gating has been demonstrated using biophysical techniques in 
which force is exerted on pure lipid bilayers containing the bacterial mechano¬ 
sensitive channels; for example, by applying suction with a micropipette. Such 
measurements demonstrate that the cell has several different channels that open 
at different levels of pressure. The mechanosensitive channel of small conduc¬ 
tance, called the MscS channel, opens at low and moderate pressures (Figure 
11-27). It is composed of seven identical subunits, which in the open State form a 
pore about 1.3 nm in diameter—just big enough to pass ions and small molécules. 
Large cytoplasmic domains limit the size of molécules that can reach the pore. 
The mechanosensitive channel of large conductance, called the MscL channel, 
opens to over 3 nm in diameter when the pressure gets so high that the cell might 
burst. 

The Function of a Neuron Dépends on Its Elongated Structure 

The cells that malce most sophisticated use of channels are neurons. Before dis- 
cussing how they do so, we digress briefly to describe how a typical neuron is 
organized. 

The fundamental taslc of a neuron, or nerve cell, is to receive, conduct, and 
transmit signais. To perform these functions, neurons are often extremely elon¬ 
gated. In humans, for example, a single neuron extending from the spinal cord to 
a muscle in the foot may be as long as 1 meter. Every neuron consists of a cell body 
(containing the nucléus) with a number of thin processes radiating outward from 
it. Usually one long axon conducts signais away from the cell body toward distant 
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axon (lessthan 1 mm to 
morethan 1 m in length) 



terminal branches of axon 


Figure 11-28 A typical vertebrate 
neuron. The arrows indicate the 
direction in which signais are conveyed. 
The single axon conducts signais 
away from the cell body, while the 
multiple dendrites (and the cell body) 
receive signais from the axons of other 
neurons. The axon terminais end on the 
dendrites or cell body of other neurons 
or on other cell types, such as muscle 
or gland cells. 


targets, and several shorter, branching dendrites extend from the cell body lilce 
antennae, providing an enlarged surface area to receive signais from the axons 
of other neurons (Figure 11-28), although the cell body itself also receives such 
signais. A typical axon divides at its far end into many branches, passing on its 
message to many target cells simultaneously. Lilcewise, the extent of branching 
of the dendrites can be very great—in some cases sufficient to receive as many as 
100,000 inputs on a single neuron. 

Despite the varied significance of the signais carried by different classes of 
neurons, the form of the signal is always the same, consisting of changes in the 
electrical potential across the neuron's plasma membrane. The signal spreads 
because an electrical disturbance produced in one part of the membrane spreads 
to other parts, although the disturbance becomes wealcer with increasing distance 
from its source, unless the neuron expends energy to amplify it as it travels. Over 
short distances, this atténuation is unimportant; in fact, many small neurons con- 
duct their signais passively, without amplification. For long-distance communi¬ 
cation, however, such passive spread is inadéquate. Thus, larger neurons employ 
an active signaling mechanism, which is one of their most striking features. An 
electrical stimulus that exceeds a certain threshold strength triggers an explosion 
of electrical activity that propagates rapidly along the neuron's plasma membrane 
and is sustained by automatic amplification ail along the way. This traveling wave 
of electrical excitation, known as an action potential, or nerve impulse , can carry 
a message without atténuation from one end of a neuron to the other at speeds of 
100 meters per second or more. Action potentials are the direct conséquence of 
the properties of voltage-gated cation channels, as we now discuss. 

Voltage-Gated Cation Channels Generate Action Potentials in 
Electrically Excitable Cells 

The plasma membrane of ail electrically excitable cells—not only neurons, but 
also muscle, endocrine, and egg cells—contains voltage-gated cation channels, 
which are responsible for generating the action potentials. An action potential 
is triggered by a depolarization of the plasma membrane—that is, by a shift in 
the membrane potential to a less négative value inside. (We shall see later how 
the action of a neurotransmitter causes depolarization.) In nerve and slceletal 
muscle cells, a stimulus that causes sufficient depolarization promptly opens the 
voltage-gated Na + channels, allowing a small amount of Na + to enter the cell down 
its electrochemical gradient. The influx of positive charge depolarizes the mem¬ 
brane further, thereby opening more Na + channels, which admit more Na + ions, 
causing still further depolarization. This self-amplification process (an example of 
positive feedback, discussed in Chapters 8 and 15) continues until, within a frac¬ 
tion of a millisecond, the electrical potential in the local région of membrane has 
shifted from its resting value of about -70 mV (in squid giant axon; about -40 mV 
in human) to almost as far as the Na + equilibrium potential of about +50 mV (see 
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Figure 11-29 Structural models of 
voltage-gated Na + channels. (A) The 

channel in animal cells is built from a 
single polypeptide chain that contains 
four homologous domains. Each domain 
contains two transmembrane a helices 
(green) that surround the central ion- 
conducting pore. They are separated by 
sequences (blue) that form the selectivity 
filter. Four a additional helices (gray 
and red) in each domain constitute the 
voltage sensor. The S4 helices (red) are 
unique in that they contain an abundance 
of positively charged arginines. An 
inactivation gâte that is part of a flexible 
loop connecting the third and fourth 
domains acts as a plug that obstructs the 
pore in the channel’s inactivated State, as 
shown in Figure 11-30. (B) Side and top 
views of a homologous bacterial channel 
protein showing its arrangement within the 
membrane. (C) A cross section of the pore 
domain of the channel shown in (B) shows 
latéral portais, through which the central 
cavity is accessible from the hydrophobie 
core of the lipid bilayer. In the crystals, lipid 
acyl chains were found to intrude into the 
pore. These latéral portais are large enough 
to allow entry of small, hydrophobie, pore- 
blocking drugs that are commonly used as 
anesthetics and block ion conductance. 
(PDB code: 3RVZ.) 


Panel 11-1, p. 616). At this point, when the net electrochemical driving force for 
the flow of Na + is almost zéro, the cell would corne to a newresting State, with ail of 
its Na + channels permanently open, if the open conformation of the channel were 
stable. Two mechanisms act in concert to save the cell from such a permanent 
electrical spasm: the Na + channels automatically inactivate and voltage-gated K + 
channels open to restore the membrane potential to its initial négative value. 

The Na + channel is built from a single polypeptide chain that contains four 
structurally very similar domains. It is thought that these domains evolved by 
gene duplication followed by fusion into a single large gene (Figure 11-29A). In 
bacteria, in fact, the Na + channel is a tetramer of four identical polypeptide chains, 
supporting this evolutionary idea. 

Each domain contributes to the central channel, which is very similar to the K + 
channel. Each domain also contains a voltage sensor that is characterized by an 
unusual transmembrane hélix, S4, that contains many positively charged amino 
acids. As the membrane depolarizes, the S4 helices expérience an electrostatic 
pulling force that attracts them to the now negatively charged extracellular side of 
the plasma membrane. The resulting conformational change opens the channel. 
The structure of a bacterial voltage-gated Na + channel provides insights how the 
structural éléments are arranged in the membrane (Figure 11-29B and C). 

The Na + channels also hâve an automatic inactivating mechanism, which 
causes the channels to reclose rapidly even though the membrane is still depolar- 
ized (see Figure 11-30). The Na + channels remain in this inactivated State, unable 
to reopen, until after the membrane potential has returned to its initial négative 
value. The time necessary for a sufficient number of Na + channels to recover from 
inactivation to support a new action potential, termed the refractory period , limits 
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Figure 11-30 Na + channels and an action potential. (A) An action potential is triggered by a brief puise of current, which 
(B) partially depolarizes the membrane, as shown in the plot of membrane potential versus time. The green curve shows how 
the membrane potential would hâve simply relaxed back to the resting value after the initial depolarizing stimulus if there had 
been no voltage-gated Na + channels in the membrane. The red curve shows the course of the action potential that is caused 
by the opening and subséquent inactivation of voltage-gated Na + channels. The States of the Na + channels are indicated 
in (B). The membrane cannot tire a second action potential until the Na + channels hâve returned from the inactivated to the 
closed conformation; until then, the membrane is refractory to stimulation. (C) The three States of the Na + channel. When the 
membrane is at rest (highly polarized), the closed conformation of the channel has the lowest free energy and is therefore most 
stable; when the membrane is depolarized, the energy of the open conformation is lower, so the channel has a high probability 
of opening. But the free energy of the inactivated conformation is lower still; therefore, after a randomly variable period spent in 
the open State, the channel becomes inactivated. Thus, the open conformation corresponds to a metastable State that can exist 
only transiently when the membrane depolarizes (Movie 11.10). 


the répétitive firing rate of a neuron. The cycle from initial stimulus to the return 
to the original resting State talces a few milliseconds or less. The Na + channel can 
therefore exist in three distinct States—closed, open, and inactivated—which con¬ 
tribue to the rise and fall of the action potential (Figure 11-30). 

This description of an action potential applies only to a small patch of plasma 
membrane. The self-amplifying depolarization of the patch, however, is sufficient 
to depolarize neighboring régions of membrane, which then go through the same 
cycle. In this way, the action potential sweeps lilce a wave from the initial site of 
depolarization over the entire plasma membrane, as shown in Figure 11-31. 

The Use of Channelrhodopsins Has Revolutionized the Study of 
Neural Circuits 

Channelrhodopsins are photosensitive ion channels that open in response to 
light. They evolved as sensory receptors in photosynthetic green algae to allow 
the algae to swim toward light. The structure of channelrhodopsin closely resem- 
bles that of bacteriorhodopsin (see Figure 10-31). It contains a covalently bound 
retinal group that absorbs light and undergoes an isomerization reaction, which 
triggers a conformational change in the protein, opening an ion channel in the 
plasma membrane. In contrast to bacteriorhodopsin, which is a light-driven pro¬ 
ton pump, channelrhodopsin is a light-driven cation channel. 

Using genetic engineering techniques, channelrhodopsin can be expressed 
in virtually any cell type in vertebrates and invertebrates. Researchers first 
introduced the gene into cultured neurons and showed that flashing light could 
now activate the channelrhodopsin and induce the neurons to fire action poten- 
tials. Because the frequency of the light flashes determined the frequency of the 
action potentials, one can control the frequency of neuronal firing with milli- 
second précision. 

Next, neurobiologists used the approach to activate spécifie neurons in the 
brain of experimental animais. Using a tiny fiber optic cable implanted near the 
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Figure 11-31 The propagation of an action potential along an axon. (A) The 

voltages that would be recorded from a set of intracellular électrodes placed at 
intervals along the axon. (B) The changes in the Na + channels and the current flows 
(curved red arrows) that give rise to a traveling action potential. The région of the 
axon with a depolarized membrane is shaded in blue. Note that once an action 
potential has started to progress, it has to continue in the same direction, traveling 
only away from the site of depolarization, because Na + -channel inactivation 
prevents the depolarization from spreading backward. 
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relevant brain région, they could flash light to specifically activate the channel- 
rhodopsin-containing neurons to lire action potentials. One group of researchers 
expressed channelrhodopsin in a subset of mouse neurons thought to be involved 
in aggression: when these cells were activated by light, the mouse immediately 
attacked anything in its environment—including other mice or even an inflated 
rubber glove (Figure 11-32); when the light was switched off, the neurons fell 
silent and the mouse's behavior returned to normal. 

Since these pioneering studies, researchers hâve engineered additional 
light-responsive ion channels and transporters, including some that can rapidly 



Figure 11-32 Optogenetic control of 
aggression neurons in a living mouse. 

A gene encoding channelrhodopsin was 
introduced into a subpopulation of neurons 
in the hypothalamus of a mouse. When the 
neurons were exposed to flashing blue light 
using a tiny, implanted fiber optic cable, 
the channelrhodopsin channels opened, 
depolarizing and activating the cells. 

When the light was switched on, the 
mouse immediately became aggressive 
and attacked the inflated rubber glove; 
when the light was switched off, its 
behavior immediately returned to normal 
(Movie 11.11). (From D. Lin et al., Nature 
470:221-226, 2011. With permission from 
Macmillan Publishers Ltd.) 
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inactivate spécifie neurons. It is therefore now possible to transiently activate or 
inhibit spécifie neurons in the brains of awake animais with remarkable spatial 
and temporal précision. In this way, the rapidly expanding new field of optoge- 
netics is revolutionizing neurobiology, allowing neuroscientists to analyze the 
neurons and circuits underlying even the most complex behaviors in experimen¬ 
tal animais, including nonhuman primates. 

Myelination Increases the Speed and Efficiency of Action Potential 
Propagation in Nerve Cells 

The axons of many vertebrate neurons are insulated by a myelin sheath, which 
greatly increases the rate at which an axon can conduct an action potential. The 
importance of myelination is dramatically demonstrated by the demyelinating 
disease multiple sclerosis, in which the immune System destroys myelin sheaths in 
some régions of the central nervous System; in the affected régions, nerve impulse 
propagation greatly slows or even fails, often with devastating neurological con¬ 
séquences. 

Myelin is formed by specialized non-neuronal supporting cells called glial 
cells. Schwann cells are the glial cells that myelinate axons in peripheral nerves, 
and oligodendrocytes do so in the central nervous System. These myelinating 
glial cells wrap layer upon layer of their own plasma membrane in a tight spiral 
around the axon (Figure 11-33A and B), thereby insulating the axonal membrane 
so that little current can lealc across it. The myelin sheath is interrupted at regularly 
spaced nodes of Ranvier, where almost ail the Na + channels in the axon are con- 
centrated (Figure 11-33C). This arrangement allows an action potential to prop- 
agate along a myelinated axon by jumping from node to node, a process called 
saltatory conduction. This type of conduction has two main advantages: action 
potentials travel very much faster, and metabolic energy is conserved because the 
active excitation is confined to the small régions of axonal plasma membrane at 
nodes of Ranvier. 


Figure 11-33 Myelination. 

(A) A myelinated axon from a peripheral 
nerve. Each Schwann cell wraps its 
plasma membrane concentrically around 
the axon to form a segment of myelin 
sheath about 1 mm long. For clarity, 
the membrane layers of the myelin are 
shown less compacted than they are 
in reality (see part B). (B) An électron 
micrograph of a nerve in the leg of a 
young rat. Two Schwann cells can 
be seen: one near the bottom is just 
beginning to myelinate its axon; the 
one above it has formed an almost 
mature myelin sheath. (C) Fluorescence 
micrograph and diagram of individual 
myelinated axons teased apart in a rat 
optic nerve, showing the confinement 
of the voltage-gated Na + channels 
(green) in the axonal membrane at the 
node of Ranvier. A protein called Caspr 
(red) marks the junctions where the 
myelinating glial cell plasma membrane 
tightly abuts the axon on either side of 
the node. Voltage-gated K + channels 
(M/e) localize to régions in the axon 
plasma membrane well away from the 
node. (B, from Cedric S. Raine, in Myelin 
[P. Morell, ed.]. New York: Plénum, 1976; 
C, from M.N. Rasband and P. Shrager, 

J. Physiol. 525:63-73, 2000. With 
permission from Blackwell Publishing.) 
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Patch-Clamp Recording Indicates That Individual Ion Channels 
Open in an AII-or-Nothing Fashion 

Neuron and slceletal muscle cell plasma membranes contain many thousands of 
voltage-gated Na + channels, and the current Crossing the membrane is the sum 
of the currents flowing through ail of these. An intracellular microelectrode can 
record this aggregate current, as shown in Figure 11-31 A. Remarkably, however, 
it is also possible to record current flowing through individual channels. Patch- 
clamp recording, developed in the 1970s and 1980s, revolutionized the study of 
ion channels and made it possible to examine transport through a single chan- 
nel in a small patch of membrane covering the mouth of a micropipette (Figure 
11-34). With this simple but powerful technique, one can study the detailed prop¬ 
erties of ion channels in ail sorts of cell types. This worlc led to the discovery that 
even cells that are not electrically excitable usually hâve a variety of ion channels 
in their plasma membrane. Many of these cells, such as yeasts, are too small to be 
investigated by the traditional electrophysiologist's method of impaiement with 
an intracellular microelectrode. 

Patch-clamp recording indicates that individual ion channels open in an all- 
or-nothing fashion. For example, a voltage-gated Na + channel opens and closes 
at random, but when open, the channel always has the same large conductance, 
allowing more than 1000 ions to pass per millisecond (Figure 11-35). Therefore, 
the aggregate current Crossing the membrane of an entire cell do es not indicate 
the degree to which a typical individual channel is open but rather the total num- 
ber of channels in its membrane that are open at any one time. 

Some simple physical principles allow us to refine our understanding of volt- 
age-gating from the perspective of a single Na + channel. The interior of the resting 
neuron or muscle cell is at an electrical potential about 40-100 mV more négative 
than the external medium. Although this potential différence seems small, it exists 
across a plasma membrane only about 5 nm thiclc, so that the resulting voltage 
gradient is about 100,000 V/cm. Charged proteins in the membrane such as Na + 
channels are thus subjected to a very large electrical field that can profoundly 
affect their conformation. Each conformation can “flip" to another conformation if 
given a sufficient jolt by the random thermal movements of the surroundings, and 
it is the relative stability of the closed, open, and inactivated conformations against 
flipping that is altered by changes in the membrane potential (see Figure 11-30C). 

Voltage-Gated Cation Channels Are Evolutionarily and Structurally 
Related 

Na + channels are not the only kind of voltage-gated cation channel that can gen- 
erate an action potential. The action potentials in some muscle, egg, and endo¬ 
crine cells, for example, dépend on voltage-gated Ca 2+ channels rather than on 
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Figure 11-34 The technique of 
patch-clamp recording. Because of 
the extremely tight seal between the 
micropipette and the membrane, current 
can enter or leave the micropipette only 
by passing through the ion channels in the 
patch of membrane covering its tip. The 
term clamp is used because an electronic 
device is employed to maintain, or “clamp,” 
the membrane potential at a set value 
while recording the ionic current through 
individual channels. The current through 
these channels can be recorded with 
the patch still attached to the rest of the 
cell, as in (A), or detached, as in (B). The 
advantage of the detached patch is that 
it is easy to alter the composition of the 
solution on either side of the membrane to 
test the effect of various solutés on channel 
behavior. A detached patch can also be 
produced with the opposite orientation, 
so that the cytoplasmic surface of the 
membrane faces the inside of the pipette. 
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There is a surprising amount of structural and functional diversity within each 
of the different classes of voltage-gated cation channels, generated both by mul¬ 
tiple genes and by the alternative splicing of RNA transcripts produced from the 
same gene. Nonetheless, the amino acid sequences of the known voltage-gated 
Na + , K + , and Ca 2+ channels show striking similarities, demonstrating that they ail 
belong to a large superfamily of evolutionarily and structurally related proteins 
and share many of the design principles. Whereas the single-celled yeast S. cerevi- 
siae contains a single gene that codes for a voltage-gated IC channel, the genome 
of the worm C. elegans contains 68 genes that encode different but related IC 
channels. This complexity indicates that even a simple nervous System made up 
of only 302 neurons uses a large number of different ion channels to compute its 
responses. 

Humans who inherit mutant genes encoding ion channels can suffer from a 
variety of nerve, muscle, brain, or heart diseases, depending in which cells the 
channel encoded by the mutant gene normally functions. Mutations in genes 
that encode voltage-gated Na + channels in slceletal muscle cells, for example, can 
cause myotonia, a condition in which there is a delay in muscle relaxation after 
voluntary contraction, causing painful muscle spasms. In some cases, this occurs 
because the abnormal channels fail to inactivate normally; as a resuit, Na + entry 
persists after an action potential finishes and repeatedly reinitiates membrane 
depolarization and muscle contraction. Similarly, mutations that affect Na + or IC 
channels in the brain can cause epilepsy, in which excessive synchronized firing of 
large groups of neurons causes epileptic seizures (convulsions, or fits). 

The particular combination of ion channels conducting Na + , IC, and Ca 2+ 
that are expressed in a neuron largely détermines how the cell fires répétitive 
sequences of action potentials. Some nerve cells can repeat action potentials up 
to 300 times per second; other neurons fire short bursts of action potentials sepa- 
rated by periods of silence; while others rarely fire more than one action potential 
at a time. There is a remarkable diversity of neurons in the brain. 

Different Neuron Types Display Characteristic Stable Firing 
Properties 

It is estimated that the human brain contains about 10 11 neurons and 10 14 synap- 
tic connections. To make matters more complex, neural circuitry is continuously 
sculpted in response to expérience, modified as we learn and store memories, 
and irreversibly altered by the graduai loss of neurons and their connections as 
we âge. How can a System so complex be subject to such change and yet con¬ 
tinue to function stably? One emerging theory suggests that individual neurons 
are self-tuning devices, constantly adjusting the expression of ion channels and 
neurotransmitter receptors in order to maintain a stable function. How might this 
worlc? 

Neurons can be categorized into functionally different types, based in part on 
their propensity to fire action potentials and their pattern of firing. For example, 
some neurons fire action potentials at high frequencies, while others fire rarely. 
The firing properties of each neuron type are determined to a large extent by the 
ion channels that the cell expresses. The number of ion channels in a neuron's 
membrane is not fixed: as conditions change, a neuron can modify the num- 
bers of depolarizing (Na + and Ca 2+ ) and hyperpolarizing (K + ) channels and lceep 
their proportions adjusted so as to maintain its characteristic firing behavior—a 
remarkable example of homeostatic control. The molecular mechanisms involved 
remain an important mystery. 

Transmitter-Gated Ion Channels Convert Chemical Signais into 
Electrical Ones at Chemical Synapses 

Neuronal signais are transmitted from cell to cell at specialized sites of contact 
known as synapses. The usual mechanism of transmission is indirect. The cells are 
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Figure 11-35 Patch-clamp 
measurements for a single voltage- 
gated Na + channel. A tiny patch of 
plasma membrane was detached from 
an embryonic rat muscle cell, as in Figure 
11-34. (A) The membrane was depolarized 
by an abrupt shift of potential from -90 to 
about -40 mV. (B) Three current records 
from three experiments performed on 
the same patch of membrane. Each 
major current step in (B) represents the 
opening and closing of a single channel. 

A comparison of the three records shows 
that, whereas the durations of channel 
opening and closing vary greatly, the rate 
at which current flows through an open 
channel (its conductance) is practically 
constant. The minor fluctuations in 
the current records arise largely from 
electrical noise in the recording apparatus. 
Current flowing into the cell, measured 
in picoamperes (pA), is shown as a 
downward deflection of the curve. By 
convention, the electrical potential on the 
outside of the cell is defined as zéro. 

(C) The sum of the currents measured in 
144 répétitions of the same experiment. 
This aggregate current is équivalent to the 
usual Na + current that would be observed 
flowing through a relatively large région 
of membrane containing 144 channels. 

A comparison of (B) and (C) reveals that 
the time course of the aggregate current 
reflects the probability that any individual 
channel will be in the open State; this 
probability decreases with time as the 
channels in the depolarized membrane 
adopt their inactivated conformation. 

(Data from J. Patlak and R. Horn, 

J. Gen. Physiol. 79:333-351, 1982. 

With permission from The Rockefeller 
University Press.) 
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Figure 11-36 A Chemical synapse. (A) When an action potential reaches 
the nerve terminal in a presynaptic cell, it stimulâtes the terminal to release 
its neurotransmitter. The neurotransmitter molécules are contained in 
synaptic vesicles and are released to the cell exterior when the vesicles 
fuse with the plasma membrane of the nerve terminal. The released 
neurotransmitter binds to and opens the transmitter-gated ion channels 
concentrated in the plasma membrane of the postsynaptic target cell at 
the synapse. The resulting ion flows alter the membrane potential of the 
postsynaptic membrane, thereby transmitting a signal from the excited nerve (B) 
(Movie 11.12). (B) A thin-section électron micrograph of two nerve terminal 
synapses on a dendrite of a postsynaptic cell. (B, courtesy of Cedric Raine.) 
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electrically isolated from one another, the presynaptic cell being separated from 
the postsynaptic cell by a narrow synaptic cleft. When an action potential arrives 
at the presynaptic site, the depolarization of the membrane opens voltage-gated 
Ca 2+ channels that are clustered in the presynaptic membrane. Ca 2+ influx trig- 
gers the release into the cleft of small signal molécules known as neurotransmit- 
ters, which are stored in membrane-enclosed synaptic vesicles and released by 
exocytosis (discussed in Chapter 13). The neurotransmitter diffuses rapidly across 
the synaptic cleft and provolces an electrical change in the postsynaptic cell by 
binding to and opening transmitter-gated ion channels (Figure 11-36). After the 
neurotransmitter has been secreted, it is rapidly removed: it is either destroyed 
by spécifie enzymes in the synaptic cleft or talcen up by the presynaptic nerve ter¬ 
minal or by surrounding glial cells. Reuptake is mediated by a variety of Na + -dé¬ 
pendent neurotransmitter symporters (see Figure 11-8); in this way, neurotrans- 
mitters are recycled, allowing cells to lceep up with high rates of release. Rapid 
removal ensures both spatial and temporal précision of signaling at a synapse. 
It decreases the chances that the neurotransmitter will influence neighboring 
cells, and it clears the synaptic cleft before the next puise of neurotransmitter is 
released, so that the timing of repeated, rapid signaling events can be accurately 
communicated to the postsynaptic cell. As we shall see, signaling via such Chemi¬ 
cal synapses is far more versatile and adaptable than direct electrical coupling via 
gap junctions at electrical synapses (discussed in Chapter 19), which are also used 
by neurons but to a much smaller extent. 

Transmitter-gated ion channels, also called ionotropic receptors, are built 
for rapidly converting extracellular Chemical signais into electrical signais at 
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Chemical synapses. The channels are concentrated in a specialized région of the 
postsynaptic plasma membrane at the synapse and open transiently in response 
to the binding of neurotransmitter molécules, thereby producing a brief perme- 
ability change in the membrane (see Figure 11-36A). Unlilce the voltage-gated 
channels responsible for action potentials, transmitter-gated channels are rela- 
tively insensitive to the membrane potential and therefore cannot by themselves 
produce a self-amplifying excitation. Instead, they produce local permeability 
increases, and hence changes of membrane potential, that are graded according 
to the amount of neurotransmitter released at the synapse and how long it persists 
there. Only if the summation of small depolarizations at this site opens sufficient 
numbers of nearby voltage-gated cation channels can an action potential be trig- 
gered. This may require the opening of transmitter-gated ion channels at numer- 
ous synapses in close proximity on the target nerve cell. 

Chemical Synapses Can Be Excitatory or Inhibitory 

Transmitter-gated ion channels differ from one another in several important ways. 
First, as receptors, they hâve highly sélective binding sites for the neurotransmit¬ 
ter that is released from the presynaptic nerve terminal. Second, as channels, they 
are sélective in the type of ions that they let pass across the plasma membrane; 
this détermines the nature of the postsynaptic response. Excitatory neurotrans- 
mitters open cation channels, causing an influx of Na + , and in many cases Ca 2+ , 
that depolarizes the postsynaptic membrane toward the threshold potential for 
firing an action potential. Inhibitory neurotransmitters, by contrast, open either 
Cl - channels or K + channels, and this suppresses firing by malcing it harder for 
excitatory neurotransmitters to depolarize the postsynaptic membrane. Many 
transmitters can be either excitatory or inhibitory, depending on where they are 
released, what receptors they bind to, and the ionic conditions that they encoun- 
ter. Acétylcholine , for example, can either excite or inhibit, depending on the type 
of acétylcholine receptors it binds to. Usually, however, acétylcholine, glutamate, 
and serotonin are used as excitatory transmitters, an&y-aminobutyric acid ( GABA ) 
and glycine are used as inhibitory transmitters. Glutamate, for instance, médiates 
most of the excitatory signaling in the vertebrate brain. 

We hâve already discussed how the opening of Na + or Ca 2+ channels depo¬ 
larizes a membrane. The opening of K + channels has the opposite effect because 
the IC concentration gradient is in the opposite direction—high concentration 
inside the cell, low outside. Opening IC channels tends to lceep the cell close to 
the equilibrium potential for IC, which, as we discussed earlier, is normally close 
to the resting membrane potential because at rest IC channels are the main type 
of channel that is open. When additional IC channels open, it becomes harder to 
drive the cell away from the resting State. We can understand the effect of open¬ 
ing CF channels similarly. The concentration of CF is much higher outside the 
cell than inside (see Table 11-1, p. 598), but the membrane potential opposes its 
influx. In fact, for many neurons, the equilibrium potential for Cl" is close to the 
resting potential—or even more négative. For this reason, opening CF channels 
tends to buffer the membrane potential; as the membrane starts to depolarize, 
more negatively charged CF ions enter the cell and counteract the depolariza- 
tion. Thus, the opening of CF channels malces it more difficult to depolarize the 
membrane and hence to excite the cell. Some powerful toxins act by blocking the 
action of inhibitory neurotransmitters: strychnine, for example, binds to glycine 
receptors and prevents their inhibitory action, causing muscle spasms, convul¬ 
sions, and death. 

However, not ail Chemical signaling in the nervous System opérâtes through 
these ionotropic ligand-gated ion channels. In fact, most neurotransmitter mol¬ 
écules that are secreted by nerve terminais, including a large variety of neuro¬ 
peptides, bind to metabotropic receptors, which regulate ion channels only 
indirectly through the action of small intracellular signal molécules (discussed 
in Chapter 15). Ail neurotransmitter receptors fall into one or other of these two 
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major classes—ionotropic or metabotropic—on the basis of their signaling mech- 
anisms: 

1. Ionotropic receptors are ion channels and feature at fast Chemical syn¬ 
apses. Acétylcholine, glycine, glutamate, and GABA ail act on transmit- 
ter-gated ion channels, mediating excitatory or inhibitory signaling that is 
generally immédiate, simple, and brief. 

2. Metabotropic receptors are G-protein-coupled receptors (discussed in 
Chapter 15) that bind to ail other neurotransmitters (and, confusingly, also 
acétylcholine, glutamate, and GABA). Signaling mediated by ligand-bind- 
ing to metabotropic receptors tends to be far slower and more complex 
than that at ionotropic receptors, and longer-lasting in its conséquences. 

The Acétylcholine Receptors at the Neuromuscular Junction Are 
Excitatory Transmitter-Gated Cation Channels 

A well-studied example of a transmitter-gated ion channel is the acétylcholine 
receptor of slceletal muscle cells. This channel is opened transiently by acétylcho¬ 
line released from the nerve terminal at a neuromuscular junction—the special- 
ized Chemical synapse between a motor neuron and a slceletal muscle cell (Figure 
11-37). This synapse has been intensively investigated because it is readily acces¬ 
sible to electrophysiological study, unlilce most of the synapses in the central ner- 
vous System, that is, the brain and spinal cord in vertebrates. Moreover, the acé¬ 
tylcholine receptors are densely paclced in the muscle cell plasma membrane at 
a neuromuscular junction (about 20,000 such receptors per jim 2 ), with relatively 
few receptors elsewhere in the same membrane. 

The receptors are composed of five transmembrane polypeptides, two of one 
kind and three others, encoded by four separate genes (Figure 11-38A). The four 
genes are strikingly similar in sequence, implying that they evolved from a single 
ancestral gene. The two identical polypeptides in the pentamer each contribute 
one acetylcholine-binding site. When two acétylcholine molécules bind to the 
pentameric complex, they induce a conformational change that opens the chan¬ 
nel. With ligand bound, the channel still flickers between open and closed States, 
but now it has a 90% probability of being open. This State continues—with acé¬ 
tylcholine binding and unbinding—until hydrolysis of the free acétylcholine by 
the enzyme acetylcholinesterase lowers its concentration at the neuromuscular 
junction sufficiently. Once freed of its bound neurotransmitter, the acétylcholine 
receptor reverts to its initial resting State. If the presence of acétylcholine persists 
for a prolonged time as a resuit of excessive nerve stimulation, the channel inac¬ 
tivâtes. Normally, the acétylcholine is rapidly hydrolyzed and the channel closes 
within about 1 millisecond, well before significant desensitization occurs. Desen- 
sitization would occur after about 20 milliseconds in the continued presence of 
acétylcholine. 

The five subunits of the acétylcholine receptor are arranged in a ring, form- 
ing a water-filled transmembrane channel that consists of a narrow pore through 
the lipid bilayer, which widens into vestibules at both ends. Acétylcholine binding 
opens the channel by causing the helices that line the pore to rotate outward, thus 
disrupting a ring of hydrophobie amino acids that bloclcs ion flow in the closed 
State. Clusters of negatively charged amino acids at either end of the pore help to 
exclude négative ions and encourage any positive ion of diameter less than 0.65 
nm to pass through (Figure 11-38B). The normal through-traffic consists chiefly 
of Na + and I< + , together with some Ca 2+ . Thus, unlilce voltage-gated cation chan¬ 
nels, such as the IC channel discussed earlier, there is little selectivity among cat¬ 
ions, and the relative contributions of the different cations to the current through 
the channel dépend chiefly on their concentrations and on the electrochemical 
driving forces. When the muscle cell membrane is at its resting potential, the net 
driving force for IC is near zéro, since the voltage gradient nearly balances the IC 
concentration gradient across the membrane (see Panel 11-1, p. 616). For Na + , 
in contrast, the voltage gradient and the concentration gradient both act in the 
same direction to drive the ion into the cell. (The same is true for Ca 2+ , but the 
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Figure 11-37 A low-magnification 
scanning électron micrograph of a 
neuromuscular junction in a frog. The 

termination of a single axon on a skeletal 
muscle cell is shown. (From J. Desaki and 
Y. Uehara, J. Neurocytol. 10:101-110, 

1981. With permission from Kluwer 
Academie Publishers.) 
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extracellular concentration of Ca 2+ is so much lower than that of Na + that Ca 2+ 
malces only a small contribution to the total inward current.) Therefore, the open- 
ing of the acetylcholine-receptor channels leads to a large net influx of Na + (a 
pealc rate of about 30,000 ions per channel each millisecond). This influx causes a 
membrane depolarization that signais the muscle to contract, as discussed below. 


Figure 11-38 A model for the structure 
of the skeletal muscle acétylcholine 
receptor. (A) Five homologous subunits 
(a, a, p, y, 5) combine to form a 
transmembrane pore. Both of the a 
subunits contribute an acetylcholine- 
binding site nestled between adjoining 
subunits. (B) The pore is lined by a ring 
of five transmembrane a helices, one 
contributed by each subunit (just the 
two a subunits are shown). In its closed 
conformation, the pore is occluded 
by the hydrophobie side chains of five 
leucines (green), one from each a hélix, 
which form a gâte near the middle of the 
lipid bilayer. When acétylcholine binds to 
both a subunits, the channel undergoes 
a conformational change that opens 
the gâte by an outward rotation of the 
helices containing the occluding leucines. 
Negatively charged side chains (indicated 
by the signs) at either end of the pore 
ensure that only positively charged ions 
pass through the channel. (PDB code: 
2BG9.) 


Neurons Contain Many Types of Transmitter-Gated Channels 

The ion channels that open directly in response to the neurotransmitters acétyl¬ 
choline, serotonin, GABA, and glycine contain subunits that are structurally sim- 
ilar and probably form transmembrane pores in the same way as the ionotropic 
acétylcholine receptor, even though they hâve distinct neurotransmitter-binding 
specificities and ion selectivities. These channels are ail built from homologous 
polypeptide subunits, which assemble as a pentamer. Glutamate-gated ion chan¬ 
nels are an exception, in that they are constructed from a distinct family of sub¬ 
units and form tetramers resembling the IC channels discussed earlier (see Figure 
11-24A). 

For each class of transmitter-gated ion channel, there are alternative forms of 
each type of subunit, which may be encoded by distinct genes or else generated 
by alternative RNA splicing of a single gene product. The subunits assemble in 
different combinations to form an extremely diverse set of distinct channel sub¬ 
types, with different ligand affinities, different channel conductances, different 
rates of opening and closing, and different sensitivities to drugs and toxins. Some 
vertebrate neurons, for example, hâve acetylcholine-gated ion channels that dif- 
fer from those of muscle cells in that they are formed from two subunits of one 
type and three of another; but there are at least nine genes coding for different 
versions of the first type of subunit and at least three coding for different versions 
of the second. Subsets of such neurons performing different functions in the brain 
express different combinations of the genes for these subunits. In principle, and 
already to some extent in practice, it is possible to design drugs targeted against 
these narrowly defined subsets, thereby specifically influencing particular brain 
functions. 


Many Psychoactive Drugs Act at Synapses 

Transmitter-gated ion channels hâve for a long time been important drug tar- 
gets. A surgeon, for example, can relax muscles for the duration of an operation 
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by bloclcing the acétylcholine receptors on slceletal muscle cells with curare, a 
plant-derived drug that was originally used by South American Indians to malce 
poison arrows. Most drugs used to treat insomnia, anxiety, dépréssion, and 
schizophrenia exert their effects at Chemical synapses, and many of these act 
by binding to transmitter-gated channels. Barbiturates, tranquilizers such as 
Valium, and sleeping pills such as Ambien, for example, bind to GABA receptors, 
potentiating the inhibitory action of GABA by allowing lower concentrations of 
this neurotransmitter to open Cl - channels. Our increasing understanding of the 
molecular biology of ion channels should allow us to design a new génération of 
psychoactive drugs that will act still more selectively to alleviate the miseries of 
mental illness. 

In addition to ion channels, many other components of the synaptic signaling 
machinery are potential targets for psychoactive drugs. As mentioned earlier, after 
release into the synaptic cleft, many neurotransmitters are cleared by reuptake 
mechanisms mediated by Na + -driven symports. Inhibiting such transporters pro¬ 
longs the effect of the neurotransmitter, thereby strengthening synaptic transmis¬ 
sion. Many antidepressant drugs, including Prozac, inhibit the reuptake of sero- 
tonin; others inhibit the reuptake of both serotonin and norepinephrine. 

Ion channels are the basic molecular units from which neuronal devices for 
signaling and computation are built. To provide a glimpse of how sophisticated 
these devices can be, we consider several examples that demonstrate how the 
coordinated activities of groups of ion channels allow you to move, feel, and 
remember. 

Neuromuscular Transmission Involves the Sequential Activation of 
Five Different Sets of Ion Channels 

The following process, in which a nerve impulse stimulâtes a muscle cell to con- 
tract, illustrâtes the importance of ion channels to electrically excitable cells. This 
apparently simple response requires the sequential activation of at least five dif¬ 
ferent sets of ion channels, ail within a few milliseconds (Figure 11-39). 

1. The process is initiated when a nerve impulse reaches the nerve terminal 
and depolarizes the plasma membrane of the terminal. The depolarization 
transiently opens voltage-gated Ca 2+ channels in this presynaptic mem¬ 
brane. As the Ca 2+ concentration outside cells is more than 1000 times 


RESTING NEUROMUSCULAR JUNCTION 



ACTIVATED NEUROMUSCULAR JUNCTION 



Figure 11-39 The System of ion 
channels at a neuromuscular junction. 

These gated ion channels are essential for 
the stimulation of muscle contraction by a 
nerve impulse. The various channels are 
numbered in the sequence in which they 
are activated, as described in the text. 
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greater than the free Ca 2+ concentration inside, Ca 2+ flows into the nerve 
terminal. The increase in Ca 2+ concentration in the cytosol of the nerve 
terminal triggers the local release of acétylcholine by exocytosis into the 
synaptic cleft. 

2. The released acétylcholine binds to acétylcholine receptors in the muscle 
cell plasma membrane, transiently opening the cation channels associated 
with them. The resulting influx of Na + causes a local membrane depolariza- 
tion. 

3. The local depolarization opens voltage-gated Na + channels in this mem¬ 
brane, allowing more Na + to enter, which further depolarizes the mem¬ 
brane. This, in turn, opens neighboring voltage-gated Na + channels and 
results in a self-propagating depolarization (an action potential) that 
spreads to involve the entire plasma membrane (see Figure 11-31). 

4. The generalized depolarization of the muscle cell plasma membrane acti¬ 
vâtes voltage-gated Ca 2+ channels in the transverse tubules (T tubules— 
discussed in Chapter 16) of this membrane. 

5. This in turn causes Ca 2+ -release channels in an adjacent région of the sar- 
coplasmic réticulum (SR) membrane to open transiently and release Ca 2+ 
stored in the SR into the cytosol. The T-tubule and SR membranes are 
closely apposed with the two types of channel joined together in a special- 
ized structure, in which activation of the voltage-sensitive Ca 2+ channel in 
the T-tubule plasma membrane causes a channel conformational change 
that is mechanically transmitted to the Ca 2+ -release channel in the SR 
membrane, opening it and allowing Ca 2+ to flow from the SR lumen into 
the cytoplasm (see Figure 16-35). The sudden increase in the cytosolic Ca 2+ 
concentration causes the myofibrils in the muscle cell to contract. 

Whereas the initiation of muscle contraction by a motor neuron is complex, an 
even more sophisticated interplay of ion channels is required for a neuron to inte- 
grate a large number of input signais at its synapses and compute an appropriate 
output, as we now discuss. 

Single Neurons Are Complex Computation Devices 

In the central nervous System, a single neuron can receive inputs from thousands 
of other neurons, and it can in turn form synapses with many thousands of other 
cells. Several thousand nerve terminais, for example, malce synapses on an aver¬ 
age motor neuron in the spinal cord, almost completely covering its cell body and 
dendrites (Figure 11-40). Some of these synapses transmit signais from the brain 
or spinal cord; others bring sensory information from muscles or from the slcin. 
The motor neuron must combine the information received from ail these sources 
and react, either by firing action potentials along its axon or by remaining quiet. 

Of the many synapses on a neuron, some tend to excite it, while others inhibit 
it. Neurotransmitter released at an excitatory synapse causes a small depolariza¬ 
tion in the postsynaptic membrane called an excitatory postsynaptic potential 
{excitatory PSP ), whereas neurotransmitter released at an inhibitory synapse 
generally causes a small hyperpolarization called an inhibitory PSP The plasma 
membrane of the dendrites and cell body of most neurons contains a relatively 
low density of voltage-gated Na + channels, and so an individual excitatory PSP is 
generally too small to trigger an action potential. Instead, each incoming signal 
initiâtes a local PSP, which decreases with distance from the site of the synapse. If 
signais arrive simultaneously at several synapses in the same région of the den- 
dritic tree, the total PSP in that neighborhood will be roughly the sum of the indi¬ 
vidual PSPs, with inhibitory PSPs making a négative contribution to the total. The 
PSPs from each neighborhood spread passively and converge on the cell body. 
For long-distance transmission, the combined magnitude of the PSP is then trans- 
lated, or encoded, into the frequency of firing of action potentials: the greater the 
stimulation (depolarization), the higher the frequency of action potentials. 
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Neuronal Computation Requires a Combination of at Least Three 
Kinds of K + Channels 

The intensity of stimulation that a neuron receives is encoded by that neuron into 
action potential frequency for long-distance transmission. The encoding talces 
place at a specialized région of the axonal membrane lcnown as the initial seg¬ 
ment, or axon hilloclc, at the junction of the axon and the cell body (see Figure 
11-40). This membrane is rich in voltage-gated Na + channels; but it also contains 
at least four other classes of ion channels—three sélective for K + and one sélec¬ 
tive for Ca 2+ —ail of which contribute to the axon hillock's encoding function. The 
three varieties of K + channels hâve different properties; we shall refer to them as 
delayed, rapidly inactivating, and Ca 2+ -activated K + channels. 

To understand the need for multiple types of channels, consider first what 
would happen if the only voltage-gated ion channels présent in the nerve cell 
were the Na + channels. Below a certain threshold level of synaptic stimulation, 
the depolarization of the initial-segment membrane would be insufficient to 
trigger an action potential. With gradually increasing stimulation, the threshold 
would be crossed, the Na + channels would open, and an action potential would 
fire. The action potential would be terminated by inactivation of the Na + channels. 
Before another action potential could fire, these channels would hâve to recover 
from their inactivation. But that would require a return of the membrane voltage 
to a very négative value, which would not occur as long as the strong depolariz- 
ing stimulus (from PSPs) was maintained. An additional channel type is needed, 
therefore, to repolarize the membrane after each action potential to préparé the 
cell to fire again. 

The delayed K + channels perform this taslc, as discussed previously in rela¬ 
tion to the propagation of the action potential (see Figure 11-31). They are volt¬ 
age-gated, but because of their slower kinetics they open only during the falling 
phase of the action potential, when the Na + channels are inactive. Their opening 
permits an efflux of I< + that drives the membrane baclc toward the I< + equilibrium 
potential, which is so négative that the Na + channels rapidly recover from their 
inactivated State. Repolarization of the membrane also closes the delayed K + 
channels. The initial segment is now reset so that the depolarizing stimulus from 


Figure 11-40 A motor neuron in the 
spinal cord. (A) Many thousands of nerve 
terminais synapse on the cell body and 
dendrites. These deliver signais from other 
parts of the organism to control the firing of 
action potentials along the single axon of 
this large cell. (B) Fluorescence micrograph 
showing a nerve cell body and its dendrites 
stained with a fluorescent antibody that 
recognizes a cytoskeletal protein (green) 
that is not présent in axons. Thousands 
of axon terminais (red) from other nerve 
cells (not visible) make synapses on the 
cell body and dendrites; the terminais are 
stained with a fluorescent antibody that 
recognizes a protein in synaptic vesicles. 

(B, courtesy of Olaf Mundigl and Pietro de 
Camilli.) 





CHANNELS AND THE ELECTRICAL PROPERTIES OF MEMBRANES 


635 


(B) 

-O 
eu 


100 . 200 


0 

> 

-70 

100 200 
time (milliseconds) 

synaptic inputs can fire another action potential. In this way, sustained stimula¬ 
tion of the dendrites and cell body leads to répétitive firing of the axon. 

Répétitive firing in itself, however, is not enough. The frequency of firing has 
to refiect the intensity of stimulation, and a simple System of Na + channels and 
delayed IC channels is inadéquate for this purpose. Below a certain threshold 
level of steady stimulation, the cell will not fire at ail; above that threshold level, 
it will abruptly begin to fire at a relatively rapid rate. The rapidly inactivating IC 
channels solve the problem. These, too, are voltage-gated and open when the 
membrane is depolarized, but their spécifie voltage sensitivity and kinetics of 
inactivation are such that they act to reduce the rate of firing at levels of stimula¬ 
tion that are only just above the threshold required for firing. Thus, they remove 
the discontinuity in the relationship between the firing rate and the intensity of 
stimulation. The resuit is a firing rate that is proportional to the strength of the 
depolarizing stimulus over a very broad range (Figure 11-41). 

The process of encoding is usually further modulated by the two other types 
of ion channels in the initial segment that were mentioned earlier—voltage-gated 
Ca 2+ channels and Ca 2+ -activated IC channels. They act together to decrease the 
response of the cell to an unchanging, prolonged stimulation—a process called 
adaptation. These Ca 2+ channels are similar to the Ca 2+ channels that médiate the 
release of neurotransmitter from presynaptic axon terminais; they open when an 
action potential fires, transiently allowing Ca 2+ into the axon cytosol at the initial 
segment. 

The Ca 2+ -activated IC channel opens in response to a raised concentration 
of Ca 2+ at the charniers cytoplasmic face (Figure 11-42). Prolonged, strong depo¬ 
larizing stimuli will trigger a long train of action potentials, each of which permits 
a brief influx of Ca 2+ through the voltage-gated Ca 2+ channels, so that local cyto- 
solic Ca 2+ concentration gradually builds up to a level high enough to open the 
Ca 2+ -activated IC channels. Because the resulting increased permeability of the 
membrane to IC malces the membrane harder to depolarize, the delay between 
one action potential and the next is increased. In this way, a neuron that is stim- 
ulated continuously for a prolonged period becomes gradually less responsive to 
the constant stimulus. 

Such adaptation, which can also occur by other mechanisms, allows a neu¬ 
ron—indeed, the nervous System generally—to react sensitively to change , even 
against a high baclcground level of steady stimulation. It is one of the computa- 
tional strategies that help us, for example, to feel a light touch on the shoulder 
and yet ignore the constant pressure of our clothing. We discuss adaptation as a 
general feature in cell signaling processes in more detail in Chapter 15. 

Other neurons do different computations, reacting to their synaptic inputs in 
myriad ways, reflecting the different assortments of ion channels in their mem¬ 
brane. There are several hundred genes that code for ion channels in the human 
genome, with over 150 encoding voltage-gated channels alone. Further complex- 
ity is introduced by alternative splicing of RNA transcripts and assembling chan¬ 
nel subunits in different combinations. Moreover, ion channels are selectively 
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Figure 11-41 The magnitude of the 
combined postsynaptic potential (PSP) 
is reflected in the frequency of firing of 
action potentials. The mix of excitatory 
and inhibitory PSPs produces a combined 
PSP at the initial segment. A comparison of 
(A) and (B) shows how the firing frequency 
of an axon increases with an increase in the 
combined PSP, while (C) summarizes the 
general relationship. 
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localized to different sites in the plasma membrane of a neuron. Some K + and 
Ca 2+ channels are concentrated in the dendrites and participate in processing the 
input that a neuron receives. As we hâve seen, other ion channels are located at 
the axon's initial segment, where they control action potential firing; and some 
ligand-gated channels are distributed over the cell body and, depending on their 
ligand occupancy, modulate the cell's general sensitivity to synaptic inputs. The 
multiplicity of ion channels and their locations evidently allows each of the many 
types of neurons to tune the electrical behavior to the particular taslcs they per¬ 
forai. 

One of the crucial properties of the nervous System is its ability to learn and 
remember. This property dépends in part on the ability of individual synapses to 
strengthen or wealcen depending on their use—a process called synaptic plas- 
ticity. We end this chapter by considering a remarkable type of ion channel that 
has a spécial rôle in some forms of synaptic plasticity. It is located at many excit- 
atory synapses in the central nervous System, where it is gated by both voltage 
and the excitatory neurotransmitter glutamate. It is also the site of action of the 
psychoactive drug phencyclidine, or angel dust. 

Long-Term Potentiation (LTP) in the Mammalian Hippocampus 
Dépends on Ca 2+ Entry Through NMDA-Receptor Channels 

Practically ail animais can learn, but mammals seem to learn exceptionally well 
(or so we lilce to thinlc). In a mammafs brain, the région called the hippocampus 
has a spécial rôle in learning. When it is destroyed on both sides of the brain, the 
ability to form new memories is largely lost, although previous long-established 
memories remain. Some synapses in the hippocampus show a striking form of 
synaptic plasticity with repeated use: whereas occasional single action poten- 
tials in the presynaptic cells leave no lasting trace, a short burst of répétitive firing 
causes long-term potentiation (LTP), such that subséquent single action poten- 
tials in the presynaptic cells evolce a greatly enhanced response in the postsynaptic 
cells. The effect lasts hours, days, or weeks, according to the number and intensity 
of the bursts of répétitive firing. Only the synapses that were activated exhibit LTP; 
synapses that hâve remained quiet on the same postsynaptic cell are not affected. 
However, while the cell is receiving a burst of répétitive stimulation via one set of 
synapses, if a single action potential is delivered at another synapse on its surface, 
that latter synapse also will undergo LTP, even though a single action potential 
delivered there at another time would leave no such lasting trace. 

The underlying raie in such events seems to be that LTP occurs on any occa¬ 
sion when a presynaptic cell fires (once or more) at a time when the postsynaptic 
membrane is strongly depolarized (either through recent répétitive firing of the 
same presynaptic cell or by other means). This raie reflects the behavior of a par¬ 
ticular class of ion channels in the postsynaptic membrane. Glutamate is the main 
excitatory neurotransmitter in the mammalian central nervous System, and gluta- 
mate-gated ion channels are the most common of ail transmitter-gated channels 
in the brain. In the hippocampus, as elsewhere, most of the depolarizing current 
responsible for excitatory PSPs is carried by glutamate-gated ion channels called 
AMPA receptors, which operate in the standard way (Figure 11-43). But the cur¬ 
rent has, in addition, a second and more intriguing component, which is mediated 
by a separate subclass of glutamate-gated ion channels lcnown as NMDA recep¬ 
tors, so named because they are selectively activated by the artificial glutamate 
analog N-methyl-D-aspartate. The NMDA-receptor channels are doubly gated, 
opening only when two conditions are satisfied simultaneously: glutamate must 
be bound to the receptor, and the membrane must be strongly depolarized. The 
second condition is required for releasing the Mg 2+ that normally bloclcs the rest- 
ing channel. This means that NMDA receptors are normally activated only when 
AMPA receptors are activated as well and depolarize the membrane. The NMDA 
receptors are critical for LTP. When they are selectively blocked with a spécifie 
inhibitor or inactivated genetically, LTP do es not occur, even though ordinary 
synaptic transmission continues, indicating the importance of NMDA receptors 
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Figure 11-42 Structure of a Ca 2+ - 
activated K + channel. The channel 
contains four identical subunits (which 
are shown in different colors for clarity). 

It is both voltage- and Ca 2+ -gated. The 
structure shown is a composite of the 
cytosolic and membrane portions of the 
channel that were separately crystallized. 
(PDB codes: 2R99, 1LNQ.) 
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Figure 11-43 The structure of the 
AMPA receptor. This ionotropic glutamate 
receptor (named after the glutamate analog 
a-Amino 3-hydroxy 5-Methyl 4-isoxazole 
Propionic Acid) is the most common 
mediator of fast, excitatory synaptic 
transmission in the central nervous System 
(CNS). (PDB code: 3KG2.) 
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for LTP induction. Such animais exhibit spécifie déficits in their learning abilities 
but behave almost normally otherwise. 

How do NMDA receptors médiate LTP? The answer is that these channels, 
when open, are highly permeable to Ca 2+ , which acts as an intracellular signal in 
the postsynaptic cell, triggering a cascade of changes that are responsible for LTP. 
Thus, LTP is prevented when Ca 2+ levels are held artificially low in the postsynap¬ 
tic cell by injecting the Ca 2+ chelator EGTA into it, and LTP can be induced by arti¬ 
ficially raising intracellular Ca 2+ levels in the cell. Among the long-term changes 
that increase the sensitivity of the postsynaptic cell to glutamate is the insertion of 
new AMPA receptors into the plasma membrane (Figure 11-44). In some forms 
of LTP, changes occur in the presynaptic cell as well, so that it releases more gluta¬ 
mate than normal when it is activated subsequently. 

If synapses were capable only of LTP they would quicldy become saturated, 
and thus be of limited value as an information-storage device. In fact, they also 
exhibit long-term dépréssion (LTD), with the long-term effect of reducing the 
number of AMPA receptors in the post-synaptic membrane. This feat is accom- 
plished by degrading AMPA receptors after their sélective endocytosis. Surpris- 
ingly, LTD also requires NMDA receptor activation and a rise in Ca 2+ . How does 
Ca 2+ trigger opposite effects at the same synapse? It turns out that this bidirec- 
tional control of synaptic strength dépends on the magnitude of the rise in Ca 2+ : 
high Ca 2+ levels activate protein kinases and LTP, whereas modest Ca 2+ levels acti- 
vate protein phosphatases and LTD. 

There is evidence that NMDA receptors hâve an important rôle in synaptic 
plasticity and learning in other parts of the brain, as well as in the hippocampus. 
Moreover, they hâve a crucial rôle in adjusting the anatomical pattern of synap¬ 
tic connections in the light of expérience during the development of the nervous 
System. 

Thus, neurotransmitters released at synapses, besides relaying transient elec- 
trical signais, can also alter concentrations of intracellular mediators that bring 
about lasting changes in the efficacy of synaptic transmission. However, it is still 
uncertain how these changes endure for weelcs, months, or a lifetime in the face 
of the normal turnover of cell constituents. 



increased Ca 2+ in the cytosol 
induces postsynaptic cell to 
insert new AMPA receptors in the 
plasma membrane, increasing 
the cell's sensitivity to 
glutamate 



Figure 11-44 The signaling events in long- 
term potentiation. Although not shown, 
transmission-enhancing changes can also 
occur in the presynaptic nerve terminais in 
LTP, which may be induced by rétrogradé 
signais from the postsynaptic cell. 


Summary 

Ion channels form aqueous pores across the lipid bilayer and allow inorganic ions 
of appropriate size and charge to cross the membrane down their electrochemi- 
cal gradients at rates about 1000 times greater than those achieved by any Icnown 
transporter. The channels are “gated” and usually open transiently in response to 
a spécifie perturbation in the membrane, such as a change in membrane poten- 
tial (voltage-gated channels), or the binding ofa neurotransmitter to the channel 
(transmitter-gated channels). 

K + -sélective lealc channels hâve an important rôle in determining the rest- 
ing membrane potential across the plasma membrane in most animal cells. 
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Voltage-gated cation channels are responsible for the amplification and propaga¬ 
tion of action potentials in electrically excitable cells, such as neurons and skeletal 
muscle cells. Transmitter-gated ion channels convert Chemical signais to electrical 
signais at Chemical synapses. Excitatory neurotransmitters, such as acétylcholine 
and glutamate , open transmitter-gated cation channels and thereby depolarize the 
postsynaptic membrane toward the threshold levelfor firing an action potential. 
Inhibitory neurotransmitters, such as GABA and glycine, open transmitter-gated 
Ct or K + channels and thereby suppress firing by Iceeping the postsynaptic mem¬ 
brane polarized. A subclass of glutamate-gated ion channels, called NMDA-recep- 
tor channels; is highly permeable to Ca 2+ , which can trigger the long-term changes 
in synapse efficacy (synaptic plasticity) such as LTP and LTD that are thought to be 
involved in someforms oflearning and memory. 

Ion channels work together in complex ways to control the behavior of electri¬ 
cally excitable cells. A typical neuron, for example, receives thousands of excitatory 
and inhibitory inputs, which combine by spatial and temporal summation to pro¬ 
duce a combined postsynaptic potential (PSP) at the initial segment ofits axon. The 
magnitude of the PSP is translated into the rate of firing of action potentials by a 
mixture of cation channels in the initial segment membrane. 


WHATWE DONT KNOW 


• How do individual neurons establish 
and maintain their characteristic 
intrinsic firing properties? 

• Even organisms with very simple 
nervous Systems hâve dozens of 
different K + channels. Why is it 
important to hâve so many? 

• Why do cells that are not electrically 
active contain voltage-gated ion 
channels? 

• How are memories stored for so 
many years in the human brain? 


PROBLEMS 

Which statements are true? Explain why or why not. 

11-1 Transport by transporters can be either active or 
passive, whereas transport by channels is always passive. 

11-2 Transporters saturate at high concentrations of 
the transportée! molécule when ail their binding sites are 
occupied; channels, on the other hand, do not bind the 
ions they transport and thus the flux of ions through a 
channel does not saturate. 

11-3 The membrane potential arises from movements 
of charge that leave ion concentrations practically unaf- 
fected, causing only a very slight discrepancy in the num- 
ber of positive and négative ions on the two sides of the 
membrane. 

Discuss the following problems. 

11-4 Order Ca 2+ , CO 2 , éthanol, glucose, RNA, and H 2 O 
according to their ability to diffuse through a lipid bilayer, 
beginning with the one that crosses the bilayer most read- 
ily. Explain your order. 

11-5 How is it possible for some molécules to be at 
equilibrium across a biological membrane and yet not be 
at the same concentration on both sides? 

11-6 Ion transporters are “linlced" together—not physi- 
cally, but as a conséquence of their actions. For example, 
cells can raise their intracellular pH, when it becomes too 
acidic, by exchanging external Na + for internai H + , using 
a Na + -H + antiporter. The change in internai Na + is then 
redressed using the Na + -K + pump. 

A. Can these two transporters, operating together, 
normalize both the H + and the Na + concentrations inside 
the cell? 


B. Does the linlced action of these two pumps cause 
imbalances in either the K + concentration or the mem¬ 
brane potential? Why or why not? 

11-7 Microvilli increase the surface area of intestinal 
cells, providing more efficient absorption of nutrients. 
Microvilli are shown in profile and cross section in Figure 
Qll-1. From the dimensions given in the figure, estimate 
the increase in surface area that microvilli provide (for 
the portion of the plasma membrane in contact with the 
lumen of the gut) relative to the corresponding surface of a 
cell with a "fiat" plasma membrane. 



1 |um 0.1 jum 


Figure Q11-1 Microvilli of intestinal épithélial cells in profile and cross 
section (Problem 11-7). (Left panel, from Rippel Electron Microscope 
Facility, Dartmouth College; right panel, from David Burgess.) 

11-8 According to Newton's laws of motion, an ion 
exposed to an electric field in a vacuum would expérience 
a constant accélération from the electric driving force, just 
as a falling body in a vacuum constantly accelerates due to 
gravity. In water, however, an ion moves at constant veloc- 
ity in an electric field. Why do you suppose that is? 
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Figure Q11-2 A “bail” tethered by a “chain” to a voltage-gated K + 
channel (Problem 11-9). 


11-9 In a subset of voltage-gated K + channels, the 
N-terminus of each subunit acts lilce a tethered bail that 
occludes the cytoplasmic end of the pore soon after it 
opens, thereby inactivating the channel. This “ball-and- 
chain" model for the rapid inactivation of voltage-gated 
IC channels has been elegantly supported for the shaker 
IC channel from Drosophila melanogaster. (The shaker 
IC channel in Drosophila is named after a mutant form 
that causes excitable behavior—even anesthetized Aies 
lceep twitching.) Délétion of the N-terminal amino acids 
from the normal shaker channel gives rise to a channel 
that opens in response to membrane depolarization, but 
stays open instead of rapidly closing as the normal chan¬ 
nel does. A peptide (MAAVAGLYGLGEDRQHRKKQ) that 
corresponds to the deleted N-terminus can inactivate the 
open channel at 100 pM. 

Is the concentration of free peptide (100 pM) that 
is required to inactivate the defective IC channel anywhere 
near the local concentration of the tethered bail on a nor¬ 
mal channel? Assume that the tethered bail can explore a 
hemisphere [volume = (2/3)71^] with a radius of 21.4 nm, 
which is the length of the polypeptide "chain" (Figure 
Qll-2). Calculate the concentration for one bail in this 
hemisphere. How does that value compare with the con¬ 
centration of free peptide needed to inactivate the chan¬ 
nel? 

11-10 The giant axon of the squid (Figure Qll-3) occu- 
pies a unique position in the history of our understanding 
of cell membrane potentials and nerve action. When an 
electrode is stuclc into an intact giant axon, the membrane 
potential registers -70 mV. When the axon, suspended in a 
bath of seawater, is stimulated to conduct a nerve impulse, 
the membrane potential changes transiently from -70 mV 
to +40 mV. 



ï Ionie composition of seawater and of the 
cytosol in the squid giant axon (Problem 11-10). 

Ion 

Cytosol 

Seawater 

Na + 

65 mM 

430 mM 

K + 

344 mM 

9 mM 


For univalent ions and at 20°C (293 K), the Nernst équation 
reduces to 

V= 58 mV x log (C 0 /Q) 

where C 0 and Q are the concentrations outside and inside, 
respectively. 

Using this équation, calculate the potential across 
the resting membrane (1) assuming that it is due solely to 
IC and (2) assuming that it is due solely to Na + . (The Na + 
and IC concentrations in the axon cytosol and in seawa¬ 
ter are given in Table Qll-1.) Which calculation is doser 
to the measured resting potential? Which calculation is 
doser to the measured action potential? Explain why these 
assumptions approximate the measured resting and action 
potentials. 

11-11 Acetylcholine-gated cation channels at the neu- 
romuscular junction open in response to acétylcholine 
released by the nerve terminal and allow Na + ions to enter 
the muscle cell, which causes membrane depolarization 
and ultimately leads to muscle contraction. 

A. Patch-clamp measurements show that young rat 
muscles hâve cation channels that respond to acétylcho¬ 
line (Figure Q11-4). How many lcinds of channel are there? 
How can you tell? 

B. For each lcind of channel, calculate the number of 
ions that enter in one millisecond. (One ampere is a cur- 
rent of one coulomb per second; one pA equals 10~ 12 
ampere. An ion with a single charge such as Na + carries a 
charge of 1.6 x 10 -19 coulomb.) 


2pA^ 


F 

1 




h 



40 msec 











Figure Q11-4 Patch-clamp measurements of acetylcholine-gated 
cation channels in young rat muscle (Problem 11-11). 


Figure Q11-3 The squid Loligo (Problem 11-10). This squid is about 
15 cm in length. 
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Intracellular Compartments 
and Protein Sorting 


CHAPTER 

12 


Unlilce a bacterium, which generally consists of a single intracellular compart- 
ment surrounded by a plasma membrane, a eukaryotic cell is elaborately sub- 
divided into functionally distinct, membrane-enclosed compartments. Each 
compartment, or organelle, contains its own characteristic set of enzymes and 
other specialized molécules, and complex distribution Systems transport spécifie 
products from one compartment to another. To understand the eukaryotic cell, it 
is essential to knowhow the cell créâtes and maintains these compartments, what 
occurs in each of them, and how molécules move between them. 

Proteins confer upon each compartment its characteristic structural and 
functional properties. Ihey catalyze the reactions that occur there and selectively 
transport small molécules into and out of the compartment. For membrane-en¬ 
closed organelles in the cytoplasm, proteins also serve as organelle-specific sur¬ 
face marlcers that direct new deliveries of proteins and lipids to the appropriate 
organelle. 

An animal cell contains about 10 billion (10 10 ) protein molécules of perhaps 
10,000 kinds, and the synthesis of almost ail of them begins in the cytosol, the 
space of the cytoplasm outside the membrane-enclosed organelles. Each newly 
synthesized protein is then delivered specifically to the organelle that requires it. 
The intracellular transport of proteins is the central theme of both this chapter 
and the next. By tracing the protein traffic from one compartment to another, one 
can begin to malce sense of the otherwise bewildering maze of intracellular mem¬ 
branes. 


IN THIS CHAPTER 

THE COMPARTMENTALIZATION 
OF CELLS 

THE TRANSPORT OF 
MOLECULES BETWEEN THE 
NUCLEUS AND THE CYTOSOL 

THE TRANSPORT OF PROTEINS 
INTO MITOCHONDRIA AND 
CHLOROPLASTS 

PEROXISOMES 

THE ENDOPLASMIC RETICULUM 


THE COMPARTMENTALIZATION OF CELLS 

In this brief overview of the compartments of the cell and the relationships 
between them, we organize the organelles conceptually into a small number of 
discrète families, discuss how proteins are directed to spécifie organelles, and 
explain how proteins cross organelle membranes. 

Ail Eukaryotic Cells Hâve the Same Basic Set of Membrane- 
enclosed Organelles 

Many vital biochemical processes talce place in membranes or on their surfaces. 
Membrane-bound enzymes, for example, catalyze lipid metabolism; and oxida- 
tive phosphorylation and photo synthesis both require a membrane to couple the 
transport of H + to the synthesis of ATP. In addition to providing increased mem¬ 
brane area to host biochemical reactions, intracellular membrane Systems form 
enclosed compartments that are separate from the cytosol, thus creating function¬ 
ally specialized aqueous spaces within the cell. In these spaces, subsets of mol¬ 
écules (proteins, reactants, ions) are concentrated to optimize the biochemical 
reactions in which they participate. Because the lipid bilayer of cell membranes is 
imperméable to most hydrophilic molécules, the membrane of an organelle must 
contain membrane transport proteins to import and export spécifie métabolites. 
Each organelle membrane must also hâve a mechanism for importing, and incor- 
porating into the organelle, the spécifie proteins that malce the organelle unique. 
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endoplasmic 
réticulum with 
membrane-boun< 
polyribosomes 


mitochonc 


Golç 

apparé 



endosome 


free ribosomes 


lysosome 


cytosol 

peroxisome 


distinct compartments isolated from the 
rest of the cell by at least one selectively 
permeable membrane (see Movie 9.2). 


Golgi apparatus, nucléus, mitochondrion, 
endosome, lysosome, and peroxisome are 


Figure 12-1 The major intracellular 
compartments of an animal cell. The 

cytosol (gray), endoplasmic réticulum, 


nudeus plasma membrane 


15 pm 


Figure 12-1 illustrâtes the major intracellular compartments common to 
eukaryotic cells. The nucléus contains the genome (aside from mitochondrial and 
chloroplast DNA), and it is the principal site of DNA and RNA synthesis. The sur- 
rounding cytoplasm consists of the cytosol and the cytoplasmic organelles sus- 
pended in it. The cytosol constitutes a little more than half the total volume of 
the cell, and it is the main site of protein synthesis and dégradation. It also per¬ 
forais most of the cell's intermediary metabolism —that is, the many reactions 
that dégradé some small molécules and synthesize others to provide the building 
bloclcs for macromolecules (discussed in Chapter 2). 

About half the total area of membrane in a eukaryotic cell encloses the laby- 
rinthine spaces of the endoplasmic réticulum {ER). The rough ER has many ribo¬ 
somes bound to its cytosolic surface. Ribosomes are organelles that are not mem- 
brane-enclosed; they synthesize both soluble and intégral membrane proteins, 
most of which are destined either for sécrétion to the cell exterior or for other 
organelles. We shall see that, whereas proteins are transported into other mem- 
brane-enclosed organelles only after their synthesis is complété, they are trans¬ 
ported into the ER as they are synthesized. This explains why the ER membrane is 
unique in having ribosomes tethered to it. The ER also produces most of the lipid 
for the rest of the cell and functions as a store for Ca 2+ ions. Régions of the ER that 
laclc bound ribosomes are called smooth ER. The ER sends many of its proteins 
and lipids to the Golgi apparatus, which often consists of organized stades of disc- 
like compartments called Golgi cisternae. The Golgi apparatus receives lipids and 
proteins from the ER and dispatches them to various destinations, usually cova- 
lently modifying them en route. 

Mitochondria and chloroplasts generate most of the ATP that cells use to drive 
reactions requiring an input of free energy; chloroplasts are a specialized version 
of plastids (présent in plants, algae, and some protozoa), which can also hâve 
other functions, such as the storage of food or pigment molécules. Lysosomes con- 
tain digestive enzymes that dégradé defunct intracellular organelles, as well as 
macromolecules and particles talcen in from outside the cell by endocytosis. On 
the way to lysosomes, endocytosed material must first pass through a sériés of 
organelles called endosomes. Finally, peroxisomes are small vesicular compart¬ 
ments that contain enzymes used in various oxidative reactions. 

In general, each membrane-enclosed organelle performs the same set of basic 
functions in ail cell types. But to serve the specialized functions of cells, these 
organelles vary in abundance and can hâve additional properties that differ from 
cell type to cell type. 

On average, the membrane-enclosed compartments together occupy nearly 
half the volume of a cell (Table 12-1), and a large amount of intracellular mem¬ 
brane is required to malce them. In liver and pancreatic cells, for example, the 
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endoplasmic réticulum has a total membrane surface area that is, respectively, 
25 times and 12 times that of the plasma membrane (Table 12-2). The mem¬ 
brane-enclosed organelles are paclced tightly in the cytoplasm, and, in terms of 
area and mass, the plasma membrane is only a minor membrane in most eulcary- 
otic cells (Figure 12-2). 

The abundance and shape of membrane-enclosed organelles are regulated to 
meet the needs of the cell. This is particularly apparent in cells that are highly spe- 
cialized and therefore disproportionately rely on spécifie organelles. Plasma cells, 
for example, which secrete their own weight every day in antibody molécules into 
the bloodstream, contain vastly amplified amounts of rough ER, which is found 
in large, fiat sheets. Cells that specialize in lipid synthesis also expand their ER, 
but in this case the organelle forms a networlc of convoluted tubules. Moreover, 
membrane-enclosed organelles are often found in characteristic positions in the 
cytoplasm. In most cells, for example, the Golgi apparatus is located close to the 
nucléus, whereas the networlc of ER tubules extends from the nucléus throughout 
the entire cytosol. These characteristic distributions dépend on interactions of the 
organelles with the cytoskeleton. The localization of both the ER and the Golgi 
apparatus, for instance, dépends on an intact micro tubule array; if the micro tu¬ 
bules are experimentally depolymerized with a drug, the Golgi apparatus frag¬ 
ments and disperses throughout the cell, and the ER networlc collapses toward the 
cell center (discussed in Chapter 16). The size, shape, composition, and location 
are ail important and regulated features of these organelles that ultimately con- 
tribute to the organelle's function. 

Evolutionary Origins May Help Explain the Topological 
Relationships of Organelles 

To understand the relationships between the compartments of the cell, it is help- 
ful to consider how they might hâve evolved. The precursors of the first eulcaryotic 
cells are thought to hâve been relatively simple cells that—lilce most bacterial and 


ï Relative Volumes 
Occupied by the Major 

Intracellular Compartments in a 
Liver Cell (Hépatocyte) 

Intracellular 

compartment 

Percentage 
of total cell 
volume 

Cytosol 

54 

Mitochondria 

22 

Rough ER 
cisternae 

9 

Smooth ER 
cisternae plus 

Golgi cisternae 

6 

Nucléus 

6 

Peroxisomes 

1 

Lysosomes 

1 

Endosomes 

1 


Relative Amounts of Membrane Types in Two Kinds of Eukaryotic 


Membrane Type Percentage of total cell membrane 



Liver hépatocyte* 

Pancreatic exocrine cell* 

Plasma membrane 

2 

5 

Rough ER membrane 

35 

60 

Smooth ER membrane 

16 

<1 

Golgi apparatus membrane 

7 

10 

Mitochondria 

Outer membrane 

7 

4 

Inner membrane 

32 

17 

Nucléus 

Inner membrane 

0.2 

0.7 

Secretory vesicle membrane 

Not determined 

3 

Lysosome membrane 

0.4 

Not determined 

Peroxisome membrane 

0.4 

Not determined 

Endosome membrane 

0.4 

Not determined 

*These two cells are of very different sizes: the average hépatocyte has a volume of about 

5000 |um 3 compared with 1000 jum 

3 for the pancreatic exocrine cell. Total cell membrane areas 

are estimated at about 110,000 jim 

2 and 13,000 |im 2 , respectively. 



UEM 

Cells 
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rough endoplasmic nucléus lysosomes 


mitochondrion peroxisome 5 pm 


Figure 12-2 An électron micrograph 
of part of a liver cell seen in cross 
section. Examples of most of the major 
intracellular organelles are indicated. 
(Courtesy of Daniel S. Friend.) 


archaeal cells—hâve a plasma membrane but no internai membranes. The plasma 
membrane in such cells provides ail membrane-dépendent functions, including 
the pumping of ions, ATP synthesis, protein sécrétion, and lipid synthesis. Typ- 
ical présent-day eukaryotic cells are 10-30 times larger in linear dimension and 
1000-10,000 times greater in volume than a typical bacterium such as E. coli. The 
profusion of internai membranes can be regarded, in part, as an adaptation to 
this increase in size: the eukaryotic cell has a much smaller ratio of surface area to 
volume, and its plasma membrane therefore presumably has too small an area to 
sustain the many vital functions that membranes perform. The extensive internai 
membrane Systems of a eukaryotic cell alleviate this problem. 

The évolution of internai membranes evidently went hand-in-hand with the 
specialization of membrane function. A hypothetical scheme for how the first 
eukaryotic cells, with a nucléus and ER, might hâve evolved by the invagination 
and pinching off of the plasma membrane of an ancestral cell is illustrated in 
Figure 12-3.This process would create membrane-enclosed organelles with an 
interior or lumen that is topologically équivalent to the exterior of the cell. We 
shall see that this topological relationship holds for ail of the organelles involved 
in the secretory and endocytic pathways, including the ER, Golgi apparatus, 
endosomes, lysosomes, and peroxisomes. We can therefore thinlc of ail of these 
organelles as members of the same topologically équivalent compartment. As we 
discuss in detail in the next chapter, their interiors communicate extensively with 
one another and with the outside of the cell via transport vesicles, which bud off 
from one organelle and fuse with another (Figure 12-4). 

As described in Chapter 14, mitochondria and plastids differ from the other 
membrane-enclosed organelles because they contain their own genomes. The 
nature of these genomes, and the close resemblance of the proteins in these 
organelles to those in some present-day bacteria, strongly suggest that mito¬ 
chondria and plastids evolved from bacteria that were engulfed by other cells 
with which they initially lived in symbiosis (see Figures 1-29 and 1-31): the inner 
membrane of mitochondria and plastids presumably corresponds to the original 
plasma membrane of the bacterium, while the lumen of these organelles evolved 
from the bacterial cytosol. Like the bacteria from which they were derived, 
both mitochondria and plastids are enclosed by a double membrane and they 
remain isolated from the extensive vesicular traffic that connects the interiors of 
most of the other membrane-enclosed organelles to each other and to the outside 
of the cell. 
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Figure 12-3 One suggested pathway 
for the évolution of the eukaryotic 
cell and its internai membranes As 

discussed in Chapter 1, there is evidence 
that the nuclear genome of a eukaryotic 
cell evolved from an ancient archeaon. 

For example, clear homologs of actin, 
tubulin, histones, and the nuclear DNA 
réplication System are found in archaea, 
but not in bacteria. Thus, it is now thought 
that the first eukaryotic cells arose when 
an ancient anaérobie archaeon joined 
forces with an aérobic bacterium roughly 
1.6 billion years ago. As indicated, the 
nuclear envelope may hâve originated from 
an invagination of the plasma membrane 
of this ancient archaeon—an invagination 
that protected its chromosome while still 
allowing access of the DNA to the cytosol 
(as required for DNA to direct protein 
synthesis). This envelope may hâve later 
pinched off completely from the plasma 
membrane, so as to produce a separate 
nuclear compartiment surrounded by a 
double membrane. Because this double 
membrane is penetrated by nuclear pore 
complexes, the nuclear compartiment is 
topologically équivalent to the cytosol. In 
contrast, the lumen of the ER is continuous 
with the space between the inner and outer 
nuclear membranes, and it is topologically 
équivalent to the extracellular space (see 
Figure 12-4). (Adapted from J. Martijn and 
T.J.G. Ettema, Biochem. Soc. Trans. 41 : 
451-457, 2013.) 


The evolutionary schemes just described group the intracellular compart- 
ments in eukaryotic cells into four distinct families: (1) the nucléus and the cyto¬ 
sol, which communicate with each other through nuclear pore complexes and are 
thus topologically continuous (although functionally distinct); (2) ail organelles 
that function in the secretory and endocytic pathways—including the ER, Golgi 
apparatus, endosomes, and lysosomes, the numerous classes of transport inter- 
mediates such as transport vesicles that move between them, and peroxisomes; 
(3) the mitochondria; and (4) the plastids (in plants only). 


Proteins Can Move Between Compartments in Different Ways 

The synthesis of ail proteins begins on ribosomes in the cytosol, except for the few 
that are synthesized on the ribosomes of mitochondria and plastids. Their sub¬ 
séquent fate dépends on their amino acid sequence, which can contain sorting 
signais that direct their delivery to locations outside the cytosol or to organelle 
surfaces. Some proteins do not hâve a sorting signal and consequently remain in 
the cytosol as permanent residents. Many others, however, hâve spécifie sorting 
signais that direct their transport from the cytosol into the nucléus, the ER, mito¬ 
chondria, plastids, or peroxisomes; sorting signais can also direct the transport of 
proteins from the ER to other destinations in the cell. 
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To understand the general principles by which sorting signais operate, it is 
important to distinguish three fundamentally different ways by which proteins 
move from one compartment to another. These three mechanisms are described 
below, and the transport steps at which they operate are outlined in Figure 12-5. 
We discuss the first two mechanisms (gated transport and transmembrane trans¬ 
port) in this chapter, and the third (vesicular transport, green arrows in Figure 
12-5) in Chapter 13. 

1. In gated transport, proteins and RNA molécules move between the cytosol 
and the nucléus through nuclear pore complexes in the nuclear envelope. 
The nuclear pore complexes function as sélective gates that support the 
active transport of spécifie macromolecules and macromolecular assem- 
blies between the two topologically équivalent spaces, although they also 
allow free diffusion of smaller molécules. 

2. In protein translocation, transmembrane protein translocators directly 
transport spécifie proteins across a membrane from the cytosol into a 
space that is topologically distinct. The transported protein molécule usu- 
ally must unfold to snalce through the translocator. The initial transport 
of selected proteins from the cytosol into the ER lumen or mitochondria, 
for example, occurs in this way. Intégral membrane proteins often use the 
same translocators but translocate only partially across the membrane, so 
that the protein becomes embedded in the lipid bilayer. 

3. In vesicular transport, membrane-enclosed transport intermediates— 
which may be small, spherical transport vesicles or larger, irregularly 
shaped organelle fragments—ferry pro teins from one topologically équiva¬ 
lent compartment to another. The transport vesicles and fragments become 
loaded with a cargo of molécules derived from the lumen of one compart¬ 
ment as they bud and pinch off from its membrane; they discharge their 
cargo into a second compartment by fusing with the membrane enclos- 
ing that compartment (Figure 12-6). The transfer of soluble proteins from 
the ER to the Golgi apparatus, for example, occurs in this way. Because the 

Figure 12-5 A simplified “roadmap” of protein traffic within a 
eukaryotic cell. Proteins can move from one compartment to another 
by gated transport (red), protein translocation (blue), or vesicular transport 
(green). The sorting signais that direct a given protein’s movement through 
the System, and thereby détermine its eventual location in the cell, are 
contained in each protein’s amino acid sequence. The journey begins with the 
synthesis of a protein on a ribosome in the cytosol and, for many proteins, 
terminâtes when the protein reaches its final destination. Other proteins 
shuttle back and forth between the nucléus and cytosol. At each intermediate 
station (boxes), a decision is made as to whether the protein is to be retained 
in that compartment or transported further. A sorting signal may direct either 
rétention in or exit from a compartment. 

We shall refer to this figure often as a guide in this chapter and the next, 
highlighting in color the particular pathway being discussed. 


Figure 12-4 Topologically équivalent 
compartments in the secretory and 
endocytic pathways in a eukaryotic 
cell. Compartments are said to be 
topologically équivalent if they can 
communicate with one another, in the 
sense that molécules can get from one 
to the other without having to cross a 
membrane. Topologically équivalent spaces 
are shown in red. (A) Molécules can be 
carried from one compartment to another 
topologically équivalent compartment 
by vesicles that bud from one and fuse 
with the other. (B) In principle, cycles of 
membrane budding and fusion permit 
the lumen of any of the organelles shown 
to communicate with any other and with 
the cell exterior by means of transport 
vesicles. Blue arrows indicate the 
extensive outbound and inbound vesicular 
traffic (discussed in Chapter 13). Some 
organelles, most notably mitochondria and 
(in plant cells) plastids, do not take part in 
this communication and are isolated from 
the vesicular traffic between organelles 
shown here. 
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Figure 12-6 Vesicle budding and fusion during vesicular transport. 

Transport vesicles bud from one compartiment (donor) and fuse with another 
topologically équivalent (target) compartiment. In the process, soluble 
components (red dots) are transferred from lumen to lumen. Note that 
membrane is also transferred and that the original orientation of both proteins 
and lipids in the donor compartiment membrane is preserved in the target 
compartiment membrane. Thus, membrane proteins retain their asymmetric 
orientation, with the same domains always facing the cytosol. 


transportée! proteins do not cross a membrane, vesicular transport can 
move proteins only between compartments that are topologically équiva¬ 
lent (see Figure 12-4). 

Each mode of protein transfer is usually guided by sorting signais in the trans- 
ported protein, which are recognized by complementary sorting receptors. If a 
large protein is to be imported into the nucléus, for example, it must possess a 
sorting signal that receptor proteins recognize to guide it through the nuclear pore 
complex. If a protein is to be transferred directly across a membrane, it must pos¬ 
sess a sorting signal that the translocator recognizes. Likewise, if a protein is to be 
loaded into a certain type of vesicle or retained in certain organelles, a comple¬ 
mentary receptor in the appropriate membrane must recognize its sorting signal. 

Signal Sequences and Sorting Receptors Direct Proteins to the 
Correct Cell Address 

Most protein sorting signais involved in transmembrane transport résidé in 
a stretch of amino acid sequence, typically 15-60 residues long. Such signal 
sequences are often found at the N-terminus of the polypeptide chain, and in 
many cases specialized signal peptidases remove the signal sequence from the 
finished protein once the sorting process is complété. Signal sequences can also 
be internai stretches of amino acids, which remain part of the protein. Such sig¬ 
nais are used in gated transport into the nucléus. Sorting signais can also be com- 
posed of multiple internai amino acid sequences that form a spécifie three-di- 
mensional arrangement of atoms on the protein's surface; such signal patches are 
sometimes used for nuclear import and in vesicular transport. 

Each signal sequence spécifiés a particular destination in the cell. Proteins 
destined for initial transfer to the ER usually hâve a signal sequence at their N- 
terminus that characteristically includes a sequence composed of about 5-10 
hydrophobie amino acids. Many of these proteins will in turn pass from the ER to 
the Golgi apparatus, but those with a spécifie signal sequence of four amino acids 
at their C-terminus are recognized as ER residents and are returned to the ER. 
Proteins destined for mitochondria hâve signal sequences of yet another type, in 
which positively charged amino acids alternate with hydrophobie ones. Finally, 
many proteins destined for peroxisomes hâve a signal sequence of three charac- 
teristic amino acids at their C-terminus. 

Table 12-3 présents some spécifie signal sequences. Experiments in which the 
peptide is transferred from one protein to another by genetic engineering tech¬ 
niques hâve demonstrated the importance of each of these signal sequences for 
protein targeting. Placing the N-terminal ER signal sequence at the beginning 
of a cytosolic protein, for example, redirects the protein to the ER; removing or 
mutating the signal sequence of an ER protein causes its rétention in the cytosol. 
Signal sequences are therefore both necessary and sufficient for protein targeting. 
Even though their amino acid sequences can vary greatly, the signal sequences of 
proteins having the same destination are functionally interchangeable; physical 
properties, such as hydrophobicity, often seem to be more important in the sig¬ 
nal-récognition process than the exact amino acid sequence. 

Signal sequences are recognized by complementary sorting receptors that 
guide proteins to their appropriate destination, where the receptors unload their 
cargo. The receptors function catalytically: after completing one round of tar¬ 
geting, they return to their point of origin to be reused. Most sorting receptors 
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Some Typical Signal Sequences 

Function of signal 
sequence 

Example of signal sequence 

Import into nucléus 

-Pro-Pro-Lys-Lys-Lys-Arg-Lys-Val- 

Export from nucléus 

-Met-Glu-Glu-Leu-Ser-GIn-Ala-Leu-Ala-Ser-Ser-Phe- 

Import into mitochondria 

+ H 3 N-Met-Leu-Ser-Leu-Arg-Gln-Ser-lle-Arg-Phe-Phe-Lys-Pro-Ala-Thr-Arg-Thr-Leu-Cys-Ser-Ser- 

Arg-Tyr-Leu-Leu- 

Import into plastid 

+ H 3 N-Met-Val-Ala-Met-Ala-Met-Ala- -Leu-GIn-Ser-Ser-Met-Ser-Ser-Leu-Ser-Leu-Ser-Ser-Asn- 
Ser-Phe-Leu-Gly-GIn-Pro-Leu-Ser-Pro-lle-Thr-Leu-Ser-Pro-Phe-Leu-GIn-Gly- 

Import into peroxisomes 

-Ser-Lys-Leu-COO - 

Import into ER 

+ H 3 N-Met-Met-Ser-Phe-Val-Ser-Leu-Leu-Leu-Val-Gly-lle-Leu-Phe-Trp-Ala-Thr-Glu-Ala-Glu-Gln- 

Leu-Thr-Lys-Cys-Glu-Val-Phe-GIn- 

Return to ER 

-Lys-Asp-Glu-Leu-COO - 

Some characteristic features of the different classes of signal sequences are highlighted in color. Where they are known to be important for 
the function of the signal sequence, positively charged amino acids are shown in red and negatively charged amino acids are shown in green. 
Similarly, important hydrophobie amino acids are shown in orange and important hydroxylated amino acids are shown in blue. + H 3 N indicates the 
N-terminus of a protein; COO - indicates the C-terminus. 


recognize classes of proteins rather than an individual protein species. They can 
therefore be viewed as public transportation Systems, dedicated to delivering 
numerous different components to their correct location in the cell. 

Most Organelles Cannot Be Constructed De Novo: They Require 
Information in the Organelle Itself 

When a cell reproduces by division, it has to duplicate its organelles, in addition to 
its chromosomes. In general, cells do this by incorporating new molécules into the 
existing organelles, thereby enlarging them; the enlarged organelles then divide 
and are distributed to the two daughter cells. Thus, each daughter cell inherits a 
complété set of specialized cell membranes from its mother. This inheritance is 
essential because a cell could not malce such membranes from scratch. If the ER 
were completely removed from a cell, for example, how could the cell reconstruct 
it? As we discuss later, the membrane proteins that define the ER and perform 
many of its functions are themselves products of the ER. A new ER could not be 
made without an existing ER or, at least, a membrane that specifically contains 
the protein translocators required to import selected proteins into the ER from the 
cytosol (including the ER-specific translocators themselves). The same is true for 
mitochondria and plastids. 

Thus, it seems that the information required to construct an organelle does not 
résidé exclusively in the DNA that spécifiés the organelle's proteins. Information 
in the form of at least one distinct protein that preexists in the organelle mem¬ 
brane is also required, and this information is passed from parent cell to daughter 
cells in the form of the organelle itself. Presumably, such information is essential 
for the propagation of the cell's compartmental organization, just as the informa¬ 
tion in DNA is essential for the propagation of the cell's nucléotide and amino acid 
sequences. 

As we discuss in more detail in Chapter 13, however, the ER buds off a con¬ 
stant stream of transport vesicles that incorporate only a subset of ER pro teins and 
therefore hâve a composition different from the ER itself. Similarly, the plasma 
membrane constantly buds off various types of specialized endocytic vesicles. 
Thus, some organelles can form from other organelles and do not hâve to be 
inherited at cell division. 
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Summary 

Eukaryotic cells contain intracellular membrane-enclosed organelles that malce up 
nearly halfthe celVs total volume. The main ones présent in ail eukaryotic cells are 
the endoplasmic réticulum, Golgi apparatus, nucléus , mitochondria , lysosomes, 
endosomes, and peroxisomes; plant cells also contain plastids such as chloroplasts. 
These organelles contain distinct sets of proteins, which médiate each organelle's 
unique function. 

Each newly synthesized organelle protein must find its way from a ribosome in 
the cytosol, where the protein is made, to the organelle where it functions. It does 
so by following a spécifie pathway, guided by sorting signais in its amino acid 
sequence that function as either signal sequences or signal patches. Sorting signais 
are recognized by complementary sorting receptors; which deliver the protein to the 
appropriate target organelle. Proteins that function in the cytosol do not contain 
sorting signais and therefore remain there after they are synthesized. 

During cell division, organelles such as the ER and mitochondria are distributed 
to each daughter cell. These organelles contain information that is requiredfor their 
construction, and so they cannot be made de novo. 


THE TRANSPORT OF MOLECULES BETWEEN THE 
NUCLEUS AND THE CYTOSOL 


The nuclear envelope encloses the DNA and defines the nuclear compartment. 
This envelope consists of two concentric membranes, which are penetrated by 
nuclear pore complexes (Figure 12-7). Although the inner and outer nuclear 
membranes are continuous, they maintain distinct protein compositions. The 
inner nuclear membrane contains proteins that act as binding sites for chromo¬ 
somes and for the nuclear lamina, a protein meshworlc that provides structural 
support for the nuclear envelope; the lamina also acts as an anchoring site for 
chromosomes and the cytoplasmic cytoskeleton (via protein complexes that span 
the nuclear envelope). The inner membrane is surrounded by the outer nuclear 
membrane, which is continuous with the membrane of the ER. Lilce the ER mem¬ 
brane (discussed later), the outer nuclear membrane is studded with ribosomes 
engaged in protein synthesis. The proteins made on these ribosomes are trans- 
ported into the space between the inner and outer nuclear membranes (the peri- 
nuclear space), which is continuous with the ER lumen (see Figure 12-7). 

Bidirectional traffic occurs continuously between the cytosol and the nucléus. 
The many proteins that function in the nucléus—including histones, DNA poly¬ 
merases, RNA polymerases, transcriptional regulators, and RNA-processing pro¬ 
teins—are selectively imported into the nuclear compartment from the cytosol, 
where they are made. At the same time, almost ail RNAs—including mRNAs, 
rRNAs, tRNAs, miRNAs, and snRNAs—are synthesized in the nuclear compart¬ 
ment and then exported to the cytosol. Lilce the import process, the export process 
is sélective; mRNAs, for example, are exported only after they hâve been properly 
modified by RNA-processing reactions in the nucléus. In some cases, the trans¬ 
port process is complex. Ribosomal proteins, for instance, are made in the cytosol 
and imported into the nucléus, where they assemble with newly made ribosomal 
RNA into particles. The particles are then exported to the cytosol, where they 
assemble into ribosomes. Each of these steps requires sélective transport across 
the nuclear envelope. 


CYTOSOL 



Nuclear Pore Complexes Perforate the Nuclear Envelope 

Large and elaborate nuclear pore complexes (NPCs) perforate the nuclear enve¬ 
lope in ail eulcaryotes. Each NPC is composed of a set of approximately 30 differ¬ 
ent proteins, or nucleoporins. Reflecting the high degree of internai symmetry, 
each nucleoporin is présent in multiple copies, resulting in 500-1000 protein mol¬ 
écules in the fully assembled NPC, with an estimated mass of 66 million daltons in 
yeast and 125 million daltons in vertebrates (Figure 12-8). Most nucleoporins are 
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composed of répétitive protein domains of only a few different types, which hâve 
evolved through extensive gene duplication. Some of the scaffold nucleoporins 
(see Figure 12-8) are structurally related to vesicle coat protein complexes, such 
as clathrin and COPII coatomer (discussed in Chapter 13), which shape transport 
vesicles; and one protein is used as a common building bloclc in both NPCs and 
vesicle coats. These similarities suggest a common evolutionary origin for NPCs 
and vesicle coats: they may dérivé from an early membrane-bending protein 
module that helped shape the elaborate membrane Systems of eukaryotic cells, 
and in présent-day cells stabilize the sharp membrane bends required to form a 
nuclear pore. 

The nuclear envelope of a typical mammalian cell contains 3000-4000 NPCs, 
although that number varies widely, from a few hundred in glial cells to almost 
20,000 in Purkinje neurons. The total traffic that passes through each NPC is enor- 
mous: each NPC can transport up to 1000 macromolecules per second and can 
transport in both directions at the same time. How it coordinates the bidirectional 
flow of macromolecules to avoid congestion and head-on collisions is not lcnown. 

Each NPC contains aqueous passages, through which small water-soluble 
molécules can diffuse passively. Researchers hâve determined the effective size 
of these passages by injecting labeled water-soluble molécules of different sizes 
into the cytosol and then measuring their rate of diffusion into the nucléus. Small 
molécules (5000 daltons or less) diffuse in so fast that we can consider the nuclear 
envelope freely permeable to them. Large proteins, however, diffuse in much 
more slowly, and the larger a protein, the more slowly it passes through the NPC. 
Proteins larger than 60,000 daltons cannot enter by passive diffusion. This size 
cut-off to free diffusion is thought to resuit from the NPC structure (see Figure 
12-8). The channel nucleoporins with extensive unstructured régions form a dis- 
ordered tangle (much lilce a lcelp bed in the océan) that restricts the diffusion of 
large macromolecules while allowing smaller molécules to pass. 

Because many cell proteins are too large to diffuse passively through the NPCs, 
the nuclear compartment and the cytosol can maintain different protein compo¬ 
sitions. Mature cytosolic ribosomes, for example, are about 30 nm in diameter 
and thus cannot diffuse through the NPC, confining protein synthesis to the cyto¬ 
sol. But how does the nucléus export newly made ribosomal subunits or import 
large molécules, such as DNA polymerases and RNA polymerases, which hâve 
subunit molecular masses of 100,000-200,000 daltons? As we discuss next, these 
and most other transported protein and RNA molécules bind to spécifie receptor 
proteins that actively ferry large molécules through NPCs. Even small proteins lilce 
histones frequently use receptor-mediated mechanisms to cross the NPC, thereby 
increasing transport efficiency. 

Nuclear Localization Signais Direct Nuclear Proteins to the 
Nucléus 

When pro teins are experimentally extracted from the nucléus and reintroduced 
into the cytosol, even the very large ones reaccumulate efficiently in the nucléus. 
Sorting signais called nuclear localization signais (NLSs) are responsible for the 
selectivity of this active nuclear import process. The signais hâve been precisely 
defined by using recombinant DNA technology for numerous nuclear pro teins, as 
well as for pro teins that enter the nucléus only transiently (Figure 12-9). In many 
nuclear proteins, the signais consist of one or two short sequences that are rich 
in the positively charged amino acids lysine and arginine (see Table 12-3, p. 648), 
with the précisé sequence varying for different proteins. Other nuclear proteins 
contain different signais, some of which are notyet characterized. 

Nuclear localization signais can be located almost anywhere in the amino acid 
sequence and are thought to form loops or patches on the protein surface. Many 
function even when linlced as short peptides to lysine side chains on the surface 
of a cytosolic protein, suggesting that the précisé location of the signal within the 
amino acid sequence of a nuclear protein is not important. Moreover, as long 
as one of the protein subunits of a multicomponent complex displays a nuclear 
localization signal, the entire complex will be imported into the nucléus. 
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Figure 12-7 The nuclear envelope. The 

double-membrane envelope is penetrated 
by pores in which nuclear pore complexes 
(not shown) are positioned. The outer 
nuclear membrane is continuous with 
the endoplasmic réticulum (ER). The 
ribosomes that are normally bound to the 
cytosolic surface of the ER membrane and 
outer nuclear membrane are not shown. 
The nuclear lamina is a fibrous protein 
meshwork underlying the inner membrane. 
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One can visualize the transport of nuclear proteins through NPCs by coating 
gold particles with a nuclear localization signal, injecting the particles into the 
cytosol, and then following their fate by électron microscopy (Figure 12-10). The 
particles bind to the tentaclelike fibrils that extend from the scaffold nucleoporins 
at the rim of the NPC into the cytosol, and then proceed through the center of the 
NPC. Presumably, the unstructured régions of the nucleoporins that form a diffu¬ 
sion barrier for large molécules (mentioned earlier) are pushed away to allow the 
coated gold particles to squeeze through. 

Macromolecular transport across NPCs differs fundamentally from the 
transport of proteins across the membranes of other organelles, in that it occurs 
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Figure 12-8 The arrangement of NPCs in the nuclear envelope. (A) In a vertebrate NPC, nucleoporins are arrangée! with striking eightfold 
rotational symmetry. In addition, immunoelectron microscopie studies show that the proteins that make up the central portion of the NPC are 
oriented symmetrically across the nuclear envelope, so that the nuclear and cytosolic sides look identical. The eightfold rotational and twofold 
transverse symmetry explains how such a huge structure can be formed from only about 30 different proteins: many of the nucleoporins are 
présent in 8, 16, or 32 copies. Based on their approximate localization in the central portion of the NPC, nucleoporins can be classified into (1) 
transmembrane ring proteins that span the nuclear envelope and anchor the NPC to the envelope; (2) scaffold nucleoporins that form layered ring 
structures. Some scaffold nucleoporins are membrane-bending proteins that stabilize the Sharp membrane curvature where the nuclear envelope 
is penetrated; and (3) channel nucleoporins that line a central pore. In addition to folded domains that anchor the proteins in spécifie places, many 
channel nucleoporins contain extensive unstructured régions, where the polypeptide chains are intrinsically disordered. The central pore is filled 
with a tangled mesh of these disordered domains that blocks the passive diffusion of large macromolecules. The disordered régions contain a 
large number of phenylalanine-glycine (FG) repeats. Fibrils protrude from both the cytosolic and the nuclear sides of the NPC. By contrast to the 
twofold transverse symmetry of the NPC core, the fibrils facing the cytosol and nucléus are different: on the nuclear side, the fibrils converge at 
their distal end to form a basketlike structure. The précisé arrangement of individual nucleoporins in the assembled NPC is still a matter of intense 
debate, because atomic resolution analyses hâve been hindered by the sheer size and flexible nature of the NPC, and by difficultés in purifying 
sufficient amounts of homogeneous material. A combination of électron microscopy, computational analyses, and crystal structures of nucleoporin 
subcomplexes has been used to develop the current models of the NPC architecture. (B) A scanning électron micrograph of the nuclear side of the 
nuclear envelope of an oocyte (see also Figure 9-52). (C) An électron micrograph showing a side view of two NPCs (brackets)] note that the inner 
and outer nuclear membranes are continuous at the edges of the pore. (D) An électron micrograph showing face-on views of negatively stained 
NPCs. The membrane has been removed by detergent extraction. Note that some of the NPCs contain material in their center, which is thought to 
be trapped macromolecules in transit through these NPCs. (A, adapted from A. Hoelz, E.W. Debler and G. Biobel, Annu. Rev. Biochem. 80:613- 
643, 2011. With permission from Annual Reviews; B, from M.W. Goldberg and T.D. Allen, J. Cell Biol. 119:1429-1440, 1992. With permission from 
The Rockefeller University Press; C, courtesy of Werner Franke and Ulrich Scheer; D, courtesy of Ron Milligan.) 
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through a large, expandable, aqueous pore, rather than through a protein trans¬ 
porter spanning one or more lipid bilayers. For this reason, fully folded nuclear 
proteins can be transported into the nucléus through an NPC, and newly formed 
ribosomal subunits are transported out of the nucléus as an assembled particle. 
By contrast, proteins hâve to be extensively unfolded to be transported into most 
other organelles, as we discuss later. 

Nuclear Import Receptors Bind to Both Nuclear Localization 
Signais and NPC Proteins 

To initiate nuclear import, most nuclear localization signais must be recognized 
by nuclear import receptors, sometimes called importins, most of which are 
encoded by a family of related genes. Each family member encodes a receptor 
protein that can bind and transport the subset of cargo proteins containing the 
appropriate nuclear localization signal (Figure 12-1 IA). Nuclear import recep¬ 
tors do not always bind to nuclear proteins directly. Additional adaptor proteins 
can form a bridge between the import receptors and the nuclear localization sig¬ 
nais on the proteins to be transported (Figure 12-1 IB). Some adaptor proteins 
are structurally related to nuclear import receptors, suggesting a common evolu- 
tionary origin. By using a variety of import receptors and adaptors, cells are able 
to recognize the broad répertoire of nuclear localization signais that are displayed 
on nuclear proteins. 

The import receptors are soluble cytosolic proteins that bind both to the 
nuclear localization signal on the cargo protein and to the phenylalanine-gly¬ 
cine (FG) repeats in the unstructured domains of the channel nucleoporins that 
line the central pore. FG-repeats are also found in the cytoplasmic and nuclear 
fibrils. FG-repeats in the unstructured tangle of the pore are thought to do double 
duty. They interact wealdy, which gives the protein tangle gel-like properties that 
impose a permeability barrier to large macromolecules, and they serve as dock- 
ing sites for nuclear import receptors. FG-repeats line the path through the NPCs 
talcen by the import receptors and their bound cargo proteins. According to one 
model of nuclear transport, the receptor-cargo complexes move along the trans¬ 
port path by repeatedly binding, dissociating, and then re-binding to adjacent 
FG-repeat sequences. In this way, the complexes may hop from one nucleoporin 
to another to traverse the tangled interior of the NPC in a random walk. As import 
receptors bind to FG-repeats during this journey, they would disrupt interaction 
between the repeats and locally dissolve the gel phase of the protein tangle that 
fills the pore, allowing the passage of the receptor-cargo complex. Once inside the 
nucléus, the import receptors dissociate from their cargo and return to the cyto¬ 
sol. As we will see, this dissociation only occurs on the nuclear side of the NPC and 
thereby confers directionality to the import process. 

Nuclear Export Works Like Nuclear Import, But in Reverse 

The nuclear export of large molécules, such as new ribosomal subunits and 
RNA molécules, occurs through NPCs and also dépends on a sélective transport 


Figure 12-9 The function of a nuclear 
localization signal. Immunofluorescence 
micrographs showing the cell location 
of SV40 virus T-antigen containing or 
lacking a short sequence that serves as a 
nuclear localization signal. (A) The normal 
T-antigen protein contains the lysine-rich 
sequence indicated and is imported to its 
site of action in the nucléus, as indicated 
by immunofluorescence staining with 
antibodies against the T-antigen. 

(B) T-antigen with an altered nuclear 
localization signal (a threonine replacing 
a lysine) remains in the cytosol. (From 
D. Kalderon, B. Roberts, W. Richardson 
and A. Smith, Cell 39:499-509, 1984. With 
permission from Elsevier.) 
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Figure 12-10 Visualizing active import 
through NPCs. This sériés of électron 
micrographs shows colloïdal gold spheres 
(arrowheads) coated with peptides 
containing nuclear localization signais 
entering the nucléus through NPCs. The 
gold particles were injected into the cytosol 
of living cells, which then were fixed and 
prepared for électron microscopy at various 
times after injection. (A) Gold particles are 
first seen in proximity to the cytosolic fibrils 
of the NPCs. (B, C) They are then seen at 
the center of the NPCs, exclusively on the 
cytosolic face. (D) They then appear on 
the nuclear face. These gold particles hâve 
much larger diameters than the diffusion 
channels in the NPC and are imported by 
active transport. (From N. Panté and U. 
Aebi, Science 273:1729-1732, 1996. With 
permission from AAAS.) 
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System. The transport System relies on nuclear export signais on the macromol- 
ecules to be exported, as well as on complementary nuclear export receptors, 
or exportins. These receptors bind to both the export signal and NPC proteins to 
guide their cargo through the NPC to the cytosol. 

Many nuclear export receptors are structurally related to nuclear import 
receptors, and they are encoded by the same gene family of nuclear transport 
receptors, or lcaryopherins. Inyeast, there are 14 genes encodingkaryopherins; in 
animal cells, the number is significantly larger. It is often not possible to tell from 
their amino acid sequence alone whether a particular family member worlcs as a 
nuclear import or nuclear export receptor. As might be expected, therefore, the 
import and export transport Systems worlc in similar ways but in opposite direc¬ 
tions: the import receptors bind their cargo molécules in the cytosol, release them 
in the nucléus, and are then exported to the cytosol for reuse, while the export 
receptors function in the opposite fashion. 


Figure 12-11 Nuclear import receptors 
(importins). (A) Different nuclear import 
receptors bind different nuclear localization 
signais and thereby different cargo 
proteins. (B) Cargo protein 4 requires an 
adaptor protein to bind to its nuclear import 
receptor. The adaptors are structurally 
related to nuclear import receptors and 
recognize nuclear localization signais 
on cargo proteins. They also contain a 
nuclear localization signal that binds them 
to an import receptor, but this signal only 
becomes exposed when they are loaded 
with a cargo protein. 


The Ran GTPase Imposes Directionality on Transport Through 
NPCs 


The import of nuclear proteins through NPCs concentrâtes spécifie proteins in the 
nucléus and thereby increases order in the cell. The cell fuels this ordering process 
by harnessing energy stored in concentration gradients of the GTP-bound form 
of the monomeric GTPase Ran, which is required for both nuclear import and 
export. 

Lilce other GTPases, Ran is a molecular switch that can exist in two conforma- 
tional States, depending on whether GDP or GTP is bound (discussed in Chapter 
3). Two Ran-specific regulatory proteins trigger the conversion between the two 
States: a cytosolic GTPase-activating protein (GAP) triggers GTP hydrolysis and 
thus converts Ran-GTP to Ran-GDP, and a nuclear guanine exchange factor (GEF) 
promûtes the exchange of GDP for GTP and thus converts Ran-GDP to Ran-GTP. 
Because Ran-GAP is located in the cytosol and Ran-GEF is located in the nucléus 
where it is anchored to chromatin, the cytosol contains mainly Ran-GDP, and the 
nucléus contains mainly Ran-GTP (Figure 12-12). 


Ran-GDP 



Figure 12-12 The compartmentalization 
of Ran-GDP and Ran-GTP. Localization 
of Ran-GDP in the cytosol and Ran-GTP in 
the nucléus results from the localization of 
two Ran regulatory proteins: Ran GTPase- 
activating protein (Ran-GAP) is located in 
the cytosol, and Ran guanine nucléotide 
exchange factor (Ran-GEF) binds to 
chromatin and is therefore located in the 
nucléus. 

Ran-GDP is imported into the nucléus by 
its own import receptor, which is spécifie 
for the GDP-bound conformation of Ran. 
The Ran-GDP receptor is structurally 
unrelated to the main family of nuclear 
transport receptors. However, it also 
binds to FG-repeats in NPC channel 
nucleoporins. 
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This gradient of the two conformational forms of Ran drives nuclear transport 
in the appropriate direction. Docking of nuclear import receptors to FG-repeats 
on the cytosolic side of the NPC, for example, occurs whether or not these recep¬ 
tors are loaded with appropriate cargo. Import receptors, facilitated by FG-repeat 
binding, then enter the channel. If they reach the nuclear side of the pore com- 
plex, Ran-GTP binds to them, and, if the receptors arrive loaded with cargo mol¬ 
écules, the Ran-GTP binding causes the receptors to release their cargo (Figure 
12-13A). Because the Ran-GDP in the cytosol does not bind to import (or export) 
receptors, unloading occurs only on the nuclear side of the NPC. In this way, the 
nuclear localization of Ran-GTP créâtes the directionality of the import process. 

Having discharged its cargo in the nucléus, the empty import receptor with 
Ran-GTP bound is transported baclc through the pore complex to the cytosol. 
There, Ran-GAP triggers Ran-GTP to hydrolyze its bound GTP, thereby converting 
it to Ran-GDP, which dissociâtes from the receptor. The receptor is then ready for 
another cycle of nuclear import. 

Nuclear export occurs by a similar mechanism, except that Ran-GTP in the 
nucléus promûtes cargo binding to the export receptor, rather than promoting 
cargo dissociation. Once the export receptor moves through the pore to the cyto¬ 
sol, it encounters Ran-GAP, which induces the receptor to hydrolyze its GTP to 
GDP. As a resuit, the export receptor releases both its cargo and Ran-GDP in the 
cytosol. Free export receptors are then returned to the nucléus to complété the 
cycle (Figure 12-13B). 

Transport Through NPCs Can Be Regulated by Controlling Access 
to the Transport Machinery 

Some proteins contain both nuclear localization signais and nuclear export sig¬ 
nais. These proteins continually shuttle baclc and forth between the nucléus and 
the cytosol. The relative rates of their import and export détermine the steady- 
state localization of such shuttling proteins : if the rate of import exceeds the rate 
of export, a protein will be located mainly in the nucléus; conversely, if the rate of 
export exceeds the rate of import, a protein will be located mainly in the cytosol. 
Thus, changing the rate of import, export, or both, can change the location of a 
protein. 


Figure 12-13 How GTP hydrolysis 
by Ran in the cytosol provides 
directionality to nuclear transport. 

Movement through the NPC of loaded 
nuclear transport receptors occurs along 
the FG-repeats displayed by certain NPC 
proteins. The differential localization of 
Ran-GTP in the nucléus and Ran-GDP 
in the cytosol provides directionality (red 
arrows) to both nuclear import (A) and 
nuclear export (B). Ran-GAP stimulâtes the 
hydrolysis of GTP to produce Ran-GDP on 
the cytosolic side of the NPC (see Figure 
12 - 12 ). 
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Some shuttling proteins move continuously into and out of the nucléus. In 
other cases, however, the transport is stringently controlled. As discussed in Chap- 
ter 7, cells control the activity of some transcription regulators by lceeping them 
out of the nucléus until they are needed there (Figure 12-14). In many cases, cells 
control transport by regulating nuclear localization and export signais—turn- 
ing them on or off, often by phosphorylation of amino acids close to the signal 
sequences (Figure 12-15). 

Other transcription regulators are bound to inhibitory cytosolic proteins that 
either anchor them in the cytosol (through interactions with the cytoslceleton or 
spécifie organelles) or maslc their nuclear localization signais so that they cannot 
interact with nuclear import receptors. An appropriate stimulus releases the gene 
regulatory protein from its cytosolic anchor or maslc, and it is then transported 
into the nucléus. One important example is the latent gene regulatory protein that 
Controls the expression of proteins involved in cholestérol metabolism. The pro¬ 
tein is made and stored in an inactive form as a transmembrane protein in the ER. 
When a cell is deprived of cholestérol, the protein is transported from the ER to 
the Golgi apparatus where it encounters spécifie proteases that cleave off the cyto¬ 
solic domain, releasing it into the cytosol. This domain is then imported into the 
nucléus, where it activâtes the transcription of genes required for both cholestérol 
uptalce and synthesis (Figure 12-16). 

As we discuss in detail in Chapter 6, cells control the export of RNAs from the 
nucléus in a similar way. snRNAs, miRNAs, and tRNAs bind to the same family 
of nuclear export receptors just discussed, and they use the same Ran-GTP gra¬ 
dient to fuel the transport process. By contrast, the export of mRNAs out of the 
nucléus uses a different mechanism. mRNAs are exported as large assemblies, 
which can be as large as 100 million daltons (see Figure 6-37) and can contain 
hundreds of proteins of a few dozen different types. These mRNA ribonucleo- 
protein complexes (mRNPs) first dock at the nuclear side of the NPC, where they 
are extensively remodeled. Although Ran-GTP is indirectly involved in the export 
(because it imports the proteins that bind to the mRNA molécules), the transloca¬ 
tion across the NPC is thought to be driven by ATP hydrolysis. How export direc- 
tionality is assured is unclear. It is likely that the many accessory proteins tethered 
to the NPC's nuclear and cytoplasmic fibrils hâve important rôles in remodeling 
the mRNPs as they pass through the pores, in particular stripping away nuclear 
proteins as the mRNPs exit on the cytosolic side of the NPC, thereby ensuring 
that transport is unidirectional. Upon entry into the cytosol, these nuclear mRNP 
proteins are rapidly returned to the nucléus. 


calcineurin 
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Figure 12-14 The control of nuclear 
transport in the early Drosophila 
embryo. The embryo at this stage is a 
syncytium, shown here in cross section, 
with many nuclei in a common cytoplasm, 
arrangée! around the periphery, just 
beneath the plasma membrane. The 
transcription regulatory protein Dorsal 
is produced uniformly throughout the 
peripheral cytoplasm, but it can act only 
when inside the nuclei. The Dorsal protein 
has been stained with an enzyme-coupled 
antibody that yields a brown product, 
revealing that Dorsal is excluded from the 
nuclei at the dorsal side (top) of the embryo 
but is concentrated in the nuclei toward 
the ventral side (bottom) of the embryo. 

The regulated traffic of Dorsal into the 
nuclei Controls the differential development 
between the back and belly of the animal. 
(Courtesy of Siegfried Roth.) 


Figure 12-15 The control of nuclear 
import during T cell activation. The 

nuclear factor of activated T cells 
(NF-AT) is a transcription regulatory protein 
that, in the resting T cell, is found in the 
cytosol in a phosphorylated State. When 
T cells are activated by foreign antigen 
(discussed in Chapter 24), the intracellular 
Ca 2+ concentration increases. In high Ca 2+ , 
the protein phosphatase calcineurin binds 
to NF-AT and dephosphorylates it. The 
déphosphorylation exposes nuclear import 
signais and blocks a nuclear export signal. 
The complex of NF-AT and calcineurin 
is therefore imported into the nucléus, 
where NF-AT activâtes the transcription 
of numerous genes required for T cell 
activation. 

The response shuts off when Ca 2+ levels 
decrease, releasing NF-AT from calcineurin. 
Rephosphorylation of NF-AT inactivâtes 
the nuclear import signais and re-exposes 
the nuclear export signal, causing NF-AT to 
relocate to the cytosol. Some of the most 
potent immunosuppressive drugs, including 
cyclosporin A and FK506, inhibit the ability 
of calcineurin to dephosphorylate NF-AT 
and thereby block the nuclear accumulation 
of NF-AT and T cell activation (Movie 12.1). 
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SCAP site 1 protease 



cholestérol biosynthesis enzymes 


During Mitosis the Nuclear Envelope Disassembles 

The nuclear lamina, located on the nuclear side of the inner nuclear membrane, 
is a meshworlc of interconnected protein subunits called nuclear lamins. The 
lamins are a spécial class of intermediate filament proteins (discussed in Chapter 
16) that polymerize into a two-dimensional lattice (Figure 12-17). The nuclear 
lamina gives shape and stability to the nuclear envelope, to which it is anchored 
by attachment to both the NPCs and transmembrane proteins of the inner nuclear 
membrane. The lamina also interacts directly with chromatin, which itself inter- 
acts with transmembrane proteins of the inner nuclear membrane. Together with 
the lamina, these inner membrane proteins provide structural links between the 
DNA and the nuclear envelope. 

When a nucléus is dismantled during mitosis, the NPCs and nuclear lamina 
disassemble and the nuclear envelope fragments. The dismantling process is at 
least partly a conséquence of direct phosphorylation of nucleoporins and lamins 
by the cyclin-dépendent protein kinase (Cdlc) that is activated at the onset of mito¬ 
sis (discussed in Chapter 17). During this process, some NPC proteins become 
bound to nuclear import receptors, which play an important part in the reassem- 
bly of NPCs at the end of mitosis. Nuclear envelope membrane proteins—no lon¬ 
ger tethered to the pore complexes, lamina, or chromatin—disperse throughout 
the ER membrane. The dynein motor protein, which moves along microtubules 
(discussed in Chapter 16), actively participâtes in tearing the nuclear envelope off 
the chromatin. Together, these processes break down the barriers that normally 
separate the nucléus and cytosol, and the nuclear proteins that are not bound to 
membranes or chromosomes intermix completely with the proteins of the cytosol 
(Figure 12-18). 

Later in mitosis, the nuclear envelope reassembles on the surface of the 
daughter chromosomes. In addition to its crucial rôle in nuclear transport, the 
Ran GTPase also acts as a positional marlcer for chromatin during cell division, 
when the nuclear and cytosolic components intermix. Because Ran-GEF remains 
bound to chromatin when the nuclear envelope breaks down, Ran molécules 
close to chromatin are mainly in their GTP-bound conformation. By contrast, Ran 
molécules further away hâve a high likelihood of encountering Ran-GAP, which 
is distributed throughout the cytosol; these Ran molécules are mainly in their 
GDP-bound conformation. As a resuit, the chromosomes in mitotic cells are sur- 
rounded by a cloud of Ran-GTP. Ran-GTP releases the NPC proteins in proximity 
to the chromosomes from nuclear import receptors. The free NPC proteins attach 


Figure 12-16 Feedback régulation 
of cholestérol biosynthesis. SREBP 
(sterol response element binding protein), 
a latent transcription regulator that Controls 
expression of cholestérol biosynthetic 
enzymes, is initially synthesized as an ER 
membrane protein. It is anchored in the 
ER if there is sufficient cholestérol in the 
membrane by interaction with another 
ER membrane protein, called SCAP 
(SREBP cleavage activation protein), 
which binds cholestérol. If the cholestérol 
binding site on SCAP is empty (at low 
cholestérol concentrations), SCAP changes 
conformation and is packaged together 
with SREBP into transport vesicles, which 
deliver their cargo to the Golgi apparatus, 
where two Golgi-resident proteases cleave 
SREBP to free its cytosolic domain from 
the membrane. The cytosolic domain then 
moves into the nucléus, where it binds 
to the promoters of genes that encode 
proteins involved in cholestérol biosynthesis 
and activâtes their transcription. In this 
way, more cholestérol is made when its 
concentration falls below a threshold. 



1 |um 


Figure 12-17 The nuclear lamina. An 

électron micrograph of a portion of the 
nuclear lamina in a Xenopus oocyte prepared 
by freeze-drying and métal shadowing. 

The lamina is formed by a regular lattice of 
specialized intermediate filaments. Lamins 
are only présent in metazoan cells. Other, 
yet-unknown proteins may serve similar 
functions in species that lack lamins. 
(Courtesy of Ueli Aebi.) 
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to the chromosome surface, where they assemble into new NPCs. At the same 
time, inner nuclear membrane proteins and dephosphorylated lamins bind again 
to chromatin. ER membranes wrap around groups of chromosomes until they 
form a sealed nuclear envelope (Movie 12.2). During this process, the NPCs start 
actively re-importing proteins that contain nuclear localization signais. Because 
the nuclear envelope is initially closely applied to the surface of the chromosomes, 
the newly formed nucléus excludes ail proteins except those initially bound to the 
mitotic chromosomes and those that are selectively imported through NPCs. In 
this way, ail other large proteins, including ribosomes, are lcept out of the newly 
assembled nucléus. 

As we discuss in Chapter 17, the cloud of Ran-GTP surrounding chromatin is 
also important in assembling the mitotic spindle in a dividing cell. 


Summary 


Figure 12-18 The breakdown and re¬ 
formation of the nuclear envelope and 
lamina during mitosis. Phosphorylation 
of the lamins triggers the disassembly 
of the nuclear lamina, which initiâtes 
the nuclear envelope to break up. 
Déphosphorylation of the lamins reverses 
the process. An analogous phosphorylation 
and déphosphorylation cycle occurs 
for some nucleoporins and proteins of 
the inner nuclear membrane, and some 
of these déphosphorylations are also 
involved in the reassembly process. As 
indicated, the nuclear envelope initially 
re-forms around individual decondensing 
daughter chromosomes. Eventually, 
as decondensation progresses, these 
structures fuse to form a single complété 
nucléus. 

Mitotic breakdown of the nuclear 
envelope occurs in ail metazoan cells. 
However, in many other species, such as 
yeasts, the nuclear envelope remains intact 
during mitosis, and the nucléus divides by 
fission. 


The nuclear envelope consists ofan inner and an outer nuclear membrane that are 
continuons with each other and with the ER membrane , and the space between the 
inner and outer nuclear membrane is continuons with the ER lumen. RNA molé¬ 
cules, which are made in the nucléus, and ribosomal subunits, which are assembled 
there, are exported to the cytosol; in contrast, ail the proteins that Junction in the 
nucléus are synthesized in the cytosol and are then imported. The extensive traffic of 
materials between the nucléus and cytosol occurs through nuclear pore complexes 
(NPCs), which provide a direct passageway across the nuclear envelope. Small 
molécules diffuse passively through the NPCs, but large macromolecules hâve to be 
actively transported. 

Proteins containing nuclear localization signais are actively transported into 
the nucléus through NPCs, while proteins containing nuclear export signais are 
transported out ofthe nucléus to the cytosol. Some proteins; including the nuclear 
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import and export receptors, continually shuttle between the cytosol and nucléus. 
The monomeric GTPase Ran provides both thefree energy and the directionality for 
nuclear transport. Cells regulate the transport ofnuclear proteins and RNA molé¬ 
cules through the NPCs by controlling the access ofthese molécules to the transport 
machinery. Newly transcribed messenger RNA and ribosomal RNA are exported 
from the nucléus as parts of large ribonucleoprotein complexes. Because nuclear 
localization signais are not removed, nuclear proteins can be imported repeatedly , 
as is required each time that the nucléus reassembles after mitosis. 


THE TRANSPORT OF PROTEINS INTO 
MITOCHONDRIA AND CHLOROPLASTS 


Mitochondria and chloroplasts (a specialized form of plastids in green algae 
and plant cells) are double-membrane-enclosed organelles. They specialize in 
ATP synthesis, using energy derived from électron transport and oxidative phos¬ 
phorylation in mitochondria and from photosynthesis in chloroplasts (discussed 
in Chapter 14). Although both organelles contain their own DNA, ribosomes, 
and other components required for protein synthesis, most of their proteins are 
encoded in the cell nucléus and imported from the cytosol. Each imported pro¬ 
tein must reach the particular organelle subcompartment in which it functions. 

There are different subcompartments in mitochondria (Figure 12-19A): the 
internai matrix space and the intermembrane space, which is continuous with 
the cristae space. These compartments are formed by the two concentric mito¬ 
chondrial membranes: the inner membrane, which encloses the matrix space 
and forms extensive invaginations called cristae, and the outer membrane, 
which is in contact with the cytosol. Protein complexes provide boundaries at the 
junctions where the cristae invaginate and divide the inner membrane into two 
domains: one inner membrane domain surrounds the cristae space, and the other 
domain abuts the outer membrane. Chloroplasts also hâve an outer and inner 
membrane, which enclose an intermembrane space, and the stroma, which is the 
chloroplast équivalent of the mitochondrial matrix space (Figure 12-19B). They 
hâve an additional subcompartment, the thylalcoid space, which is surrounded by 
the thylakoid membrane. The thylalcoid membrane dérivés from the inner mem¬ 
brane during plastid development and is pinched off to become discontinuous 
with it. Each of the subcompartments in mitochondria and chloroplasts contains 
a distinct set of proteins. 

New mitochondria and chloroplasts are produced by the growth of preexist- 
ing organelles, followed by fission (discussed in Chapter 14). The growth dépends 
mainly on the import of proteins from the cytosol. The imported proteins must 
be transported across a number of membranes in succession and end up in the 
appropriate place. The process of protein movement across membranes is called 
protein translocation. This section explains how it occurs. 


CYTOSOL 



(A) MITOCHONDRION 



(B) CHLOROPLAST 



Figure 12-19 The subcompartments 
of mitochondria and chloroplasts. In 

contrast to the cristae of mitochondria (A), 
the thylakoids of chloroplasts (B) are not 
connected to the inner membrane and 
therefore form a sealed compartiment with 
a separate internai space. 


























































THE TRANSPORT OF PROTEINS INTO MITOCHONDRIA AND CHLOROPU\STS 

Translocation into Mitochondria Dépends on Signal Sequences 
and Protein Translocators 

Proteins imported into mitochondria are usually talcen up from the cytosol within 
seconds or minutes of their release from ribosomes. Thus, in contrast to protein 
translocation into the ER, which often talces place simultaneously with translation 
by a ribosome doclced on the rough ER membrane (described later), mitochon¬ 
drial proteins are first fully synthesized as mitochondrial precursor proteins in 
the cytosol and then translocated into mitochondria by a post-translational mech- 
anism. One or more signal sequences direct ail mitochondrial precursor proteins 
to their appropriate mitochondrial subcompartment. Many proteins entering the 
matrix space contain a signal sequence at their N-terminus that a signal pepti- 
dase rapidly removes after import. Other imported proteins, including ail outer 
membrane and many inner membrane and intermembrane space proteins, hâve 
internai signal sequences that are not removed. The signal sequences are both 
necessary and suffïcient for the import and correct localization of the proteins: 
when genetic engineering techniques are used to linlc these signais to a cytosolic 
protein, the signais direct the protein to the correct mitochondrial subcompart¬ 
ment. 

The signal sequences that direct precursor proteins into the mitochondrial 
matrix space are best understood. They ail form an amphiphilic a hélix, in which 
positively charged residues cluster on one side of the hélix, while uncharged 
hydrophobie residues cluster on the opposite side. Spécifie receptor proteins that 
initiate protein translocation recognize this configuration rather than the précisé 
amino acid sequence of the signal sequence (Figure 12-20). 

Multisubunit protein complexes that function as protein translocators médi¬ 
ate protein movement across mitochondrial membranes. The TOM complex 
transfers proteins across the outer membrane, and two TIM complexes (TIM23 
and TIM22) transfer proteins across the inner membrane (Figure 12-21). These 
complexes contain some components that act as receptors for mitochondrial pre¬ 
cursor proteins, and other components that form the translocation channels. 

The TOM complex is required for the import of ail nucleus-encoded mitochon¬ 
drial proteins. It initially transports their signal sequences into the intermembrane 
space and helps to insert transmembrane proteins into the outer membrane. 
P-barrel proteins, which are particularly abundant in the outer membrane, are 
then passed on to an additional translocator, the SAM complex, which helps them 
to fold properly in the outer membrane. The TIM23 complex transports some sol¬ 
uble proteins into the matrix space and helps to insert transmembrane proteins 
into the inner membrane. The TIM22 complex médiates the insertion of a sub- 
class of inner membrane pro teins, including the transporter that moves ADP, ATP, 
and phosphate in and out of mitochondria. Yet another protein translocator in 
the inner mitochondrial membrane, the OXA complex, médiates the insertion of 


659 


Figure 12-20 A signal sequence 
for mitochondrial protein import. 

Cytochrome oxidase is a large multiprotein 
complex located in the inner mitochondrial 
membrane, where it functions as the 
terminal enzyme in the electron-transport 
chain (discussed in Chapter 14). (A) The 
first 18 amino acids of the precursor to 
subunit IV of this enzyme serve as a signal 
sequence for import of the subunit into 
the mitochondrion. (B) When the signal 
sequence is folded as an a hélix, the 
positively charged amino acids (red) are 
clustered on one face of the hélix, while 
the non polar ones (green) are clustered 
primarily on the opposite face. Uncharged 
polar amino acids are shaded orange-, 
nitrogen atoms on the side chains of 
Arg and Gin are colored blue. Signal 
sequences that direct proteins into the 
matrix space always hâve the potential to 
form such an amphiphilic a hélix, which is 
recognized by spécifie receptor proteins 
on the mitochondrial surface. (C) The 
structure of a signal sequence (of alcohol 
dehydrogenase, another mitochondrial 
matrix enzyme), bound to an import 
receptor (gray), as determined by nuclear 
magnetic résonance. The amphiphilic a 
hélix binds with its hydrophobie face to 
a hydrophobie groove in the receptor 
(PDB code: 10M2). 
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Figure 12-21 The protein translocators 
in the mitochondrial membranes. The 

TOM, TlM, SAM, and OXA complexes are 
multimeric membrane protein assemblies 
that catalyze protein translocation across 
mitochondrial membranes. The protein 
components of the TIM22 and TIM23 
complexes that line the import channel are 
structurally related, suggesting a common 
evolutionary origin of both TIM complexes. 
On the matrix side, the TIM23 complex 
is bound to a multimeric protein complex 
containing mitochondrial hsp70, which 
acts as an import ATPase, using ATP 
hydrolysis to pull proteins through the pore. 
In animal cells, subtle variations exist in the 
subunit composition of the translocator 
complexes to adapt the mitochondrial 
import machinery to the particular needs of 
specialized cell types. SAM = Sorting and 
Assembly Machinery; OXA = cytochrome 
OXidase Activity; TIM = Translocator of the 
Inner Mitochondrial membrane; 

TOM = Translocator of the Outer 
Membrane. 


those inner membrane proteins that are synthesized within mitochondria. It also 
helps to insert some imported inner membrane proteins that are initially trans- 
ported into the matrix space by the other complexes. 

Mitochondrial Precursor Proteins Are Imported as Unfolded 
Polypeptide Chains 

We hâve learned almost everything we lcnow about the molecular mechanism of 
protein import into mitochondria from analyses of cell-free, reconstituted trans¬ 
location Systems, in which purified mitochondria in a test tube import radiola- 
beled mitochondrial precursor proteins. By changing the conditions in the test 
tube, it is possible to establish the biochemical requirements for the import. 

Mitochondrial precursor proteins do not fold into their native structures after 
they are synthesized; instead, they remain unfolded in the cytosol through inter¬ 
actions with other proteins. Some of these interacting proteins are general chap- 
erone proteins of the hsp70 family (discussed in Chapter 6), whereas others are 
dedicated to mitochondrial precursor proteins and bind directly to their signal 
sequences. Ail the interacting proteins help to prevent the precursor proteins 
from aggregating or folding up spontaneously before they engage with the TOM 
complex in the outer mitochondrial membrane. As a first step in the import pro- 
cess, the import receptors of the TOM complex bind the signal sequence of the 
mitochondrial precursor protein. The interacting proteins are then stripped off, 
and the unfolded polypeptide chain is fed—signal sequence first—into the trans¬ 
location channel. 

In principle, a protein could reach the mitochondrial matrix space by either 
Crossing the two membranes ail at once or Crossing one at a time. One can dis- 
tinguish between these possibilities by cooling a cell-free mitochondrial import 
System to arrest the proteins at an intermediate step in the translocation process. 
The resuit is that the arrested proteins no longer contain their N-terminal signal 
sequence, indicating that the N-terminus must be in the matrix space where the 
signal peptidase is located, but the bulle of the protein can still be attacked from 
outside the mitochondria by externally added proteolytic enzymes. Clearly, the 
precursor proteins can pass through both mitochondrial membranes at once to 
enter the matrix space (Figure 12-22). The TOM complex first transports the sig¬ 
nal sequence across the outer membrane to the intermembrane space, where it 
binds to a TIM complex, opening the channel in the complex. The polypeptide 
chain is then either translocated into the matrix space or inserted into the inner 
membrane. 
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Figure 12-22 Protein import by 
mitochondria. The N-terminal signal 
sequence of the mitochondrial precursor 
protein is recognized by receptors 
of the TOM complex. The protein is 
then translocated through the TIM23 
complex so that it transiently spans both 
mitochondrial membranes (Movie 12.3). 
The signal sequence is cleaved off by a 
signal peptidase in the matrix space to 
form the mature protein. The free signal 
sequence is then rapidly degraded (not 
shown). 


Although the TOM and TIM complexes usually worlc together to translocate 
precursor proteins across both membranes at the same time, they can worlc inde- 
pendently. In isolated outer membranes, for example, the TOM complex can 
translocate the signal sequence of precursor proteins across the membrane. Sim- 
ilarly, if the outer membrane is experimentally disrupted in isolated mitochon¬ 
dria, the exposed TIM23 complex can efficiently import precursor proteins into 
the matrix space. 

ATP Hydrolysis and a Membrane Potential Drive Protein Import 
Into the Matrix Space 

Directional transport requires energy, which in most biological Systems is sup- 
plied by ATP hydrolysis. ATP hydrolysis fuels mitochondrial protein import at 
two discrète sites, one outside the mitochondria and one in the matrix space. In 
addition, protein import requires another energy source, which is the membrane 
potential across the inner mitochondrial membrane (Figure 12-23). 

The first requirement for energy occurs at the initial stage of the translocation 
process, when the unfolded precursor protein, associated with chaperone pro¬ 
teins, interacts with the import receptors of the TOM complex. As discussed in 
Chapter 6, the binding and release of newly synthesized polypeptides from the 
chaperone proteins requires ATP hydrolysis. 


cytosolic hsp70 

chaperones receptor protein 



Figure 12-23 The rôle of energy in protein import into the mitochondrial matrix space. (1) Bound cytosolic hsp70 
chaperone is released from the precursor protein in a step that dépends on ATP hydrolysis. After initial insertion of the signal 
sequence and of adjacent portions of the polypeptide chain into the TOM complex translocation channel, the signal sequence 
interacts with a TIM complex. (2) The signal sequence is then translocated into the matrix space in a process that requires the 
energy in the membrane potential across the inner membrane. (3) Mitochondrial hsp70, which is part of an import ATPase 
complex, binds to régions of the polypeptide chain as they become exposed in the matrix space, pulling the protein through the 
translocation channel, using the energy of ATP hydrolysis. 
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Once the signal sequence has passed through the TOM complex and is bound 
to a TIM complex, further translocation through the TIM translocation channel 
requires the membrane potential, which is the electrical component of the elec- 
trochemical H + gradient across the inner membrane (see Figure 11-4). Pumping 
of H + from the matrix space to the intermembrane space, driven by électron trans¬ 
port processes in the inner membrane (discussed in Chapter 14), maintains the 
electrochemical gradient. The energy in the electrochemical H + gradient across 
the inner membrane therefore not only powers most of the celFs ATP synthesis, 
but it also drives the translocation of the positively charged signal sequences 
through the TIM complexes by electrophoresis. 

Mitochondrial hsp70 also plays a crucial part in the import process. Mito- 
chondria containing mutant forms of the protein fail to import precursor proteins. 
The mitochondrial hsp70 is part of a multisubunit protein assembly that is bound 
to the matrix side of the TIM23 complex and acts as a motor to pull the precursor 
protein into the matrix space. Lilce its cytosolic cousin, mitochondrial hsp70 has a 
high affinity for unfolded polypeptide chains, and it binds tightly to an imported 
protein chain as soon as the chain emerges from the TIM translocator in the 
matrix space. The hsp70 then undergoes a conformational change and releases 
the protein chain in an ATP-dépendent step, exerting a ratcheting/pulling force 
on the protein being imported. This energy-driven cycle of binding and subsé¬ 
quent release provides the final driving force needed to complété protein import 
after a protein has initially inserted into the TIM23 complex (see Figure 12-23). 

After the initial interaction with mitochondrial hsp70, many imported matrix 
proteins are passed on to another chaperone protein, mitochondrial hsp60. As 
discussed in Chapter 6, hsp60 helps the unfolded polypeptide chain to fold by 
binding and releasing it through cycles of ATP hydrolysis. 

Bacteria and Mitochondria Use Similar Mechanisms to Insert 
Porins into their Outer Membrane 

The outer mitochondrial membrane, like the outer membrane of Gram-negative 
bacteria (see Figure 11-17), contains abundant pore-forming P-barrel proteins 
called porins, and it is thus freely permeable to inorganic ions and métabolites 
(but not to most proteins). In contrast to other outer membrane proteins, which 
are anchored in the membrane through transmembrane a-helical régions, the 
TOM complex cannot integrate porins into the lipid bilayer. Instead, porins are 
first transported unfolded into the intermembrane space, where they transiently 
bind specialized chaperone proteins, which keep the porins from aggregating 
(Figure 12-24A). They then bind to the SAM complex in the outer membrane, 
which both inserts them into the outer membrane and helps them fold properly. 

One of the central subunits of the SAM complex is homologous to a bacterial 
outer membrane protein that helps insert P-barrel proteins into the bacterial outer 

TOM COMPLEX SAM outer INSERTION INTO 



MATRIX SPACE ^ chaperones 


Figure 12-24 Intégration of porins into 
the outer mitochondrial and bacterial 
membranes. (A) After translocation 


through the TOM complex in the outer 


mitochondrial membrane, p-barrel proteins 
bind to chaperones in the intermembrane 
space. The SAM complex then inserts the 
unfolded polypeptide chain into the outer 
membrane and helps the chain fold. 


(A) 





TRANSLOCATION ACROSS 
INNER (PLASMA) MEMBRANE 


(B) A structurally related BAM complex in 
the outer membrane of Gram-negative 


(B) 


bacteria catalyzes p-barrel protein insertion 
and folding (see Figure 11-17). 
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membrane from the periplasmic space (the équivalent of the intermembrane 
space in mitochondria) (Figure 12-24B). This conserved pathway for inserting 
P-barrel proteins further underscores the endosymbiotic origin of mitochondria. 

Transport Into the Inner Mitochondrial Membrane and 
Intermembrane Space Occurs Via Several Routes 

The same mechanism that transports proteins into the matrix space using the 
TOM and TIM23 translocators (see Figure 12-22) also médiates the initial translo¬ 
cation of many proteins that are destined for the inner mitochondrial membrane 
or the intermembrane space. In the most common translocation route, only the 
N-terminal signal sequence of the transported protein actually enters the matrix 
space (Figure 12-25A). A hydrophobie amino acid sequence, strategically placed 
after the N-terminal signal sequence, acts as a stop-transfer sequence , preventing 
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Figure 12-25 Protein import from the cytosol into the inner mitochondrial membrane and intermembrane space. (A) The N-terminal 
signal sequence (red) initiâtes import into the matrix space (see Figure 12-22). A hydrophobie sequence (blue) that follows the matrix-targeting 
signal sequence binds to the TIM23 translocator (orange) in the inner membrane and stops translocation. The remainder of the protein is then 
pulled into the intermembrane space through the TOM translocator in the outer membrane, and the hydrophobie sequence is released into the 
inner membrane anchoring the protein there. (B) A second route for protein intégration into the inner membrane first delivers the protein completely 
into the matrix space. Cleavage of the signal sequence (red) used for the initial translocation unmasks an adjacent hydrophobie signal sequence 
(blue) at the new N-terminus. This signal then directs the protein into the inner membrane, using the same OXA-dependent pathway that inserts 
proteins that are encoded by the mitochondrial genome and translated in the matrix space. (C) Some soluble proteins of the intermembrane space 
also use the pathways shown in (A) and (B) before they are released into the intermembrane space by a second signal peptidase, which has its 
active site in the intermembrane space and removes the hydrophobie signal sequence. (D) Some soluble intermembrane-space proteins become 
oxidized by the Mia40 protein (Mia = mitochondrial intermembrane space assembly) during import. Mia40 forms a covalent intermediate through an 
intermolecular disulfide bond, which helps pull the transported protein through the TOM complex. Mia40 becomes reduced in the process, and then 
is reoxidized by the électron transport chain, so that it can catalyze the next round of import. (E) Multipass inner membrane proteins that function 
as métabolite transporters contain internai signal sequences and snake through the TOM complex as a loop. They then bind to the chaperones in 
the intermembrane space, which guide the proteins to the TIM22 complex. The TIM22 complex is specialized for the insertion of multipass inner 
membrane proteins. 
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further translocation across the inner membrane. The remainder of the protein 
then crosses the outer membrane through the TOM complex into the intermem¬ 
brane space; the signal sequence is cleaved off in the matrix, and the hydrophobie 
sequence, released from TIM23, remains anchored in the inner membrane. 

In another transport route to the inner membrane or intermembrane space, 
the TIM23 complex initially translocates the entire protein into the matrix space 
(Figure 12-25B). A matrix signal peptidase then removes the N-terminal signal 
sequence, exposing a hydrophobie sequence at the new N-terminus. This signal 
sequence guides the protein to the OXA complex, which inserts the protein into 
the inner membrane. As mentioned earlier, the OXA complex is primarily used 
to insert proteins that are encoded and translated in the mitochondrion into the 
inner membrane, and only a few imported proteins use this pathway. Transloca- 
tors that are closely related to the OXA complex are found in the plasma mem¬ 
brane of bacteria and in the thylakoid membrane of chloroplasts, where they 
insert membrane proteins by a similar mechanism. 

Many proteins that use these pathways to the inner membrane remain 
anchored there through their hydrophobie signal sequence (see Figure 12-25A,B). 
Others, however, are released into the intermembrane space by a protease that 
removes the membrane anchor (Figure 12-25C). Many of these cleaved proteins 
remain attached to the outer surface of the inner membrane as peripheral sub- 
units of protein complexes that also contain transmembrane proteins. 

Certain intermembrane-space proteins that contain cysteine motifs are 
imported by a yet different route. These proteins form a transient covalent disul- 
fide bond to the Mia40 protein (Figure 12-25D). The imported proteins are 
then released in an oxidized form containing intrachain disulfide bonds. Mia40 
becomes reduced in the process, and is then reoxidized by passing électrons to 
the électron transport chain in the inner mitochondrial membrane. In this way, 
the energy stored in the redox potential in the mitochondrial électron transport 
chain is tapped to drive protein import. 

Mitochondria are the principal sites of ATP synthesis in the cell, but they also 
contain many metabolic enzymes, such as those of the citric acid cycle. Thus, in 
addition to proteins, mitochondria must also transport small métabolites across 
their membranes. While the outer membrane contains porins, which malce the 
membrane freely permeable to such small molécules, the inner membrane does 
not. Instead, a family of metabolite-specific transporters transfers a vast number 
of small molécules across the inner membrane. In yeast cells, these transporters 
comprise a family of 35 different proteins, the most abundant of which transport 
ATP, ADP, and phosphate. These are multipass transmembrane proteins, which 
do not hâve cleavable signal sequences at their N-termini but instead contain 
internai signal sequences. They cross the TOM complex in the outer membrane, 
and intermembrane-space chaperones guide them to the TIM22 complex, which 
inserts them into the inner membrane by a process that requires the membrane 
potential, but not mitochondrial hsp70 or ATP (Figure 12-25E). An energetically 
favorable partitioning of the hydrophobie transmembrane régions into the inner 
membrane is lilcely to drive this process. 

Two Signal Sequences Direct Proteins to the Thylakoid Membrane 
in Chloroplasts 

Protein transport into chloroplasts resembles transport into mitochondria. 
Both processes occur post-translationally, use separate translocation complexes 
in each membrane, require energy, and use amphiphilic N-terminal signal 
sequences that are removed after use. With the exception of some of the chap- 
erone molécules, however, the protein components that form the translocation 
complexes differ. Moreover, whereas mitochondria harness the electrochemical 
H + gradient across their inner membrane to drive transport, chloroplasts, which 
hâve an electrochemical H + gradient across their thylakoid membrane but not 
their inner membrane, use GTP and ATP hydrolysis to power import across their 
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Figure 12-26 Translocation of 
chloroplast precursor proteins into 
the thylakoid space. (A) The precursor 
protein contains an N-terminal chloroplast 
signal sequence (red), followed immediately 
by a thylakoid signal sequence (brown). 

The chloroplast signal sequence initiâtes 
translocation into the stroma by a 
mechanism similar to that used for the 
translocation of mitochondrial precursor 
proteins into the matrix space, although 
the translocator complexes, TOC and TIC, 
are different. The signal sequence is then 
cleaved off, unmasking the thylakoid signal 
sequence, which initiâtes translocation 
across the thylakoid membrane. 

(B) Translocation into the thylakoid space or 
thylakoid membrane can occur by any one 
of at least four routes: (1) a Sec pathway, 
so called because it uses components 
that are homologs of Sec proteins, which 
médiate protein translocation across the 
bacterial plasma membrane (discussed 
later); (2) an SRP-like pathway, so called 
because it uses a chloroplast homolog 
of the signal-récognition particle, or 
SRP (discussed later); (3) a TAT (twin 
arginine translocation) pathway, so called 
because two arginines are critical in the 
signal sequences that direct proteins into 
this pathway, which dépends on the H + 
gradient across the thylakoid membrane; 
and (4) a spontaneous insertion pathway 
that seems not to require any protein 
translocator. 


double membrane. The functional similarities may thus resuit from convergent 
évolution, reflecting the common requirements for translocation across a double 
membrane. 

Although the signal sequences for import into chloroplasts superficially 
resemble those for import into mitochondria, the same plant cells hâve both 
mitochondria and chloroplasts, so proteins must partition appropriatelybetween 
the two organelles. In plants, for example, a bacterial enzyme can be directed 
specifically to mitochondria if it is experimentally joined to an N-terminal signal 
sequence of a mitochondrial protein; the same enzyme joined to an N-terminal 
signal sequence of a chloroplast protein ends up in chloroplasts. Thus, the import 
receptors on each organelle distinguish between the different signal sequences. 

Chloroplasts hâve an extra membrane-enclosed compartment, the thylakoid. 
Many chloroplast proteins, including the protein subunits of the photosynthetic 
System and of the ATP synthase (discussed in Chapter 14), are located in the thyla¬ 
koid membrane. Tilce the precursors of some mitochondrial proteins, the precur- 
sors of these proteins are translocated from the cytosol to their final destination 
in two steps. First, they pass across the double membrane into the matrix space 
(called the stroma in chloroplasts), and then they either integrate into the thyla¬ 
koid membrane or translocate into the thylakoid space (Figure 12-26A). The pre¬ 
cursors of these proteins hâve a hydrophobie thylakoid signal sequence following 
the N-terminal chloroplast signal sequence. After the N-terminal signal sequence 
has been used to import the protein into the stroma, a stromal signal peptidase 
removes it, unmasking the thylakoid signal sequence that initiâtes transport 



















666 Chapter 12: Intracellular Compartments and Protein Sorting 


across the thylakoid membrane. Ihere are at least four routes by which proteins 
cross or become integrated into the thylakoid membrane, distinguished by their 
need for different stromal chaperones and energy sources (Figure 12-26B). 

Summary 

Although mitochondria and chloroplasts hâve their own genetic Systems, they 
produce only a small proportion of their own proteins. Instead, the two organelles 
import most of their proteins from the cytosol, usingsimilar mechanisms. In both 
cases, proteins are transported in an unfolded State across both outer and inner 
membranes simultaneously into the matrix space or stroma. Both ATP hydrolysis 
and a membrane potential across the inner membrane drive translocation into 
mitochondria, whereas GTP and ATP hydrolysis drive translocation into chloro¬ 
plasts. Chaperone proteins of the cytosolic hsp70 family maintain the precursor 
proteins in an unfolded State, and a second set ofhsp70 proteins in the matrix space 
or stroma pulls the polypeptide chain into the organelle. Only proteins that con- 
tain a spécifie signal sequence are translocated. The signal sequence can either be 
located at the N-terminus and cleaved offafter import or be internai and retained. 
Transport into the inner membrane sometimes uses a second, hydrophobie signal 
sequence that is unmasked when the first signal sequence is removed. In chloro¬ 
plasts, import from the stroma into the thylakoid can occur by several routes; dis¬ 
tinguished by the chaperones and energy source used. 


PEROXISOMES 


Peroxisomes differ from mitochondria and chloroplasts in many ways. Most 
notably, they are surrounded by only a single membrane, and they do not contain 
DNA or ribosomes. Thus, because peroxisomes laclc a genome, ail of their pro¬ 
teins are encoded in the nucléus. Peroxisomes acquire most of these proteins by 
sélective import from the cytosol, although some of them enter the peroxisome 
membrane via the ER. 

Because we do not discuss peroxisomes elsewhere, we shall digress to consider 
some of the functions of this diverse family of organelles, before discussing their 
biosynthesis. Virtually ail eukaryotic cells hâve peroxisomes. They contain oxida- 
tive enzymes, such as catalase and urate oxidase, at such high concentrations that, 
in some cells, the peroxisomes stand out in électron micrographs because of the 
presence of a crystalloid protein core (Figure 12-27). 

Lilce mitochondria, peroxisomes are major sites of oxygen utilization. One 
hypothesis is that peroxisomes are a vestige of an ancient organelle that per- 
formed ail the oxygen metabolism in the primitive ancestors of eukaryotic cells. 
When the oxygen produced by photosynthetic bacteria first accumulated in the 
atmosphère, it would hâve been highly toxic to most cells. Peroxisomes might 
hâve lowered the intracellular concentration of oxygen, while also exploiting its 
Chemical reactivity to perform useful oxidation reactions. According to this view, 
the later development of mitochondria rendered peroxisomes largely obsolète 
because many of the same biochemical reactions—which had formerly been car- 
ried out in peroxisomes without producing energy—were now coupled to ATP 
formation by means of oxidative phosphorylation. The oxidation reactions per- 
formed by peroxisomes in present-day cells could therefore partly be those whose 
functions were not talcen over by mitochondria. 


CYTOSOL 



Peroxisomes Use Molecular Oxygen and Hydrogen Peroxide to 
Perform Oxidation Reactions 

Peroxisomes are so named because they usually contain one or more enzymes 
that use molecular oxygen to remove hydrogen atoms from spécifie organic sub¬ 
strates (designated here as R) in an oxidation reaction that produces hydrogen 
peroxide (H 2 O 2 ): 


RH2 + O2 —* R + H2O2 
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Catalase uses the H 2 O 2 generated by other enzymes in the organelle to oxidize 
a variety of other substrates—including formic acid, formaldéhyde, and alcohol— 
by the “peroxidation” reaction: H 2 O 2 + R'H 2 —> R' + 2 H 2 O. This type of oxidation 
reaction is particularly important in liver and kidney cells, where the peroxisomes 
detoxify various harmful molécules that enter the bloodstream. About 25% of the 
éthanol we drinlc is oxidized to acetaldehyde in this way. In addition, when excess 
H 2 0 2 accumulâtes in the cell, catalase converts it to H 2 0 through the reaction 

2H 2 0 2 -> 2H 2 0 + 0 2 

A major function of the oxidation reactions performed in peroxisomes is the 
breakdown of fatty acid molécules. The process, called /? oxidation , shortens the 
alkyl chains of fatty acids sequentially in bloclcs of two carbon atoms at a time, 
thereby converting the fatty acids to acetyl CoA. The peroxisomes then export the 
acetyl CoA to the cytosol for use in biosynthetic reactions. In mammalian cells, P 
oxidation occurs in both mitochondria and peroxisomes; in yeast and plant cells, 
however, this essential reaction occurs exclusively in peroxisomes. 

An essential biosynthetic function of animal peroxisomes is to catalyze the 
first reactions in the formation of plasmalogens, which are the most abundant 
class of phospholipids in myelin (Figure 12-28). Plasmalogen deficiencies cause 
profound abnormalities in the myelination of nerve-cell axons, which is one rea- 
son why many peroxisomal disorders lead to neurological disease. 

Peroxisomes are unusually diverse organelles, and even in the various cell 
types of a single organism they may contain different sets of enzymes. They also 
adapt remarkably to changing conditions. Yeasts grown on sugar, for example, 
hâve few small peroxisomes. But when some yeasts are grown on methanol, 
numerous large peroxisomes are formed that oxidize methanol; and when grown 
on fatty acids, they develop numerous large peroxisomes that break down fatty 
acids to acetyl CoA by P oxidation. 

Peroxisomes are also important in plants. Two types of plant peroxisomes hâve 
been studied extensively. One is présent in leaves, where it participâtes in photo¬ 
respiration (discussed in Chapter 14) (Figure 12-29A). The other type of peroxi- 
some is présent in germinating seeds, where it converts the fatty acids stored in 
seed lipids into the sugars needed for the growth of the young plant. Because this 
conversion of fats to sugars is accomplished by a sériés of reactions known as the 
glyoxylate cycle, these peroxisomes are also called glyoxysomes (Figure 12-29B). 
In the glyoxylate cycle, two molécules of acetyl CoA produced by fatty acid break- 
down in the peroxisome are used to malce succinic acid, which then leaves the 
peroxisome and is converted into glucose in the cytosol. The glyoxylate cycle does 
not occur in animal cells, and animais are therefore unable to convert the fatty 
acids in fats into carbohydrates. 



200 nm 


Figure 12-27 An électron micrograph 
of three peroxisomes in a rat liver 
cell. The paracrystalline, electron-dense 
inclusions are composed primarily of the 
enzyme urate oxidase. (Courtesy of Daniel 
S. Friend.) 
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A Short Signal Sequence Directs the Import of Proteins into 
Peroxisomes 

A spécifie sequence of three amino acids (Ser-Lys-Leu) located at the C-termi- 
nus of many peroxisomal proteins functions as an import signal (see Table 12-3, 
p. 648). Other peroxisomal proteins contain a signal sequence near the N-termi¬ 
nus. If either sequence is attached to a cytosolic protein, the protein is imported 
into peroxisomes. The import signais are first recognized by soluble receptor pro¬ 
teins in the cytosol. Numerous distinct proteins, called peroxins, participate in 
the import process, which is driven by ATP hydrolysis. A complex of at least six dif¬ 
ferent peroxins forms a protein translocator in the peroxisome membrane. Even 
oligomeric proteins do not hâve to unfold to be imported. To allow the passage 
of such compactly folded cargo molécules, the pore formed by the transporter is 
thought to be dynamic in its dimensions, adapting in size to the particular cargo 
molécules to be transported. In this respect, the mechanism differs from that used 
by mitochondria and chloroplasts. One soluble import receptor, the peroxin Pex5 
recognizes the C-terminal peroxisomal import signal. It accompanies its cargo ail 
the way into peroxisomes and, after cargo release, cycles baclc to the cytosol. After 
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Figure 12-28 The structure of a 
plasmalogen. Plasmalogens are very 
abundant in the myelin sheaths that 
insulate the axons of nerve cells. They 
make up some 80-90% of the myelin 
membrane phospholipids. In addition to 
an ethanolamine head group and a long- 
chain fatty acid attached to the same 
glycerol phosphate backbone used for 
phospholipids, plasmalogens contain 
an unusual fatty alcohol that is attached 
through an ether linkage (bottom left). 
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Figure 12-29 Electron micrographs of two types of peroxisomes found in plant cells. (A) A peroxisome with a 
paracrystalline core in a tobacco leaf mesophyll cell. Its close association with chloroplasts is thought to facilitate the 
exchange of materials between these organelles during photorespiration. The vacuole in plant cells is équivalent to the 
lysosome in animal cells. (B) Peroxisomes in a fat-storing cotylédon cell of a tomato seed 4 days after germination. Here the 
peroxisomes (glyoxysomes) are associated with the lipid droplets that store fat, reflecting their central rôle in fat mobilization and 
gluconeogenesis during seed germination. (A, from S.E. Frederick and E.H. Newcomb, J. Cell Biol. 43:343-353, 1969. With 
permission from The Rockefeller Press; B, from W.P Wergin, PJ. Gruber and E.H. Newcomb, J. Ultrastruct. Res. 30:533-557, 

1970. With permission from Academie Press.) 


delivering its cargo to the peroxisome lumen, Pex5 undergoes ubiquitylation. This 
modification is required to release Pex5 backinto the cytosol, where the ubiquitin 
is removed. An ATPase composed of Pexl and Pex6 harnesses the energy of ATP 
hydrolysis to help release Pex5 from peroxisomes. 

The importance of this import process and of peroxisomes is demonstrated 
by the inherited human disease Zellweger syndrome, in which a defect in import- 
ing proteins into peroxisomes leads to a profound peroxisomal deficiency. These 
individuals, whose cells contain “empty” peroxisomes, hâve severe abnormalities 
in their brain, liver, and kidneys, and they die soon after birth. A mutation in the 
gene encoding peroxin Pex5 causes one form of the disease. A defect in Pex7, the 
receptor for the N-terminal import signal, causes a milder peroxisomal disease. 

It has long been debated whether new peroxisomes arise from preexisting ones 
by organelle growth and fission—as mentioned earlier for mitochondria and plas- 
tids—or whether they dérivé as a specialized compartment from the endoplasmic 
réticulum (ER). Aspects of both views are true (Figure 12-30). Most peroxisomal 
membrane proteins are made in the cytosol and insert into the membrane of 



réticulum 


Figure 12-30 A model that explains 
how peroxisomes proliferate and how 
new peroxisomes arise. Peroxisomal 
precursor vesicles bud from the ER. 

At least two peroxisomal membrane 
proteins, Pex3 and Pexl 5, followthis 
route. The machinery that drives the 
budding reaction and that selects only 
peroxisomal proteins for packaging into 
these vesicles dépends on Pexl 9 and 
other cytosolic proteins that are still 
unknown. Peroxisomal precursor vesicles 
may then fuse with one another or with 
preexisting peroxisomes. The peroxisome 
membrane contains import receptors and 
protein translocators that are required 
for the import of peroxisomal proteins 
made on cytosolic ribosomes, including 
new copies of the import receptors and 
translocator components. Presumably, 
the lipids required for growth are also 
imported, although some may dérivé 
directly from the ER in the membrane of 
peroxisomal precursor vesicles. 
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preexisting peroxisomes, but others are first integrated into the ER membrane, 
where they are packaged into specialized peroxisomal precursor vesicles. New 
precursor vesicles may then fuse with one another and begin importing addi- 
tional peroxisomal proteins, using their own protein import machinery to grow 
into mature peroxisomes, which can undergo cycles of growth and fission. 

Summary 

Peroxisomes are specialized for carrying out oxidation reactions using molecular 
oxygen. They generate hydrogen peroxide, which they employ for oxidative pur- 
poses—and contain catalase to destroy the excess. Lilce mitochondria and plastids, 
peroxisomes are self-replicating organelles. Because they do not contain DNA or 
ribosomes, however, ail of their proteins are encoded in the cell nucléus. Some of 
these proteins are conveyed to peroxisomes via peroxisomal precursor vesicles that 
bud from the ER, but most are synthesized in the cytosol and directly imported. A 
spécifie sequence ofthree amino acids near the C-terminus ofmany ofthe latter pro¬ 
teins functions as a peroxisomal import signal. The mechanism of protein import 
differs from that of mitochondria and chloroplasts, in that even oligomeric proteins 
are imported from the cytosol without unfolding. 


THE ENDOPLASMIC RETICULUM 


Ail eukaryotic cells hâve an endoplasmic réticulum (ER). Its membrane typi- 
cally constitutes more than half of the total membrane of an average animal cell 
(see Table 12-2, p. 643). The ER is organized into a netlike labyrinth of branching 
tubules and flattened sacs that extends throughout the cytosol (Figure 12-31 and 
Movie 12.4). The tubules and sacs interconnect, and their membrane is continu- 
ous with the outer nuclear membrane; the compartment that they enclose there- 
fore is also continuous with the space between the inner and outer nuclear mem¬ 
branes. Thus, the ER and nuclear membranes form a continuous sheet enclosing 
a single internai space, called the ER lumen or the ER cisternal space, which often 
occupies more than 10% of the total cell volume (see Table 12-1, p. 643). 

As mentioned at the beginning of this chapter, the ER has a central rôle in both 
lipid and protein biosynthesis, and it also serves as an intracellular Ca 2+ store 
that is used in many cell signaling responses (discussed in Chapter 15). The ER 
membrane is the site of production of ail the transmembrane proteins and lip- 
ids for most of the cell's organelles, including the ER itself, the Golgi apparatus, 
lysosomes, endosomes, secretory vesicles, and the plasma membrane. The ER 
membrane is also the site at which most of the lipids for mitochondrial and per¬ 
oxisomal membranes are made. In addition, almost ail of the proteins that will be 
secreted to the cell exterior—plus those destined for the lumen of the ER, Golgi 
apparatus, or lysosomes—are initially delivered to the ER lumen. 



CYTOSOL 



ENDOPLASMIC RETICULUM 



Figure 12-31 Fluorescent micrographs of 
the endoplasmic réticulum. (A) An animal 
cell in tissue culture that was genetically 
engineered to express an ER membrane 
protein fused to a fluorescent protein. The 
ER extends as a network of tubules and 
sheets throughout the entire cytosol, so that 
ail régions of the cytosol are close to some 
portion of the ER membrane. The outer 
nuclear membrane, which is continuous 
with the ER, is also stained. (B) Part of 
an ER network in a living plant cell that 
was genetically engineered to express a 
fluorescent protein in the ER. (A, courtesy of 
Patrick Chitwood and Gia Voeltz; B, courtesy 
of Petra Boevink and Chris Hawes.) 
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The ER Is Structurally and Functionally Diverse 

While the various functions of the ER are essential to every cell, their relative 
importance varies greatly between individual cell types. To meet different func- 
tional demands, distinct régions of the ER become highly specialized. We observe 
such functional specialization as dramatic changes in ER structure, and different 
cell types can therefore possess characteristically different types of ER membrane. 
One of the most remarlcable ER specializations is the rough ER. 

Mammalian cells begin to import most proteins into the ER before complété 
synthesis of the polypeptide chain—that is, import is a co-translational process 
(Figure 12-32A). In contrast, the import of proteins into mitochondria, chloro- 
plasts, nuclei, and peroxisomes is a post-translational process (Figure 12-32B). In 
co-translational transport, the ribosome that is synthesizing the protein is attached 
directly to the ER membrane, enabling one end of the protein to be translocated 
into the ER while the rest of the polypeptide chain is being synthesized. These 
membrane-bound ribosomes coat the surface of the ER, creating régions termed 
rough endoplasmic réticulum, or rough ER; régions of ER that laclc bound ribo¬ 
somes are called smooth endoplasmic réticulum, or smooth ER (Figure 12-33). 

Most cells hâve scanty régions of smooth ER, and the ER is often partly smooth 
and partly rough. Areas of smooth ER from which transport vesicles carrying 
newly synthesized proteins and lipids bud off for transport to the Golgi apparatus 
are called transitional ER. In certain specialized cells, the smooth ER is abundant 
and has additional functions. It is prominent, for example, in cells that specialize 
in lipid metabolism, such as cells that synthesize steroid hormones from choles¬ 
térol; the expanded smooth ER accommodâtes the enzymes that malce choles¬ 
térol and modify it to form the hormones (see Figure 12-33B). 

The main cell type in the liver, the hépatocyte , also has a substantial amount 
of smooth ER. It is the principal site of production of lipoprotein particles , which 
carry lipids via the bloodstream to other parts of the body. The enzymes that 
synthesize the lipid components of the particles are located in the membrane of 
the smooth ER, which also contains enzymes that catalyze a sériés of reactions 
to detoxify both lipid-soluble drugs and various harmful compounds produced 
by metabolism. The most extensively studied of these détoxification reactions are 
carried out by the cytochrome P450 family of enzymes, which catalyze a sériés of 
reactions in which water-insoluble drugs or métabolites that would otherwise 
accumulate to toxic levels in cell membranes are rendered sufficiently water-solu- 
ble to leave the cell and be excreted in the urine. Because the rough ER alone can- 
not house enough of these and other necessary enzymes, a substantial portion of 
the membrane in a hépatocyte normally consists of smooth ER (see Table 12-2). 

Another crucially important function of the ER in most eukaryotic cells is to 
sequester Ca 2+ from the cytosol. The release of Ca 2+ into the cytosol from the ER, 
and its subséquent reuptake, occurs in many rapid responses to extracellular 



Figure 12-32 Co-translational and post- 
translational protein translocation. 

(A) Ribosomes bind to the ER membrane 
during co-translational translocation. (B) By 
contrast, cytosolic ribosomes complété the 
synthesis of a protein and release it prior 
to post-translational translocation. In both 
cases, the protein is directed to the ER by 
an ER signal sequence (red and orange). 
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Figure 12-33 The rough and smooth ER. (A) An électron micrograph of the rough ER in a pancreatic exocrine 
cell that makes and sécrétés large amounts of digestive enzymes every day. The cytosol is filled with closely packed sheets 
of ER membrane that is studded with ribosomes. At the top left is a portion of the nucléus and its nuclear envelope; note that 
the outer nuclear membrane, which is continuous with the ER, is also studded with ribosomes. (B) Abundant smooth ER in a 
steroid-hormone-secreting cell. This électron micrograph is of a testosterone-secreting Leydig cell in the human testis. 

(C) A three-dimensional reconstruction of a région of smooth ER and rough ER in a liver cell. The rough ER forms oriented 
stacks of flattened cisternae, each having a lumenal space 20-30 nm wide. The smooth ER membrane is connected to 
these cisternae and forms a fine network of tubules 30-60 nm in diameter. The ER lumen is colored green. (D) A tomographie 
reconstruction of a portion of the ER network in a yeast cell. Membrane-bound ribosomes (tiny dark spheres) are seen in both 
fiat sheets and tubular régions of irregular diameter, demonstrating that the ribosomes bind to ER membranes of different 
curvature in these cells. (A, courtesy of Lelio Orci; B, courtesy of Daniel S. Friend; C, after R.V. Krstic, Ultrastructure of the 
Mammalian Cell. New York: Springer-Verlag, 1979; D, from M. West et al., J. Cell Biol. 193:333-346, 2011. With permission 
from Rockefeller University Press.) 


signais, as discussed in Chapter 15. A Ca 2+ pump transports Ca 2+ from the cytosol 
into the ER lumen. A high concentration of Ca 2+ -binding proteins in the ER facil¬ 
itâtes Ca 2+ storage. In some cell types, and perhaps in most, spécifie régions of 
the ER are specialized for Ca 2+ storage. Muscle cells hâve an abundant, modified 
smooth ER called the sarcoplasmic réticulum. The release and reuptake of Ca 2+ by 
the sarcoplasmic réticulum trigger myofibril contraction and relaxation, respec- 
tively, during each round of muscle contraction (discussed in Chapter 16). 

To study the functions and biochemistry of the ER, it is necessary to isolate 
it. This may seem to be a hopeless taslc because the ER is intricately interleaved 
with other components of the cytoplasm. Fortunately, when tissues or cells are 
disrupted by homogenization, the ER breaks into fragments, which reseal to 
form small (-100-200 nm in diameter) closed vesicles called microsomes. Mic- 
rosomes are relatively easy to purify. To the biochemist, microsomes represent 
small authentic versions of the ER, still capable of protein translocation, protein 
glycosylation (discussed later), Ca 2+ uptalce and release, and lipid synthesis. Mic¬ 
rosomes derived from rough ER are studded with ribosomes and are called rough 
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microsomes. The ribosomes are always found on the outside surface, so the inte- 
rior of the microsome is biochemically équivalent to the lumen of the ER (Figure 
12-34A). 

Many vesicles similar in size to rough microsomes, but lacking attached ribo¬ 
somes, are also found in cell homogenates. Such smooth microsomes are derived 
in part from smooth portions of the ER and in part from vesiculated fragments of 
the plasma membrane, Golgi apparatus, endosomes, and mitochondria (the ratio 
depending on the tissue). Thus, whereas rough microsomes are clearly derived 
from rough portions of ER, it is not easy to separate smooth microsomes derived 
from different organelles. The smooth microsomes prepared from liver or muscle 
cells are an exception. Because of the unusually large quantities of smooth ER or 
sarcoplasmic réticulum, respectively, most of the smooth microsomes in homog¬ 
enates of these tissues are derived from the smooth ER or sarcoplasmic réticu¬ 
lum. The ribosomes attached to rough microsomes malce them more dense than 
smooth microsomes. As a resuit, we can use equilibrium centrifugation to sepa¬ 
rate the rough and smooth microsomes (Figure 12-34B). Microsomes hâve been 
invaluable in elucidating the molecular aspects of ER function, as we discuss next. 


Figure 12-34 The isolation of purified 
rough and smooth microsomes 
from the ER. (A) A thin section électron 
micrograph of the purified rough ER 
fraction shows an abundance of ribosome- 
studded vesicles. (B) When sedimented to 
equilibrium through a gradient of sucrose, 
the two types of microsomes separate from 
each other on the basis of their different 
densities. Note that the smooth fraction will 
also contain non-ER-derived material. 

(A, courtesy of George Palade.) 


Signal Sequences Were First Discovered in Proteins Imported into 
the Rough ER 

The ER captures selected proteins from the cytosol as they are being synthesized. 
These proteins are of two types: transmembrane proteins, which are only partly 
translocated across the ER membrane and become embedded in it, and water-sol- 
uble proteins, which are fully translocated across the ER membrane and are 
released into the ER lumen. Some of the transmembrane proteins function in the 
ER, but many are destined to résidé in the plasma membrane or the membrane of 
another organelle. The water-soluble proteins are destined either for sécrétion or 
for résidence in the lumen of the ER or of another organelle. Ail of these proteins, 
regardless of their subséquent fate, are directed to the ER membrane by an ER 
signal sequence, which initiâtes their translocation by a common mechanism. 

Signal sequences (and the signal sequence strategy of protein sorting) were 
first discovered in the early 1970s in secreted proteins that are translocated across 
the ER membrane as a first step toward their eventual discharge from the cell. 
In the key experiment, the mRNA encoding a secreted protein was translated by 
ribosomes in vitro. When microsomes were omitted from this cell-free System, 
the protein synthesized was slightly larger than the normal secreted protein. In 
the presence of microsomes derived from the rough ER, however, a protein of the 
correct size was produced. According to the signal hypothesis, the size différence 
reflects the initial presence of a signal sequence that directs the secreted protein 
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Figure 12-35 The signal hypothesis. 

A simplifiée! view of protein translocation 
across the ER membrane, as originally 
proposed. When the ER signal sequence 
emerges from the ribosome, it directs 
the ribosome to a translocator on 
the ER membrane that forms a pore 
in the membrane through which the 
polypeptide is translocated. A signal 
peptidase is closely associated with 
the translocator and clips off the signal 
sequence during translation, and the 
mature protein is released into the lumen 
of the ER immediately after its synthesis 
is completed. The translocator is closed 
until the ribosome has bound, so that the 
permeability barrier of the ER membrane is 
maintained at ail times. 


to the ER membrane and is then cleaved off by a signal peptidase in the ER mem¬ 
brane before the polypeptide chain has been completed (Figure 12-35). Cell-free 
Systems in which proteins are imported into microsomes hâve provided powerful 
procedures for identifying, purifying, and studying the various components of the 
molecular machinery responsible for the ER import process. 

A Signal-Récognition Particle (SRP) Directs the ER Signal 
Sequence to a Spécifie Receptor in the Rough ER Membrane 

The ER signal sequence is guided to the ER membrane by at least two compo¬ 
nents: a signal-récognition particle (SRP), which cycles between the ER mem¬ 
brane and the cytosol and binds to the signal sequence, and an SRP receptor in 
the ER membrane. The SRP is a large complex; in animal cells, it consists of six 
different polypeptide chains bound to a single small RNA molécule. While the 
SRP and SRP receptor hâve fewer subunits in bacteria, homologs are présent in 
ail cells, indicating that this protein-targeting mechanism arose early in évolution 
and has been conserved. 

ER signal sequences vary greatly in amino acid sequence, but each has eight or 
more nonpolar amino acids at its center (see Table 12-3, p. 648). How can the SRP 
bind specifically to so many different sequences? The answer has corne from the 
crystal structure of the SRP protein, which shows that the signal-sequence-bind- 
ing site is a large hydrophobie pocket lined by methionines. Because methionines 
hâve unbranched, flexible side chains, the pocket is sufficiently plastic to accom- 
modate hydrophobie signal sequences of different sequences, sizes, and shapes. 

The SRP is a rodlike structure, which wraps around the large ribosomal subunit, 
with one end binding to the ER signal sequence as it emerges from the ribosome 
as part of the newly made polypeptide chain; the other end bloclcs the élongation 
factor binding site at the interface between the large and small ribosomal subunits 
(Figure 12-36). This blockhalts protein synthesis as soon as the signal peptide has 
emerged from the ribosome. The transient pause presumably gives the ribosome 
enough time to bind to the ER membrane before completion of the polypeptide 
chain, thereby ensuring that the protein is not released into the cytosol. This safety 
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Figure 12-36 The signal-récognition particle (SRP). (A) A mammalian SRP is a rodlike 
ribonucleoprotein complex containing six protein subunits (brown) and one RNA molécule (blue). 
The SRP RNA forms a backbone that links the protein domain containing the signal-sequence- 
binding pocket to the domain responsible for pausing translation. Crystal structures of various 
SRP pièces from different species are assembled here into a composite model to approximate the 
structure of a complété SRP. (B) The three-dimensional outline of the SRP bound to a ribosome 
was determined by cryoelectron microscopy. SRP binds to the large ribosomal subunit so that its 
signal-sequence-binding pocket is positioned near the growing polypeptide chain exit site, and its 
translational pause domain is positioned at the interface between the ribosomal subunits, where it 
interfères with élongation factor binding. (C) As a signal sequence emerges from the ribosome and 
binds to the SRP, a conformational change in the SRP exposes a binding site for the SRP receptor. 
(B, adapted from M. Halic et al., Nature 427:808-814, 2004. With permission from Macmillan 
Publishers Ltd.) 


device may be especially important for secreted and lysosomal hydrolases, which 
could wrealc havoc in the cytosol; cells that secrete large amounts of hydrolases, 
however, talce the added précaution of having high concentrations of hydrolase 
inhibitors in their cytosol. The pause also ensures that large portions of a protein 
that could fold into a compact structure are not made before reaching the trans- 
locator in the ER membrane. Thus, in contrast to the post-translational import of 
proteins into mitochondria and chloroplasts, chaperone proteins are not required 
to lceep the protein unfolded. 

When a signal sequence binds, SRP exposes a binding site for the SRP receptor 
(see Figure 12-36B,C), which is a transmembrane protein complex in the rough 
ER membrane. The binding of the SRP to its receptor brings the SRP-ribosome 
complex to an unoccupied protein translocator in the same membrane. The SRP 
and SRP receptor are then released, and the translocator transfers the growing 
polypeptide chain across the membrane (Figure 12-37). 

This co-translational transfer process créâtes two spatially separate popula¬ 
tions of ribosomes in the cytosol. Membrane-bound ribosomes, attached to the 
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Figure 12-37 How ER signal sequences 
and SRP direct ribosomes to the ER 
membrane. The SRP and its receptor 
act in concert. The SRP binds to both 
the exposed ER signal sequence and 
the ribosome, thereby inducing a pause 
in translation. The SRP receptor in the 
ER membrane, which in animal cells is 
composed of two different polypeptide 
chains, binds the SRP-ribosome complex 
and directs it to the translocator. In a 
poorly understood reaction, the SRP and 
SRP receptor are then released, leaving 
the ribosome bound to the translocator 
in the ER membrane. The translocator 
then inserts the polypeptide chain into 
the membrane and transfers it across 
the lipid bilayer. Because one of the SRP 
proteins and both chains of the SRP 
receptor contain GTP-binding domains, 
it is thought that conformational changes 
that occur during cycles of GTP binding 
and hydrolysis (discussed in Chapter 
15) ensure that SRP release occurs only 
after the ribosome has become properly 
engaged with the translocator in the ER 
membrane. The translocator is closed 
until the ribosome has bound, so that the 
permeability barrier of the ER membrane is 
maintained at ail times. 
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cytosolic side of the ER membrane, are engaged in the synthesis of proteins that 
are being concurrently translocated into the ER. Free ribosomes, unattached to 
any membrane, synthesize ail other proteins encoded by the nuclear genome. 
Membrane-bound and free ribosomes are structurally and functionally identical. 
They differ only in the proteins they are making at any given time. 

Since many ribosomes can bind to a single mRNA molécule, a polyribosome 
is usually formed. If the mRNA encodes a protein with an ER signal sequence, 
the polyribosome becomes attached to the ER membrane, directed there by the 
signal sequences on multiple growing polypeptide chains. The individual ribo¬ 
somes associated with such an mRNA molécule can return to the cytosol when 
they finish translation and intermix with the pool of free ribosomes. The mRNA 
itself, however, remains attached to the ER membrane by a changing population 
of ribosomes, each transiently held at the membrane by the translocator (Figure 
12-38). 

The Polypeptide Chain Passes Through an Aqueous Channel in 
the Translocator 

It had long been debated whether polypeptide chains are transferred across the 
ER membrane in direct contact with the lipid bilayer or through a channel in a 
protein translocator. The debate ended with the identification of the transloca¬ 
tor, which was shown to form a water-filled channel in the membrane through 


mRNA encoding a cytosolic protein 
remains free in cytosol 


free polyribosome 
in cytosol 



mRNA encoding a protein 
targeted to ER remains 
membrane-bound 


(A) ER membrane 


ER membrane polyribosome 



(B) / I- 

polyribosome rosette 400 nm 


Figure 12-38 Free and membrane-bound polyribosomes. (A) A common pool of ribosomes synthesizes the proteins that 
stay in the cytosol and those that are transported into the ER. The ER signal sequence on a newly formed polypeptide chain 
binds to SRP, which directs the translating ribosome to the ER membrane. The mRNA molécule remains permanently bound 
to the ER as part of a polyribosome, while the ribosomes that move along it are recycled; at the end of each round of protein 
synthesis, the ribosomal subunits are released and rejoin the common pool in the cytosol. (B) A thin section électron micrograph 
of polyribosomes attached to the ER membrane. The plane of section in some places cuts through the ER roughly parallel to 
the membrane, giving a face-on view of the rosettelike pattern of the polyribosomes. (B, courtesy of George Palade.) 
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Figure 12-39 Structure of the Sec61 complex. (A) A side view (left) and a top view (right, seen 
from the cytosol) of the structure of the Sec61 complex of the archaeon Methanococcus jannaschii. 
The Sec61 a subunit has an inverted repeat structure (see Figure 11-10) and is shown in blue and 
beige to indicate this pseudo-symmetry; the two smaller p and y subunits are shown in gray. In 
the side view, some helices in front hâve been omitted to make the inside of the pore visible. The 
yellow short hélix is thought to form a plug that seals the pore when the translocator is closed. To 
open, the complex rearranges itself to move the plug hélix out of the way, as indicated by the red 
arrow. A ring of hydrophobie amino acid side chains is thought to form a tight-fitting diaphragm 
around translocating polypeptide chain to prevent leaks of other molécules across the membrane. 
The pore of the Sec61 complex can also open sideways at a latéral seam. (B) Models of the closed 
and open States of the translocator are shown in top view, illustrating how a signal sequence (or a 
transmembrane segment) could be released into the lipid bilayer after opening of the seam. (PDB 
codes: 1RFI5 and 1RHZ.) 


which the polypeptide chain passes. The core of the translocator, called the Sec61 
complex, is built from three subunits that are highly conserved from bacteria to 
eukaryotic cells. The structure of the Sec61 complex suggests that a helices con- 
tributed by the largest subunit surround a central channel through which the 
polypeptide chain traverses the membrane (Figure 12-39). The channel is gated 
by a short a hélix that is thought to lceep the translocator closed when it is idle and 
to move aside when it is engaged in passing a polypeptide chain. According to 
this view, the pore is a dynamic gated channel that opens only transiently when a 
polypeptide chain traverses the membrane. In an idle translocator, it is important 
to lceep the channel closed, so that the membrane remains imperméable to ions, 
such as Ca 2+ , which otherwise would lealc out of the ER. As a polypeptide chain is 
translocating, a ring of hydrophobie amino acid side chains is thought to provide 
a flexible seal to prevent ion leaks. 

The structure of the Sec61 complex suggests that the pore can also open along 
a seam on its side. Indeed, some structures of the translocator show it loclced in an 
open-seam conformation. This opening allows a translocating peptide chain lat¬ 
éral access into the hydrophobie core of the membrane, a process that is import¬ 
ant both for the release of a cleaved signal peptide into the membrane (see Figure 
12-35) and for the intégration of transmembrane proteins into the bilayer, as we 
discuss later. 
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Figure 12-40 A ribosome (green) bound to the ER protein translocator 
(blue). (A) A side-view reconstruction of the complex from électron 
microscopie images. (B) A view of the translocator seen from the ER lumen. 
The translocator contains Sec61, accessory proteins, and detergent used in 
the préparation. Domains of accessory proteins extend across the membrane 
and form the lumenal bulge. (C) A schematic drawing of a membrane-bound 
ribosome attached to the translocator, indicating the location of the tunnel 
in the large ribosomal subunit through which the growing polypeptide chain 
exits from the ribosome. The mRNA (not shown) would be located between 
the small and large ribosomal subunits. (Adapted from J.F. Ménétret et al., J. 
Mol. Biol. 348:445-457, 2005. With permission from Academie Press.) 


In eulcaryotic cells, four Sec61 complexes form a large translocator assembly 
that can be visualized on ER-bound ribosomes after detergent solubilization of 
the ER membrane (Figure 12-40). It is lilcely that this assembly includes other 
membrane complexes that associate with the translocator, such as enzymes that 
modify the growing polypeptide chain, including oligosaccharide transferase and 
the signal peptidase. The assembly of a translocator with these accessory compo- 
nents is called the translocon. 

Translocation Across the ER Membrane Does Not Always Require 
Ongoing Polypeptide Chain Elongation 

As we hâve seen, translocation of proteins into mitochondria, chloroplasts, and 
peroxisomes occurs post-translationally, after the protein has been made and 
released into the cytosol, whereas translocation across the ER membrane usually 
occurs during translation (co-translationally). This explains why ribosomes are 
bound to the ER but not to other organelles. 

Some completely synthesized proteins, however, are imported into the ER, 
demonstrating that translocation does not always require ongoing translation. 
Post-translational protein translocation is especially common across the yeast 
ER membrane and the bacterial plasma membrane (which is thought to be evo- 
lutionarily related to the ER). To function in post-translational translocation, the 
ER translocator needs accessory proteins that feed the polypeptide chain into the 
pore and drive translocation (Figure 12-41). In bacteria, a translocation motor 
protein, the SecA ATPase, attaches to the cytosolic side of the translocator, where it 
undergoes cyclic conformational changes driven by ATP hydrolysis. Each time an 
ATP is hydrolyzed, a portion of the SecA protein inserts into the pore of the trans¬ 
locator, pushing a short segment of the passenger protein with it. As a resuit of this 
ratchet mechanism, the SecA ATPase progressively pushes the polypeptide chain 
of the transported protein across the membrane. 

Eulcaryotic cells use a different set of accessory proteins that associate with the 
Sec61 complex. These proteins span the ER membrane and use a small domain on 
the lumenal side of the ER membrane to deposit an hsp70-like chaperone protein 
(called BiP, for hmding protein) onto the polypeptide chain as it emerges from the 
pore into the ER lumen. ATP-dependent cycles of BiP binding and release drive 
unidirectional translocation, as described earlier for the mitochondrial hsp70 
proteins that pull proteins across mitochondrial membranes. 

Proteins that are transported into the ER by a post-translational mechanism 
are first released into the cytosol, where they bind to chaperone proteins to pre- 
vent folding, as discussed earlier for pro teins destined for mitochondria and chlo¬ 
roplasts. 
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In Single-Pass Transmembrane Proteins, a Single Internai ER 
Signal Sequence Remains in the Lipid Bilayer as a Membrane- 
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The ER signal sequence in the growing polypeptide chain is thought to trigger the 
opening of the pore in the Sec61 protein translocator: after the signal sequence is 
released from the SRP and the growing chain has reached a sufficient length, the 
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Figure 12-41 Three ways in which protein translocation can be driven through structurally similar translocators. 

(A) Co-translational translocation. The ribosome is brought to the membrane by the SRP and SRP receptor and then engages 
with the Sec61 protein translocator. The growing polypeptide chain is threaded across the membrane as it is made. No 
additional energy is needed, as the only path available to the growing chain is to cross the membrane. (B) Post-translational 
translocation in eukaryotic cells requires an additional complex composed of Sec62, Sec63, Sec71, and Sec72 proteins, 
which is attached to the Sec61 translocator and deposits BiP molécules onto the translocating chain as it emerges from the 
translocator in the lumen of the ER. ATP-driven cycles of BiP binding and release pull the protein into the lumen, a mechanism 
that closely resembles the mechanism of mitochondrial import in Figure 12-23. (C) Post-translational translocation in bacteria. 
The completed polypeptide chain is fed from the cytosolic side into the bacterial homolog of the Sec61 complex (called the 
SecY complex in bacteria) in the plasma membrane by the SecA ATPase. ATP hydrolysis-driven conformational changes drive 
a pistonlike motion in SecA, each cycle pushing about 20 amino acids of the protein chain through the pore of the translocator. 
The Sec pathway used for protein translocation across the thylakoid membrane in chloroplasts uses a similar mechanism (see 
Figure 12-26B). 

Whereas the Sec61 translocator, SRP, and SRP receptor are found in ail organisms, SecA is found exclusively in bacteria, and 
the Sec62, 63, 71,72 complex is found exclusively in eukaryotic cells. (Adapted from P. Walter and A.E. Johnson, Annu. Rev. 
Cell Biol. 10:87-119, 1994. With permission from Annual Reviews.) 


signal sequence binds to a spécifie site inside the pore itself, thereby opening the 
pore. An ER signal sequence is therefore recognized twice: first by an SRP in the 
cytosol and then by a binding site in the pore of the protein translocator, where it 
serves as a start-transfer signal (or start-transfer peptide) that opens the pore (for 
example, see Figure 12-35 for how this worlcs for a soluble protein). Dual récog¬ 
nition may help ensure that only appropriate proteins enter the lumen of the ER. 

While bound in the translocation pore, a signal sequence is in contact not only 
with the Sec61 complex, which forms the walls of the pore, but also, along the lat¬ 
éral seam, with the hydrophobie core of the lipid bilayer. This was shown in Chem¬ 
ical cross-linking experiments in which the signal sequence and the hydrocarbon 
chains of lipids were covalently linlced together. When the nascent polypeptide 
chain grew long enough, the ER signal peptidase cleaved off the signal sequence 
and released it from the pore into the membrane, where it was rapidly degraded 
to amino acids by other proteases in the ER membrane. To release the signal 
sequence into the membrane, the translocator opens laterally along the seam (see 
Figures 12-35 and 12-39). The translocator is therefore gated in two directions: 
it opens to form a pore across the membrane to let the hydrophilic portions of 
proteins cross the lipid bilayer, and it opens laterally within the membrane to let 
hydrophobie portions of proteins partition into the lipid bilayer. Latéral gating of 
the pore is an essential step during the intégration of transmembrane proteins. 

The intégration of membrane proteins requires that some parts of the poly¬ 
peptide chain be translocated across the lipid bilayer whereas others are not. 
Despite this additional complexity, ail modes of insertion of membrane proteins 
are simply variants of the sequence of events just described for transferring a sol¬ 
uble protein into the lumen of the ER. We begin by describing the three ways in 
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Figure 12-42 How a single-pass 
transmembrane protein with a cleaved 
ER signal sequence is integrated into 
the ER membrane. In this protein, the 
co-translational translocation process 
is initiated by an N-terminal ER signal 
sequence (red) that functions as a start- 
transfer signal, opening the translocator as 
in Figure 12-35. In addition to this start- 
transfer sequence, however, the protein 
also contai ns a stop-transfer sequence 
(orange)] when this sequence enters the 
translocator and interacts with a binding 
site within the pore, the translocator opens 
at the seam and discharges the protein 
laterally into the lipid bilayer, where the 
stop-transfer sequence remains to anchor 
the protein in the membrane. (In this 
figure and the two figures that follow, the 
ribosomes hâve been omitted for clarity.) 


which single-pass transmembrane proteins (see Figure 10-17) become inserted 
into the ER membrane. 

In the simplest case, an N-terminal signal sequence initiâtes translocation, just 
as for a soluble protein, but an additional hydrophobie segment in the polypep¬ 
tide chain stops the transfer process before the entire polypeptide chain is trans- 
located. This stop-transfer signal anchors the protein in the membrane after the 
ER signal sequence (the start-transfer signal) has been cleaved off and released 
from the translocator (Figure 12-42). The latéral gating mechanism transfers 
the stop-transfer sequence into the bilayer, where it remains as a single a-helical 
membrane-spanning segment, with the N-terminus of the protein on the lumenal 
side of the membrane and the C-terminus on the cytosolic side. 

In the other two cases, the signal sequence is internai, rather than at the N-ter- 
minal end of the protein. As for an N-terminal ER signal sequence, the SRP binds 
to an internai signal sequence by recognizing its hydrophobie a-helical features. 
The SRP brings the ribosome making the protein to the ER membrane, and the 
ER signal sequence then serves as a start-transfer signal that initiâtes the pro- 
tein's translocation. After release from the translocator, the internai start-transfer 
sequence remains in the lipid bilayer as a single membrane-spanning a hélix. 

Internai start-transfer sequences can bind to the translocation apparatus in 
either of two orientations; this in turn détermines which protein segment (the one 
preceding or the one following the start-transfer sequence) is moved across the 
membrane into the ER lumen. In one case, the resulting membrane protein has its 
C-terminus on the lumenal side (pathway A in Figure 12-43), while in the other, it 
has its N-terminus on the lumenal side (pathway B in Figure 12-43). The orienta¬ 
tion of the start-transfer sequence dépends on the distribution of nearby charged 
amino acids, as described in the figure legend. 

Combinations of Start-Transfer and Stop-Transfer Signais 
Détermine the Topology of Multipass Transmembrane Proteins 

In multipass transmembrane proteins, the polypeptide chain passes baclc and 
forth repeatedly across the lipid bilayer as hydrophobie a helices (see Figure 
10-17). It is thought that an internai signal sequence serves as a start-transfer 
signal in these proteins to initiate translocation, which continues until the trans¬ 
locator encounters a stop-transfer sequence; in double-pass transmembrane 
proteins, for example, the polypeptide can then be released into the bilayer 
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Figure 12-43 Intégration of a single- 
pass transmembrane protein with 
an internai signal sequence into the 
ER membrane. An internai ER signal 
sequence that functions as a start-transfer 
signal can bind to the translocator in one 
of two ways, leading to a membrane 
protein that has either its C-terminus 
(pathway A) or its N-terminus (pathway 
B) in the ER lumen. Proteins are directed 
into either pathway by features in the 
polypeptide chain flanking the internai 
start-transfer sequence: if there are 
more positively charged amino acids 
immediately preceding the hydrophobie 
core of the start-transfer sequence than 
there are following it, the membrane 
protein is inserted into the translocator 
in the orientation shown in pathway 
A, whereas if there are more positively 
charged amino acids immediately following 
the hydrophobie core of the start-transfer 
sequence than there are preceding it, 
the membrane protein is inserted into 
the translocator in the orientation shown 
in pathway B. Because translocation 
cannot start before a start-transfer 
sequence appears outside the ribosome, 
translocation of the N-terminal portion of 
the protein shown in (B) can occur only 
after this portion has been fully synthesized. 

Note that there are two ways to insert a 
single-pass membrane-spanning protein 
whose N-terminus is located in the ER 
lumen: that shown in Figure 12-42 and that 
shown here in (B). 


(Figure 12-44). In more complex multipass proteins, in which many hydro¬ 
phobie a helices span the bilayer, a second start-transfer sequence reinitiates 
translocation further down the polypeptide chain until the next stop-transfer 
sequence causes polypeptide release, and so on for subséquent start-transfer and 
stop-transfer sequences (Figure 12-45 and Movie 12.5). 

Hydrophobie start-transfer and stop-transfer signal sequences both act to fix 
the topology of the protein in the membrane by locking themselves into the mem¬ 
brane as membrane-spanning a helices; and they can do this in either orienta¬ 
tion. Whether a given hydrophobie signal sequence functions as a start-transfer 
or stop-transfer sequence must dépend on its location in a polypeptide chain, 
since its function can be switched by changing its location in the protein by using 
recombinant DNA techniques. Thus, the distinction between start-transfer and 
stop-transfer sequences results mostly from their relative order in the growing 
polypeptide chain. It seems that the SRP begins scanning an unfolded polypep¬ 
tide chain for hydrophobie segments at its N-terminus and proceeds toward the 
C-terminus, in the direction that the protein is synthesized. By recognizing the 
first appropriate hydrophobie segment to emerge from the ribosome, the SRP 
sets the “reading frame” for membrane intégration: after the SRP initiâtes trans¬ 
location, the translocator recognizes the next appropriate hydrophobie segment 
in the direction of transfer as a stop-transfer sequence, causing the région of the 
polypeptide chain in between to be threaded across the membrane. A similar 
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Figure 12-44 Intégration of a double- 
pass transmembrane protein with an 
internai signal sequence into the ER 
membrane. In this protein, an internai ER 
signal sequence acts as a start-transfer 
signal (as in Figure 12-43) and initiâtes 
the transfer of the C-terminal part of the 
protein. At some point after a stop-transfer 
sequence has entered the translocator, 
the translocator discharges the sequence 
laterally into the membrane. 


scanning process continues until ail of the hydrophobie régions in the protein 
hâve been inserted into the membrane as transmembrane a helices. 

Because membrane proteins are always inserted from the cytosolic side of the 
ER in this programmed manner, ail copies of the same polypeptide chain will hâve 
the same orientation in the lipid bilayer. This generates an asymmetrical ER mem¬ 
brane in which the protein domains exposed on one side are different from those 
exposed on the other side. This asymmetry is maintained during the many mem¬ 
brane budding and fusion events that transport the proteins made in the ER to 
other cell membranes (discussed in Chapter 13). Thus, the way in which a newly 
synthesized protein is inserted into the ER membrane détermines the orientation 
of the protein in ail of the other membranes as well. 

When proteins are extracted with detergent from a membrane and then recon- 
stituted into artificial lipid vesicles, a random mixture of right-side-out and inside- 
out protein orientations usually results. Thus, the protein asymmetry observed in 
cell membranes seems not to be an inhérent property of the proteins, but instead 
results solely from the process by which proteins are inserted into the ER mem¬ 
brane from the cytosol. 
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Figure 12-45 The insertion of the 
multipass membrane protein rhodopsin 
into the ER membrane. Rhodopsin is the 
light-sensitive protein in rod photoreceptor 
cells in the mammalian retina (discussed 
in Chapter 15). (A) A hydropathy plot 
(see Figure 10-20) identifies seven short 
hydrophobie régions in rhodopsin. (B) The 
hydrophobie région nearest the N-terminus 
serves as a start-transfer sequence that 
causes the preceding N-terminal portion 
of the protein to pass across the ER 
membrane. Subséquent hydrophobie 
sequences function in alternation as start- 
transfer and stop-transfer sequences. The 
green arrows indicate the paired start and 
stop signais inserted into the translocator. 
(C) The final integrated rhodopsin has its 
N-terminus located in the ER lumen and its 
C-terminus located in the cytosol. The blue 
hexagons represent covalently attached 
oligosaccharides. 
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ER Tail-anchored Proteins Are Integrated into the ER Membrane 
by a Spécial Mechanism 

Many important membrane proteins are anchored in the membrane by a C-ter- 
minal transmembrane, hydrophobie a hélix. These ER tail-anchored proteins 
include a large number of SNARE protein subunits that guide vesicular traffic 
(discussed in Chapter 13). When such a tail-anchored protein inserts into the ER 
membrane from the cytosol, only a few amino acids that follow the transmem¬ 
brane a hélix on its C-terminal side are translocated into the ER lumen, while 
most of the protein remains in the cytosol. Because of the unique position of the 
transmembrane a hélix in the protein sequence, translation terminâtes while the 
C-terminal amino acids that will form the transmembrane a hélix hâve not yet 
emerged from the ribosome exit tunnel. Récognition by SRP is therefore not pos¬ 
sible. It was long thought that these proteins are released from the ribosome and 
the hydrophobie C-terminal tail spontaneously partitions into the ER membrane. 
Such a mechanism could not explain, however, why ER tail-anchored proteins 
insert into the ER membrane selectively and not also into ail other membranes 
in the cell. It is now clear that a specialized targeting machinery is involved that 
is fueled by ATP hydrolysis (Figure 12-46). Although the components and details 
differ, this post-translational targeting mechanism is conceptually similar to 
SRP-dependent protein targeting (see Figure 12-37). 

Not ail tail-anchored pro teins are inserted into the ER, however. Some pro- 
teins contain a C-terminal membrane anchor that contains additional sorting 
information that directs the protein to mitochondria or peroxisomes. How these 
proteins are sorted there remains unlcnown. 

Translocated Polypeptide Chains Fold and Assemble in the Lumen 
of the Rough ER 

Many of the proteins in the lumen of the ER are in transit, en route to other desti¬ 
nations; others, however, normally résidé there and are présent at high concen¬ 
trations. These ER résident proteins contain an ER rétention signal of four amino 
acids at their C-terminus that is responsible for retaining the protein in the ER (see 
Table 12-3. p. 648; discussed in Chapter 13). Some of these proteins function as 
catalysts that help the many proteins that are translocated into the ER lumen to 
fold and assemble correctly. 

One important ER résident protein is protein disulfide isomerase {PDI), which 
catalyzes the oxidation of free sulfhydryl (SH) groups on cysteines to form disul¬ 
fide (S-S) bonds. Almost ail cysteines in protein domains exposed to either the 
extracellular space or the lumen of organelles in the secretory and endocytic 
pathways are disulfide-bonded. By contrast, disulfide bonds form only very rarely 
in domains exposed to the cytosol, because of the reducing environment there. 
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Figure 12-46 The insertion mechanism 
for tail-anchored proteins. In this post- 
translational pathway for the insertion of 
tail-anchored ER membrane proteins, a 
soluble pre-targeting complex captures 
the hydrophobie C-terminal a hélix after 
it emerges from the ribosomal exit tunnel 
and loads it onto the Get3 ATPase. The 
resulting complex is targeted to the ER 
membrane by interaction with the Getl- 
Get2 receptor complex that functions as 
a membrane protein insertion machine. 
After Get3 hydrolyzes its bound ATP, the 
tail-anchored protein is released from 
the receptor and inserted into the ER 
membrane. ADP release and renewed ATP 
binding recycles Get3 back to the cytosol. 
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Another ER résident protein is the chaperone protein BiP. We hâve already dis- 
cussed how BiP pulls proteins post-translationally into the ER through the Sec61 
ER translocator. Lilce other chaperones (discussed in Chapter 13), BiP recognizes 
incorrectly folded proteins, as well as protein subunits that hâve not yet assem- 
bled into their final oligomeric complexes. It does so by binding to exposed amino 
acid sequences that would normally be buried in the interior of correctly folded 
or assembled polypeptide chains. An example of a BiP-binding site is a stretch 
of alternating hydrophobie and hydrophilic amino acids that would normally be 
buried in a P sheet with its hydrophobie side oriented towards the hydrophobie 
core of the folded protein. The bound BiP both prevents the protein from aggre- 
gating and helps lceep it in the ER (and thus out of the Golgi apparatus and later 
parts of the secretory pathway). Lilce some other members of the hsp70 family of 
chaperone proteins, which bind unfolded proteins and facilitate their import into 
mitochondria and chloroplasts, BiP hydrolyzes ATP to shuttle between high- and 
low-affinity binding States, which allow it to hold on to and let go of its substrate 
proteins in a dynamic cycle. 

Most Proteins Synthesized in the Rough ER Are Glycosylated by 
the Addition of a Common A/-Linked Oligosaccharide 

The covalent addition of oligosaccharides to proteins is one of the major bio- 
synthetic functions of the ER. About half of the soluble and membrane-bound 
proteins that are processed in the ER—including those destined for transport to 
the Golgi apparatus, lysosomes, plasma membrane, or extracellular space—are 
glycoproteins that are modified in this way. Many proteins in the cytosol and 
nucléus are also glycosylated, but not with oligosaccharides: they carry a much 
simpler sugar modification, in which a single AAacetylghicosamine group is 
added to a serine or threonine of the protein. 

During the most common form of protein glycosylation in the ER, a pre- 
formed precursor oligosaccharide (composed of AAacetylghicosamine, mannose, 
and glucose, and containing a total of 14 sugars) is transferred en bloc to proteins. 
Because this oligosaccharide is transferred to the side-chain NH 2 group of an 
asparagine in the protein, it is said to be N-linked or asparagine-linked (Figure 
12-47A). The transfer is catalyzed by a membrane-bound enzyme complex, an 


Figure 12-47 AHinked protein 
glycosylation in the rough ER. (A) Almost 
as soon as a polypeptide chain enters 
the ER lumen, it is glycosylated on target 
asparagine amino acids. The precursor 
oligosaccharide (shown in color) is attached 
only to asparagines in the sequences Asn- 
X-Ser and Asn-X-Thr (where X is any amino 
acid except proline). These sequences 
occur much less frequently in glycoproteins 
than in nonglycosylated cytosolic proteins. 
Evidently there has been sélective 
pressure against these sequences during 
protein évolution, presumably because 
glycosylation at too many sites would 
interfère with protein folding. The five sugars 
in the gray box form the core région of this 
oligosaccharide. For many glycoproteins, 
only the core sugars survive the extensive 
oligosaccharide trimming that takes place 
in the Golgi apparatus. (B) The precursor 
oligosaccharide is transferred from a 
dolichol lipid anchor to the asparagine as 
an intact unit in a reaction catalyzed by a 
transmembrane oligosaccharyl transferase 
enzyme. One copy of this enzyme is 
associated with each protein translocator 
in the ER membrane. (The translocator is 
not shown.) Oligosaccharyl transferase 
contains 13 transmembrane a helices and 
a large ER lumenal domain that contains 
its substrate-binding sites. The asparagine 
binds a tunnel that pénétrâtes the enzyme 
interior. There, the amino group of the 
asparagine is twisted out of the plane that 
stabilizes the otherwise poorly reactive 
amide bond, activating it for reaction with 
the dolichol-oligosaccharide. The structure 
shown is of a prokaryotic homolog that 
closely resembles the catalytic subunit of 
the eukaryotic oligosaccharyl transferase. 
(PDB code: 3RCE.) 
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oligosaccharyl transferase, which has its active site exposed on the lumenal side 
of the ER membrane; this explains why cytosolic proteins are not glycosylated 
in this way. A spécial lipid molécule called dolichol anchors the precursor oli¬ 
gosaccharide in the ER membrane. The precursor oligosaccharide is transferred 
to the target asparagine in a single enzymatic step immediately after that amino 
acid has reached the ER lumen during protein translocation. The precursor oligo¬ 
saccharide is linlced to the dolichol lipid by a high-energy pyrophosphate bond, 
which provides the activation energy that drives the glycosylation reaction (Figure 
12-47B). One copy of oligosaccharyl transferase is associated with each protein 
translocator, allowing it to scan and glycosylate the incoming polypeptide chains 
efficiently. 

The precursor oligosaccharide is built up sugar by sugar on the mem- 
brane-bound dolichol lipid and is then transferred to a protein. The sugars are 
first activated in the cytosol by the formation of nucléotide (UDP or GDP)-sugar 
intermediates , which then donate their sugar (directly or indirectly) to the lipid in 
an orderly sequence. Part way through this process, the lipid-linlced oligosaccha¬ 
ride is flipped, with the help of a transporter, from the cytosolic to the lumenal 
side of the ER membrane (Figure 12-48). 

Ail of the diversity of the AMinked oligosaccharide structures on mature glyco- 
proteins results from the later modification of the original precursor oligosaccha¬ 
ride. While still in the ER, three glucoses (see Figure 12-47) and one mannose are 
quicldy removed from the oligosaccharides of most glycoproteins. We shall return 
to the importance of glucose trimming shortly. This oligosaccharide “trimming," 
or “processing," continues in the Golgi apparatus, as we discuss in Chapter 13. 

The AMinked oligosaccharides are by far the most common oligosaccharides, 
being found on 90% of ail glycoproteins. Less frequently, oligosaccharides are 
linlced to the hydroxyl group on the side chain of a serine, threonine, or hydroxy¬ 
lysine amino acid. A first sugar of these O-linked oligosaccharides is added in the 
ER and the oligosaccharide is then further extended in the Golgi apparatus (see 
Figure 13-32). 
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Figure 12-48 Synthesis of the lipid- 
linked precursor oligosaccharide 
in the rough ER membrane. The 

oligosaccharide is assembled sugar by 
sugar onto the carrier lipid dolichol (a 
polyisoprenoid; see Panel 2-5, pp. 98-99). 
Dolichol is long and very hydrophobie: its 
22 five-carbon units can span the thickness 
of a lipid bilayer more than three times, so 
that the attached oligosaccharide is firmly 
anchored in the membrane. The first sugar 
is linked to dolichol by a pyrophosphate 
bridge. This high-energy bond activâtes 
the oligosaccharide for its eventual transfer 
from the lipid to an asparagine side chain 
of a growing polypeptide on the lumenal 
side of the rough ER. As indicated, the 
synthesis of the oligosaccharide starts on 
the cytosolic side of the ER membrane 
and continues on the lumenal face after 
the (Man) 5 (GlcNAc )2 lipid intermediate is 
flipped across the bilayer by a transporter 
(which is not shown). Ail the subséquent 
glycosyl transfer reactions on the lumenal 
side of the ER involve transfers from 
dolichol-P-glucose and dolichol-P- 
mannose; these activated, lipid-linked 
monosaccharides are synthesized from 
dolichol phosphate and UDP-glucose or 
GDP-mannose (as appropriate) on the 
cytosolic side of the ER and are then 
flipped across the ER membrane. GIcNAc 
= /V-acetylglucosamine; Man = mannose; 
GIc = glucose. 
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Oligosaccharides Are Used as Tags to Mark the State of Protein 
Folding 

It has long been debated why glycosylation is such a common modification of pro ¬ 
teins that enter the ER. One particularly puzzling observation has been that some 
proteins require AMinlced glycosylation for proper folding in the ER, yet the précisé 
location of the oligosaccharides attached to the protein's surface does not seem to 
matter. A due to the rôle of glycosylation in protein folding came from studies of 
two ER chaperone proteins, which are called calnexin and calreticulin because 
they require Ca 2+ for their activities. These chaperones are carbohydrate-binding 
proteins, or lectins, which bind to oligosaccharides on incompletely folded pro¬ 
teins and retain them in the ER. Lilce other chaperones, they prevent incompletely 
folded pro teins from irreversibly aggregating. Both calnexin and calreticulin also 
promote the association of incompletely folded proteins with another ER chaper¬ 
one, which binds to cysteines that hâve not yet formed disulfide bonds. 

Calnexin and calreticulin recognize AMinlced oligosaccharides that contain 
a single terminal glucose, and they therefore bind proteins only after two of the 
three glucoses on the precursor oligosaccharide hâve been removed during glu¬ 
cose trimming by ER glucosidases. When the third glucose has been removed, the 
glycoprotein dissociâtes from its chaperone and can leave the ER. 

How, then, do calnexin and calreticulin distinguish properly folded from 
incompletely folded proteins? The answer lies in yet another ER enzyme, a glu- 
cosyl transferase that lceeps adding a glucose to those oligosaccharides that hâve 
lost their last glucose. It adds the glucose, however, only to oligosaccharides that 
are attached to unfolded proteins. Thus, an unfolded protein undergoes continu- 
ous cycles of glucose trimming (by glucosidase) and glucose addition (by gluco- 
syl transferase), maintaining an affinity for calnexin and calreticulin until it has 
achieved its fully folded State (Figure 12-49). 

Improperly Folded Proteins Are Exported from the ER and 
Degraded in the Cytosol 

Despite ail the help from chaperones, many protein molécules (more than 80% 
for some proteins) translocated into the ER fail to achieve their properly folded 
or oligomeric State. Such proteins are exported from the ER baclc into the cyto¬ 
sol, where they are degraded in protéasomes (discussed in Chapter 6). In many 
ways, the mechanism of retrotranslocation is similar to other post-translational 
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Figure 12-49 The rôle of A/-linked 
glycosylation in ER protein folding. 

The ER-membrane-bound chaperone 
protein calnexin binds to incompletely 
folded proteins containing one terminal 
glucose on A/-linked oligosaccharides, 
trapping the protein in the ER. Removal 
of the terminal glucose by a glucosidase 
releases the protein from calnexin. A 
glucosyl transferase is the crucial enzyme 
that détermines whether the protein is 
folded properly or not: if the protein is still 
incompletely folded, the enzyme transfers 
a new glucose from UDP-glucose to 
the A/-linked oligosaccharide, renewing 
the protein’s affinity for calnexin and 
retaining it in the ER. The cycle repeats 
until the protein has folded completely. 
Calreticulin functions similarly, except that 
it is a soluble ER résident protein. Another 
ER chaperone, ERp57 (not shown), 
collaborâtes with calnexin and calreticulin 
in retaining an incompletely folded protein 
in the ER. ERp57 recognizes free sulfhydryl 
groups, which are a sign of incomplète 
disulfide bond formation. 
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modes of translocation. For example, lilce translocation into mitochondria or 
chloroplasts, chaperone proteins are necessary to lceep the polypeptide chain in 
an unfolded State prior to and during translocation. Similarly, a source of energy is 
required to provide directionality to the transport and to pull the protein into the 
cytosol. Finally, a translocator is necessary. 

Selecting proteins from the ER for dégradation is a challenging process: mis- 
folded proteins or unassembled protein subunits should be degraded, but folding 
intermediates of newly made proteins should not. Help in making this distinction 
cornes from the AMinked oligosaccharides, which serve as timers that measure 
how long a protein has spent in the ER. The slow trimming of a particular man¬ 
nose on the core oligosaccharide tree by an enzyme (a mannosidase) in the ER 
créâtes a new oligosaccharide structure that ER-lumenal lectins of the retrotrans- 
location apparatus recognize. Proteins that fold and exit from the ER faster than 
the mannosidase can remove its target mannose therefore escape dégradation. 

In addition to the lectins in the ER that recognize the oligosaccharides, chap- 
erones and protein disulfide isomerases (enzymes mentioned earlier that catalyze 
the formation and breakage of S-S bonds) associate with the proteins that must be 
degraded. The chaperones prevent the unfolded proteins from aggregating, and 
the disulfide isomerases break disulfide bonds that may hâve formed incorrectly, 
so that a linear polypeptide chain can be translocated baclc into the cytosol. 

Multiple translocator complexes move different proteins from the ER mem¬ 
brane or lumen into the cytosol. A common feature is that they each contain an 
E3 ubiquitin ligase enzyme, which attaches polyubiquitin tags to the unfolded 
proteins as they emerge into the cytosol, marking them for destruction. Fueled 
by the energy derived from ATP hydrolysis, a hexomeric ATPase of the family of 
AAA-ATPases (see Figure 6-85) pulls the unfolded protein through the transloca¬ 
tor into the cytosol. An Af-glycanase removes its oligosaccharide chains en bloc. 
Guided by its ubiquitin tag, the deglycosylated polypeptide is rapidly fed into pro¬ 
téasomes, where it is degraded (Figure 12-50). 

Misfolded Proteins in the ER Activate an Unfolded Protein 
Response 

Cells carefully monitor the amount of misfolded protein in various compart¬ 
ments. An accumulation of misfolded proteins in the cytosol, for example, triggers 
a heat-shock response (discussed in Chapter 6), which stimulâtes the transcription 
of genes encoding cytosolic chaperones that help to refold the proteins. Similarly, 
an accumulation of misfolded proteins in the ER triggers an unfolded protein 
response, which includes an increased transcription of genes encoding proteins 
involved in retrotranslocation and protein dégradation in the cytosol, ER chaper¬ 
ones, and many other proteins that help to increase the protein-folding capacity 
of the ER. 
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Figure 12-50 The export and 
dégradation of misfolded ER proteins. 

Misfolded soluble proteins in the ER 
lumen are recognized and targeted to a 
translocator complex in the ER membrane. 
They first interact in the ER lumen with 
chaperones, disulfide isomerases, and 
lectins. They are then exported into 
the cytosol through the translocator. 

In the cytosol, they are ubiquitylated, 
deglycosylated, and degraded in 
protéasomes. Misfolded membrane 
proteins follow a similar pathway but 
use a different translocator. 
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How do misfolded proteins in the ER signal to the nucléus? There are three par- 
allel pathways that execute the unfolded protein response (Figure 12-51 A). The 
first pathway, which was initially discovered in yeast cells, is particularly remarlc- 
able. Misfolded proteins in the ER activate a transmembrane protein kinase in the 
ER, called IRE1, which causes the kinase to oligomerize and phosphorylate itself. 
(Some cell-surface receptor kinases in the plasma membrane are activated in a 
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Figure 12-51 The unfolded protein 
response. (A) By three parallel intracellular 
signaling pathways, the accumulation of 
misfolded proteins in the ER lumen signais 
to the nucléus to activate the transcription 
of genes that encode proteins that help 
the cell cope with misfolded proteins in 
the ER. (B) Regulated RNA splicing is a 
key regulatory switch in pathway 1 of the 
unfolded protein response (Movie 12.6). 
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similar way, as discussed in Chapter 15.) The oligomérization and autophosphor¬ 
ylation of IRE1 activâtes an endoribonuclease domain in the cytosolic portion 
of the same molécule, which cleaves a spécifie cytosolic mRNA molécule at two 
positions, excising an intron. (This is a unique exception to the rule that introns 
are spliced out while the RNA is still in the nucléus.) The separated exons are then 
joined by an RNA ligase, generating a spliced mRNA, which is translated to pro¬ 
duce an active transcription regulatory protein. This protein activâtes the tran¬ 
scription of genes encoding the proteins that help médiate the unfolded protein 
response (Figure 12-51B). 

Misfolded proteins also activate a second transmembrane kinase in the ER, 
PERK, which inhibits a translation initiation factor by phosphorylating it, thereby 
reducing the production of new proteins throughout the cell. One conséquence 
of the réduction in protein synthesis is to reduce the flux of proteins into the ER, 
thereby reducing the load of proteins that need to be folded there. Some pro- 
teins, however, are preferentially translated when translation initiation factors are 
scarce (discussed in Chapter 7, p. 424), and one of these is a transcription regula- 
tor that helps activate the transcription of the genes encoding proteins active in 
the unfolded protein response. 

Finally, a third transcription regulator, ATF6, is initially synthesized as a trans¬ 
membrane ER protein. Because it is embedded in the ER membrane, it cannot 
activate the transcription of genes in the nucléus. When misfolded proteins accu- 
mulate in the ER, however, the ATF6 protein is transported to the Golgi appara- 
tus, where it encounters proteases that cleave off its cytosolic domain, which can 
now migrate to the nucléus and help activate the transcription of genes encoding 
proteins involved in the unfolded protein response. (This mechanism is similar 
to that described in Figure 12-16 for activation of the transcription regulator that 
Controls cholestérol biosynthesis.) The relative importance of each of these three 
pathways in the unfolded protein response differs in different cell types, enabling 
each cell type to tailor the unfolded protein response to its particular needs. 

Some Membrane Proteins Acquire a Covalently Attached 
Glycosylphosphatidylinositol (GPI) Anchor 

As discussed in Chapter 10, several cytosolic enzymes catalyze the covalent addi¬ 
tion of a single fatty acid chain or prenyl group to selected proteins. The attached 
lipids help direct and attach these proteins to cell membranes. A related process 
is catalyzed by ER enzymes that covalently attach a glycosylphosphatidylinosi¬ 
tol (GPI) anchor to the C-terminus of some membrane proteins destined for the 
plasma membrane. This linkage forms in the lumen of the ER, where, at the same 
time, the transmembrane segment of the protein is cleaved off (Figure 12-52). A 
large number of plasma membrane proteins are modified in this way. Since they 
are attached to the exterior of the plasma membrane only by their GPI anchors, 



Figure 12-52 The attachment of a GPI 
anchor to a protein in the ER. GPI- 
anchored proteins are targeted to the 
ER membrane by an N-terminal signal 
sequence (not shown), which is removed 
(see Figure 12-42). Immediately after 
the completion of protein synthesis, the 
precursor protein remains anchored in the 
ER membrane by a hydrophobie C-terminal 
sequence of 15-20 amino acids; the rest of 
the protein is in the ER lumen. Within less 
than a minute, an enzyme in the ER cuts 
the protein free from its membrane-bound 
C-terminus and simultaneously attaches 
the new C-terminus to an amino group 
on a preassembled GPI intermediate. The 
sugar chain contains an inositol attached to 
the lipid from which the GPI anchor dérivés 
its name. It is followed by a glucosamine 
and three mannoses. The terminal 
mannose links to a phosphoethanolamine 
that provides the amino group to attach 
the protein. The signal that spécifiés 
this modification is contained within the 
hydrophobie C-terminal sequence and 
a few amino acids adjacent to it on the 
lumenal side of the ER membrane; if this 
signal is added to other proteins, they too 
become modified in this way. Because 
of the covalently linked lipid anchor, the 
protein remains membrane-bound, with ail 
of its amino acids exposed initially on the 
lumenal side of the ER and eventually on 
the exterior of the plasma membrane. 
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they can in principle be released from cells in soluble form in response to signais 
that activate a spécifie phospholipase in the plasma membrane. Trypanosome 
parasites, for example, use this mechanism to shed their coat of GPI-anchored 
surface proteins when attaclced by the immune System. GPI anchors may also be 
used to direct plasma membrane proteins into lipid rafts and thus segregate the 
proteins from other membrane proteins (see Figure 10-13). 


Figure 12-53 The synthesis of 
phosphatidylcholine. As illustrated, this 
phospholipid is synthesized from glycerol 
3-phosphate, cytidine-diphosphocholine 
(CDP-choline), and fatty acids delivered 
to the ER by a cytosolic fatty acid binding 
protein. 


The ER Assembles Most Lipid Bilayers 

The ER membrane is the site of synthesis of nearly ail of the cell's major classes 
of lipids, including both phospholipids and cholestérol, required for the produc¬ 
tion of new cell membranes. The major phospholipid made is phosphatidylcho¬ 
line, which can be formed in three steps from choline, two fatty acids, and glycerol 
phosphate (Figure 12-53). Each step is catalyzed by enzymes in the ER mem¬ 
brane, which hâve their active sites facing the cytosol, where ail of the required 
métabolites are found. Thus, phospholipid synthesis occurs exclusively in the 
cytosolic leaflet of the ER membrane. Because fatty acids are not soluble in water, 
they are shepherded from their sites of synthesis to the ER by a fatty acid binding 
protein in the cytosol. After arrivai in the ER membrane and activation with CoA, 
acyl transferases successively add two fatty acids to glycerol phosphate to produce 
phosphatidic acid. Phosphatidic acid is sufficiently water-insoluble to remain in 
the lipid bilayer; it cannot be extracted from the bilayer by the fatty acid binding 
proteins. It is therefore this first step that enlarges the ER lipid bilayer. The later 
steps détermine the head group of a newly formed lipid molécule and therefore 
the Chemical nature of the bilayer, but they do not resuit in net membrane growth. 
The two other major membrane phospholipids—phosphatidylethanolamine and 
phosphatidylserine (see Figure 10-3)—as well as the minor phospholipid phos¬ 
phatidylinositol (PI), are ail synthesized in this way. 

Because phospholipid synthesis talces place in the cytosolic leaflet of the ER 
lipid bilayer, there needs to be a mechanism that transfers some of the newly 
formed phospholipid molécules to the lumenal leaflet of the bilayer. In synthetic 
lipid bilayers, lipids do not “flip-flop” in this way (see Figure 10-10). In the ER, 
however, phospholipids equilibrate across the membrane within minutes, which 
is almost 100,000 times faster than can be accounted for by spontaneous "flip- 
flop." This rapid trans-bilayer movement is mediated by a poorly characterized 
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Figure 12-54 The rôle of phospholipid 
translocators in lipid bilayer synthesis. 

(A) Because new lipid molécules are 
added only to the cytosolic half of the ER 
membrane bilayer and lipid molécules 
do not flip spontaneously from one 
monolayer to the other, a transmembrane 
phospholipid translocator (called a 
scramblase) is required to transfer lipid 
molécules from the cytosolic half to 
the lumenal half so that the membrane 
grows as a bilayer. The scramblase is not 
spécifie for particular phospholipid head 
groups and therefore équilibrâtes the 
different phospholipids between the two 
monolayers. (B) Fueled by ATP hydrolysis, a 
head-group-specific flippase in the plasma 
membrane actively flips phosphatidylserine 
and phosphatidylethanolamine directionally 
from the extracellular to the cytosolic leaflet, 
creating the characteristically asymmetric 
lipid bilayer of the plasma membrane of 
animal cells (see Figure 10-15). 
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phospholipid translocator called a scramblase , which nonselectively équilibrâtes 
phospholipids between the two leaflets of the lipid bilayer (Figure 12-54). Thus, 
the different types of phospholipids are thought to be equally distributed between 
the two leaflets of the ER membrane. 

The plasma membrane contains a different type of phospholipid translocator 
that belongs to the family of P-type pumps (discussed in Chapter 11). These flip- 
pases specifically recognize those phospholipids that contain free amino groups 
in their head groups (phosphatidylserine and phosphatidylethanolamine—see 
Figure 10-3) and transfers them from the extracellular to the cytosolic leaflet, 
using the energy of ATP hydrolysis. The plasma membrane therefore has a highly 
asymmetric phospholipid composition, which is actively maintained by the flip- 
pases (see Figure 10-15). The plasma membrane also contains a scramblase but, 
in contrast to the ER scramblase, which is always active, the plasma membrane 
enzyme is regulated and only activated in some situations, such as in apoptosis 
and in activated platelets, where it acts to abolish the lipid bilayer asymmetry; the 
resulting exposure of phosphatidylserine on the surface of apoptotic cells serves 
as a signal for phagocytic cells to ingest and dégradé the dead cell. 

The ER also produces cholestérol and ceramide (Figure 12-55). Ceramide is 
made by condensing the amino acid serine with a fatty acid to form the amino 
alcohol sphingosine (see Figure 10-3); a second fatty acid is then covalently added 
to form ceramide. The ceramide is exported to the Golgi apparatus, where it serves 
as a precursor for the synthesis of two types of lipids: oligosaccharide chains are 
added to form glycosphingolipids (glycolipids; see Figure 10-16), and phospho- 
choline head groups are transferred from phosphatidylcholine to other ceramide 
molécules to form sphingomyelin (discussed in Chapter 10). Thus, both glycolip¬ 
ids and sphingomyelin are produced relatively late in the process of membrane 
synthesis. Because they are produced by enzymes that hâve their active sites 
exposed to the Golgi lumen, they are found exclusively in the noncytosolic leaflet 
of the lipid bilayers that contain them. 
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As discussed in Chapter 13, the plasma membrane and the membranes of the 
Golgi apparatus, lysosomes, and endosomes ail form part of a membrane System 
that communicates with the ER by means of transport vesicles, which transfer 
both proteins and lipids. Mitochondria and plastids, however, do not belong to 
this System, and they therefore require different mechanisms to import proteins 
and lipids for growth. We hâve already seen that they import most of their proteins 
from the cytosol. Although mitochondria modify some of the lipids they import, 
they do not synthesize lipids de novo; instead, their lipids hâve to be imported 
from the ER, either directly or indirectly by way of other cell membranes. In either 
case, spécial mechanisms are required for the transfer. 

The details of howlipid distribution between different membranes is catalyzed 
and regulated are not lcnown. Water-soluble carrier proteins called phospholipid 
exchange proteins (or phospholipid transfer proteins ) are thought to transfer indi- 
vidual phospholipid molécules between membranes, functioning much lilce fatty 
acid binding proteins that shepherd fatty acids through the cytosol (see Figure 
12-54). In addition, mitochondria are often seen in close juxtaposition to ER 
membranes in électron micrographs, and spécifie junction complexes hâve been 
identified that hold the ER and outer mitochondrial membrane in close proxim- 
ity. It is thought that these junction complexes provide spécifie contact-depen- 
dent lipid transfer mechanisms that operate between these adjacent membranes. 

Summary 

The extensive ER networlc serves as a factory for the production ofalmost ail of the 
cell's lipids. In addition, a major portion ofthe cell's protein synthesis occurs on the 
cytosolic surface ofthe rough ER: virtually ail proteins destinedfor sécrétion or for 
the ER itself the Golgi apparatus, the lysosomes, the endosomes, and the plasma 
membrane are first imported into the ER from the cytosol. In the ER lumen, the 
proteins fold and oligomerize, disulfide bonds are formed, and N-linked oligosac¬ 
charides are added. The pattern of N-linked glycosylation is used to indicate the 
extent of protein folding, so that proteins leave the ER only when they are prop- 
erlyfolded. Proteins that do not fold or oligomerize correctly are translocated baclc 
into the cytosol, where they are deglycosylated, polyubiquitylated, and degraded in 
protéasomes. If misfolded proteins accumulate in excess in the ER, they trigger an 
unfolded protein response, which activâtes appropriate genes in the nucléus to help 
the ER cope. 

Only proteins that carry a spécial ER signal sequence are imported into the ER. 
The signal sequence is recognized by a signal-récognition particle (SRP), which 
binds both the growing polypeptide chain and the ribosome and directs them to a 
receptor protein on the cytosolic surface ofthe rough ER membrane. This binding 
to the ER membrane initiâtes the translocation process that threads a loop of poly¬ 
peptide chain across the ER membrane through the hydrophilic pore of a protein 
translocator. 

Soluble proteins—destinedfor the ER lumen, for sécrétion, or for transfer to the 
lumen of other organelles—pass completely into the ER lumen. Transmembrane 
proteins destined for the ER or for other cell membranes are translocated part 
way across the ER membrane and remain anchored there by one or more mem¬ 
brane- spanning a-helical segments in their polypeptide chains. These hydropho¬ 
bie portions ofthe protein can act either as start-transfer or stop-transfer signais 
during the translocation process. When a polypeptide contains multiple, alternat- 
ing start-transfer and stop-transfer signais, it will pass baclc andforth across the 
bilayer multiple times as a multipass transmembrane protein. 

The asymmetry of protein insertion and glycosylation in the ER establishes the 
sidedness ofthe membranes of ail the other organelles that the ER supplies with 
membrane proteins. 


WHATWE DONT KNOW 


• How do nuclear import receptors 
negotiate the tangled gel-like interior of 
a nuclear pore complex so efficiently? 

• Is the nuclear pore complex a 
rigid structure or can it expand and 
contract, depending on the cargo 
transported? 

• Sequence comparisons show that 
signal sequences for an individual 
protein such as insulin are quite 
conserved across species, much 
more so than would be expected 
from our current understanding that 
ail that matters for their function are 
general structural features such as 
hydrophobicity. What other functions 
might signal sequences hâve that 
could account for their evolutionary 
sequence conservation? 

• How are polyribosomes on the 
endoplasmic réticulum membrane 
arranged so that the next initiating 
ribosome will find an unoccupied 
translocator? 

• Why does the signal-récognition 
particle hâve an indispensable RNA 
subunit? 
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PROBLEMS 

Which statements are true? Explain why or why not. 

12-1 Like the lumen of the ER, the interior of the nucléus 

is topologically équivalent to the outside of the cell. 

12-2 ER-bound and free ribosomes, which are structur- 
ally and functionally identical, differ only in the proteins 
they happen to be making at a particular time. 

12-3 To avoid the inévitable collisions that would occur 
if two-way traffic through a single pore were allowed, 
nuclear pore complexes are specialized so that some 
médiate import while others médiate export. 

12-4 Peroxisomes are found in only a few specialized 
types of eukaryotic cell. 

Discuss the following problems. 

12-5 What is the fate of a protein with no sorting signal? 

12-6 The rough ER is the site of synthesis of many classes 
of membrane proteins. Some of these proteins remain in 
the ER, whereas others are sorted to compartments such 
as the Golgi apparatus, lysosomes, and the plasma mem¬ 
brane. One measure of the difficulty of the sorting prob- 
lem is the degree of "purification" that must be achieved 
during transport from the ER. Are proteins bound for the 
plasma membrane common or rare among ail ER mem¬ 
brane proteins? 

A few simple considérations allow one to answer 
this question. In a typical growing cell that is dividing once 
every 24 hours, the équivalent of one new plasma mem¬ 
brane must transit the ER every day. If the ER membrane 
is 20 times the area of a plasma membrane, what is the 
ratio of plasma membrane proteins to other membrane 
proteins in the ER? (Assume that ail proteins on their way 
to the plasma membrane remain in the ER for 30 minutes 
on average before exiting, and that the ratio of proteins to 
lipids in the ER and plasma membranes is the same.) 

12-7 Before nuclear pore complexes were well under- 
stood, it was unclear whether nuclear proteins diffused 
passively into the nucléus and accumulated there by bind- 
ing to residents of the nucléus such as chromosomes, or 
whether they were actively imported and accumulated 
regardless of their affinity for nuclear components. 

A classic experiment that addressed this prob- 
lem used several forms of radioactive nucleoplasmin, 
which is a large pentameric protein involved in chromatin 
assembly. In this experiment, either the intact protein or 
the nucleoplasmin heads, tails, or heads with a single tail 
were injected into the cytoplasm of a frog oocyte or into 
the nucléus (Figure Q12-1). Ail forms of nucleoplasmin, 
except heads, accumulated in the nucléus when injected 
into the cytoplasm, and ail forms were retained in the 
nucléus when injected there. 

A. What portion of the nucleoplasmin molécule is 
responsible for localization in the nucléus? 


Figure Q12-1 Cellular 
location of injected 
nucleoplasmin and 
nucleoplasmin 
components 
(Problem 12-7). 
Schematic diagrams 
of autoradiographs 
show the cytoplasm 
and nucléus with 
the location of 
nucleoplasmin indicated 
by the red areas. 


B. How do these experiments distinguish between 
active transport, in which a nuclear localization signal trig- 
gers transport by the nuclear pore complex, and passive 
diffusion, in which a binding site for a nuclear component 
allows accumulation in the nucléus? 

12-8 Assuming that 32 million histone octamers are 
required to package the human genome, how many his¬ 
tone molécules must be transported per second per 
nuclear pore complex in cells whose nuclei contain 3000 
nuclear pores and are dividing once per day? 

12-9 The nuclear pore complex (NPC) créâtes a barrier 
to the free exchange of molécules between the nucléus and 
cytosol, but in a way that remains mysterious. In yeast, for 
example, the central pore of the NPC has a diameter of 35 
nm and is 30 nm long, which is somewhat smaller than its 
vertebrate counterpart. Even so, it is large enough to accom- 
modate virtually ail components of the cytosol. Yet the pore 
allows passive diffusion of molécules only up to about 40 
lcd; entry of anything larger requires help from a nuclear 
import receptor. Sélective permeabilityis controlledby pro¬ 
tein components of the NPC that hâve unstructured, polar 
tails extending into the central pore. These tails are charac- 
terized by periodic repeats of the hydrophobie amino acids 
phenylalanine (F) and glycine (G). 

At high enough concentration (~50 mM), the 
FG-repeat domains of these proteins can form a gel, with 
a meshwork of interactions between the hydrophobie FG 
repeats (Figure Q12-2A). These gels allow passive diffu¬ 
sion of small molécules, but they prevent entry of larger 
proteins such as the fluorescent protein mCherry fused 
to maltose binding protein (MBP) (Figure Q12-2B). (The 
fusion to MBP malces mCherry too large to enter the 
nucléus by passive diffusion.) However, if the nuclear 
import receptor, importin, is fused to a similar protein, 
MBP-GFP, the importin-MBP-GFP fusion readily enters 
the gel (Figure Q12-2B). 
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complex. They were initially identified using a genetic 
triclc. The yeast Ura3 gene, whose product is an enzyme 
that is normally located in the cytosol where it is essential 
for synthesis of uracil, was modified so that the protein 
carried an import signal for the mitochondrial matrix. 
A population of cells carrying the modified Ura3 gene in 
place of the normal gene was then grown in the absence 
of uracil. Most cells died, but the rare cells that grew were 
shown to be defective for mitochondrial import. Explain 
how this sélection identifies cells with defects in compo- 
nents required for import into the mitochondrial matrix. 
Why don't normal cells with the modified Ura3 gene grow 
in the absence of uracil? Why do cells that are defective for 
mitochondrial import grow in the absence of uracil? 

12-11 If the enzyme dihydrofolate reductase (DHFR), 
which is normally located in the cytosol, is engineered to 
carry a mitochondrial targeting sequence at its N-terminus, 
it is efficiently imported into mitochondria. If the modified 
DHFR is first incubated with methotrexate, which binds 
tightly to the active site, the enzyme remains in the cyto¬ 
sol. How do you suppose that the binding of methotrexate 
interfères with mitochondrial import? 

12-12 Why do mitochondria need a spécial translocator 
to import proteins across the outer membrane, when the 
membrane already has large pores formed by porins? 


Figure Q1 2-2 FG-repeat gel and influx of proteins into the nucléus 
(Problem 12-9). (A) Cartoon of the meshwork (gel) formed by pairwise 
interactions between hydrophobie FG repeats. For FG-repeats 
separated by 17 amino acids, as is typical, the mesh formed by 
extended amino acid side chains would correspond to about 4 nm on 
a side, which would be large enough to account for the characteristic 
passive diffusion of proteins through nuclear pores. (B) Diffusion of 
MBP-mCherry and importin-MBP-GFP into a gel of FG-repeats. In each 
group, the solution is shown at left and the gel at right. The bright areas 
indicate régions that contain the fluorescent proteins. 

A. FG-repeats only form gels in vitro at relatively high 
concentration (50 mM). Is this concentration reasonable 
for FG repeats in the NPC core? In yeast, there are about 
5000 FG-repeats in each NPC. Given the dimensions of the 
yeast nuclear pore (35 nm diameter and 30 nm length), 
calculate the concentration of FG-repeats in the cylindri- 
cal volume of the pore. Is this concentration comparable 
to the one used in vitro ? 

B. A second question is whether the diffusion of 
importin-MBP-GFP through the FG-repeat gel is fast 
enough to account for the efficient flow of materials 
between the nucléus and cytosol. From experiments of 
the type shown in Figure Q12-2B, the diffusion coefficient 
(D) of importin-MBP-GFP through the FG-repeat gel was 
determined to be about 0.1 pm 2 /s. The équation for diffu¬ 
sion is t - x 2 /2D, where t is time and x is distance. Calcu¬ 
late the time it would talce importin-MBP-GFP to diffuse 
through a yeast nuclear pore (30 nm) if the pore consisted 
of a gel of FG-repeats. Does this time seem fast enough for 
the needs of a eukaryotic cell? 

12-10 Components of the TIM complexes, the multi- 
subunit protein translocators in the mitochondrial inner 
membrane, are much less abundant than those of the TOM 


12-13 Examine the multipass transmembrane protein 
shown in Figure Q12-3. What would you predict would 
be the effect of converting the first hydrophobie trans¬ 
membrane segment to a hydrophilic segment? Sketch the 
arrangement of the modified protein in the ER membrane. 


COOH 



Figure Q1 2-3 Arrangement of a multipass transmembrane protein 
in the ER membrane (Problem 12-13). Blue hexagons represent 
covalently attached oligosaccharides. The positions of positively and 
negatively charged amino acids flanking the second transmembrane 
segment are shown. 


12-14 Ail new phospholipids are added to the cytosolic 
leaflet of the ER membrane, yet the ER membrane has a 
symmetrical distribution of different phospholipids in its 
two leaflets. By contrast, the plasma membrane, which 
receives ail its membrane components ultimately from the 
ER, has a very asymmetrical distribution of phospholipids 
in the two leaflets of its lipid bilayer. How is the symmetry 
generated in the ER membrane, and how is the asymmetry 
generated and maintained in the plasma membrane? 
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Every cell must eat, communicate with the world around it, and quicldy respond 
to changes in its environment. To help accomplish these tasks, cells continually 
adjust the composition of their plasma membrane and internai compartments in 
rapid response to need. They use an elaborate internai membrane System to add 
and remove cell-surface proteins, such as receptors, ion channels, and transport- 
ers (Figure 13-1). Through the process of exocytosis, the secretory pathway deliv- 
ers newly synthesized proteins, carbohydrates, and lipids either to the plasma 
membrane or the extracellular space. By the converse process of endocytosis, cells 
remove plasma membrane components and deliver them to internai compart¬ 
ments called endosomes, from where they can be recycled to the same or different 
régions of the plasma membrane or be delivered to lysosomes for dégradation. 
Cells also use endocytosis to capture important nutrients, such as vitamins, cho¬ 
lestérol, and iron; these are talcen up together with the macromolecules to which 
they bind and are then moved on to endosomes and lysosomes, from where they 
can be transported into the cytosol for use in various biosynthetic processes. 

The interior space, or lumen, of each membrane-enclosed compartment 
along the secretory and endocytic pathways is équivalent to the lumen of most 
other membrane-enclosed compartments and to the cell exterior, in the sense 
that proteins can travel in this space without having to cross a membrane as 
they are passed from one compartment to another by means of numerous mem¬ 
brane-enclosed transport containers. These containers are formed from the donor 
compartment and are either small, spherical vesicles, larger irregular vesicles, or 
tubules. We shall use the term transport vesicle to apply to ail forms of these con¬ 
tainers. 

Within a eulcaryotic cell, transport vesicles continually bud off from one mem¬ 
brane and fuse with another, carrying membrane components and soluble lume- 
nal molécules, which are referred to as cargo (Figure 13-2). This vesicular traffic 
flows along highly organized, directional routes, which allow the cell to secrete, 
eat, and remodel its plasma membrane and organelles. The secretory pathway 
leads outward from the endoplasmic réticulum (ER) toward the Golgi apparatus 
and cell surface, with a side route leading to lysosomes, while the endocytic path¬ 
way leads inward from the plasma membrane. In each case, retrieval pathways 


IN THIS CHAPTER 

THE MOLECULAR MECHANISMS 
OF MEMBRANE TRANSPORT 
AND THE MAINTENANCE OF 
COMPARTMENTAL DIVERSITY 
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Figure 13-1 Exocytosis and 
endocytosis. (A) In exocytosis, a transport 
vesicle fuses with the plasma membrane. 
Its content is released into the extracellular 
space, while the vesicle membrane (red) 
becomes continuous with the plasma 
membrane. (B) In endocytosis, a plasma 
membrane patch (red) is internalized, 
forming a transport vesicle. Its content 
dérivés from the extracellular space. 
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balance the flow of membrane between compartments in the opposite direction, 
bringing membrane and selected proteins baclc to the compartment of origin 
(Figure 13-3). 

To perform its function, each transport vesicle that buds from a compartment 
must be sélective. It must talce up only the appropriate molécules and must fuse 
only with the appropriate target membrane. A vesicle carrying cargo from the ER 
to the Golgi apparatus, for example, must exclude most proteins that are to stay 
in the ER, and it must fuse only with the Golgi apparatus and not with any other 
organelle. 

We begin this chapter by considering the molecular mechanisms of budding 
and fusion that underlie ail vesicle transport. We then discuss the fundamental 
problem of how, in the face of this transport, the cell maintains the molecular and 


Figure 13-2 Vesicle transport. Transport 
vesicles bud off from one compartment 
and fuse with another. As they do so, they 
carry material as cargo from the lumen 
(the space within a membrane-enclosed 
compartment) and membrane of the donor 
compartment to the lumen and membrane 
of the target compartment, as shown. 



Figure 13-3 A “road-map” of the secretory and endocytic pathways. (A) In this schematic roadmap, which was introduced in Chapter 12, the 
endocytic and secretory pathways are illustrated with green and red arrows, respectively. In addition, blue arrows dénoté retrieval pathways for the 
backflow of selected components. (B) The compartments of the eukaryotic cell involved in vesicle transport. The lumen of each membrane-enclosed 
compartment is topologically équivalent to the outside of the cell. Ail compartments shown communicate with one another and the outside of the 
cell by means of transport vesicles. In the secretory pathway (red arrows), protein molécules are transported from the ER to the plasma membrane 
or (via endosomes) to lysosomes. In the endocytic pathway (green arrows), molécules are ingested in endocytic vesicles derived from the plasma 
membrane and delivered to early endosomes and then (via late endosomes) to lysosomes. Many endocytosed molécules are retrieved from early 
endosomes and returned (some via recycling endosomes) to the cell surface for reuse; similarly, some molécules are retrieved from the early and 
late endosomes and returned to the Golgi apparatus, and some are retrieved from the Golgi apparatus and returned to the ER. Ail of these retrieval 
pathways are shown with blue arrows, as in part (A). 
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functional différences between its compartments. Finally, we consider the func- 
tion of the Golgi apparatus, lysosomes, secretory vesicles, and endosomes, as we 
trace the pathways that connect these organelles. 

THE MOLECULAR MECHANISMS OF MEMBRANE 
TRANSPORT AND THE MAINTENANCE OF 
COMPARTMENTAL DIVERSITY 

Vesicle transport médiates a continuons exchange of components between the ten 
or more chemically distinct, membrane-enclosed compartments that collectively 
comprise the secretory and endocytic pathways. With this massive exchange, how 
can each compartment maintain its spécial identity? To answer this question, we 
must first consider what defines the character of a compartment. Above ail, it is 
the composition of the enclosing membrane: molecular markers displayed on the 
cytosolic surface of the membrane serve as guidance eues for incoming traffic to 
ensure that transport vesicles fuse only with the correct compartment. Many of 
these membrane markers, however, are found on more than one compartment, 
and it is the spécifie combination of marker molécules that gives each compart¬ 
ment its molecular address. 

How are these membrane markers lcept at high concentration on one compart¬ 
ment and at low concentration on another? To answer this question, we need to 
consider how patches of membrane, enriched or depleted in spécifie membrane 
components, bud off from one compartment and transfer to another. 

We begin by discussing how cells segregate proteins into separate membrane 
domains by assembling a spécial protein coat on the membrane's cytosolic face. 
We consider how coats form, what they are made of, and how they are used to 
extract spécifie cargo components from a membrane and compartment lumen for 
delivery to another compartment. Finally, we discuss how transport vesicles dock 
at the appropriate target membrane and then fuse with it to deliver their cargo. 

There Are Various Types of Coated Vesicles 

Most transport vesicles form from specialized, coated régions of membranes. 
They bud off as coated vesicles, which hâve a distinctive cage of proteins covering 
their cytosolic surface. Before the vesicles fuse with a target membrane, they dis- 
card their coat, as is required for the two cytosolic membrane surfaces to interact 
directly and fuse. 

The coat performs two main functions that are reflected in a common two-lay- 
ered structure. First, an inner coat layer concentrâtes spécifie membrane proteins 
in a specialized patch, which then gives rise to the vesicle membrane. In this way, 
the inner layer selects the appropriate membrane molécules for transport. Sec¬ 
ond, an outer coat layer assembles into a curved, basketlike lattice that deforms 
the membrane patch and thereby shapes the vesicle. 

There are three well-characterized types of coated vesicles, distinguished by 
their major coat proteins: clathrin-coated , COPI-coated, and COPII-coated (Fig¬ 
ure 13-4). Each type is used for different transport steps. Clathrin-coated vesicles, 
for example, médiate transport from the Golgi apparatus and from the plasma 
membrane, whereas COPI- and COPII-coated vesicles most commonly médiate 
transport from the ER and from the Golgi cisternae (Figure 13-5). There is, how¬ 
ever, much more variety in coated vesicles and their functions than this short list 
suggests. As we discuss below, there are several types of clathrin-coated vesicles, 
each specialized for a different transport step, and the COPI- and COPII-coated 
vesicles may be similarly diverse. 

The Assembly of a Clathrin Coat Drives Vesicle Formation 

Clathrin-coated vesicles, the first coated vesicles to be identified, transport mate- 
rial from the plasma membrane and between endosomal and Golgi compart¬ 
ments. COPI-coated vesicles and COPII-coated vesicles transport material early 
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Figure 13-4 Electron micrographs of 
clathrin-coated, COPI-coated, and 
COPII-coated vesicles. Ail are shown in 
électron micrographs at the same scale. 

(A Clathrin-coated vesicles. (B) COPI- 
coated vesicles and Golgi cisternae (red 
arrows) from a cell-free System in which 
COPI-coated vesicles bud in the test tube. 
(C) COPII-coated vesicles. (A and B, from 
L. Orci, B. Glick and J. Rothman, Cell 
46:171-184, 1986. With permission from 
Elsevier; C, courtesy of Charles Barlowe 
and Lelio Orci.) 


in the secretory pathway: COPI-coated vesicles bud from Golgi compartments, 
and COPII-coated vesicles bud from the ER (see Figure 13-5). We discuss clath¬ 
rin-coated vesicles first, as they provide a good example of how vesicles form. 

The major protein component of clathrin-coated vesicles is clathrin itself, 
which forms the outer layer of the coat. Each clathrin subunit consists of three 
large and three small polypeptide chains that together form a three-legged struc¬ 
ture called a triskelion (Figure 13-6A,B). Clathrin trislcelions assemble into a bas- 
lcetlilce frameworlc of hexagons and pentagons to form coated pits (buds) on the 
cytosolic surface of membranes (Figure 13-7). Under appropriate conditions, 
isolated trislcelions spontaneously self-assemble into typical polyhedral cages 
in a test tube, even in the absence of the membrane vesicles that these baskets 
normally enclose (Figure 13-6C,D). Thus, the clathrin trislcelions détermine the 
geometry of the clathrin cage (Figure 13-6E). 

Adaptor Proteins Select Cargo into Clathrin-Coated Vesicles 

Adaptor proteins, another major coat component in clathrin-coated vesicles, 
form a discrète inner layer of the coat, positioned between the clathrin cage and 
the membrane. They bind the clathrin coat to the membrane and trap various 
transmembrane proteins, including transmembrane receptors that capture solu¬ 
ble cargo molécules inside the vesicle—so-called cargo receptors. In this way, the 
adaptor proteins select a spécifie set of transmembrane proteins, together with 
the soluble proteins that interact with them, and package them into each newly 
formed clathrin-coated transport vesicle (Figure 13-8). 



Figure 13-5 Use of different coats for 
different steps in vesicle traffic. Different 
coat proteins select different cargo and 
shape the transport vesicles that médiate the 
various steps in the secretory and endocytic 
pathways. When the same coats function 
in different places in the cell, they usually 
incorporate different coat protein subunits 
that modify their properties (not shown). Many 
differentiated cells hâve additional pathways 
beside those shown here, including a sorting 
pathway from the trans Golgi network to 
the apical surface of épithélial cells and a 
spécialized recycling pathway for proteins 
of synaptic vesicles in the nerve terminais of 
neurons (see Figure 11-36). The arrows are 
colored as in Figure 13-3. 
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Figure 13-6 The structure of a clathrin coat. (A) Electron micrograph of a clathrin triskelion shadowed with platinum. 

(B) Each triskelion is composed of three clathrin heavy chains and three clathrin light chains, as shown in the diagram. (C and 
D) A cryoelectron micrograph taken of a clathrin coat composed of 36 triskelions organized in a network of 12 pentagons 
and 6 hexagons, with some heavy chains (C) and light chains (D) highlighted (Movie 13.1). The light chains linkto the actin 
cytoskeleton, which helps generate force for membrane budding and vesicle movement, and their phosphorylation régulâtes 
clathrin coat assembly. The interwoven legs of the clathrin triskelions form an outer shell from which the N-terminal domains 
of the triskelions protrude inward. These domains bind to the adaptor proteins shown in Figure 13-8. The coat shown was 
assembled biochemically from pure clathrin triskelions and is too small to enclose a membrane vesicle. (E) Images of clathrin- 
coated vesicles isolated from bovine brain. The clathrin coats are constructed in a similar but less regular way, from pentagons, 
a larger number of hexagons, and sometime heptagons, resembling the architecture of deformed soccer balls. The structures 
were determined by cryoelectron microscopy and tomographie reconstruction. (A, from E. Ungewickell and D. Branton, Nature 
289:420-422, 1981 ; C and D, from A. Fotin et al., Nature 432:573-579, 2004. Ail with permission from Macmillan Publishers 
Ltd; E, from Y. Cheng et al., J. Mol. Biol. 365:892-899, 2007. With permission from Elsevier.) 


There are several types of adaptor proteins. The best characterized hâve four 
different protein subunits; others are single-chain proteins. Each type of adaptor 
protein is spécifie for a different set of cargo receptors. Clathrin-coated vesicles 
budding from different membranes use different adaptor proteins and thus pack¬ 
age different receptors and cargo molécules. 

The assembly of adaptor proteins on the membrane is tightly controlled, in 
part by the cooperative interaction of the adaptor proteins with other compo- 
nents of the coat. The adaptor protein AP2 serves as a well-understood example. 
When it binds to a spécifie phosphorylated phosphatidylinositol lipid (a phospho- 
inositide), it alters its conformation, exposing binding sites for cargo receptors in 
the membrane. The simultaneous binding to the cargo receptors and lipid head 
groups greatly enhances the binding of AP2 to the membrane (Figure 13-9). 

Because several requirements must be met simultaneously to stably bind AP2 
proteins to a membrane, the proteins act as coincidence detectors that only assem¬ 
ble at the right time and place. Upon binding, they induce membrane curvature, 
which malces the binding of additional AP2 proteins in its proximity more lilcely. 
The cooperative assembly of the AP2 coat layer then is further amplified by clath¬ 
rin binding, which leads to the formation and budding of a transport vesicle. 

Adaptor proteins found in other coats also bind to phosphoinositides, which 
not only hâve a major rôle in directing when and where coats assemble in the cell, 
but also are used much more widely as molecular markers of compartment iden- 
tity. This helps to control vesicular traffic, as we now discuss. 

Figure 13-7 Clathrin-coated pits and vesicles. This rapid-freeze, deep- 
etch électron micrograph shows numerous clathrin-coated pits and vesicles 
on the inner surface of the plasma membrane of cultured fibroblasts. The 
cells were rapidly frozen in liquid hélium, fractured, and deep-etched to 
expose the cytoplasmic surface of the plasma membrane. (Courtesy of John 
Heuser.) 
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Phosphoinositides Mark Organelles and Membrane Domains 

Although inositol phospholipids typically comprise less than 10% of the total 
phospholipids in a membrane, they hâve important regulatory functions. They 
can undergo rapid cycles of phosphorylation and déphosphorylation at the 3', 4', 
and 5' positions of their inositol sugar head groups to produce various types of 
phosphoinositides (phosphatidylinositol phosphates, or PIPs). The interconver¬ 
sion of phosphatidylinositol (PI) and PIPs is highly compartmentalized: different 
organelles in the endocytic and secretory pathways hâve distinct sets of PI and PIP 
kinases and PIP phosphatases (Figure 13-10). The distribution, régulation, and 
local balance of these enzymes détermine the steady-state distribution of each 
PIP species. As a conséquence, the distribution of PIPs varies from organelle to 
organelle, and often within a continuous membrane from one région to another, 
thereby defining specialized membrane domains. 

Many proteins involved at different steps in vesicle transport contain domains 
that bind with high specificity to the head groups of particular PIPs, distinguishing 
one phosphorylated form from another (see Figure 13-10 E and F). Local control 
of the PI and PIP kinases and PIP phosphatases can therefore be used to rapidly 
control the binding of proteins to a membrane or membrane domain. The produc¬ 
tion of a particular type of PIP recruits proteins containing matching PIP-binding 
domains. The PIP-binding proteins then help regulate vesicle formation and other 
steps in the control of vesicle traffic (Figure 13-11). The same strategy is widely 
used to recruit spécifie intracellular signaling proteins to the plasma membrane 
in response to extracellular signais (discussed in Chapter 15). 


cargo 

receptors 



AP2 locked 


Figure 13-8 The assembly and 
disassembly of a clathrin coat. The 

assembly of the coat introduces curvature 
into the membrane, which leads in turn 
to the formation of a coated bud (called a 
coated pit if it is in the plasma membrane). 
The adaptor proteins bind both clathrin 
triskelions and membrane-bound cargo 
receptors, thereby mediating the sélective 
recruitment of both membrane and soluble 
cargo molécules into the vesicle. Other 
membrane-bending and fission proteins 
are recruited to the neck of the budding 
vesicle, where sharp membrane curvature 
is introduced. The coat is rapidly lost 
shortly after the vesicle buds off. 


Figure 13-9 Lipid-induced conformation 
switching of AP2. The AP2 adaptor 
protein complex has four subunits (a, P2, 
|i2, and o2). Upon interaction with the 
phosphoinositide PI(4,5)P2 (see Figure 
13-10) in the cytosolic leaflet of the plasma 
membrane, AP2 rearranges so that binding 
sites for cargo receptors become exposed. 
Each AP2 complex binds four PI(4,5)P2 
molécules (for clarity, only one is shown). 

In the open AP2 complex, the g2 and o2 
subunits bind the cytosolic tails of cargo 
receptors that display the appropriate 
endocytosis signais. These signais consist 
of short amino acid sequence motifs. When 
AP2 binds tightly to the membrane, it 
induces curvature, which favors the binding 
of additional AP2 complexes in the vicinity. 
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Membrane-Bending Proteins Help Deform the Membrane During 
Vesicle Formation 

The forces generated by clathrin coat assembly alone are not sufficient to 
shape and pinch off a vesicle from the membrane. Other membrane-bending 
and force-generating proteins participate at every stage of the process. Mem¬ 
brane-bending proteins that contain crescent-shaped domains, called BAR 
domains , bind to and impose their shape on the underlying membrane via elec- 
trostatic interactions with the lipid head groups (Figure 13-12; also see Figure 
10-40). Such BAR-domain proteins are thought to help AP2 nucleate clathrin-me- 
diated endocytosis by shaping the plasma membrane to allow a clathrin-coated 
bud to form. Some of these proteins also contain amphiphilic helices that induce 
membrane curvature after being inserted as wedges into the cytoplasmic leaflet 
of the membrane. Other BAR-domain pro teins are important in shaping the neclc 
of a budding vesicle, where stabilization of sharp membrane bends is essential. 
Finally, the clathrin machinery nucleates the local assembly of actin filaments 
that introduce tension to help pinch off and propel the forming vesicle away from 
the membrane. 

Cytoplasmic Proteins Regulate the Pinching-Off and Uncoating of 
Coated Vesicles 

As a clathrin-coated bud grows, soluble cytoplasmic pro teins, including dynamin, 
assemble at the neclc of each bud (Figure 13-13). Dynamin contains a PI(4,5) 
P 2 -binding domain, which tethers the protein to the membrane, and a GTPase 
domain, which régulâtes the rate at which vesicles pinch off from the membrane. 


Figure 13-11 The intracellular location of phosphoinositides. Different 
types of PIPs are located in different membranes and membrane domains, 
where they are often associated with spécifie vesicle transport events. The 
membrane of secretory vesicles, for example, contains PI(4)P When the 
vesicles fuse with the plasma membrane, a PI 5-kinase that is localized 
there converts the PI(4)P into PI(4,5)P2- The PI(4,5)P2, in turn, helps recruit 
adaptor proteins, which initiate the formation of a clathrin-coated pit, as the 
first step in clathrin-mediated endocytosis. Once the clathrin-coated vesicle 
buds off from the plasma membrane, a PI(5)P phosphatase hydrolyzes 
PI(4,5)P2, which weakens the binding of the adaptor proteins, promoting 
vesicle uncoating. We discuss phagocytosis and the distinction between 
regulated and constitutive exocytosis later in the chapter. (Modified from 
M.A. de Mattéis and A. Godi, Nat. Cell Biol. 6:487-492, 2004. With 
permission from Macmillan Publishers Ltd.) 


Figure 13-10 Phosphatidylinositol 
(PI) and phosphoinositides (PIPs). 

(A, B) The structure of PI shows the 
free hydroxyl groups in the inositol 
sugar that can in principle be modified. 

(C) Phosphorylation of one, two, or three 
of the hydroxyl groups on PI by PI and 
PI P kinases produces a variety of PI P 
species. They are named according to 
the ring position (in parenthèses) and the 
number of phosphate groups (subscript) 
added to PI. PI(3,4)P 2 is shown. (D) Animal 
cells hâve several PI and PI P kinases and 
a similar number of PIP phosphatases, 
which are localized to different organelles, 
where they are regulated to catalyze the 
production of particular PIPs. The red 
and green arrows show the kinase and 
phosphatase reactions, respectively. 

(E, F) Phosphoinositide head groups 
are recognized by protein domains that 
discriminate between the different forms. 

In this way, select groups of proteins 
containing such domains are recruited 
to régions of membrane in which these 
phosphoinositides are présent. PI(3)P and 
PI(4,5)P2 are shown. (D, modified from 
M.A. de Mattéis and A. Godi, Nat. Cell Biol. 
6:487-492, 2004. With permission from 
Macmillan Publishers Ltd.) 



KEY: PI(3)P PI(4)P : 1(4,5) P 2 PI(3,5)P 2 PI(3,4,5)P 3 































702 Chapter 13: Intracellular Membrane Traffic 


BAR domain dimer 



The pinching-off process brings the two noncytosolic leaflets of the membrane 
into close proximity and fuses them, sealing off the forming vesicle (see Figure 
13-2). To perform this task, dynamin recruits other proteins to the neclc of the 
bud. Together with dynamin, they help bend the patch of membrane—by directly 
distorting the bilayer structure, or by changing its lipid composition through the 
recruitment of lipid-modifying enzymes, or by both mechanisms. 

Once released from the membrane, the vesicle rapidly loses its clathrin coat. A 
PIP phosphatase that is co-packaged into clathrin-coated vesicles depletes PI(4,5) 
P 2 from the membrane, which weakens the binding of the adaptor proteins. In 
addition, an hsp70 chaperone protein (see Figure 6-80) functions as an uncoating 
ATPase, using the energy of ATP hydrolysis to peel off the clathrin coat. Auxilin, 
another vesicle protein, is thought to activate the ATPase. The release of the coat, 
however, must not happen prematurely, so additional control mechanisms must 
somehow prevent the clathrin from being removed before it has formed a com¬ 
plété vesicle (discussed below). 


Figure 13-12 The structure of BAR 
domains. BAR-domain proteins are diverse 
and enable many membrane-bending 
processes in the cell. BAR domains are 
built from coiled coils that dimerize into 
modules with a positively charged inner 
surface, which preferentially interacts 
with negatively charged lipid head groups 
to bend membranes. Local membrane 
deformations caused by BAR-domain 
proteins facilitate the binding of additional 
BAR-domain proteins, thereby generating 
a positive feedback cycle for curvature 
propagation. Individual BAR-domain 
proteins contain a distinctive curvature and 
often hâve additional features that adapt 
them to their spécifie tasks: some hâve 
short amphiphilic helices that cause further 
membrane deformation by wedge insertion; 
others are flanked by PIP-binding domains 
that direct them to membranes enriched in 
cognate phosphoinositides. 


(A) 



dynamin hélix and 
associated proteins 



(B) |_ 

200 nm 
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constricted 
membrane 
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Figure 13-13 The rôle of dynamin in pinching off clathrin-coated vesicles. (A) Multiple dynamin molécules assemble into a spiral around the 
neck of the forming bud. The dynamin spiral is thought to recruit other proteins to the bud neck, which, together with dynamin, destabilize the 
interacting lipid bilayers so that the noncytoplasmic leaflets flow together. The newly formed vesicle then pinches off from the membrane. Spécifie 
mutations in dynamin can either enhance or block the pinching-off process. (B) Dynamin was discovered as the protein defective in the shibire 
mutant of Drosophila. These mutant flies become paralyzed because clathrin-mediated endocytosis stops, and the synaptic vesicle membrane fails 
to recycle, blocking neurotransmitter release. Deeply invaginated clathrin-coated pits form in the nerve endings of the fly’s nerve cells, with a belt of 
mutant dynamin assembled around the neck, as shown in this thin-section électron micrograph. The pinching-off process fails because the required 
membrane fusion does not take place. (C, D) A model of how conformational changes in the GTPase domains of membrane-assembled dynamin 
may power a conformational change that constricts the neck of the bud. A single dynamin molécule is shown in orange in D. (B, from J.H. Koenig 
and K. Ikeda, J. Neurosci. 9:3844-3860, 1989. With permission from the Society of Neuroscience; C and D, adapted from M.G.J. Ford, S. Jenni 
and J. Nunnari, Nature 477:561-566, 2011. With permission from Macmillan Publishers.) 
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Monomeric GTPases Control Coat Assembly 

To balance the vesicle traffic to and from a compartment, coat proteins must 
assemble only when and where they are needed. While local production of PIPs 
plays a major part in regulating the assembly of clathrin coats on the plasma 
membrane and Golgi apparatus, cells superimpose additional ways of regulat¬ 
ing coat formation. Coat-recruitment GTPases, for example, control the assembly 
of clathrin coats on endosomes and the COPI and COPII coats on Golgi and ER 
membranes. 

Many steps in vesicle transport dépend on a variety of GTP-binding pro¬ 
teins that control both the spatial and temporal aspects of vesicle formation and 
fusion. As discussed in Chapter 3, GTP-binding proteins regulate most processes 
in eulcaryotic cells. They act as molecular switches, which flip between an active 
State with GTP bound and an inactive State with GDP bound. Two classes of pro¬ 
teins regulate the flipping: guanine nucléotide exchange factors (GEFs ) activate 
the proteins by catalyzing the exchange of GDP for GTP, and GTPase-activating 
proteins (GAPs ) inactivate the proteins by triggering the hydrolysis of the bound 
GTP to GDP (see Figures 3-68 and 15-7). Although both monomeric GTP-binding 
proteins (monomeric GTPases) and trimeric GTP-binding proteins (G proteins) 
hâve important rôles in vesicle transport, the rôles of the monomeric GTPases are 
better understood, and we focus on them here. 

Coat-recruitment GTPases are members of a family of monomeric GTPases. 
They include the ARF proteins, which are responsible for the assembly of both 
COPI and clathrin coats assembly at Golgi membranes, and the Sari protein, 
which is responsible for the assembly of COPII coats at the ER membrane. 
Coat-recruitment GTPases are usually found in high concentration in the cyto¬ 
sol in an inactive, GDP-bound State. When a COPII-coated vesicle is to bud from 
the ER membrane, for example, a spécifie Sarl-GEF embedded in the ER mem¬ 
brane binds to cytosolic Sari, causing the Sari to release its GDP and bind GTP 
in its place. (Recall that GTP is présent in much higher concentration in the cyto¬ 
sol than GDP and therefore will spontaneously bind after GDP is released.) In its 
GTP-bound State, the Sari protein exposes an amphiphilic hélix, which inserts 
into the cytoplasmic leaflet of the lipid bilayer of the ER membrane. The tightly 
bound Sari now recruits adaptor coat protein subunits to the ER membrane to 
initiate budding (Figure 13-14). Other GEFs and coat-recruitment GTPases oper- 
ate in a similar way on other membranes. 

The coat-recruitment GTPases also hâve a rôle in coat disassembly. The hydro¬ 
lysis of bound GTP to GDP causes the GTPase to change its conformation so that 
its hydrophobie tail pops out of the membrane, causing the vesicle's coat to dis¬ 
assemble. Although it is not lcnown what triggers the GTP hydrolysis, it has been 
proposed that the GTPases worlc lilce timers, which hydrolyze GTP at slow but pre- 
dictable rates, to ensure that vesicle formation is synchronized with the require- 
ments of the moment. COPII coats accelerate GTP hydrolysis by Sari, and a fully 
formed vesicle will be produced only when bud formation occurs faster than the 
timed disassembly process; otherwise, disassembly will be triggered before a 
vesicle pinches off, and the process will hâve to start again, perhaps at a more 
appropriate time and place. Once a vesicle pinches off, GTP hydrolysis releases 
Sari, but the sealed coat is sufficiently stabilized through many cooperative inter¬ 
actions, including binding to the cargo receptors in the membrane, that it may 
stay on the vesicle until the vesicle docks at a target membrane. There, a kinase 
phosphorylâtes the coat proteins, which complétés coat disassembly and readies 
the vesicle for fusion. 

Clathrin- and COPI-coated vesicles, by contrast, shed their coat soon after 
they pinch off. For COPI vesicles, the curvature of the vesicle membrane serves 
as a trigger to begin uncoating. An ARF-GAP is recruited to the COPI coat as it 
assembles. It interacts with the membrane, and senses the lipid packing density. 
It becomes activated when the curvature of the membrane approaches that of a 
transport vesicle. It then inactivâtes ARF, causing the coat to disassemble. 
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Figure 13-14 Formation of a COPII- 
coated vesicle. (A) Inactive, soluble 
Sari -GDP binds to a Sari -GEF in the ER 
membrane, causing the Sari to release 
its GDP and bind GTP A GTP-triggered 
conformational change in Sari exposes 
an amphiphilic hélix, which inserts into the 
cytoplasmic leaflet of the ER membrane, 
initiating membrane bending (which is 
not shown). (B) GTP-bound Sari binds 
to a complex of two COPII adaptor coat 
proteins, called Sec23 and Sec24, which 
form the inner coat. Sec24 has several 
different binding sites for the cytosolic 
tails of cargo receptors. The entire surface 
of the complex that attaches to the 
membrane is gently curved, matching 
the diameter of COPII-coated vesicles. 

(C) A complex of two additional COPII 
coat proteins, called Sec13 and Sec 31, 
forms the outer shell of the coat. Like 
clathrin, they can assemble on their own 
into symmetrical cages with appropriate 
dimensions to enclose a COPII-coated 
vesicle. (D) Membrane-bound, active 
Sari -GTP recruits COPII adaptor proteins 
to the membrane. They select certain 
transmembrane proteins and cause the 
membrane to deform. The adaptor proteins 
then recruit the outer coat proteins which 
help form a bud. A subséquent membrane 
fusion event pinches off the coated vesicle. 
Other coated vesicles are thought to form 
in a similar way. (C, modified from 
S.M. Stagg et al., Nature 439:234-238, 
2006. With permission from Macmillan 
Publishers Ltd.) 


Not Ail Transport Vesicles Are Spherical 

Although vesicle-budding is similar at various locations in the cell, each cell 
membrane poses its own spécial challenges. The plasma membrane, for exam¬ 
ple, is comparatively fiat and stiff, owing to its cholesterol-rich lipid composition 
and underlying actin-rich cortex. Thus, the coordinated action of clathrin coats 
and membrane-bending proteins has to produce sufficient force to introduce 
curvature, especially at the neclc of the bud where sharp bends are required for 
the pinching-off processes. In contrast, vesicle-budding from many intracellular 
membranes occurs preferentially at régions where the membranes are already 
curved, such as the rims of the Golgi cisternae or ends of membrane tubules. In 
these places, the primary function of the coats is to capture the appropriate cargo 
proteins rather than to deform the membrane. 

Transport vesicles also occur in various sizes and shapes. Diverse COPII ves¬ 
icles are required for the transport of large cargo molécules. Collagen, for exam¬ 
ple, is assembled in the ER as 300-nm-long, stiff procollagen rods that then are 
secreted from the cell where they are cleaved by proteases to collagen, which is 
embedded into the extracellular matrix (discussed in Chapter 19). Procollagen 
rods do not fit into the 60-80 nm COPII vesicles normally observed. To circumvent 
this problem, the procollagen cargo molécules bind to transmembrane packag¬ 
ing proteins in the ER, which control the assembly of the COPII coat components 
(Figure 13-15). These events drive the local assembly of much larger COPII vesi¬ 
cles that accommodate the oversized cargo. Human mutations in genes encoding 
such packaging proteins resuit in collagen defects with severe conséquences, such 
as slceletal abnormalities and other developmental defects. Similar mechanisms 
must regulate the sizes of vesicles required to secrete other large macromolecular 
complexes, including the lipoprotein particles that transport lipids out of cells. 
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Many other vesicle budding events likewise involve variations of common 
mechanisms. When living cells are genetically engineered to express fluores¬ 
cent membrane components, the endosomes and trans Golgi network are seen 
in a fluorescence microscope to continually send out long tubules. Coat proteins 
assemble onto the membrane tubules and help recruit spécifie cargo. Ihe tubules 
then either regress or pinch off with the help of dynamin-lilce proteins to form 
transport vesicles of different sizes and shapes. 

Tubules hâve a higher surface-to-volume ratio than the larger organelles from 
which they form. Ihey are therefore relatively enriched in membrane proteins 
compared with soluble cargo proteins. As we discuss later, this property of tubules 
is an important feature for sorting proteins in endosomes. 


Figure 13-15 Packaging of procollagen 
into large tubular COPII-coated vesicles. 

The cartoons show models for two COPII 
coat assembly modes. The models are 
based on cryoelectron tomography 
images of reconstituted COPII vesicles. 

On a spherical membrane (left), the 
Sec23/24 inner coat proteins assemble in 
patches that anchor the Secl 3/31 outer 
coat protein cage. The Secl 3/31 rods 
assemble a cage of triangles, squares, 
and pentagons. When procollagen needs 
to be packaged (right), spécial packaging 
proteins sense the cargo and modify the 
coat assembly process. This interaction 
recruits the COPII inner coat protein Sec24 
and locally enhances the rate with which 
Sari cycles on and off the membrane 
(not shown). In addition, a monoubiquitin 
is added to the Sec31 protein, changing 
the assembly properties of the outer 
cage. Sec23/24 proteins arrange in larger 
arrays and Secl 3/31 arrange in a regular 
lattice of diamond shapes. As the resuit, 
a large tubular vesicle is formed that can 
accommodate the large cargo molécules. 
The packaging proteins are not part of 
the budding vesicle but remain in the 
ER. (Modified from G. Zanetti et al., eLife 
2:e00951,2013.) 


Rab Proteins Guide Transport Vesicles to Their Target Membrane 

To ensure an orderly flow of vesicle traffic, transport vesicles must be highly accu- 
rate in recognizing the correct target membrane with which to fuse. Because of the 
diversity and crowding of membrane Systems in the cytoplasm, a vesicle is lilcely 
to encounter many potential target membranes before it finds the correct one. 
Specificity in targeting is ensured because ail transport vesicles display surface 
markers that identify them according to their origin and type of cargo, and tar¬ 
get membranes display complementary receptors that recognize the appropriate 
markers. This crucial process occurs in two steps. First, Rab proteins and Rab effec- 
tors direct the vesicle to spécifie spots on the correct target membrane. Second, 
SNARE proteins and SNARE regulators médiate the fusion of the lipid bilayers. 

Rab proteins play a central part in the specificity of vesicle transport. Lilce the 
coat-recruitment GTPases discussed earlier (see Figure 13-14), Rab proteins are 
also monomeric GTPases. With over 60 known members, the Rab subfamily is the 
largest of the monomeric GTPase subfamilies. Each Rab protein is associated with 
one or more membrane-enclosed organelles of the secretory or endocytic path- 
ways, and each of these organelles has at least one Rab protein on its cytosolic sur¬ 
face (Table 13-1). Their highly sélective distribution on these membrane Systems 
malces Rab proteins idéal molecular markers for identifying each membrane type 
and guiding vesicle traffic between them. Rab proteins can function on transport 
vesicles, on target membranes, or both. 

Lilce the coat-recruitment GTPases, Rab proteins cycle between a membrane 
and the cytosol and regulate the réversible assembly of protein complexes on the 
membrane. In their GDP-bound State, they are inactive and bound to another 
protein ( Rab-GDP dissociation inhibitor , or GDI) that lceeps them soluble in the 
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TABLE 13-1 


Subcellular Locations of Some Rab Proteins 


Protein 

Organelle 

Rabl 

ER and Golgi complex 

Rab2 

cis Golgi network 

Rab3A 

Synaptic vesicles, secretory vesicles 

Rab4/Rab11 

Recycling endosomes 

Rab5 

Early endosomes, plasma membrane, clathrin-coated vesicles 

Rab6 

Médial and trans Golgi 

Rab7 

Late endosomes 

Rab8 

Cilia 

Rab9 

Late endosomes, trans Golgi 


cytosol; in their GTP-bound State, they are active and tightly associated with the 
membrane of an organelle or transport vesicle. Membrane-bound Rab-GEFs 
activate Rab proteins on both transport vesicle and target membranes; for some 
membrane fusion events, activated Rab molécules are required on both sides 
of the reaction. Once in the GTP-bound State and membrane-bound through a 
now-exposed lipid anchor, Rab proteins bind to other proteins, called Rab effec- 
tors, which are the downstream mediators of vesicle transport, membrane teth- 
ering, and membrane fusion (Figure 13-16). The rate of GTP hydrolysis sets the 
concentration of active Rab and, consequently, the concentration of its effectors 
on the membrane. 

In contrast to the highly conserved structure of Rab proteins, the structures 
and functions of Rab effectors vary greatly, and the same Rab proteins can often 
bind to many different effectors. Some Rab effectors are motor proteins that pro- 
pel vesicles along actin filaments or microtubules to their target membrane. Oth- 
ers are tethering pro teins , some of which hâve long, threadlilce domains that serve 
as “fishing lines" that can extend to linlc two membranes more than 200 nm apart; 
other tethering proteins are large protein complexes that linlc two membranes 
that are doser together and interact with a wide variety of other pro teins that facil- 
itate the membrane fusion step. The tethering complex that docks COPII-coated 


-cargo receptor 
^cargo 



target membrane 


Figure 13-16 Tethering of a transport 
vesicle to a target membrane. Rab 

effector proteins interact with active Rab 
proteins (Rab-GTPs, yellow) located on 
the target membrane, vesicle membrane, 
or both, to establish the first connection 
between the two membranes that are 
going to fuse. In the example shown here, 
the Rab effector is a filamentous tethering 
protein (dark green). Next, SNARE proteins 
on the two membranes (red and bluë) pair, 
docking the vesicle to the target membrane 
and catalyzing the fusion of the two 
apposed lipid bilayers. During docking and 
fusion, a Rab-GAP (not shown) induces the 
Rab protein to hydrolyze its bound GTP to 
GDP, causing the Rab to dissociate from 
the membrane and return to the cytosol 
as Rab-GDP, where it is bound by a GDI 
protein that keeps the Rab soluble and 
inactive. 
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vesicles, for example, contains a protein kinase that phosphorylâtes the coat pro¬ 
teins to complété the uncoating process. Coupling uncoating to vesicle delivery 
helps to ensure directionality of the transport process and fusion with the proper 
membrane. Rab effectors can also interact with SNAREs to couple membrane 
tethering to fusion (see Figure 13-16). 

The assembly of Rab proteins and their effectors on a membrane is coopera¬ 
tive and results in the formation of large, specialized membrane patches. Rab5, 
for example, assembles on endosomes and médiates the capture of endocytic 
vesicles arriving from the plasma membrane. The experimental déplétion of 
Rab5 causes disappearance of the entire endosomal and lysosomal membrane 
System, highlighting the crucial rôle of Rab proteins in organelle biogenesis and 
maintenance. 

A Rab5 domain concentrâtes tethering proteins that catch incoming vesicles. 
Its assembly on endosomal membranes begins when a Rab5-GDP/GDI complex 
encounters a Rab-GEF. GDI is released and Rab5-GDP is converted to Rab5-GTP. 
Active Rab5-GTP becomes anchored to the membrane and recruits more Rab5- 
GEF to the endosome, thereby stimulating the recruitment of more Rab5 to the 
same site. In addition, active Rab5 activâtes a PI 3-kinase, which locally converts 
PI to PI(3)P, which in turn binds some of the Rab effectors including tethering pro¬ 
teins and stabilizes their local membrane attachment (Figure 13-17). This type of 
positive feedback greatly amplifies the assembly process and helps to establish 
functionally distinct membrane domains within a continuous membrane. 

The endosomal membrane provides a striking example of how different Rab 
proteins and their effectors help to create multiple specialized membrane domains, 
each fulfilling a particular set of functions. Thus, while the Rab5 membrane domain 
receives incoming endocytic vesicles from the plasma membrane, distinct Rab 11 
and Rab4 domains in the same membrane organize the budding of recycling vesi¬ 
cles that return proteins from the endosome to the plasma membrane. 

Rab Cascades Can Change the Identity of an Organelle 

A Rab domain can be disassembled and replaced by a different Rab domain, 
changing the identity of an organelle. Such ordered recruitment of sequentially 
acting Rab proteins is called a Rab cascade. Over time, for example, Rab5 domains 
are replaced by Rab7 domains on endosomal membranes. This converts an early 
endosome, marlced by Rab5, into a late endosome, marlced by Rab7. Because the 
set of Rab effectors recruited by Rab7 is different from that recruited by Rab5, 
this change reprograms the compartment: as we discuss later, it alters the mem¬ 
brane dynamics, including the incoming and outgoing traffic, and repositions 
the organelle away from the plasma membrane toward the cell interior. Ail of the 
cargo contained in the early endosome that has not been recycled to the plasma 
membrane is now part of a late endosome. This process is also referred to as endo¬ 
some maturation. The self-amplifying nature of the Rab domains renders the pro¬ 
cess of endosome maturation unidirectional and irréversible (Figure 13-18). 


Figure 13-17 The formation of a Rab5 
domain on the endosome membrane. 

A Rab5-GEF on the endosome membrane 
binds a Rab5 protein and induces it to 
exchange GDP for GTP. GDI is lost and 
GTP binding alters the conformation of 
the Rab protein, exposing an amphiphilic 
hélix and a covalently attached lipid group, 
which together anchor the Rab5-GTP 
to the membrane. Active Rab5 activâtes 
PI 3-kinase, which converts PI into PI(3) 

P. PI(3)P and active Rab5 together bind a 
variety of Rab effector proteins that contain 
PI(3)P-binding sites, including filamentous 
tethering proteins that catch incoming 
clathrin-coated endocytic vesicles from 
the plasma membrane. With the help of 
another effector, active Rab5 also recruits 
more Rab5-GEF, further enhancing the 
assembly of the Rab5 domain on the 
membrane. 

Controlled cycles of GTP hydrolysis 
and GDP-GTP exchange dynamically 
regulate the size and activity of such 
Rab domains. Unlike SNAREs, which are 
intégral membrane proteins, the GDP/ 

GTP cycle, coupled to the membrane/ 
cytosol translocation cycle, endows 
the Rab machinery with the ability to 
undergo assembly and disassembly on 
the membrane. (Adapted from M. Zerial 
and H. McBride, Nat. Rev. Mol. Cell Biol. 
2:107-117, 2001. With permission from 
Macmillan Publishers Ltd.) 
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SNAREs Médiate Membrane Fusion 

Once a transport vesicle has been tethered to its target membrane, it unloads its 
cargo by membrane fusion. Membrane fusion requires bringing the lipid bilayers 
of two membranes to within 1.5 nm of each other so that they can merge. When 
the membranes are in such close apposition, lipids can fiow from one bilayer to 
the other. For this close approach, water must be displaced from the hydrophilic 
surface of the membrane—a process that is highly energetically unfavorable and 
requires specialized fusion proteins that overcome this energy barrier. We hâve 
already discussed the rôle of dynamin in a related taslc during the pinching-off of 
clathrin-coated vesicles (see Figure 13-13). 

The SNARE proteins (also called SNAREs, for short) catalyze the membrane 
fusion reactions in vesicle transport. There are at least 35 different SNAREs in an 
animal cell, each associated with a particular organelle in the secretory or endo- 
cytic pathways. These transmembrane proteins exist as complementary sets, with 
v-SNAREs usually found on vesicle membranes and t-SNAREs usually found on 
target membranes (see Figure 13-16). A v-SNARE is a single polypeptide chain, 
whereas a t-SNARE is usually composed of three proteins. The v-SNAREs and 
t-SNAREs hâve characteristic helical domains, and when a v-SNARE interacts with 
a t-SNARE, the helical domains of one wrap around the helical domains of the 
other to form a very stable four-hélix bundle. The resulting trans-SNARE complex 
loclcs the two membranes together. Biochemical membrane fusion assays with ail 
different SNARE combinations show that v- and t-SNARE pairing is highly spécifie. 
The SNAREs thus provide an additional layer of specificity in the transport process 
by helping to ensure that vesicles fuse only with the correct target membrane. 

The trans-SNARE complexes catalyze membrane fusion by using the energy 
that is freed when the interacting helices wrap around each other to pull the 
membrane faces together, simultanéously squeezing out water molécules from 
the interface (Figure 13-19). When liposomes containing purified v-SNAREs are 
mixed with liposomes containing complementary t-SNAREs, their membranes 
fuse, albeit slowly. In the cell, other proteins recruited to the fusion site, presum- 
ably Rab effectors, cooperate with SNAREs to accelerate fusion. Fusion does not 
always follow immediately after v-SNAREs and t-SNAREs pair. As we discuss later, 
in the process of regulated exocytosis, fusion is delayed until sécrétion is triggered 
by a spécifie extracellular signal. 

Rab proteins, which can regulate the availability of SNARE proteins, exert an 
additional layer of control. t-SNAREs in target membranes are often associated 
with inhibitory proteins that must be released before the t-SNARE can function. 
Rab proteins and their effectors trigger the release of such SNARE inhibitory 


Figure 13-19 A model for how SNARE proteins may catalyze membrane 
fusion. Bilayer fusion occurs in multiple steps. Atight pairing between 
v- and t-SNAREs forces lipid bilayers into close apposition and expels 
water molécules from the interface. Lipid molécules in the two interacting 
(cytosolic) leaflets of the bilayers then flow between the membranes to form 
a connecting stalk. Lipids of the two noncytosolic leaflets then contact each 
other, forming a new bilayer, which widens the fusion zone [hemifusion, or 
half-fusion). Rupture of the new bilayer complétés the fusion reaction. 


Figure 13-18 A model for a generic 
Rab cascade. The local activation of a 
RabA-GEF leads to assembly of a RabA 
domain on the membrane. Active RabA 
recruits its effector proteins, one of which 
is a GEF for RabB. The RabB-GEF then 
recruits RabB to the membrane, which in 
turn begins to recru it its effectors, among 
them a GAP for RabA. The RabA-GAP 
activâtes RabA GTP hydrolysis leading 
to the inactivation of the RabA and the 
disassembly of the RabA domain as the 
RabB domain grows. In this way, the RabA 
domain is irreversibly replaced by the RabB 
domain. In principle, this sequence can 
continued by the recruitment of a next GEF 
by RabB. (Adapted from A.H. Hutagalung 
and P.J. Novick, Physiol. Rev. 91:119-149, 
2011. With permission from The American 
Physiological Society.) 
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proteins. In this way, SNARE proteins are concentrated and activated in the cor¬ 
rect location on the membrane, where tethering proteins capture incoming ves- 
icles. Rab proteins thus speed up the process by which appropriate SNARE pro¬ 
teins in two membranes find each other. 

For vesicle transport to operate normally, transport vesicles must incorpo- 
rate the appropriate SNARE and Rab proteins. Not surprisingly, therefore, many 
transport vesicles will form only if they incorporate the appropriate complément 
of SNARE and Rab proteins in their membrane. How this crucial control process 
opérâtes during vesicle budding remains a mystery. 

Interacting SNAREs Need to Be Pried Apart Before They Can 
Function Again 

Most SNARE proteins in cells hâve already participated in multiple rounds of 
vesicle transport and are sometimes présent in a membrane as stable complexes 
with partner SNAREs. The complexes hâve to disassemble before the SNAREs can 
médiate new rounds of transport. A crucial protein called NSF cycles between 
membranes and the cytosol and catalyzes the disassembly process. NSF is a 
hexameric ATPase of the family of AAA-ATPases (see Figure 6-85) that uses the 
energy of ATP hydrolysis to unravel the intimate interactions between the helical 
domains of paired SNARE proteins (Figure 13-20). The requirement for NSF-me- 
diated réactivation of SNAREs by SNARE complex disassembly helps prevent 
membranes from fusing indiscriminately: if the t-SNAREs in a target membrane 
were always active, any membrane containing an appropriate v-SNARE might 
fuse whenever the two membranes made contact. It is not lcnown how the activity 
of NSF is controlled so that the SNARE machinery is activated at the right time and 
place. It is also not known how v-SNAREs are selectively retrieved and returned to 
their compartment of origin so that they can be reused in newly formed transport 
vesicles. 

Membrane fusion is important in other processes beside vesicle transport. 
The plasma membranes of a sperm and an egg fuse during fertilization, and myo- 
blasts fuse with one another during the development of multinucleate muscle 
fibers (discussed in Chapter 22). Lilcewise, the ER networlc and mitochondria fuse 
and fragment in a dynamic way (discussed in Chapters 12 and 14). Ail cell mem¬ 
brane fusions require spécial proteins and are tightly regulated to ensure that only 
appropriate membranes fuse. The Controls are crucial for maintaining both the 
identity of cells and the individuality of each type of intracellular compartment. 

The membrane fusions catalyzed by viral fusion proteins are well understood. 
These proteins hâve a crucial rôle in permitting the entry of enveloped viruses 
(which hâve a lipid-bilayer-based membrane coat) into the cells that they infect 
(discussed in Chapters 5 and 23). For example, viruses such as the human immu- 
nodeficiency virus (HIV), which causes AIDS, bind to cell-surface receptors and 
then fuse with the plasma membrane of the target cell (Figure 13-21). This fusion 
event allows the viral nucleic acid inside the nucleocapsid to enter the cytosol, 
where it replicates. Other viruses, such as the influenza virus, first enter the cell 
by receptor-mediated endocytosis (discussed later) and are delivered to endo- 
somes; the low pH in endosomes activâtes a fusion protein in the viral envelope 
that catalyzes the fusion of the viral and endosomal membranes, releasing the 
viral nucleic acid into the cytosol. Viral fusion proteins and SNAREs promote lipid 
bilayer fusion in similar ways. 


trans -SNARE 


NSF 



docking 


fusion 


SNARE dissociation 


Figure 13-20 Dissociation of SNARE 
pairs by NSF after a membrane fusion 
cycle. After a v-SNARE and t-SNARE hâve 
mediated the fusion of a transport vesicle 
with a target membrane, NSF binds to 
the SNARE complex and, with the help of 
accessory proteins, hydrolyzes ATP to pry 
the SNAREs apart. 
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Figure 13-21 The entry of enveloped 
viruses into cells. Electron micrographs 
showing how HIV enters a cell by fusing 
its membrane with the plasma membrane 
of the cell. (From B.S. Stein et al., Cell 
49:659-668, 1987. With permission from 
Elsevier.) 


Summary 

Directed and sélective transport of particular membrane components from one 
membrane-enclosed compartment to another in a eukaryotic cell maintains the 
différences between those compartments. Transport vesicles, which can be spheri- 
cal , tubular, or irregularly shaped , budfrom specialized coated régions ofthe donor 
membrane. The assembly ofthe coat helps to collect spécifie membrane and soluble 
cargo molécules for transport and to drive the formation ofthe vesicle. 

There are various types of coated vesicles. The best characterized are clath- 
rin-coated vesicles , which médiate transport from the plasma membrane and the 
trans Golgi network, and COPI- and COPII-coated vesicles, which médiate trans¬ 
port between Golgi cisternae and between the ER and the Golgi apparatus, respec- 
tively. Coats hâve a common two-layered structure: an inner layerformed ofadap- 
tor proteins links the outer layer (or cage) to the vesicle membrane and also traps 
spécifie cargo molécules for packaging into the vesicle. The coat is shed before the 
vesicle fuses with its appropriate target membrane. 

Local synthesis of spécifie phosphoinositides créâtes binding sites that trigger 
clathrin coat assembly and vesicle budding. In addition, monomeric GTPases help 
regulate various steps in vesicle transport, including both vesicle budding and 
docking. The coat-recruitment GTPases, including Sari and the ARF proteins, reg¬ 
ulate coat assembly and disassembly. A large family ofRab proteins functions as 
vesicle-targeting GTPases. Rab proteins are recruited to both, forming transport 
vesicles and target membranes. The assembly and disassembly of Rab proteins 
and their effectors in specialized membrane domains are dynamically controlled 
by GTP binding and hydrolysis. Active Rab proteins recruit Rab effectors, such as 
motor proteins; which transport vesicles alongactin filaments or microtubules, and 
filamentous tethering proteins, which help ensure that the vesicles deliver their con¬ 
tents only to the appropriate target membrane. Complementary v-SNARE proteins 
on transport vesicles and t-SNARE proteins on the target membrane form stable 
trans-SNARE complexes, which force the two membranes into close apposition so 
that their lipid bilayers can fuse. 

TRANSPORT FROM THE ER THROUGH THE 
GOLGI APPARATUS 

As discussed in Chapter 12, newly synthesized proteins cross the ER membrane 
from the cytosol to enter the secretorypathway. During their subséquent transport, 
from the ER to the Golgi apparatus and from the Golgi apparatus to the cell surface 
and elsewhere, these proteins are successively modified as they pass through a 
sériés of compartments. Transfer from one compartment to the next involves a 
délicate balance between forward and backward (retrieval) transport pathways. 
Some transport vesicles select cargo molécules and move them to the next com¬ 
partment in the pathway, while others retrieve escaped proteins and return them 
to a previous compartment where they normally function. Thus, the pathway from 
the ER to the cell surface consists of many sorting steps, which continuously select 
membrane and soluble lumenal proteins for packaging and transport. 
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In this section, we focus mainly on the Golgi apparatus (also called the Goigi 
complex). It is a major site of carbohydrate synthesis, as well as a sorting and dis¬ 
patching station for products of the ER. The cell malces many polysaccharides in 
the Golgi apparatus, including the pectin and hemicellulose of the cell wall in 
plants and most of the glycosaminoglycans of the extracellular matrix in animais 
(discussed in Chapter 19). The Golgi apparatus also lies on the exit route from the 
ER, and a large proportion of the carbohydrates that it malces are attached as oli¬ 
gosaccharide side chains to the many proteins and lipids that the ER sends to it. A 
subset of these oligosaccharide groups serve as tags to direct spécifie proteins into 
vesicles that then transport them to lysosomes. But most proteins and lipids, once 
they hâve acquired their appropriate oligosaccharides in the Golgi apparatus, are 
recognized in other ways for targeting into the transport vesicles going to other 
destinations. 

Proteins Leave the ER in COPII-Coated Transport Vesicles 

To initiate their journey along the secretory pathway, proteins that hâve entered 
the ER and are destined for the Golgi apparatus or beyond are first packaged into 
COPII-coated transport vesicles. These vesicles bud from specialized régions of 
the ER called ER exit sites, whose membrane laclcs bound ribosomes. Most animal 
cells hâve ER exit sites dispersed throughout the ER network. 

Entry into vesicles that leave the ER can be a sélective process or can happen by 
default. Many membrane proteins are actively recruited into such vesicles, where 
they become concentrated. These cargo membrane proteins display exit (trans¬ 
port) signais on their cytosolic surface that adaptor proteins of the inner COPII 
coat recognize (Figure 13-22); some of these components act as cargo receptors 
and are recycled baclc to the ER after they hâve delivered their cargo to the Golgi 
apparatus. Soluble cargo pro teins in the ER lumen, by contrast, hâve exit signais 
that attach them to transmembrane cargo receptors. Proteins without exit signais 
can also enter transport vesicles, including protein molécules that normally func- 
tion in the ER (so-called ER résident proteins), some of which slowly lealc out of 
the ER and are delivered to the Golgi apparatus. Different cargo pro teins enter the 
transport vesicles with substantially different rates and efficiencies, which may 
resuit from différences in their folding and oligomérization efficiencies and kinet- 
ics, as well as the factors already discussed. The exit step from the ER is a major 
checkpoint at which quality control is exerted on the proteins that a cell sécrétés 
or displays on its surface, as we discussed in Chapter 12. 

The exit signais that direct soluble proteins out of the ER for transport to the 
Golgi apparatus and beyond are not well understood. Some transmembrane 
proteins that serve as cargo receptors for packaging some secretory proteins into 
COPII-coated vesicles are lectins that bind to oligosaccharides on the secreted 


forming transport vesicle 
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Figure 13-22 The recruitment of 
membrane and soluble cargo molécules 
into ER transport vesicles. Membrane 
proteins are packaged into budding 
transport vesicles through interactions 
of exit signais on their cytosolic tails with 
adaptor proteins of the inner COPII coat. 
Some of these membrane proteins function 
as cargo receptors, binding soluble 
proteins in the ER lumen and helping to 
package them into vesicles. Other proteins 
may enter the vesicle by bulk flow. A typical 
50 nm transport vesicle contains about 200 
membrane proteins, which can be of many 
different types. As indicated, unfolded or 
incompletely assembled proteins are bound 
to chaperones and transiently retained in 
the ER compartiment. 
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proteins. One such lectin, for example, binds to mannose on two secreted 
blood-clotting factors (Factor V and Factor VIII), thereby packaging the pro¬ 
teins into transport vesicles in the ER; its rôle in protein transport was identified 
because humans who laclc it owing to an inherited mutation hâve lowered sérum 
levels of Factors V and VIII, and they therefore bleed excessively. 

Only Proteins That Are Properly Folded and Assembled Can Leave 
the ER 

To exit from the ER, proteins must be properly folded and, if they are subunits of 
multiprotein complexes, they need to be completely assembled. Those that are 
misfolded or incompletely assembled transiently remain in the ER, where they 
are bound to chaperone proteins (discussed in Chapter 6) such as BiP or calnexin. 
The chaperones may cover up the exit signais or somehow anchor the proteins 
in the ER. Such failed proteins are eventually transported baclc into the cytosol, 
where they are degraded by protéasomes (discussed in Chapters 6 and 12). This 
quality-control step prevents the onward transport of misfolded or misassembled 
proteins that could potentially interfère with the functions of normal proteins. 
Such failures are surprisingly common. More than 90% of the newly synthesized 
subunits of the T cell receptor (discussed in Chapter 24) and of the acétylcholine 
receptor (discussed in Chapter 11), for example, are normally degraded without 
ever reaching the cell surface where they function. Thus, cells must malce a large 
excess of some protein molécules to produce a select few that fold, assemble, and 
function properly. 

Sometimes, however, there are drawbacks to the stringent quality-control 
mechanism. The prédominant mutations that cause cystic fibrosis, a common 
inherited disease, resuit in the production of a slightly misfolded form of a plasma 
membrane protein important for CF transport. Although the mutant protein 
would function normally if it reached the plasma membrane, it is retained in the 
ER and then is degraded by cytosolic protéasomes. This devastating disease thus 
results not because the mutation inactivâtes the protein but because the active 
protein is discarded before it reaches the plasma membrane. 


Vesicular Tubular Clusters Médiate Transport from the ER to the 
Golgi Apparatus 

After transport vesicles hâve budded from ER exit sites and hâve shed their coat, 
they begin to fuse with one another. The fusion of membranes from the same 
compartment is called homotypic fusion, to distinguish it from heterotypic fusion, 
in which a membrane from one compartment fuses with the membrane of a dif¬ 
ferent compartment. As with heterotypic fusion, homotypic fusion requires a set 
of matching SNAREs. In this case, however, the interaction is symmetrical, with 
both membranes contributing v-SNAREs and t-SNAREs (Figure 13-23). 

The structures formed when ER-derived vesicles fuse with one another are 
called vesicular tubular clusters, because they hâve a convoluted appearance in 



STEP 1 STEP2 STEP3 


Figure 13-23 Homotypic membrane 
fusion. In step 1, NSF pries apart identical 
pairs of v-SNAREs and t-SNAREs in both 
membranes (see Figure 13-20). In steps 
2 and 3, the separated matching SNAREs 
on adjacent identical membranes interact, 
which leads to membrane fusion and the 
formation of one continuous compartment. 
Subsequently, the compartment grows 
by further homotypic fusion with vesicles 
from the same kind of membrane, 
displaying matching SNAREs. Homotypic 
fusion occurs when ER-derived transport 
vesicles fuse with one another, but also 
when endosomes fuse to generate larger 
endosomes. Rab proteins help regulate the 
extent of homotypic fusion and hence the 
size of a cell’s compartiments (not shown). 
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vesicular tubular cluster 




Figure 13-24 Vesicular tubular clusters. (A) An électron micrograph of vesicular tubular 
clusters forming around an exit site. Many of the vesicle-like structures seen in the micrograph 
are cross sections of tubules that extend above and below the plane of this thin section and are 
interconnected. (B) Vesicular tubular clusters move along microtubules to carry proteins from the 
ER to the Golgi apparatus. COPI-coated vesicles médiate the budding of vesicles that return to the 
ER from these clusters (and from the Golgi apparatus). (A, courtesy of William Balch.) 


the électron microscope (Figure 13-24A). These clusters constitute a compart- 
ment that is separate from the ER and lacks many of the proteins that function 
in the ER. They are generated continually and function as transport containers 
that bring material from the ER to the Golgi apparatus. The clusters move quicldy 
along microtubules to the Golgi apparatus with which they fuse (Figure 13-24B 
and Movie 13.2). 

As soon as vesicular tubular clusters form, they begin to bud off transport ves¬ 
icles of their own. Unlilce the COPII-coated vesicles that bud from the ER, these 
vesicles are COPI-coated (see Figure 13-24A). COPI-coated vesicles are unique in 
that the components that malce up the inner and outer coat layers are recruited as 
a preassembled complex, called coatomer. They function as a retrieval pathway , 
carrying baclc ER résident proteins that hâve escaped, as well as proteins such 
as cargo receptors and SNAREs that participated in the ER budding and vesicle 
fusion reactions. This retrieval process demonstrates the exquisite control mech- 
anisms that regulate coat assembly reactions. The COPI coat assemblybegins only 
seconds after the COPII coats hâve been shed, and remains a mystery how this 
switch in coat assembly is controlled. 

The retrieval (or rétrogradé) transport continues as the vesicular tubular clus¬ 
ters move toward the Golgi apparatus. Thus, the clusters continuously mature, 
gradually changing their composition as selected proteins are returned to the ER. 
The retrieval continues from the Golgi apparatus, after the vesicular tubular clus¬ 
ters hâve delivered their cargo. 

The Retrieval Pathway to the ER Uses Sorting Signais 

The retrieval pathway for returning escaped proteins baclc to the ER dépends on 
ER retrieval signais. Résident ER membrane proteins, for example, contain signais 
that bind directly to COPI coats and are thus packaged into COPI-coated transport 
vesicles for rétrogradé delivery to the ER. The best-characterized retrieval signal 
of this type consists of two lysines, followed by any two other amino acids, at the 
extreme C-terminal end of the ER membrane protein. It is called a KKXXsequence, 
based on the single-letter amino acid code. 

Soluble ER résident proteins, such as BiP, also contain a short ER retrieval sig¬ 
nal at their C-terminal end, but it is different: it consists of a Lys-Asp-Glu-Leu or 
a similar sequence. If this signal (called the KDEL sequencé) is removed from BiP 
by genetic engineering, the protein is slowly secreted from the cell. If the signal 
is transferred to a protein that is normally secreted, the protein is now efficiently 
returned to the ER, where it accumulâtes. 
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vesicular tubular cluster 
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Unlilce the retrieval signais on ER membrane proteins, which can interact 
directly with the COPI coat, soluble ER résident proteins must bind to special- 
ized receptor proteins such as the KDEL receptor—a multipass transmembrane 
protein that binds to the KDEL sequence and packages any protein displaying it 
into COPI-coated rétrogradé transport vesicles (Figure 13-25). To accomplish this 
taslc, the KDEL receptor itself must cycle between the ER and the Golgi apparatus, 
and its affïnity for the KDEL sequence must differ in these two compartments. 
The receptor must hâve a high affïnity for the KDEL sequence in vesicular tubular 
clusters and the Golgi apparatus, so as to capture escaped, soluble ER résident 
proteins that are présent there at low concentration. It must hâve a low affïnity for 
the KDEL sequence in the ER, however, to unload its cargo in spite of the very high 
concentration of KDEL-containing soluble résident proteins in the ER. 

How does the affïnity of the KDEL receptor change depending on the compart- 
ment in which it résides? The answer is lilcely related to the lower pH in the Golgi 
compartments, which is regulated by H + pumps. As we discuss later, pH-sensitive 
protein-protein interactions form the basis for many of the protein sorting steps 
in the cell. 

Most membrane proteins that function at the interface between the ER and 
Golgi apparatus, including v- and t-SNAREs and some cargo receptors, also enter 
the retrieval pathway baclc to the ER. 

Many Proteins Are Selectively Retained in the Compartments in 
Which They Function 

The KDEL retrieval pathway only partly explains how ER résident proteins are 
maintained in the ER. As mentioned, cells that express genetically modified 
ER résident proteins, from which the KDEL sequence has been experimentally 
removed, secrete these proteins. But the rate of sécrétion is much slower than for a 
normal secretory protein. It seems that a mechanism that is independent of their 
KDEL signal normally retains ER résident proteins and that only those proteins 
that escape this rétention mechanism are captured and returned via the KDEL 
receptor. A suggested rétention mechanism is that ER résident proteins bind to 
one another, thus forming complexes that are too big to enter transport vesicles 
efficiently. Because ER résident proteins are présent in the ER at very high con¬ 
centrations (estimated to be millimolar), relatively low-affïnity interactions would 
suffice to retain most of the proteins in such complexes. 

Aggregation of proteins that function in the same compartment is a general 
mechanism that compartments use to organize and retain their résident proteins. 
Golgi enzymes that function together, for example, also bind to each other and are 
thereby restrained from entering transport vesicles leaving the Golgi apparatus. 


Figure 13-25 Retrieval of soluble ER 
résident proteins. ER résident proteins 
that escape from the ER are returned by 
vesicle transport. (A) The KDEL receptor 
présent in both vesicular tubular clusters 
and the Golgi apparatus captures the 
soluble ER résident proteins and carries 
them in COPI-coated transport vesicles 
back to the ER. (Recall that the COPI- 
coated vesicles shed their coats as soon as 
they are formed.) Upon binding its ligands 
in the tubular cluster or Golgi, the KDEL 
receptor may change conformation, so 
as to facilitate its recruitment into budding 
COPI-coated vesicles. (B) The retrieval 
of ER proteins begins in vesicular tubular 
clusters and continues from later parts of 
the Golgi apparatus. In the environment of 
the ER, the ER résident proteins dissociate 
from the KDEL receptor, which is then 
returned to the Golgi apparatus for reuse. 
We discuss the different compartments of 
the Golgi apparatus shortly. 
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Figure 13-26 The Golgi apparatus. 

(A) Three-dimensional reconstruction 
from électron micrographs of the Golgi 
apparatus in a secretory animal cell. The 
ois face of the Golgi stack is that closest 
to the ER. (B) A thin-section électron 
micrograph of an animal cell. In plant 
cells, the Golgi apparatus is generally 
more distinct and more clearly separated 
from other intracellular membranes than 
in animal cells. (A, redrawn from 

A. Rambourg and Y. Clermont, Eur. J. Cell 
Biol. 51:189-200, 1990. With permission 
from Wissenschaftliche Verlagsgesellschaft; 

B, courtesy of Brij J. Gupta.) 


The Golgi Apparatus Consists of an Ordered Sériés of 
Compartments 

Because it could be selectively visualized by silver stains, the Golgi apparatus was 
one of the first organelles described by early light microscopists. It consists of a 
collection of flattened, membrane-enclosed compartments called cisternae, that 
somewhat resemble a stack of pita breads. Each Golgi stack typically consists of 
four to six cisternae (Figure 13-26), although some unicellular flagellâtes can hâve 
more than 20. In animal cells, tubular connections between corresponding cister¬ 
nae linlc many stades, thus forming a single complex, which is usually located near 
the cell nucléus and close to the centrosome (Figure 13-27A). This localization 
dépends on microtubules. If microtubules are experimentally depolymerized, the 
Golgi apparatus reorganizes into individual stades that are found throughout the 
cytoplasm, adjacent to ER exit sites. Some cells, including most plant cells, hâve 
hundreds of individual Golgi stades dispersed throughout the cytoplasm where 
they are typically found adjacent to ER exit sites (Figure 13-27B). 



(A) 
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Figure 13-27 Localization of the Golgi 
apparatus in animal and plant cells. 

(A) The Golgi apparatus in a cultured 
fibroblast stained with a fluorescent 
antibody that recognizes a Golgi résident 
protein (bright orange). The Golgi apparatus 
is polarized, facing the direction in which 
the cell was crawling before fixation. 

(B) The Golgi apparatus in a plant cell that 
is expressing a fusion protein consisting 
of a résident Golgi enzyme fused to green 
fluorescent protein. (A, courtesy of John 
Henley and Mark McNiven; B, courtesy of 
Chris Hawes.) 
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During their passage through the Golgi apparatus, transportée! molécules 
undergo an ordered sériés of covalent modifications. Each Golgi stade has two 
distinct faces: a cis face (or entry face) and a trans face (or exit face). Both cis and 
trans faces are closely associated with spécial compartments, each composed of a 
network of interconnected tubular and cisternal structures: the cis Golgi network 
(CGN) and the trans Golgi network (TGN), respectively. The CGN is a collection 
of fused vesicular tubular clusters arriving from the ER. Proteins and lipids enter 
the cis Golgi network and exit from the trans Golgi network; bound for the cell 
surface or another compartment. Both networlcs are important for protein sort- 
ing: proteins entering the CGN can either move onward in the Golgi apparatus or 
be returned to the ER. Similarly, proteins exiting from the TGN move onward and 
are sorted according to their next destination: endosomes ; secretory vesicles, or 
the cell surface. They also can be returned to an earlier compartment. Some mem¬ 
brane proteins are retained in the part of the Golgi apparatus where they function. 

As described in Chapter 12 ; a single species of N-linked oligosaccharide is 
attached en bloc to many pro teins in the ER and then trimmed while the pro¬ 
tein is still in the ER. The oligosaccharide intermediates created by the trimming 
reactions serve to help proteins fold and to help transport misfolded proteins to 
the cytosol for dégradation in protéasomes. Thus, they play an important rôle in 
controlling the quality of proteins exiting from the ER. Once these ER functions 
hâve been fulfilled ; the cell reutilizes the oligosaccharides for new functions. This 
begins in the Golgi apparatus, which generates the heterogeneous oligosaccha¬ 
ride structures seen in mature proteins. After arrivai in the CGN, proteins enter 
the first of the Golgi processing compartments (the cis Golgi cisternae). They then 
move to the next compartment (the médial cisternae) and finally to the trans 
cisternae, where glycosylation is completed. The lumen of the trans cisternae is 
thought to be continuous with the TGN, the place where proteins are segregated 
into different transport packages and dispatched to their final destinations. 

The oligosaccharide processing steps occur in an organized sequence in the 
Golgi stade, with each cisterna containing a characteristic mixture of processing 
enzymes. Proteins are modified in successive stages as they move from cisterna 
to cisterna across the stade, so that the stade forms a multistage processing unit. 

Investigators discovered the functional différences between the cis, médial, 
and trans subdivisions of the Golgi apparatus by localizing the enzymes involved 
in processing N-linked oligosaccharides in distinct régions of the organelle, both 
by physical fractionation of the organelle and by labeling the enzymes in électron 
microscope sections with antibodies (Figure 13-28). The removal of mannose 
and the addition of N-acetylglucosamine, for example, occur in the cis and médial 
cisternae, while the addition of galactose and sialic acid occurs in the trans cis¬ 
terna and trans Golgi network. Figure 13-29 summarizes the functional compart- 
mentalization of the Golgi apparatus. 

Oligosaccharide Chains Are Processed in the Golgi Apparatus 

Whereas the ER lumen is full of soluble lumenal résident proteins and enzymes, 
the résident proteins in the Golgi apparatus are ail membrane bound, as the enzy- 
matic reactions apparently occur entirely on membrane surfaces. Ail of the Golgi 
glycosidases and glycosyl transferases, for example, are single-pass transmem¬ 
brane proteins, many of which are organized in multienzyme complexes. 


Figure 13-28 Molecular compartmentalization of the Golgi apparatus. 

A sériés of électron micrographs shows the Golgi apparatus (A) unstained, 

(B) stained with osmium, which preferentially labels the cisternae of the 
cis compartment, and (C and D) stained to reveal the location of spécifie 
enzymes. Nucleoside diphosphatase is found in the trans Golgi cisternae 

(C) , while acid phosphatase is found in the trans Golgi network (D). Note that 
usually more than one cisterna is stained. The enzymes are therefore thought 
to be highly enriched rather than precisely localized to a spécifie cisterna. 
(Courtesy of Daniel S. Friend.) 
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Figure 13-29 Oligosaccharide 
Processing in Golgi compartments. 

The localization of each Processing step 
shown was determined by a combination 
of techniques, including biochemical 
subfractionation of the Golgi apparatus 
membranes and électron microscopy after 
staining with antibodies spécifie for some 
of the Processing enzymes. Processing 
enzymes are not restricted to a particular 
cisterna; instead, their distribution is graded 
across the stack, such that early-acting 
enzymes are présent mostly in the cis Golgi 
cisternae and later-acting enzymes are 
mostly in the trans Golgi cisternae. Man, 
mannose; GIcNAc, /V-acetylglucosamine; 
Gai, galactose; NANA, /V-acetylneuraminic 
acid (sialic acid). 


Two broad classes of AMinked oligosaccharides, the complex oligosaccha¬ 
rides and the high-mannose oligosaccharides, are attached to mammalian gly- 
coproteins. Sometimes, both types are attached (in different places) to the same 
polypeptide chain. Complex oligosaccharides are generated when the original 
AMinlced oligosaccharide added in the ER is trimmed and further sugars are 
added; by contrast, high-mannose oligosaccharides are trimmed but hâve no new 
sugars added to them in the Golgi apparatus (Figure 13-30). The sialic acids in the 
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Figure 13-30 The two main classes of asparagine-linked (AMinked) oligosaccharides found 
in mature mammalian glycoproteins. (A) Both complex oligosaccharides and high-mannose 
oligosaccharides share a common core région derived from the original /V-linked oligosaccharide 
added in the ER (see Figure 12-50) and typically containing two /V-acetylglucosamines (GIcNAc) 
and three mannoses (Man). (B) Each complex oligosaccharide consists of a core région, together 
with a terminal région that contains a variable number of copies of a spécial trisaccharide unit 
(N-acetylglucosamine-galactose-sialic acid) linked to the core mannoses. Frequently, the terminal 
région is truncated and contains only GIcNAc and galactose (Gai) or just GIcNAc. In addition, 
a fucose may be added, usually to the core GIcNAc attached to the asparagine (Asn). Thus, 
although the steps of Processing and subséquent sugar addition are rigidly ordered, complex 
oligosaccharides can be heterogeneous. Moreover, although the complex oligosaccharide 
shown has three terminal branches, two and four branches are also common, depending on the 
glycoprotein and the cell in which it is made. (C) High-mannose oligosaccharides are not trimmed 
back ail the way to the core région and contain additional mannoses. Hybrid oligosaccharides with 
one Man branch and one GIcNAc and Gai branch are also found (not shown). 

The three amino acids indicated in (A) constitute the sequence recognized by the oligosaccharyl 
transferase enzyme that adds the initial oligosaccharide to the protein. Ser, serine; Thr, threonine; 
X, any amino acid, except proline. 
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Figure 13-31 Oligosaccharide Processing in the ER and the Golgi apparatus. The Processing pathway is highly ordered, 
so that each step shown dépends on the previous one. Step 1 : Processing begins in the ER with the removal of the glucoses 
from the oligosaccharide initially transferred to the protein. Then a mannosidase in the ER membrane removes a spécifie 
mannose. The remaining steps occur in the Golgi stack. Step 2: Golgi mannosidase I removes three more mannoses. Step 
3: /V-acetylglucosamine transferase I then adds an /V-acetylglucosamine. Step 4: Mannosidase II then removes two additional 
mannoses. This yields the final core of three mannoses that is présent in a complex oligosaccharide. At this stage, the bond 
between the two A/-acetylglucosamines in the core becomes résistant to attack by a highly spécifie endoglycosidase (Endo 
H). Since ail later structures in the pathway are also Endo H-resistant, treatment with this enzyme is widely used to distinguish 
complex from high-mannose oligosaccharides. Step 5: Finally, as shown in Figure 13-30, additional /V-acetylglucosamines, 
galactoses, and sialic acids are added. These final steps in the synthesis of a complex oligosaccharide occur in the cisternal 
compartiments of the Golgi apparatus: three types of glycosyl transferase enzymes act sequentially, using sugar substrates 
that hâve been activated by linkage to the indicated nucléotide; the membranes of the Golgi cisternae contain spécifie carrier 
proteins that allow each sugar nucléotide to enter in exchange for the nucleoside phosphates that are released after the sugar is 
attached to the protein on the lumenal face. 

Note that, as a biosynthetic organelle, the Golgi apparatus differs from the ER: ail sugars in the Golgi are assembled inside the 
lumen from sugar nucléotide, whereas in the ER, the A/-linked precursor oligosaccharide is assembled partly in the cytosol and 
partly in the lumen, and most lumenal reactions use dolichol-linked sugars as their substrates (see Figure 12-51). 

complex oligosaccharides are of spécial importance because they bear a négative 
charge. Whether a given oligosaccharide remains high-mannose or is processed 
dépends largely on its position in the protein. If the oligosaccharide is accessible 
to the processing enzymes in the Golgi apparatus, it is likely to be converted to a 
complex form; if it is inaccessible because its sugars are tightly held to the pro- 
tein's surface, it is likely to remain in a high-mannose form. The processing that 
generates complex oligosaccharide chains follows the highly ordered pathway 
shown in Figure 13-31. 

Beyond these commonalities in oligosaccharide processing that are shared 
among most cells, the products of the carbohydrate modifications carried out in 
the Golgi apparatus are highly complex and hâve given rise to a new held of study 
called glycobiology. The human genome, for example, encodes hundreds of dif¬ 
ferent Golgi glycosyl transferases and many glycosidases, which are expressed dif- 
ferently from one cell type to another, resulting in a variety of glycosylated forms 
of a given protein or lipid in different cell types and at varying stages of différenti¬ 
ation, depending on the spectrum of enzymes expressed by the cell. The complex- 
ity of modifications is not limited to AMinlced oligosaccharides but also occurs on 
O-linked sugars, as we discuss next. 

Proteoglycans Are Assembled in the Golgi Apparatus 

In addition to the AMinlced oligosaccharide alterations made to proteins as they 
pass through the Golgi cisternae en route from the ER to their final destinations, 
many proteins are also modified in the Golgi apparatus in other ways. Some pro¬ 
teins hâve sugars added to the hydroxyl groups of selected serines or thréonines, 
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or, in some cases—such as collagens—to hydroxylated proline and lysine side 
chains. This O-linked glycosylation (Figure 13-32), lilce the extension of AMinlced 
oligosaccharide chains, is catalyzed by a sériés of glycosyl transferase enzymes 
that use the sugar nucléotides in the lumen of the Golgi apparatus to add sugars 
to a protein one at a time. Usually, ACacetylgalactosamine is added first, followed 
by a variable number of additional sugars, ranging from just a few to 10 or more. 

The Golgi apparatus confers the heaviest 0-linlced glycosylation of ail on 
mucins, the glycoproteins in mucus sécrétions, and on proteoglycan core pro¬ 
teins, which it modifies to produce proteoglycans. As discussed in Chapter 19, 
this process involves the polymerization of one or more glycosaminoglycan chains 
(long, unbranched polymers composed of repeating disaccharide units; see Fig¬ 
ure 19-35) onto serines on a core protein. Many proteoglycans are secreted and 
become components of the extracellular matrix, while others remain anchored 
to the extracellular face of the plasma membrane. Still others form a major com- 
ponent of slimy materials, such as the mucus that is secreted to form a protective 
coating on the surface of many epithelia. 

The sugars incorporated into glycosaminoglycans are heavily sulfated in the 
Golgi apparatus immediately after these polymers are made, thus adding a sig¬ 
nifiant portion of their characteristically large négative charge. Some tyrosines 
in proteins also become sulfated shortly before they exit from the Golgi appara¬ 
tus. In both cases, the sulfation dépends on the sulfate donor 3'-phosphoadenos- 
ine-5'-phosphosulfate (PAPS) (Figure 13-33), which is transported from the cyto¬ 
sol into the lumen of the trans Golgi network. 

What Is the Purpose of Glycosylation? 

There is an important différence between the construction of an oligosaccha¬ 
ride and the synthesis of other macromolecules such as DNA, RNA, and protein. 
Whereas nucleic acids and proteins are copied from a template in a repeated 
sériés of identical steps using the same enzyme or set of enzymes, complex carbo¬ 
hydrates require a different enzyme at each step, each product being recognized 
as the exclusive substrate for the next enzyme in the sériés. The vast abundance 
of glycoproteins and the complicated pathways that hâve evolved to synthesize 
them emphasize that the oligosaccharides on glycoproteins and glycosphingolip- 
ids hâve very important functions. 

AMinlced glycosylation, for example, is prévalent in ail eukaryotes, including 
yeasts. AMinlced oligosaccharides also occur in a very similar form in archaeal 
cell wall proteins, suggesting that the whole machinery required for their synthe¬ 
sis is evolutionarily ancient. AMinlced glycosylation promûtes protein folding in 
two ways. First, it has a direct rôle in malcing folding intermediates more soluble, 
thereby preventing their aggregation. Second, the sequential modifications of the 
AMinlced oligosaccharide establish a “glyco-code” that marks the progression of 


Figure 13-32 N- and O-linked 
glycosylation. In each case, only the single 
sugar group that is directly attached to the 
protein chain is shown. 
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Figure 13-33 The structure of PAPS. 
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protein folding and médiates the binding of the protein to chaperones (discussed 
in Chapter 12) and lectins—for example, in guiding ER-to-Golgi transport. As we 
discuss later, lectins also participate in protein sorting in the trans Golgi network. 

Because chains of sugars hâve limited flexibility, even a small AMinked oli¬ 
gosaccharide protruding from the surface of a glycoprotein (Figure 13-34) can 
limit the approach of other macromolecules to the protein surface. In this way, 
for example, the presence of oligosaccharides tends to malce a glycoprotein more 
résistant to digestion by proteolytic enzymes. It may be that the oligosaccharides 
on cell-surface proteins originally provided an ancestral cell with a protective 
coat; compared to the rigid bacterial cell wall, such a sugar coat has the advantage 
that it leaves the cell with the freedom to change shape and move. 

The sugar chains hâve since become modified to serve other purposes as 
well. The mucus coat of lung and intestinal cells, for example, protects against 
many pathogens. The récognition of sugar chains by lectins in the extracellular 
space is important in many developmental processes and in cell-cell récognition: 
selectins , for example, are transmembrane lectins that function in cell-cell 
adhesion during blood cell migration, as discussed in Chapter 19. The presence 
of oligosaccharides may modify a protein's antigenic and functional properties, 
malcing glycosylation an important factor in the production of proteins for 
pharmaceutical purposes. 

Glycosylation can also hâve important regulatory rôles. Signaling through 
the cell-surface signaling receptor Notch, for example, is an important factor in 
determining the cell's fate in development (discussed in Chapter 21). Notch is a 
transmembrane protein that is O-glycosylated by addition of a single fucose to 
some serines, thréonines, and hydroxylysines. Some cell types express an addi- 
tional glycosyl transferase that adds an AAacetylglucosamine to each of these 
fucoses in the Golgi apparatus. This addition changes the specificity of Notch for 
the cell-surface signal proteins that activate it. 

Transport Through the Golgi Apparatus May Occur by Cisternal 
Maturation 

It is still uncertain how the Golgi apparatus achieves and maintains its polarized 
structure and how molécules move from one cisterna to another, and it is lilcely 
that more than one mechanism is involved in each case. One hypothesis, called 
the cisternal maturation model, views the Golgi cisternae as dynamic structures 
that mature from early to late by acquiring and then losing spécifie Golgi-resident 
proteins. According to this view, new cis cisternae continually form as vesicular 
tubular clusters arrive from the ER and progressively mature to become a médial 
cisterna and then a trans cisterna (Figure 13-35A). A cisterna therefore moves 
through the Golgi staclc with cargo in its lumen. Rétrogradé transport of the Golgi 
enzymes by budding COPI-coated vesicles explains their characteristic distribu¬ 
tion. As we discuss later, when a cisterna finally moves forward to become part of 
the trans Golgi network, various types of coated vesicles bud off it until this net¬ 
work disappears, to be replaced by a maturing cisterna just behind. At the same 
time, other transport vesicles are continually retrieving membrane from post- 
Golgi compartments and returning it to the trans Golgi network. 

The cisternal maturation model is supported by studies using Golgi enzymes 
from different cisternae that were fluorescently labeled with different colors. Such 
studies performed in yeast cells where Golgi cisternae are not stacked reveal that 
individual Golgi cisternae change their color, thereby demonstrating that they 
change their complément of résident enzymes as they mature, even though they 
are not stacked. In further support of the model, électron microscopie observa¬ 
tions found that large structures such as procollagen rods in fibroblasts and scales 
in certain algae move progressively through the Golgi staclc. 

An alternative view holds that Golgi cisternae are long-lived structures that 
retain their characteristic set of Golgi-resident proteins firmly in place, and cargo 
proteins are transported from one cisterna to the next by transport vesicles (Fig¬ 
ure 13-35B). According to this vesicle transport model, rétrogradé flow of vesicles 
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Figure 13-34 The three-dimensional 
structure of a small AMinked 
oligosaccharide. The structure was 
determined by x-ray crystallographic 
analysis of a glycoprotein. This 
oligosaccharide contains only 6 sugars, 
whereas there are 14 sugars in the /V-linked 
oligosaccharide that is initially transferred to 
proteins in the ER (see Figure 12-47). 

(A) A backbone model showing ail atoms 
except hydrogens; only the asparagine 
of the protein is shown. (B) A space-filling 
model, with the asparagine and sugars 
indicated using the same color scheme as 
in (A). (B, courtesy of Richard Feldmann.) 
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Figure 13-35 Two possible models explaining the organization of the Golgi apparatus and how proteins move through it. 

It is likely that the transport through the Golgi apparatus in the forward direction (red arrows) involves éléments of both models. 

(A) According to the cisternal maturation model, each Golgi cisterna matures as it migrâtes outward through the stack. At each 
stage, the Golgi résident proteins that are carried forward in a maturing cisterna are moved backward to an earlier compartiment 
in COPI-coated vesicles. When a newly formed cisterna moves to a médial position, for example, “leftover” c/s Golgi enzymes 
would be extracted and transported retrogradely to a new c/s cisterna behind. Likewise, the médial enzymes would be received 
by rétrogradé transport from the cisternae just ahead. In this way, a c/s cisterna would mature to a médial and then trans cisterna 
as it moves outward. (B) In the vesicle transport model, Golgi cisternae are static compartiments, which contain a characteristic 
complément of résident enzymes. The passing of molécules from c/s to trans through the Golgi is accomplished by forward- 
moving transport vesicles, which bud from one cisterna and fuse with the next in a cis-\o-trans direction. 


retrieves escaped ER and Golgi proteins and returns them to upstream compart- 
ments. Directional flow could be achieved because forward-moving cargo mol¬ 
écules are selectively packaged into forward-moving vesicles. Although both 
forward- and backward-moving vesicles would likely be COPI-coated, the coats 
may contain different adaptor proteins that confer selectivity on the packaging of 
cargo molécules. Alternatively, transport vesicles shuttling between Golgi cister¬ 
nae might not be directional at ail, transporting cargo randomly baclc and forth; 
directional flow would then occur because of the continuai input to the cis cis¬ 
terna and output from the trans cisterna. 

The vesicle transport model is supported by experiments that show that cargo 
molécules are présent in small COPI-coated vesicles and that these vesicles can 
deliver them to Golgi cisternae over large distances. In addition, when exper- 
imentally aggregated membrane proteins are introduced into Golgi cisternae, 
they can be observed staying in place, while soluble cargo, even if présent as large 
aggregates, traverses the Golgi at normal rates. 

It is likely that aspects of both models are true. A stable core of long-lasting 
cisternae might exist in the center of each Golgi cisterna, while régions at the rim 
may undergo continuous maturation, perhaps utilizing Rab cascades that change 
their identity. As matured pièces of the cisternae are formed, they might break 
off and fuse with downstream cisternae by homotypic fusion mechanisms, taking 
large cargo molécules with them. In addition, small COPI-coated vesicles might 
transport small cargo in the forward direction and retrieve escaped Golgi enzymes 
and return them to their appropriate upstream cisternae. 

Golgi Matrix Proteins Help Organize the Stack 

The unique architecture of the Golgi apparatus dépends on both the microtubule 
cytoskeleton, as already mentioned, and cytoplasmic Golgi matrix proteins, which 
form a scaffold between adjacent cisternae and give the Golgi stack its structural 
integrity. Some of the matrix proteins, called golgins, form long tethers composed 
of stiff coiled-coil domains with interspersed hinge régions. Golgins form a forest 
of tentacles that can extend 100-400 nm from the surface of the Golgi stack. They 
are thought to help retain Golgi transport vesicles close to the organelle through 
interactions with Rab proteins (Figure 13-36). When the cell préparés to divide, 
mitotic protein kinases phosphorylate the Golgi matrix pro teins, causing the Golgi 



Figure 13-36 A model of golgin function. 

Filamentous golgins anchored to Golgi 
membranes capture transport vesicles 
by binding to Rab proteins on the vesicle 
surface. 
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apparatus to fragment and disperse throughout the cytosol. The Golgi fragments 
are then distributed evenly to the two daughter cells, where the matrix proteins 
are dephosphorylated, leading to the reassembly of the Golgi stade. Similarly, 
during apoptosis, proteolytic cleavage of golgins by caspases ensues (discussed in 
Chapter 18), fragments the Golgi apparatus as the cell self-destructs. 


Summary 

Correctly folded and assembled proteins in the ER are paclcaged into COPII-coated 
transport vesicles that pinch ojffrom the ER membrane. Shortly thereafter, the vesi- 
cles shed their coat and fuse with one another toform vesicular tubular clusters. In 
animal cells , the clusters then moue on microtubule tracks to the Golgi apparatus , 
where they fuse with one another toform the cis Golgi networlc. Any résident ER 
proteins that escape from the ER are returned there from the vesicular tubular clus¬ 
ters and Golgi apparatus by rétrogradé transport in COPI-coated vesicles. 

The Golgi apparatus; unlilce the ER, contains many sugar nucléotides; which 
glycosyl transferase enzymes use to glycosylate lipid and protein molécules as they 
pass through the Golgi apparatus. The mannoses on the N-linked oligosaccharides 
that are added to proteins in the ER are often initially removed, and further sugars 
are added. Moreover, the Golgi apparatus is the site where O-linked glycosylation 
occurs and where glycosaminoglycan chains are added to core proteins to form 
proteoglycans. Sulfation ofthe sugars in proteoglycans and ofselected tyrosines on 
proteins also occurs in a late Golgi compartment. 

The Golgi apparatus modifies the many proteins and lipids that it receives from 
the ER and then distributes them to the plasma membrane, lysosomes, and secre- 
tory vesicles. The Golgi apparatus is a polarized organelle, consisting ofone or more 
stacks of disc-shaped cisternae. Each stack is organized as a sériés of at least three 
Junctionally distinct compartments, termed cis, médial, and trans cisternae. The 
cis and trans cisternae are each connected to spécial sorting stations, called the cis 
Golgi network and the trans Golgi network, respectively. Proteins and lipids move 
through the Golgi stack in the cis-to-trans direction. This movement may occur by 
vesicle transport, by progressive maturation ofthe cis cisternae as they migrate con- 
tinuously through the stack, or, most lilcely, by a combination ofthese two mecha- 
nisms. Continuai rétrogradé vesicle transport from upstream to more downstream 
cisternae is thought to keep the enzymes concentrated in the cisternae where they 
are needed. The finished new proteins end up in the trans Golgi network, which 
packages them in transport vesicles and dispatches them to their spécifie destina¬ 
tions in the cell. 


TRANSPORT FROM THE TRANS GOLGI NETWORK 
TO LYSOSOMES 

The trans Golgi network sorts ail of the proteins that pass through the Golgi appa¬ 
ratus (except those that are retained there as permanent residents) according to 
their final destination. The sorting mechanism is especially well understood for 
those proteins destined for the lumen of lysosomes, and in this section we con- 
sider this sélective transport process. We begin with a brief account of lysosome 
structure and function. 

Lysosomes Are the Principal Sites of Intracellular Digestion 

Lysosomes are membrane-enclosed organelles filled with soluble hydrolytic 
enzymes that digest macromolecules. Lysosomes contain about 40 types of 
hydrolytic enzymes, including proteases, nucleases, glycosidases, lipases, phos¬ 
pholipases, phosphatases, and sulfatases. Ail are acid hydrolases; that is, hydro- 
lases that worlc best at acidic pH. For optimal activity, they need to be activated 
by proteolytic cleavage, which also requires an acid environment. The lysosome 
provides this acidity, maintaining an interior pH of about 4.5-5.0. By this arrange¬ 
ment, the contents of the cytosol are doubly protected against attaclc by the cell's 
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own digestive System: the membrane of the lysosome lceeps the digestive enzymes 
out of the cytosol, but, even if they lealc out, they can do little damage at the cyto- 
solic pH of about 7.2. 

Like ail other membrane-enclosed organelles, the lysosome not only contains 
a unique collection of enzymes, but also has a unique surrounding membrane. 
Most of the lysosome membrane proteins, for example, are highly glycosylated, 
which helps to protect them from the lysosome proteases in the lumen. Transport 
proteins in the lysosome membrane carry the final products of the digestion of 
macromolecules—such as amino acids, sugars, and nucléotides—to the cytosol, 
where the cell can either reuse or excrete them. 

A vacuolar H + ATPase in the lysosome membrane uses the energy of ATP 
hydrolysis to pump H + into the lysosome, thereby maintaining the lumen at its 
acidic pH (Figure 13-37). The lysosome H + pump belongs to the family of V-type 
ATPases and has a similar architecture to the mitochondrial and chloroplast ATP 
synthases (F-type ATPases), which convert the energy stored in H + gradients into 
ATP (see Figure 11-12). By contrast to these enzymes, however, the vacuolar H + 
ATPase exclusively worlcs in reverse, pumping H + into the organelle. Similar or 
identical V-type ATPases acidify ail endocytic and exocytic organelles, including 
lysosomes, endosomes, some compartments of the Golgi apparatus, and many 
transport and secretory vesicles. In addition to providing a low-pH environment 
that is suitable for reactions occurring in the organelle lumen, the H + gradient 
provides a source of energy that drives the transport of small métabolites across 
the organelle membrane. 

Lysosomes Are Heterogeneous 

Lysosomes are found in ail eukaryotic cells. They were initially discovered by the 
biochemical fractionation of cell extracts; only later were they seen clearly in the 
électron microscope. Although extraordinarily diverse in shape and size, staining 
them with spécifie antibodies shows they are members of a single family of organ¬ 
elles. They can also be identified by histochemical techniques that reveal which 
organelles contain acid hydrolase (Figure 13-38). 

The heterogeneous morphology of lysosomes contrasts with the relatively uni- 
form structures of many other cell organelles. The diversity reflects the wide variety 
of digestive functions that acid hydrolases médiate, including the breakdown of 
intra- and extracellular débris, the destruction of phagocytosed microorganisms, 
and the production of nutrients for the cell. Their morphological diversity, how¬ 
ever, also reflects the way lysosomes form. Late endosomes containing material 
received from both the plasma membrane by endocytosis and newly synthesized 
lysosomal hydrolases fuse with preexisting lysosomes to form structures that are 
sometimes referred to as endolysosomes, which then fuse with one another (Fig¬ 
ure 13-39). When the majority of the endocytosed material within an endolyso- 
some has been digested so that only résistant or slowly digestible residues remain, 
these organelles become “classical” lysosomes. These are relatively dense, round, 
and small, but they can enter the cycle again by fusing with late endosomes or 
endolysosomes. Thus, there is no real distinction between endolysosomes and 
lysosomes: they are the same except that they are in different stages of a matura¬ 
tion cycle. For this reason, lysosomes are sometimes viewed as a heterogeneous 
collection of distinct organelles, the common feature of which is a high content of 

Figure 13-38 Histochemical visualization of lysosomes. These électron 
micrographs show two sections of a cell stained to reveal the location of 
acid phosphatase, a marker enzyme for lysosomes. The larger membrane- 
enclosed organelles, containing dense précipitâtes of lead phosphate, are 
lysosomes. Their diverse morphology reflects variations in the amount and 
nature of the material they are digesting. The précipitâtes are produced when 
tissue fixed with glutaraldehyde (to fix the enzyme in place) is incubated with 
a phosphatase substrate in the presence of lead ions. Red arrows in the top 
panel indicate two small vesicles thought to be carrying acid hydrolases from 
the Golgi apparatus. (Courtesy of Daniel S. Friend.) 
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Figure 13-37 Lysosomes. The acid 
hydrolases are hydrolytic enzymes that 
are active under acidic conditions. An H + 
ATPase in the membrane pumps H + into 
the lysosome, maintaining its lumen at an 
acidic pH. 


















724 Chapter 13: Intracellular Membrane Traffic 



hydrolytically active, low-pH compartments 


Figure 13-39 A model for lysosome 
maturation. Late endosomes fuse with 
preexisting lysosomes (bottom arrow) or 
preexisting endolysosomes (top arrow). 
Endolysosomes eventually mature into 
lysosomes as hydrolases complété the 
digestion of their contents, which can include 
intralumenal vesicles. Lysosomes also fuse 
with phagosomes, as we discuss later. 


hydrolytic enzymes. It is especially hard to apply a narrower définition than this in 
plant cells, as we discuss next. 


Plant and Fungal Vacuoles Are Remarkably Versatile Lysosomes 

Most plant and fungal cells (including yeasts) contain one or several very large, 
fluid-filled vesicles called vacuoles. They typically occupy more than 30% of the 
cell volume ; and as much as 90% in some cell types (Figure 13-40). Vacuoles are 
related to animal cell lysosomes and contain a variety of hydrolytic enzymes, but 
their functions are remarkably diverse. The plant vacuole can act as a storage 
organelle for both nutrients and waste products, as a degradative compartment, 
as an economical way of increasing cell size, and as a controller of turgor pressure 
(the osmotic pressure that pushes outward on the cell wall and keeps the plant 
from wilting) (Figure 13-41). The same cell may hâve different vacuoles with dis¬ 
tinct functions, such as digestion and storage. 

The vacuole is important as a homeostatic device, enabling plant cells to with- 
stand wide variations in their environment. When the pH in the environment 
drops, for example, the flux of H + into the cytosol is balanced, at least in part, by 
an increased transport of H + into the vacuole, which tends to lceep the pH in the 
cytosol constant. Similarly, many plant cells maintain an almost constant turgor 
pressure despite large changes in the tonicity of the fluid in their immédiate envi¬ 
ronment. They do so by changing the osmotic pressure of the cytosol and vac¬ 
uole—in part by the controlled brealcdown and resynthesis of polymers such as 
polyphosphate in the vacuole, and in part by altering the transport rates of sugars, 
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Figure 13-40 The plant cell vacuole. 

(A) A confocal image of cells from an 
Arabidopsis embryo that is expressing an 
aquaporin—YFP (yellow fluorescent protein) 
fusion protein in its tonoplast, or vacuole 
membrane (green)\ the cell walls hâve been 
false-colored orange. Each cell contains 
several large vacuoles. (B) This électron 
micrograph of cells in a young tobacco 
leaf shows the cytosol as a thin layer, 
containing chloroplasts, pressed against 
the cell wall by the enormous vacuole. 

(A, courtesy of C. Carroll and L. Frigerio, 
based on S. Gattolin et al., Mol. Plant 
4:180-189, 2011. With permission from 
Oxford University Press; B, courtesy of 
J. Burgess.) 
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amino acids, and other métabolites across the plasma membrane and the vacuo- 
lar membrane. The turgor pressure régulâtes the activities of distinct transporters 
in each membrane to control these fluxes. 

Humans often harvest substances stored in plant vacuoles—from rubber to 
opium to the flavoring of garlic. Many stored products hâve a metabolic function. 
Proteins, for example, can be preserved for years in the vacuoles of the storage 
cells of many seeds, such as those of peas and beans. When the seeds germinate, 
these proteins are hydrolyzed, and the resulting amino acids provide a food sup- 
ply for the developing embryo. Anthocyanin pigments stored in vacuoles color 
the petals of many flowers so as to attract pollinating insects, while noxious molé¬ 
cules released from vacuoles when a plant is eaten or damaged provide a defense 
against predators. 


Figure 13-41 The rôle of the vacuole 
in controlling the size of plant cells. A 

plant cell can achieve a large increase in 
volume without increasing the volume of 
the cytosol. Localized weakening of the cell 
wall orients a turgor-driven cell enlargement 
that accompanies the uptake of water 
into an expanding vacuole. The cytosol 
is eventually confined to a thin peripheral 
layer, which is connected to the nuclear 
région by strands of cytosol stabilized by 
bundles of actin filaments (not shown). 


Multiple Pathways Deliver Materials to Lysosomes 

Lysosomes are meeting places where several streams of intracellular traffic con¬ 
verge. A route that leads outward from the ER via the Golgi apparatus delivers 
most of the lysosome's digestive enzymes, while at least four paths from different 
sources feed substances into lysosomes for digestion. 

The best studied of these dégradation paths is the one followed by macromol- 
ecules talcen up from extracellular fluid by endocytosis. A similar pathway found 
in phagocytic cells, such as macrophages and neutrophils in vertebrates, is ded- 
icated to the engulfment, or phagocytosis, of large particles and microorganisms 
to form phagosomes. A third pathway called macropinocytosis specializes in the 
nonspecific uptake of fluids, membrane, and particles attached to the plasma 
membrane. We will return to discuss these pathways later in the chapter. A fourth 
pathway called autophagy originates in the cytoplasm of the cell itself and is used 
to digest cytosol and worn-out organelles, as we discuss next. The four paths to 
dégradation in lysosomes are illustrated in Figure 13-42. 


EXTRACELLULAR FLUID 



Figure 13-42 Four pathways to 
dégradation in lysosomes. Materials in 
each pathway are derived from a different 
source. Note that the autophagosome has 
a double membrane. In ail cases, the final 
step is the fusion with lysosomes. 
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NUCLEATION CLOSURE FUSION WITH DIGESTION 

AND EXTENSION LYSOSOMES 


(A) 

Autophagy Dégradés Unwanted Proteins and Organelles 

Ail cell types dispose of obsolète parts by a lysosome-dependent process called 
autophagy, or “self-eating.” The dégradation process is important during normal 
cell growth and in development, where it helps restructure differentiating cells, 
but also in adaptive responses to stresses such as starvation and infection. Auto¬ 
phagy can remove large objects—macromolecules, large protein aggregates, and 
even whole organelles—that other disposai mechanisms such as proteasomal 
dégradation cannot handle. Defects in autophagy may prevent cells from clearing 
away invading microbes, unwanted protein aggregates and abnormal proteins, 
and thereby contribute to diseases ranging from infectious disorders to neurode- 
generation and cancer. 

In the initial stages of autophagy, cytoplasmic cargo becomes surrounded by 
a double membrane that assembles by the fusion of small vesicles of unknown 
origin, forming an autophagosome (Figure 13-43). A few tens of different pro¬ 
teins hâve been identified in yeast and animal cells that participate in the process, 
which must be tightly regulated: either too little or too much can be deleterious. 
The whole process occurs in the following sequence of steps: 

1. Induction by activation of signaling molécules: Protein kinases (including 
the mTOR complex 1, discussed in Chapter 15) that relay information 
about the metabolic status of the cell, become activated and signal to the 
autophagie machinery. 

2. Nucléation and extension of a delimiting membrane into a crescent-shaped 
cup: Membrane vesicles, characterized by the presence of ATG9, the only 
transmembrane protein involved in the process, are recruited to an assem- 
bly site, where they nucleate autophagosome formation. ATG9 is not incor- 
porated into the autophagosome: a retrieval pathway must remove it from 
the assembling structure. 

3. Closure of the membrane cup around the target to form a sealed dou- 
ble-membrane-enclosed autophagosome. 

4. Fusion of the autophagosome with lysosomes, catalyzed by SNAREs. 

5. Digestion of the inner membrane and the lumenal contents of the auto- 
phagosome. 

Autophagy can be either nonselective or sélective. In nonselective autophagy , 
a bulle portion of cytoplasm is sequestered in autophagosomes. It might occur, for 
example, in starvation conditions: when external nutrients are limiting, métabo¬ 
lites derived from the digestion of the captured cytosol might help the cell survive. 
In sélective autophagy spécifie cargo is packaged into autophagosomes that tend 
to contain little cytosol, and their shape reflects the shape of the cargo. Sélective 
autophagy médiates the dégradation of worn out, or otherwise unwanted, mito- 
chondria, peroxisomes, ribosomes, and ER; it can also be used to destroy invading 
microbes. 


mitochondrion peroxisome 
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Figure 13-43 A model of autophagy. 

(A) Activation of a signaling pathway 
initiâtes a nucléation event in the 
cytoplasm. A crescent of autophagosomal 
membrane grows by fusion of vesicles 
of unknown origin and eventually fuses 
to form a double-membrane-enclosed 
autophagosome, which sequesters 
a portion of the cytoplasm. The 
autophagosome then fuses with lysosomes 
containing acid hydrolases that digest 
ifs content. During the formation of the 
autophagosome membrane, a ubiquitin- 
like protein becomes activated by covalent 
attachment of a phosphatidylethanolamine 
lipid anchor. These proteins then médiate 
vesicle tethering and fusion, leading to the 
formation of a crescent-shaped membrane 
structure that assembles around its target 
(not shown). (B) An électron micrograph 
of an autophagosome containing a 
mitochondrion and a peroxisome. 

(B, courtesy of Daniel S. Friend, from 
D.W. Fawcett, A Textbook of FHistology, 

12th ed. New York: Chapman and Hall, 

1994. With permission from Kluwer.) 
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The sélective autophagy of worn out or damaged mitochondria is called mito- 
phagy. As discussed in Chapters 12 and 14, when mitochondria function nor- 
mally, the inner mitochondrial membrane is energized by an electroChemical H + 
gradient that drives ATP synthesis and the import of mitochondrial precursor pro ¬ 
teins and métabolites. Damaged mitochondria cannot maintain the gradient, so 
protein import is bloclced. As a conséquence, a protein kinase called Pinlcl, which 
is normally imported into mitochondria, is instead retained on the mitochondrial 
surface where it recruits the ubiquitin ligase Parkin from the cytosol. Parkin ubiq- 
uitylates mitochondrial outer membrane proteins, which mark the organelle for 
sélective destruction in autophagosomes. Mutations in Pinlcl or Parkin cause a 
form of early-onset Parkinson's disease, a degenerative disorder of the central 
nervous System. It is not lcnown why the neurons that die prematurely in this dis¬ 
ease are particularly reliant on mitophagy. 


A Mannose 6-Phosphate Receptor Sorts Lysosomal Hydrolases in 
the Trans Golgi Network 

We now consider the pathway that delivers lysosomal hydrolases from the TGN to 
lysosomes. The enzymes are first delivered to endosomes in transport vesicles that 
bud from the TGN, before they move on to endolysosomes and lysosomes (see 
Figure 13-39). The vesicles that leave the TGN incorporate the lysosomal proteins 
and exclude the many other proteins being packaged into different transport ves¬ 
icles for delivery elsewhere. 

How are lysosomal hydrolases recognized and selected in the TGN with the 
required accuracy? In animal cells they carry a unique marlcer in the form of 
mannose 6-phosphate ( M6P ) groups, which are added exclusively to the AMinked 
oligosaccharides of these soluble lysosomal enzymes as they pass through the 
lumen of the cis Golgi networlc (Figure 13-44). Transmembrane M6P receptor 
proteins, which are présent in the TGN, recognize the M6P groups and bind to the 
lysosomal hydrolases on the lumenal side of the membrane and to adaptor pro¬ 
teins in assembling clathrin coats on the cytosolic side. In this way, the receptors 
help package the hydrolases into clathrin-coated vesicles that bud from the TGN 
and deliver their contents to early endosomes. 

The M6P receptor protein binds to M6P at pH 6.5-6.7 in the TGN lumen and 
releases it at pH 6, which is the pH in the lumen of endosomes. Thus, after the 
receptor is delivered, the lysosomal hydrolases dissociate from the M6P recep¬ 
tors, which are retrieved into transport vesicles that bud from endosomes. These 
vesicles are coated with retromer, a coat protein complex specialized for endo- 
some-to-TGN transport, which returns the receptors to the TGN for reuse (Figure 
13-45). 

Transport in either direction requires signais in the cytoplasmic tail of the M6P 
receptor that direct this protein to the endosome or baclc to the TGN. These signais 
are recognized by the retromer complex that recruits M6P receptors into transport 
vesicles that bud from endosomes. The recycling of the M6P receptor resembles 
the recycling of the KDEL receptor discussed earlier, although it differs in the type 
of coated vesicles that médiate the transport. 

Not ail the hydrolase molécules that are tagged with M6P get to lysosomes. 
Some escape the normal packaging process in the trans Golgi networlc and are 
transported "by default" to the cell surface, where they are secreted into the extra¬ 
cellular fluid. Some M6P receptors, however, also talce a détour to the plasma 
membrane, where they recapture the escaped lysosomal hydrolases and return 
them by receptor-mediated endocytosis (discussed later) to lysosomes via early and 
late endosomes. As lysosomal hydrolases require an acidic milieu to worlc, they 
can do little harm in the extracellular fluid, which usually has a neutral pH of 7.4. 

For the sorting System that segregates lysosomal hydrolases and dispatches 
them to endosomes to worlc, the M6P groups must be added only to the appropri- 
ate glycoproteins in the Golgi apparatus. This requires spécifie récognition of the 
hydrolases by the Golgi enzymes responsible for adding M6P. Since ail glycopro¬ 
teins leave the ER with identical AMinked oligosaccharide chains, the signal for 


mannose 6-phosphate 
(M6P) 



Figure 13-44 The structure of mannose 
6-phosphate on a lysosomal hydrolase. 
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adding the M6P units to oligosaccharides must résidé somewhere in the polypep¬ 
tide chain of each hydrolase. Genetic engineering experiments hâve revealed that 
the récognition signal is a cluster of neighboring amino acids on each protein's 
surface, known as a signal patch (Figure 13-46). Since most lysosomal hydrolases 
contain multiple oligosaccharides, they acquire many M6P groups, providing a 
high-affinity signal for the M6P receptor. 

Defects in the GIcNAc Phosphotransferase Cause a Lysosomal 
Storage Disease in Humans 

Genetic defects that affect one or more of the lysosomal hydrolases cause a num- 
ber of human lysosomal storage diseases. The defects resuit in an accumulation 
of undigested substrates in lysosomes, with severe pathological conséquences, 
most often in the nervous System. In most cases, there is a mutation in a struc¬ 
tural gene that codes for an individual lysosomal hydrolase. This occurs in Hurler's 
disease , for example, in which the enzyme required for the breakdown of certain 
types of glycosaminoglycan chains is defective or missing. The most severe form 
of lysosomal storage disease, however, is a very rare inherited metabolic disor- 
der called inclusion-cell disease {I-cell disease). In this condition, almost ail of the 
hydrolytic enzymes are missing from the lysosomes of many cell types, and their 
undigested substrates accumulate in these lysosomes, which consequently form 
large inclusions in the cells. The conséquent pathology is complex, affecting ail 
organ Systems, skeletal integrity, and mental development; individuals rarely live 
beyond six or seven years. 

I-cell disease is due to a single gene defect and, lilce most genetic enzyme defi- 
ciencies, it is récessive—that is, it occurs only in individuals having two copies 
of the defective gene. In patients with I-cell disease, ail the hydrolases missing 
from lysosomes are found in the blood: because they fail to sort properly in the 
Golgi apparatus, they are secreted rather than transported to lysosomes. The 
mis-sorting has been traced to a defective or missing GIcNAc phosphotransfer¬ 
ase. Because lysosomal enzymes are not phosphorylated in the cis Golgi network, 
the M6P receptors do not segregate them into the appropriate transport vesicles 
in the TGN. Instead, the lysosomal hydrolases are carried to the cell surface and 
secreted. 

In I-cell disease, the lysosomes in some cell types, such as hépatocytes, con¬ 
tain a normal complément of lysosomal enzymes, implying that there is another 


Figure 13-45 The transport of newly 
synthesized lysosomal hydrolases to 
endosomes. The sequential action of two 
enzymes in the cis and trans Golgi network 
adds mannose 6-phosphate (M6P) groups 
to the precursors of lysosomal enzymes 
(see Figure 13-46). The M6P-tagged 
hydrolases then segregate from ail other 
types of proteins in the TGN because 
adaptor proteins (not shown) in the clathrin 
coat bind the M6P receptors, which, in 
turn, bind the M6P-modified lysosomal 
hydrolases. The clathrin-coated vesicles 
bud off from the TGN, shed their coat, 
and fuse with early endosomes. At the 
lower pH of the endosome, the hydrolases 
dissociate from the M6P receptors, and the 
empty receptors are retrieved in retromer- 
coated vesicles to the TGN for further 
rounds of transport. In the endosomes, 
the phosphate is removed from the M6P 
attached to the hydrolases, which may 
further ensure that the hydrolases do not 
return to the TGN with the receptor. 
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pathway for directing hydrolases to lysosomes that is used by some cell types 
but not others. Alternative sorting receptors function in these M6P-independent 
pathways. Similarly, an M6P-independent pathway in ail cells sorts the mem¬ 
brane proteins of lysosomes from the TGN for transport to late endosomes, and 
those proteins are therefore normal in I-cell disease. 

Some Lysosomes and Multivesicular Bodies Undergo Exocytosis 

Targeting of material to lysosomes is not necessarily the end of the pathway. Lyso¬ 
somal sécrétion of undigested content enables ail cells to eliminate indigestible 
débris. For most cells, this seems to be a minor pathway, used only when the cells 
are stressed. Some cell types, however, contain specialized lysosomes that hâve 
acquired the necessary machinery for fusion with the plasma membrane. Méla¬ 
nocytes in the skin, for example, produce and store pigments in their lysosomes. 
These pigment-containing melanosomes release their pigment into the extra¬ 
cellular space of the epidermis by exocytosis. The pigment is then talcen up by 
kératinocytes, leading to normal skin pigmentation. In some genetic disorders, 
defects in melanosome exocytosis bloclc this transfer process, leading to forms 
of hypopigmentation (albinism). Under certain conditions, multivesicular bodies 
can also fuse with the plasma membrane. If that occurs, their intralumenal vesi- 
cles are released from cells. Circulating small vesicles, also called exosomes , hâve 
been observed in the blood and may be used to transport components between 
cells, although the importance of such a mechanism of potential communication 
between distant cells is unknown. Some exosomes may dérivé from direct vesicle 
budding events at the plasma membrane, which is a topologically équivalent pro¬ 
cess (see Figure 13-57). 


Figure 13-46 The récognition of 
a lysosomal hydrolase. A GIcNAc 
phosphotransferase recognizes lysosomal 
hydrolases in the Golgi apparatus. The 
enzyme has separate catalytic and 
récognition sites. The catalytic site 
binds both high-mannose A/-linked 
oligosaccharides and UDP-GIcNAc. The 
récognition site binds to a signal patch that 
is présent only on the surface of lysosomal 
hydrolases. A second enzyme cleaves 
off the GIcNAc, leaving the mannose 
6-phosphate exposed. 


Summary 

Lysosomes are specialized for the intracellular digestion of macromolecules. Lhey 
contain unique membrane proteins and a wide variety of soluble hydrolytic enzymes 
that operate best at pH 5, which is the internai pH of lysosomes. An ATP-driven H + 
pump in the lysosomal membrane maintains this low pH. Newly synthesized lyso¬ 
somal proteins transportedfrom the lumen ofthe ER , through the Golgi apparatus; 
they are then carriedfrom the trans Golgi networlc to endosomes by means ofclath- 
rin-coated transport vesicles, before moving on to lysosomes. 

The lysosomal hydrolases contain N-linked oligosaccharides that are covalently 
modified in a unique way in the cis Golgi so that their mannoses are phosphory- 
lated. These mannose 6-phosphate (M6P) groups are recognized by an M6P recep- 
tor protein in the trans Golgi networlc thatsegregates the hydrolases and helps pack¬ 
age them into budding transport vesicles that deliver their contents to endosomes. 
The M6P receptors shuttle baclc andforth between the trans Golgi networlc and the 
endosomes. The low pH in endosomes and the removal ofthe phosphate from the 
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M6P group cause the lysosomal hydrolases to dissociate from these receptors, mak- 
ing the transport ofthe hydrolases unidirectional. A separate transport System uses 
clathrin-coated vesicles to deliver résident lysosomal membrane pro teins from the 
trans Golgi networlc to endosomes. 


TRANSPORT INTO THE CELL FROM THE PLASMA 
MEMBRANE: ENDOCYTOSIS 

The routes that lead inward from the cell surface start with the process of endocy- 
tosis, by which cells talce up plasma membrane components, fluid, solutés, mac- 
romolecules, and particulate substances. Endocytosed cargo includes receptor- 
ligand complexes, a spectrum of nutrients and their carriers, extracellular matrix 
components, cell débris, bacteria, viruses, and, in specialized cases, even other 
cells. Through endocytosis, the cell régulâtes the composition of its plasma mem¬ 
brane in response to changing extracellular conditions. 

In endocytosis, the material to be ingested is progressively enclosed by a small 
portion of the plasma membrane, which first invaginâtes and then pinches off 
to form an endocytic vesicle containing the ingested substance or particle. Most 
eukaryotic cells constantly form endocytic vesicles, a process called pinocytosis 
("cell drinlcing"); in addition, some specialized cells contain dedicated pathways 
that talce up large particles on demand, a process called phagocytosis ("cell eat- 
ing ,; ). Endocytic vesicles form at the plasma membrane by multiple mechanisms 
that differ in both the molecular machinery used and how that machinery is reg- 
ulated. 

Once generated at the plasma membrane, most endocytic vesicles fuse with a 
common receiving compartment, the early endosome, where internalized cargo 
is sorted: some cargo molécules are returned to the plasma membrane, either 
directly or via a recycling endosome , and others are designated for dégradation 
by inclusion in a late endosome. Late endosomes form from a bulbous, 
vacuolar portion of early endosomes by a process called endosome maturation. 
This conversion process changes the protein composition of the endosome 
membrane, patches of which invaginate and become incorporated within the 
organelles as intralumenal vesicles , while the endosome itself moves from the 
cell periphery to a location close to the nucléus. As an endosome matures, it 
ceases to recycle material to the plasma membrane and irreversibly commits 
its remaining contents to dégradation: late endosomes fuse with one another 
and with lysosomes to form endolysosomes, which dégradé their contents, as 
discussed earlier (Figure 13-47). 

Each of the stages of endosome maturation—from the early endosome to the 
endolysosome—is connected through bidirectional vesicle transport pathways to 




Figure 13-47 Endosome maturation: 
the endocytic pathway from the plasma 
membrane to lysosomes. Endocytic 
vesicles fuse near the cell periphery with 
an early endosome, which is the primary 
sorting station. Tubular portions of the early 
endosome bud off vesicles that recycle 
endocytosed cargo back to the plasma 
membrane—either directly, or indirectly 
via recycling endosomes. Recycling 
endosomes can store proteins until they are 
needed. Conversion of early endosomes to 
late endosomes is accompanied by loss of 
the tubular projections. Membrane proteins 
destined for dégradation are internalized 
in intralumenal vesicles. The developing 
late endosome, or multivesicular body, 
moves on microtubules to the cell interior. 
Fully matured late endosomes no longer 
send vesicles to the plasma membrane, 
and they fuse with one another and 
with endolysosomes and lysosomes to 
dégradé their contents. Each stage of 
endosome maturation is connected via 
transport vesicles with the TGN, providing 
a continuous supply of newly synthesized 
lysosomal proteins. 
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the TGN. Ihese pathways allow insertion of newly synthesized materials, such as 
lysosomal enzymes arriving from the ER, and the retrieval of components, such as 
the M6P receptor, baclc into the early parts of the secretory pathway. We next dis- 
cuss how the cell uses and Controls the various features of endocytic trafficking. 

Pinocytic Vesicles Form from Coated Pits in the Plasma 
Membrane 

Virtually ail eulcaryotic cells continually ingest portions of their plasma mem¬ 
brane in the form of small pinocytic (endocytic) vesicles. The rate at which plasma 
membrane is internalized in this process of pinocytosis varies between cell types, 
but it is usually surprisingly high. A macrophage, for example, ingests 25% of its 
own volume of fluid each hour. This means that it must ingest 3% of its plasma 
membrane each minute, or 100% in about half an hour. Fibroblasts endocytose 
at a somewhat lower rate (1% of their plasma membrane per minute), whereas 
some amoebae ingest their plasma membrane even more rapidly. Since a cell's 
surface area and volume remain unchanged during this process, it is clear that the 
same amount of membrane being removed by endocytosis is being added to the 
cell surface by the converse process of exocytosis. In this sense, endocytosis and 
exocytosis are linlced processes that can be considered to constitute an endocytic- 
exocytic cycle. The coupling between exocytosis and endocytosis is particularly 
strict in specialized structures characterized by high membrane turnover, such as 
a nerve terminal. 

The endocytic part of the cycle often begins at clathrin-coated pits. These spe¬ 
cialized régions typically occupy about 2% of the total plasma membrane area. The 
lifetime of a clathrin-coated pit is short: within a minute or so of being formed, it 
invaginâtes into the cell and pinches off to form a clathrin-coated vesicle (Figure 
13-48). About 2500 clathrin-coated vesicles pinch off from the plasma membrane 
of a cultured fibroblast every minute. The coated vesicles are even more transient 
than the coated pits: within seconds of being formed, they shed their coat and 
fuse with early endosomes. 

Not Ail Pinocytic Vesicles Are Clathrin-Coated 

In addition to clathrin-coated pits and vesicles, cells can form other types of 
pinocytic vesicles, such as caveolae (from the Latin for “little cavities”), originally 
recognized by their ability to transport molécules across endothélial cells that 
form the inner lining of blood vessels. Caveolae, sometimes seen in the électron 
microscope as deeply invaginated flaslcs, are présent in the plasma membrane 
of most vertebrate cell types (Figure 13-49). They are thought to form from lipid 
rafts in the plasma membrane (discussed in Chapter 10), which are especially rich 


Figure 13-48 The formation of clathrin- 
coated vesicles from the plasma 
membrane. These électron micrographs 
illustrate the probable sequence of events 
in the formation of a clathrin-coated vesicle 
from a clathrin-coated pit. The clathrin- 
coated pits and vesicles shown are larger 
than those seen in normal-sized cells; they 
are from a very large hen oocyte and they 
take up lipoprotein particles to form yolk. 
The lipoprotein particles bound to their 
membrane-bound receptors appear as 
a dense, fuzzy layer on the extracellular 
surface of the plasma membrane—which 
is the inside surface of the coated pit and 
vesicle. (Courtesy of M.M. Perry and A.B. 
Gilbert, J. Cell Sci. 39:257-272, 1979. 

With permission from The Company of 
Biologists.) 
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Figure 13-49 Caveolae in the plasma 
membrane of a fibroblast. (A) This 
électron micrograph shows a plasma 
membrane with a very high density of 
caveolae. (B) This rapid-freeze deep-etch 
image demonstrates the characteristic 
“cauliflower” texture of the cytosolic face of 
the caveolae membrane. The characteristic 
texture is thought to resuit from aggregates 
of caveolins and cavins. A clathrin-coated 
pit is also seen at the upper right. (Courtesy 
of R.G.W. Anderson, from K.G. Rothberg 
et al., Cell 68:673-682, 1992. With 
permission from Elsevier.) 
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in cholestérol, glycosphingolipids, and glycosylphosphatidylinositol (GPI)-an- 
chored membrane proteins (see Figure 10-13). The major structural proteins in 
caveolae are caveolins, a family of unusual intégral membrane proteins that each 
insert a hydrophobie loop into the membrane from the cytosolic side but do not 
extend across the membrane. On their cytosolic side, caveolins are bound to large 
protein complexes of caving proteins, which are thought to stabilize the mem¬ 
brane curvature. 

In contrast to clathrin-coated and COPI- or COPII-coated vesicles, caveolae 
are usually static structures. Nonetheless, they can be induced to pinch off and 
serve as endocytic vesicles to transport cargo to early endosomes or to the plasma 
membrane on the opposite side of a polarized cell (in a process called transcyto- 
sis, which we discuss later). Some animal viruses such as SV40 and papillomavirus 
(which causes warts) enter cells in vesicles derived from caveolae. The viruses are 
first delivered to early endosomes and move from there in transport vesicles to the 
lumen of the ER. The viral genome exits across the ER membrane into the cytosol, 
from where it is imported into the nucléus to start the infection cycle. Choiera 
toxin (discussed in Chapters 15 and 19) also enters the cell through caveolae and 
is transported to the ER before entering the cytosol. 

Macropinocytosis is another clathrin-independent endocytic mechanism 
that can be activated in practically ail animal cells. In most cell types, it does 
not operate continually but rather is induced for a limited time in response to 
cell-surface receptor activation by spécifie cargoes, including growth factors, inte- 
grin ligands, apoptotic cell remnants, and some viruses. These ligands activate a 
complex signaling pathway, resulting in a change in actin dynamics and the for¬ 
mation of cell-surface protrusions, called ruffles (discussed in Chapter 16). When 
ruffles collapse baclc onto the cell, large fluid-filled endocytic vesicles form, called 
macropinosomes (Figure 13-50), which transiently increase the bulle fluid uptake 
of a cell by up to tenfold. Macropinocytosis is a dedicated degradative pathway: 
macropinosomes acidify and then fuse with late endosomes or endolysosomes, 
without recycling their cargo baclc to the plasma membrane. 

Cells Use Receptor-Mediated Endocytosis to Import Selected 
Extracellular Macromolecules 

In most animal cells, clathrin-coated pits and vesicles provide an efficient path¬ 
way for talcing up spécifie macromolecules from the extracellular fluid. In this 
process, called receptor-mediated endocytosis, the macromolecules bind to 
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Figure 13-50 Schematic représentation 
of macropinocytosis. Cell signaling events 
lead to a reprogramming of actin dynamics, 
which in turn triggers the formation of cell- 
surface ruffles. As the ruffles collapse back 
onto the cell surface, they nonspecifically 
trap extracellular fluid and macromolecules 
and particles contained in it, forming large 
vacuoles, or macropinosomes, as shown. 


complementary transmembrane receptor proteins, which accumulate in coated 
pits, and then enter the cell as receptor-macromolecule complexes in clath- 
rin-coated vesicles (see Figure 13-48). Because ligands are selectively captured 
by receptors, receptor-mediated endocytosis provides a sélective concentrating 
mechanism that increases the efficiency of internalization of particular ligands 
more than a hundredfold. In this way, even minor components of the extracellular 
fluid can be efficiently talcen up in large amounts. A particularly well-understood 
and physiologically important example is the process that mammalian cells use 
to import cholestérol. 

Many animal cells talce up cholestérol through receptor-mediated endocytosis 
and, in this way, acquire most of the cholestérol they require to malce new mem¬ 
brane. If the uptalce is bloclced, cholestérol accumulâtes in the blood and can con- 
tribute to the formation in blood vessel (artery) walls of atheroscierotic plaques , 
deposits of lipid and fibrous tissue that can cause strokes and heart attaclcs by 
blocking arterial blood flow. In fact, it was a study of humans with a strong genetic 
prédisposition for atherosclerosis that first revealed the mechanism of recep¬ 
tor-mediated endocytosis. 

Most cholestérol is transported in the blood as cholesteryl esters in the form of 
lipid-protein particles lcnown as low-density lipoproteins (LDLs) (Figure 13-51). 
When a cell needs cholestérol for membrane synthesis, it malces transmembrane 
receptor proteins for LDL and inserts them into its plasma membrane. Once in 
the plasma membrane, the LDL receptors diffuse until they associate with clath- 
rin-coated pits that are in the process of forming. There, an endocytosis signal 
in the cytoplasmic tail of the LDL receptors binds the membrane-bound adap- 
tor protein AP2 after its conformation has been locally unlocked by binding to 
PI(4,5)P2 on the plasma membrane. Coincidence détection, as discussed earlier, 
thus imparts both efficiency and selectivity to the process (see Figure 13-9). AP2 
then recruits clathrin to initiate endocytosis. 

Since coated pits constantly pinch off to form coated vesicles, any LDL parti¬ 
cles bound to LDL receptors in the coated pits are rapidly internalized in coated 
vesicles. After shedding their clathrin coats, the vesicles deliver their contents to 
early endosomes. Once the LDL and LDL receptors encounter the low pH in early 
endosomes, LDL is released from its receptor and is delivered via late endosomes 
to lysosomes. There, the cholesteryl esters in the LDL particles are hydrolyzed to 
free cholestérol, which is now available to the cell for new membrane synthesis 
(Movie 13.3). If too much free cholestérol accumulâtes in a cell, the cell shuts off 
both its own cholestérol synthesis and the synthesis of LDL receptors, so that it 
ceases both to malce or to talce up cholestérol. 

This regulated pathway for cholestérol uptalce is disrupted in individuals who 
inherit defective genes encoding LDL receptors. The resulting high levels of blood 
cholestérol prédisposé these individuals to develop atherosclerosis prematurely, 
and many would die at an early âge of heart attaclcs resulting from coronary artery 
disease if they were not treated with drugs such as statins that lower the level of 
blood cholestérol. In some cases, the receptor is lacking altogether. In others, the 
receptors are defective—in either the extracellular binding site for LDL or the 
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Figure 13-51 A low-density lipoprotein 
(LDL) particle. Each spherical particle has 
a mass of 3 x 10 6 daltons. It contains a 
core of about 1500 cholestérol molécules 
esterified to long-chain fatty acids. A 
lipid monolayer composed of about 
800 phospholipid and 500 unesterified 
cholestérol molécules surrounds the core 
of cholesteryl esters. A single molécule of 
apolipoprotein B, a 500,000-dalton beltlike 
protein, organizes the particle and médiates 
the spécifie binding of LDL to cell-surface 
LDL receptors. 
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intracellular binding site that attaches the receptor AP2 adaptor protein in clath- 
rin-coated pits. In the latter case, normal numbers of LDL receptors are présent, 
but they fail to become localized in clathrin-coated pits. Although LDL binds to 
the surface of these mutant cells, it is not internalized, directly demonstrating the 
importance of clathrin-coated pits for the receptor-mediated endocytosis of cho¬ 
lestérol. 

More than 25 distinct receptors are known to participate in receptor-mediated 
endocytosis of different types of molécules. They ail apparently use clathrin-de- 
pendent internalization routes and are guided into clathrin-coated pits by signais 
in their cytoplasmic tails that bind to adaptor proteins in the clathrin coat. Many 
of these receptors, lilce the LDL receptor, enter coated pits irrespective of whether 
they hâve bound their spécifie ligands. Others enter preferentially when bound 
to a spécifie ligand, suggesting that a ligand-induced conformational change is 
required for them to activate the signal sequence that guides them into the pits. 
Since most plasma membrane proteins fail to become concentrated in clath¬ 
rin-coated pits, the pits serve as molecular filters, preferentially collecting certain 
plasma membrane proteins (receptors) over others. 

Electron-microscope studies of cultured cells exposed simultaneously to dif¬ 
ferent labeled ligands demonstrate that many lcinds of receptors can cluster in the 
same coated pit, whereas some other receptors cluster in different clathrin-coated 
pits. The plasma membrane of one clathrin-coated pit can accommodate more 
than 100 receptors of assorted varieties. 


Spécifie Proteins Are Retrieved from Early Endosomes and 
Returned to the Plasma Membrane 


Early endosomes are the main sorting station in the endocytic pathway, just as 
the cis and trans Golgi networlcs serve this function in the secretory pathway. In 
the mildly acidic environment of the early endosome, many internalized recep¬ 
tor proteins change their conformation and release their ligand, as already dis- 
cussed for the M6P receptors. Those endocytosed ligands that dissociate from 
their receptors in the early endosome are usually doomed to destruction in lyso¬ 
somes (although cholestérol is an exception, as just discussed), along with the 
other soluble contents of the endosome. Some other endocytosed ligands, how- 
ever, remain bound to their receptors, and thereby share the fate of the receptors. 

In the early endosome, the LDL receptor dissociâtes from its ligand, LDL, and 
is recycled baclc to the plasma membrane for reuse, leaving the discharged LDL to 
be carried to lysosomes (Figure 13-52). The recycling transport vesicles bud from 
long, narrow tubules that extend from the early endosomes. It is lilcely that the 
geometry of these tubules helps the sorting process: because tubules hâve a large 
membrane area enclosing a small volume, membrane proteins become enriched 
over soluble proteins. The transport vesicles return the LDL receptor directly to 
the plasma membrane. 

The transferrin receptor follows a similar recycling pathway as the LDL recep¬ 
tor, but unlilce the LDL receptor it also recycles its ligand. Transferrin is a soluble 




Figure 13-52 The receptor-mediated 
endocytosis of LDL. Note that the LDL 
dissociâtes from its receptors in the acidic 
environment of the early endosome. After 
a number of steps, the LDL ends up in 
endolysosomes and lysosomes, where it 
is degraded to release free cholestérol. In 
contrast, the LDL receptors are returned 
to the plasma membrane via transport 
vesicles that bud off from the tubular 
région of the early endosome, as shown. 
For simplicity, only one LDL receptor is 
shown entering the cell and returning to the 
plasma membrane. Whether it is occupied 
or not, an LDL receptor typically makes 
one round trip into the cell and back to 
the plasma membrane every 10 minutes, 
making a total of several hundred trips in its 
20-hour life-span. 
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protein that carries iron in the blood. Cell-surface transferrin receptors deliver 
transferrin with its bound iron to early endosomes by receptor-mediated endocy- 
tosis. The low pH in the endosome induces transferrin to release its bound iron, 
but the iron-free transferrin itself (called apotransferrin) remains bound to its 
receptor. The receptor-apotransferrin complex enters the tubular extensions of 
the early endosome and from there is recycled baclc to the plasma membrane. 
When the apotransferrin returns to the neutral pH of the extracellular fluid, it dis¬ 
sociâtes from the receptor and is thereby freed to piclc up more iron and begin 
the cycle again. Thus, transferrin shuttles baclc and forth between the extracellu¬ 
lar fluid and early endosomes, avoiding lysosomes and delivering iron to the cell 
interior, as needed for cells to grow and proliferate. 

Plasma Membrane Signaling Receptors are Down-Regulated by 
Dégradation in Lysosomes 

A second pathway that endocytosed receptors can followfrom endosomes is talcen 
by many signaling receptors, including opioid receptors and the receptor that 
binds epidermal growth factor (EGF). EGF is a small, extracellular signal protein 
that stimulâtes epidermal and various other cells to divide. Unlilce LDL receptors, 
EGF receptors accumulate in clathrin-coated pits only after binding their ligand, 
and most do not recycle but are degraded in lysosomes, along with the ingested 
EGF. EGF binding therefore first activâtes intracellular signaling pathways and 
then leads to a decrease in the concentration of EGF receptors on the cell surface, 
a process called receptor downregulation, that reduces the celLs subséquent sen- 
sitivity to EGF (see Figure 15-20). 

Receptor downregulation is highly regulated. The activated receptors are first 
covalently modified on the cytosolic face with the small protein ubiquitin. Unlilce 
polyubiquitylation, which adds a chain of ubiquitins that typically targets a pro¬ 
tein for dégradation in protéasomes (discussed in Chapter 6), ubiquitin tagging 
for sorting into the clathrin-dependent endocytic pathway adds just one or a few 
single ubiquitin molécules to the protein—a process called monoubiquitylation 
or multiubiquitylation, respectively. Ubiquitin-binding proteins recognize the 
attached ubiquitin and help direct the modified receptors into clathrin-coated 
pits. After delivery to the early endosome, other ubiquitin-binding proteins that 
are part of ESCRT complexes (ESCRT = Endosome Sorting Complex Required for 
Transport) recognize and sort the ubiquitylated receptors into intralumenal vesi- 
cles, which are retained in the maturing late endosome (see Figure 13-47). In this 
way, addition of ubiquitin bloclcs receptor recycling to the plasma membrane and 
directs the receptors into the dégradation pathway, as we discuss next. 

Early Endosomes Mature into Late Endosomes 

The endosomal compartments can be made visible in the électron microscope 
by adding a readily détectable tracer molécule, such as the enzyme peroxidase, 
to the extracellular medium and allowing varying lengths of time for the cell to 
endocytose the tracer. The distribution of the molécule after its uptalce reveals 
the sequence of events. Within a minute or so after adding the tracer, it starts to 
appear in early endosomes, just beneath the plasma membrane (Figure 13-53). 
By 5-15 minutes later, it has moved to late endosomes, close to the Golgi appara- 
tus and near the nucléus. 

How early endosomes arise is not entirely clear, but their membrane and 
volume are mainly derived from incoming endocytic vesicles that fuse with one 
another (Movie 13.4). Early endosomes are relatively small and patrol the cyto- 
plasm underlying the plasma membrane in jerlcy baclc-and-forth movements 
along microtubules, capturing incoming vesicles. Typically, an early endosome 
receives incoming vesicles for about 10 minutes, during which time membrane 
and fluid is rapidly recycled to the plasma membrane. Some of the incoming 
cargo, however, accumulâtes over the lifetime of the early endosome, eventually 
to be included in the late endosome. 
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Figure 13-53 Electron micrograph of an 
early endosome. The endosome is labeled 
with endocytosed horseradish peroxidase, 
a widely used enzyme marker, detected 
in this case by an electron-dense reaction 
product. Many tubular extensions protrude 
from the central vacuolar space of the 
early endosome, which will later mature 
to give rise to a late endosome. (From 
J. Tooze and M. Hollinshead, J. Cell Biol. 
118:813-830, 1992.) 
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Early endosomes hâve tubular and vacuolar domains (see Figure 13-53). Most 
of the membrane surface is in the tubules and most of the volume is in the vacuolar 
domain. During endosome maturation, the two domains hâve different fates: the 
vacuolar portions of the early endosome are retained and transformed into late 
endosomes; the tubular portions shrinlc. Maturing endosomes, also called mul- 
tivesicular bodies, migrate along microtubules toward the cell interior, shedding 
membrane tubules and vesicles that recycle material to the plasma membrane 
and TGN, and receiving newly synthesized lysosomal proteins. As they concen- 
trate in a perinuclear région of the cell, the multivesicular bodies fuse with each 
other, and eventually with endolysosomes and lysosomes (see Figure 13-47). 

Many changes occur during the maturation process. (1) The endosome changes 
shape and location, as the tubular domains are lost and the vacuolar domains are 
thoroughly modified. (2) Rab proteins, phosphoinositide lipids, fusion machinery 
(SNAREs and tethers), and microtubule motor proteins ail participate in a molec- 
ular makeover of the cytosolic face of the endosome membrane, changing the 
functional characteristics of the organelle. (3) A V-type ATPase in the endosome 
membrane pumps H + from the cytosol into the endosome lumen and acidifies the 
organelle. Crucially, the increasing acidity that accompanies maturation renders 
lysosomal hydrolases increasingly more active, influencing many receptor-ligand 
interactions, thereby controlling receptor loading and unloading. (4) Intralume- 
nal vesicles sequester endocytosed signaling receptors inside the endosome, 
thus halting the receptor signaling activity. (5) Lysosome proteins are delivered 
from the TGN to the maturing endosome. Most of these events occur gradually 
but eventually lead to a complété transformation of the endosome into an early 
endolysosome. 

In addition to committing selected cargo for dégradation, the maturation pro¬ 
cess is important for lysosome maintenance. The continuai delivery of lysosome 
components from the TGN to maturing endosomes, ensures a steady supply of 
new lysosome proteins. The endocytosed materials mix in early endosomes with 
newly arrived acid hydrolases. Although mild digestion may start here, many 
hydrolases are synthesized and delivered as proenzymes, called zymogens, which 
contain extra inhibitory domains that lceep the hydrolases inactive until these 
domains are proteolytically removed at later stages of endosome maturation. 
Moreover, the pH in early endosomes is not low enough to activate lysosomal 
hydrolases optimally. By these means, cells can retrieve membrane proteins 
intact from early endosomes and recycle them baclc to the plasma membrane. 

ESCRT Protein Complexes Médiate the Formation of Intralumenal 
Vesicles in Multivesicular Bodies 

As endosomes mature, patches of their membrane invaginate into the endosome 
lumen and pinch off to form intralumenal vesicles. Because of their appearance in 
the électron microscope such maturing endosomes are also called multivesicular 
bodies (Figure 13-54). 

The multivesicular bodies carry endocytosed membrane proteins that are to 
be degraded. As part of the protein-sorting process, receptors destined for dégra¬ 
dation, such as the occupied EGF receptors described previously, selectively par¬ 
tition into the invaginating membrane of the multivesicular bodies. In this way, 
both the receptors and any signaling proteins strongly bound to them are seques- 
tered away from the cytosol where they might otherwise continue signaling. 
They also are made fully accessible to the digestive enzymes that eventually will 
dégradé them (Figure 13-55). In addition to endocytosed membrane proteins, 
multivesicular bodies include the soluble content of early endosomes destined 
for late endosomes and digestion in lysosomes. 

As discussed earlier, sorting into intralumenal vesicles requires one or multi¬ 
ple ubiquitin tags, which are added to the cytosolic domains of membrane pro¬ 
teins. These tags initially help guide the proteins into clathrin-coated vesicles in 
the plasma membrane. Once delivered to the endosomal membrane, the ubiq¬ 
uitin tags are recognized again, this time by a sériés of cytosolic ESCRT protein 


intralumenal multivesicular 
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Figure 13-54 Electron micrograph of a 
multivesicular body. The large amount 
of internai membrane will be delivered to 
the lysosome, for digestion. (Courtesy of 
Andrew Staehelin, from A. Driouich, 

A. Jauneau and L.A. Staehelin; Plant 
Physiol. 113:487-492, 1997. With 
permission from the American Society of 
Plant Biologists.) 
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complexes, ( ESCRT-0 , -I, -II, and -III), which bind sequentially and ultimately 
médiate the sorting process into the intralumenal vesicles. Membrane invagina¬ 
tion into multivesicular bodies also dépends on a lipid kinase that phosphorylâtes 
phosphatidylinositol to produce PI(3)P, which serves as an additional docking site 
for the ESCRT complexes; these complexes require both PI(3)P and the presence 
of ubiquitylated cargo proteins to bind to the endosomal membrane. ESCRT-III 
forms large multimeric assemblies on the membrane that bend the membrane 
(Figure 13-56). 

Mutant cells compromised in ESCRT function display signaling defects. In 
such cells, activated receptors cannot be down-regulated by endocytosis and 
packaging into multivesicular bodies. The still-active receptors therefore médiate 
prolonged signaling, which can lead to uncontrolled cell prolifération and cancer. 

Processes that shape membranes often use similar machinery. Because of 
strong similarities in their protein sequences, researchers thinlc that ESCRT com¬ 
plexes are evolutionarily related to components that médiate cell-membrane 
deformation in cytokinesis in archaea. Similarly, the ESCRT machinery that 
drives the internai budding from the endosome membrane to form intralumenal 
vesicles is also used in animal cell cytokinesis and virus budding, which are topo- 
logically équivalent, as both processes involve budding away from the cytosolic 
surface of the membrane (Figure 13-57). 


Recycling Endosomes Regulate Plasma Membrane Composition 

The fates of endocytosed receptors—and of any ligands remaining bound to 
them—vary according to the spécifie type of receptor. As we discussed, most 
receptors are recycled and returned to the same plasma membrane domain from 
which they came; some proceed to a different domain of the plasma membrane, 
thereby mediating transcytosis; and some progress to lysosomes, where they are 
degraded. 

Receptors on the surface of polarized épithélial cells can transfer spécifie 
macromolecules from one extracellular space to another by transcytosis. A new- 
born, for example, obtains antibodies from its mother's mille (which help protect 
it against infection) by transporting them across the epithelium of its gut. The 
lumen of the gut is acidic, and, at this low pH, the antibodies in the mille bind 
to spécifie receptors on the apical (absorptive) surface of the gut épithélial cells. 
The receptor-antibody complexes are internalized via clathrin-coated pits and 



Figure 13-55 The séquestration of 
endocytosed proteins into intralumenal 
vesicles of multivesicular bodies. 

Ubiquitylated membrane proteins are 
sorted into domains on the endosome 
membrane, which invaginate and pinch 
off to form intralumenal vesicles. The 
ubiquitin marker is removed and returned 
to the cytosol for reuse before the 
intralumenal vesicle closes. Eventually, 
proteases and lipases in lysosomes 
digest ail of the internai membranes. 

The invagination processes are essential 
for complété digestion of endocytosed 
membrane proteins: because the outer 
membrane of the multivesicular body 
becomes continuous with the lysosomal 
membrane, which is résistant to lysosomal 
hydrolases; the hydrolases, for example, 
could not digest the cytosolic domains of 
endocytosed transmembrane proteins, 
such as the EGF receptor shown here, 
if the protein were not localized in 
intralumenal vesicles. 


Figure 13-56 Sorting of endocytosed 
membrane proteins into the intralumenal 
vesicles of a multivesicular body. 

A sériés of complex binding events 
passes the ubiquitylated cargo proteins 
sequentially from one ESCRT complex 
(ESCRT-0) to the next, eventually 
concentrating them in membrane areas 
that bud into the lumen of the endosome 
to form intralumenal vesicles. ESCRT-III 
assembles into expansive multimeric 
structures and médiates invagination. The 
mechanisms of how cargo molécules are 
shepherded into the vesicles and how the 
vesicles are formed without including the 
ESCRT complexes themselves remain 
unknown. ESCRT complexes are soluble in 
the cytosol, are recruited to the membrane 
sequentially as needed, and are then 
released back into the cytosol as the 
vesicle pinches off. 
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vesicles and are delivered to early endosomes. The complexes remain intact and 
are retrieved in transport vesicles that bud from the early endosome and subse- 
quently fuse with the basolateral domain of the plasma membrane. On exposure 
to the neutral pH of the extracellular fluid that bathes the basolateral surface of 
the cells ; the antibodies dissociate from their receptors and eventually enter the 
baby's bloodstream. 

The transcytotic pathway from the early endosome baclc to the plasma mem¬ 
brane is not direct. The receptors first move from the early endosome to the recy- 
cling endosome. The variety of pathways that different receptors follow from early 
endosomes implies that, in addition to binding sites for their ligands and binding 
sites for coated pits, many receptors also possess sorting signais that guide them 
into the appropriate transport pathway (Figure 13-58). 

Cells can regulate the release of membrane proteins from recycling endo¬ 
somes, thus adjusting the flux of proteins through the transcytotic pathway 
according to need. This régulation, the mechanism of which is uncertain, allows 
recycling endosomes to play an important part in adjusting the concentration of 
spécifie plasma membrane proteins. Fat cells and muscle cells, for example, con- 
tain large intracellular pools of the glucose transporters that are responsible for 
the uptalce of glucose across the plasma membrane. These membrane transport 
proteins are stored in specialized recycling endosomes until the hormone insu- 
lin stimulâtes the cell to increase its rate of glucose uptalce. In response to the 
insulin signal, transport vesicles rapidly bud from the recycling endosome and 
deliver large numbers of glucose transporters to the plasma membrane, thereby 
greatly increasing the rate of glucose uptalce into the cell (Figure 13-59). Similarly, 
kidney cells regulate the insertion of aquaporins and V-ATPase into the plasma 
membrane to increase water and acid excrétion, respectively, both in response to 
hormones. 
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Figure 13-57 Conserved mechanism 
in multivesicular body formation and 
virus budding. In the two topologically 
équivalent processes indicated by the 
arrows, ESCRT complexes (not shown) 
shape membranes into buds that bulge 
away from the cytosol. 


Specialized Phagocytic Cells Can Ingest Large Particles 

Phagocytosis is a spécial form of endocytosis in which a cell uses large endocytic 
vesicles called phagosomes to ingest large particles such as microorganisms and 
dead cells. Phagocytosis is distinct, both in purpose and mechanism, from mac- 
ropinocytosis, which we discussed earlier. In protozoa, phagocytosis is a form of 
feeding: large particles talcen up into phagosomes end up in lysosomes, and the 
products of the subséquent digestive processes pass into the cytosol to be used as 
food. However, few cells in multicellular organisms are able to ingest such large 
particles efficiently. In the gut of animais, for example, extracellular processes 
break down food particles, and cells import the small products of hydrolysis. 



Figure 13-58 Possible fates for 
transmembrane receptor proteins that 
hâve been endocytosed. Three pathways 
from the early endosomal compartiment 
in an épithélial cell are shown. Retrieved 
receptors are returned (1) to the same 
plasma membrane domain from which 
they came (recycling) or (2) via a recycling 
endosome to a different domain of 
the plasma membrane (transcytosis). 

(3) Receptors that are not specifically 
retrieved from early or recycling endosomes 
follow the pathway from the endosomal 
compartiment to lysosomes, where they are 
degraded (dégradation). If the ligand that is 
endocytosed with its receptor stays bound 
to the receptor in the acidic environment 
of the endosome, it shares the same fate 
as the receptor; otherwise, it is delivered 
to lysosomes. Recycling endosomes are 
a way-station on the transcytotic pathway. 

In the transcytosis example shown here, 
an antibody Fc receptor on a gut épithélial 
cell binds antibody and is endocytosed, 
eventually carrying the antibody to the 
basolateral plasma membrane. The 
receptor is called an Fc receptor because 
it binds the Fc part of the antibody 
(discussed in Chapter 24). 
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Figure 13-59 Storage of plasma 
membrane proteins in recycling 
endosomes. Recycling endosomes can 
serve as an intracellular storage site for 
specialized plasma membrane proteins 
that can be mobilized when needed. In 
the example shown, insulin binding to the 
insulin receptor triggers an intracellular 
signaling pathway that causes the rapid 
insertion of glucose transporters into the 
plasma membrane of a fat or muscle cell, 
greatly increasing its glucose intake. 


Phagocytosis is important in most animais for purposes other than nutrition, 
and it is carried out mainly by specialized cells—so-called professional phagocytes. 
In mammals, two important classes of white blood cells that act as professional 
phagocytes are macrophages and neutrophils (Movie 13.5). These cells develop 
from hemopoietic stem cells (discussed in Chapter 22 ), and they ingest invad- 
ing microorganisms to defend us against infection. Macrophages also hâve an 
important rôle in scavenging senescent cells and cells that hâve died by apoptosis 
(discussed in Chapter 18 ). In quantitative terms, the clearance of senescent and 
dead cells is by far the most important: our macrophages, for example, phagocy¬ 
tose more than 10 11 senescent red blood cells in each of us every day. 

The diameter of a phagosome is determined by the size of its ingested parti- 
cles, and those particles can be almost as large as the phagocytic cell itself (Fig¬ 
ure 13-60). Phagosomes fuse with lysosomes, and the ingested material is then 
degraded. Indigestible substances remain in the lysosomes, forming residual bod- 
ies that can be excreted from cells by exocytosis, as mentioned earlier. Some of the 
internalized plasma membrane components never reach the lysosome, because 
they are retrieved from the phagosome in transport vesicles and returned to the 
plasma membrane. 

Some pathogenic bacteria hâve evolved elaborate mechanisms to prevent 
phagosome-lysosome fusion. The bacterium Légionella pneumophila, for exam¬ 
ple, which causes Légionnaires' disease (discussed in Chapter 23), injects into 
its unfortunate host a Rab-modifying enzyme that causes certain Rab proteins 
to misdirect membrane traffic, thereby preventing phagosome-lysosome fusion. 
The bacterium, thus spared from lysosomal dégradation, remains in the modified 
phagosome, growing and dividing as an intracellular pathogen, protected from 
the host's adaptive immune System. 

Phagocytosis is a cargo-triggered process. That is, it requires the activation of 
cell-surface receptors that transmit signais to the cell interior. Thus, to be phago- 
cytosed, particles must first bind to the surface of the phagocyte (although not 
ail particles that bind are ingested). Phagocytes hâve a variety of cell surface 
receptors that are functionally linlced to the phagocytic machinery of the cell. 
The best-characterized triggers of phagocytosis are antibodies, which protect us 
by binding to the surface of infectious microorganisms (pathogens) and initiat- 
ing a sériés of events that culminate in the invader being phagocytosed. When 
antibodies initially attaclc a pathogen, they coat it with antibody molécules that 
bind to Fc receptors on the surface of macrophages and neutrophils, activating 
the receptors to induce the phagocytic cell to extend pseudopods, which engulf 
the particle and fuse at their tips to form a phagosome (Figure 13-61A). Localized 
actin polymerization, initiated by Rho family GTPases and their activating Rho- 
GEFs (discussed in Chapters 15 and 16), shapes the pseudopods. The activated 
Rho GTPases switch on the kinase activity of local PI kinases to produce PI(4,5)P 2 
in the membrane (see Figure 13-11), which stimulâtes actin polymerization. To 
seal off the phagosome and complété the engulfment, actin is depolymerized by a 
PI 3-kinase that converts the PI(4,5)P 2 to PI(3,4,5)P 3 , which is required for closure 
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Figure 13-60 Phagocytosis by a 
macrophage. A scanning électron 
micrograph of a mouse macrophage 
phagocytosing two chemically altered red 
blood cells. The red arrows point to edges 
of thin processes (pseudopods) of the 
macrophage that are extending as collars 
to engulf the red cells. (Courtesy of Jean 
Paul Revel.) 
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Figure 13-61 A neutrophil reshaping 
its plasma membrane during 
phagocytosis. (A) An électron micrograph 
of a neutrophil phagocytosing a 
bacterium, which is in the process of 
dividing. (B) Pseudopod extension and 
phagosome formation are driven by actin 
polymerization and reorganization, which 
respond to the accumulation of spécifie 
phosphoinositides in the membrane 
of the forming phagosome: PI(4,5)P2 
stimulâtes actin polymerization, which 
promûtes pseudopod formation, and then 
PI(3,4,5)P3 depolymerizes actin filaments 
at the base. (A, courtesy of Dorothy F. 
Bainton, Phagocytic Mechanisms in Health 
and Disease. New York: Intercontinental 
Medical Book Corporation, 1971.) 


of the phagosome and may also contribute to reshaping the actin network to help 
drive the invagination of the forming phagosome (Figure 13-61B). In this way, 
the ordered génération and consumption of spécifie phosphoinositides guides 
sequential steps in phagosome formation. 

Several other classes of receptors that promote phagocytosis hâve been char- 
acterized. Some recognize complément components, which collaborate with anti- 
bodies in targeting microbes for destruction (discussed in Chapter 24). Others 
directly recognize oligosaccharides on the surface of certain pathogens. Still oth¬ 
ers recognize cells that hâve died by apoptosis. Apoptotic cells lose the asymmet- 
ric distribution of phospholipids in their plasma membrane. As a conséquence, 
negatively charged phosphatidylserine, which is normally confined to the cyto- 
solic leafiet of the lipid bilayer, is now exposed on the outside of the cell, where it 
helps to trigger the phagocytosis of the dead cell. 

Remarlcably, macrophages will also phagocytose a variety of inanimate parti- 
cles—such as glass or latex beads and asbestos fibers—yet they do not phagocy¬ 
tose live cells in their own body. The living cells display “don't-eat-me" signais in 
the form of cell-surface proteins that bind to inhibiting receptors on the surface of 
macrophages. The inhibitory receptors recruit tyrosine phosphatases that antag- 
onize the intracellular signaling events required to initiate phagocytosis, thereby 
locally inhibiting the phagocytic process. Thus phagocytosis, like many other cell 
processes, dépends on a balance between positive signais that activate the pro¬ 
cess and négative signais that inhibit it. Apoptotic cells are thought both to gain 
“eat-me” signais (such as extracellularly exposed phosphatidylserine) and to lose 
their “don't-eat-me” signais, causing them to be very rapidly phagocytosed by 
macrophages. 


Summary 

Cells ingestfluid, molécules, and particles by endocytosis , in which localized régions 
of the plasma membrane invaginate and pinch off to form endocytic vesicles. In 
most cells, endocytosis internalizes a large fraction of the plasma membrane every 
hour. The cells remain the same size because most ofthe plasma membrane compo¬ 
nents (proteins and lipids) that are endocytosed are continually returned to the cell 
surface by exocytosis. This large-scale endocytic-exocytic cycle is mediated largely 
by clathrin-coated pits and vesicles but clathrin-independent endocytic pathways 
also contribute. 

While many of the endocytosed molécules are quickly recycled to the plasma 
membrane, others eventually end up in lysosomes, where they are degraded. Mostof 
the ligands that are endocytosed with their receptors dissociate from their receptors 
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in the acidic environment of the endosome and eventually end up in lysosomes, 
while most ofthe receptors are recycled via transport vesicles baclc to the cell surface 
for reuse. Many cell-surface signaling receptors become tagged with ubiquitin when 
activated by binding their extracellular ligands. Ubiquitylation guides activated 
receptors into clathrin-coated pits, they and their ligands are efficiently internalized 
and delivered to early endosomes. 

Early endosomes, rapidly mature into late endosomes. During maturation, 
patches of the endosomal membrane containing ubiquitylated receptors invagi- 
nate and pinch offtoform intralumenal vesicles. This process is mediated byESCRT 
complexes and sequesters the receptors away from the cytosol, which terminâtes 
their signaling activity. Late endosomes migrate along microtubules toward the 
interior ofthe cell where they fuse with one another and with lysosomes toform 
endolysosomes, where dégradation occurs. 

In some cases, both receptor and ligand are transferred to a different plasma 
membrane domain, causing the ligand to be released at a different surface from 
where it originated, a process called transcytosis. In some cells, endocytosed plasma 
membrane proteins and lipids can be stored in recycling endosomes, for as long as 
necessary until they are needed. 


TRANSPORT FROM THE TRANS GOLGI NETWORK TO 
THE CELL EXTERIOR: EXOCYTOSIS 


Having considered the cell's endocytic and internai digestive Systems and the 
various types of incoming membrane traffic that converge on lysosomes, we now 
return to the Golgi apparatus and examine the secretory pathways that lead out- 
ward to the cell exterior. Transport vesicles destined for the plasma membrane 
normally leave the TGN in a steady stream as irregularly shaped tubules. The 
membrane proteins and the lipids in these vesicles provide new components 
for the cell's plasma membrane, while the soluble proteins inside the vesicles 
are secreted to the extracellular space. The fusion of the vesicles with the plasma 
membrane is called exocytosis. This is the route, for example, by which cells 
secrete most of the proteoglycans and glycoproteins of the extracellular matrix, as 
discussed in Chapter 19. 

Ail cells require this constitutive secretory pathway, which opérâtes contin- 
uously (Movie 13.6). Specialized secretory cells, however, hâve a second secre¬ 
tory pathway in which soluble proteins and other substances are initially stored 
in secretory vesicles for later release by exocytosis. This is the regulated secre¬ 
tory pathway, found mainly in cells specialized for secreting products rapidly 
on demand—such as hormones, neurotransmitters, or digestive enzymes 
(Figure 13-62). In this section, we consider the rôle of the Golgi apparatus in both 
of these pathways and compare the two mechanisms of sécrétion. 

Many Proteins and Lipids Are Carried Automatically from the 
Trans Golgi Network (TGN) to the Cell Surface 

A cell capable of regulated sécrétion must separate at least three classes of proteins 
before they leave the TGN—those destined for lysosomes (via endosomes), those 
destined for secretory vesicles, and those destined for immédiate delivery to the 
cell surface (Figure 13-63). We hâve already discussed how proteins destined for 
lysosomes are tagged with M6P for packaging into spécifie departing vesicles, and 
analogous signais are thought to direct secretory proteins into secretory vesicles. 
The nonselective constitutive secretory pathway transports most other proteins 
directly to the cell surface. Because entry into this pathway does not require a par- 
ticular signal, it is also called the default pathway. Thus, in an unpolarized cell 
such as a white blood cell or a fibroblast, it seems that any protein in the lumen of 
the Golgi apparatus is automatically carried by the constitutive pathway to the cell 
surface unless it is specifically returned to the ER, retained as a résident protein 
in the Golgi apparatus itself, or selected for the pathways that lead to regulated 
sécrétion or to lysosomes. In polarized cells, where different products hâve to be 
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Figure 13-62 The constitutive and 
regulated secretory pathways. The 

two pathways diverge in the TGN. The 
constitutive secretory pathway opérâtes 
in ail cells. Many soluble proteins are 
continually secreted from the cell by 
this pathway, which also supplies the 
plasma membrane with newly synthesized 
membrane lipids and proteins. Specialized 
secretory cells also hâve a regulated 
secretory pathway, by which selected 
proteins in the TGN are diverted into 
secretory vesicles, where the proteins 
are concentrated and stored until an 
extracellular signal stimulâtes their sécrétion. 
The regulated sécrétion of small molécules, 
such as histamine and neurotransmitters, 
occurs by a similar pathway; these 
molécules are actively transported from 
the cytosol into preformed secretory 
vesicles. There they are often bound to 
spécifie macromolecules (proteoglycans, 
for histamine) so that they can be stored at 
high concentration without generating an 
excessively high osmotic pressure. 


delivered to different domains of the cell surface, we shall see that the options are 
more complex. 


Secretory Vesicles Bud from the Trans Golgi Network 


Cells that are specialized for secreting some of their products rapidly on demand 
concentrate and store these products in secretory vesicles (often called dense- 
core secretory granules because they hâve dense cores when viewed in the électron 
microscope). Secretory vesicles form from the TGN, and they release their con¬ 
tents to the cell exterior by exocytosis in response to spécifie signais. The secreted 
product can be either a small molécule (such as histamine or a neuropeptide) or 
a protein (such as a hormone or digestive enzyme). 

Proteins destined for secretory vesicles (called secretory proteins) are paclc- 
aged into appropriate vesicles in the TGN by a mechanism that involves the sélec¬ 
tive aggregation of the secretory proteins. Clumps of aggregated, electron-dense 
material can be detected by électron microscopy in the lumen of the TGN. The 
signais that direct secretory proteins into such aggregates are not well defined 
and may be quite diverse. When a gene encoding a secretory protein is artificially 
expressed in a secretory cell that normally does not malce the protein, the for- 
eign protein is appropriately packaged into secretory vesicles. This observation 


protein mixture sorting 

H I 1 SIGNAL-MEDIATED DIVERSION 



Figure 13-63 The three best-understood 
pathways of protein sorting in the 
trans Golgi network. (1) Proteins with the 
mannose 6-phosphate (M6P) marker (see 
Figure 13-45) are diverted to lysosomes 
(via endosomes) in clathrin-coated 
transport vesicles. (2) Proteins with signais 
directing them to secretory vesicles are 
concentrated in such vesicles as part of a 
regulated secretory pathway that is présent 
only in specialized secretory cells. (3) In 
unpolarized cells, a constitutive secretory 
pathway delivers proteins with no spécial 
features to the cell surface. In polarized 
cells, such as épithélial cells, however, 
secreted and plasma membrane proteins 
are selectively directed to either the apical 
or the basolateral plasma membrane 
domain, so a spécifie signal must médiate 
at least one of these two pathways, as we 
discuss later. 
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shows that, although the proteins that an individual cell expresses and packages 
in secretory vesicles differ, they contain common sorting signais, which function 
properly even when the proteins are expressed in cells that do not normally malce 
them. 

It is unclear how the aggregates of secretory proteins are segregated into secre¬ 
tory vesicles. Secretory vesicles hâve unique proteins in their membrane, some of 
which might serve as receptors for aggregated protein in the TGN. The aggregates 
are much too big, however, for each molécule of the secreted protein to be bound 
by its own cargo receptor, as occurs for transport of the lysosomal enzymes. The 
uptalce of the aggregates into secretory vesicles may therefore more closely resem¬ 
ble the uptalce of particles by phagocytosis at the cell surface, where the plasma 
membrane zippers up around large structures. 

Initially, most of the membrane of the secretory vesicles that leave the TGN is 
only loosely wrapped around the clusters of aggregated secretory proteins. Mor- 
phologically, these immature secretory vesicles resemble dilated trans Golgi cis- 
ternae that hâve pinched off from the Golgi stack. As the vesicles mature, they 
fuse with one another and their contents become concentrated (Figure 13-64A), 
probably as the resuit of both the continuous retrieval of membrane that is recy- 
cled to the TGN, and the progressive acidification of the vesicle lumen that results 
from the increasing concentration of V-type ATPases in the vesicle membrane 
that acidify ail endocytic and exocytic organelles (see Figure 13-37). The degree of 
concentration of proteins during the formation and maturation of secretory ves¬ 
icles is only a small part of the total 200-400-fold concentration of these proteins 
that occurs after they leave the ER. Secretory and membrane proteins become 
concentrated as they move from the ER through the Golgi apparatus because of 
an extensive rétrogradé retrieval process mediated by COPI-coated transport ves¬ 
icles that carry soluble ER résident proteins baclc to the ER, while excluding the 
secretory and membrane proteins (see Figure 13-25). 

Membrane recycling is important for returning Golgi components to the Golgi 
apparatus, as well as for concentrating the contents of secretory vesicles. The ves¬ 
icles that médiate this retrieval originate as clathrin-coated buds on the surface of 
immature secretory vesicles, often being seen even on budding secretory vesicles 
that hâve not yet pinched off from the Golgi stack (Figure 13-64B). 

Because the final mature secretory vesicles are so densely filled with contents, 
the secretory cell can disgorge large amounts of material promptly by exocytosis 
when triggered to do so (Figure 13-65). 

Precursors of Secretory Proteins Are Proteolytically Processed 
During the Formation of Secretory Vesicles 

Concentration is not the only process to which secretory proteins are subjected as 
the secretory vesicles mature. Many protein hormones and small neuropeptides, 


Figure 13-64 The formation of 
secretory vesicles. (A) Secretory 
proteins become segregated and highly 
concentrated in secretory vesicles by two 
mechanisms. First, they aggregate in the 
ionic environment of the TGN; often the 
aggregates become more condensed as 
secretory vesicles mature and their lumen 
becomes more acidic. Second, clathrin- 
coated vesicles retrieve excess membrane 
and lumenal content présent in immature 
secretory vesicles as the secretory vesicles 
mature. (B) This électron micrograph shows 
secretory vesicles forming from the TGN in 
an insulin-secreting p cell of the pancréas. 
Anti-clathrin antibodies conjugated to gold 
spheres (black dots) hâve been used to 
locate clathrin molécules. The immature 
secretory vesicles, which contain insulin 
precursor protein (proinsulin), contain 
clathrin patches, which are no longer 
seen on the mature secretory vesicle. (B, 
courtesy of Lelio Orci.) 
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as well as many secreted hydrolytic enzymes, are synthesized as inactive precur- 
sors. Proteolysis is necessary to liberate the active molécules from these precursor 
proteins. The cleavages occur in the secretory vesicles and sometimes in the extra¬ 
cellular fluid after sécrétion. Additionally, many of the precursor proteins hâve an 
N-terminal pro-peptide that is cleaved off to yield the mature protein. These pro¬ 
teins are synthesized as pre-pro-proteins, the pre-peptide consisting of the ER sig¬ 
nal peptide that is cleaved off earlier in the rough ER (see Figure 12-36). In other 
cases, peptide signaling molécules are made as polyproteins that contain multiple 
copies of the same amino acid sequence. In still more complex cases, a variety of 
peptide signaling molécules are synthesized as parts of a single polyprotein that 
acts as a precursor for multiple end products, which are individually cleaved from 
the initial polypeptide chain. The same polyprotein may be processed in various 
ways to produce different peptides in different cell types (Figure 13-66). 

Why is proteolytic processing so common in the secretory pathway? Some 
of the peptides produced in this way, such as the enlcephalins (hve-amino-acid 
neuropeptides with morphine-like activity), are undoubtedly too short in their 
mature forms to be co-translationally transported into the ER lumen or to include 
the necessary signal for packaging into secretory vesicles. In addition, for secreted 
hydrolytic enzymes—or any other protein whose activity could be harmful inside 
the cell that malces it—delaying activation of the protein until it reaches a secre¬ 
tory vesicle, or until after it has been secreted, has a clear advantage: the delay 
prevents the protein from acting prematurely inside the cell in which it is synthe¬ 
sized. 




0.2 jLim 


Figure 13-65 Exocytosis of secretory 
vesicles. The process is illustrated 
schematically (top) and in an électron 
micrograph that shows the release of 
insulin from a secretory vesicle of a 
pancreatic (3 cell. (Courtesy of Lelio 
Orci, from L. Orci, J.-D. Vassalli and 
A. Perrelet, Sci. Am. 259:85-94, 1988.) 


Secretory Vesicles Wait Near the Plasma Membrane Until Signaled 
to Release Their Contents 


Once loaded, a secretory vesicle has to reach the site of sécrétion, which in some 
cells is far away from the TGN. Nerve cells are the most extreme example. Secre¬ 
tory proteins, such as peptide neurotransmitters (neuropeptides), which will be 
released from nerve terminais at the end of the axon, are made and packaged into 
secretory vesicles in the cell body. They then travel along the axon to the nerve 
terminais, which can be a meter or more away. As discussed in Chapter 16, motor 
proteins propel the vesicles along axonal microtubules, whose uniform orienta¬ 
tion guides the vesicles in the proper direction. Microtubules also guide transport 
vesicles to the cell surface for constitutive exocytosis. 

Whereas transport vesicles containing materials for constitutive release fuse 
with the plasma membrane once they arrive there, secretory vesicles in the regu- 
lated pathway wait at the membrane until the cell receives a signal to secrete, and 
they then fuse. The signal can be an electrical nerve impluse (an action potential) 
or an extracellular signal molécule, such as a hormone: in either case, it leads to a 
transient increase in the concentration of free Ca 2+ in the cytosol. 


For Rapid Exocytosis, Synaptic Vesicles Are Primed at the 
Presynaptic Plasma Membrane 

Nerve cells (and some endocrine cells) contain two types of secretory vesicles. As 
for ail secretory cells, these cells package proteins and neuropeptides in dense- 
cored secretory vesicles in the standard way for release by the regulated secretory 
pathway. In addition, however, they use another specialized class of tiny (~50 nm 
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Figure 13-66 Alternative processing 
pathways for the prohormone 
polyprotein proopiomelanocortin. The 

initial cleavages are made by proteases 
that eut next to pairs of positively charged 
amino acids (Lys-Arg, Lys-Lys, Arg-Lys, 
or Arg-Arg pairs). Trimming reactions 
then produce the final secreted products. 
Different cell types produce different 
concentrations of individual processing 
enzymes, so that the same prohormone 
precursor is cleaved to produce different 
peptide hormones. In the anterior lobe 
of the pituitary gland, for example, only 
corticotropin (ACTH) and P-lipotropin 
are produced from proopiomelanocortin, 
whereas in the intermediate lobe of the 
pituitary gland mainly a-melanocyte 
stimulating hormone (a-MSH), y-lipotropin, 
(3-MSH, and p-endorphin are produced — 
a-MSH from ACTH and the other three 
from P-lipotropin, as shown. 
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Figure 13-67 Exocytosis of synaptic vesicles. For orientation at a synapse, see Figure 11-36. (A) The trans-SNARE complex 
responsible for docking synaptic vesicles at the plasma membrane of nerve terminais consists of three proteins. The v-SNARE 
synaptobrevin and the t-SNARE syntaxin are both transmembrane proteins, and each contributes one a hélix to the complex. 

By contrast to other SNAREs discussed earlier, the t-SNARE SNAP25 is a peripheral membrane protein that contributes two a 
helices to the four-helix bundle; the two helices are connected by a loop (dashed line) that lies parallel to the membrane and has 
fatty acyl chains (not shown) attached to anchor it there. The four a helices are shown as rods for simplicity. (B) At the synapse, 
the basic SNARE machinery is modulated by the Ca 2+ sensor synaptotagmin and an additional protein called complexin. 
Synaptic vesicles first dock at the membrane (Step 1), and the SNARE bundle partially assembles (Step 2), resulting in a 
“primed vesicle” that is already drawn close to the membrane. The SNARE bundle assembles further but the additional binding 
of complexin prevents fusion (Step 3). Upon arrivai of an action potential, Ca 2+ enfers the cell and binds to synaptotagmin, 
which releases the block and opens a fusion pore (Step 4). Further rearrangements complété the fusion reaction (Step 5) and 
release the fusion machinery, which now can be reused. This elaborate arrangement allows the fusion machinery to respond 
on the millisecond time scale essential for rapid and répétitive synaptic signaling. (A, adapted from R.B. Sutton et al., Nature 
395:347-353, 1998. With permission from Macmillan Publishers Ltd.; B, adapted from Z.P. Pang and T.C. Südhof, Curr. Opin. 
Cell Biol. 22:496-505, 2010. With permission from Elsevier.) 


diameter) secretory vesicles called synaptic vesicles. These vesicles store small 
neurotransmitter molécules , such as acétylcholine, glutamate, glycine, and y-ami- 
nobutyric acid (GABA), which médiate rapid signaling from nerve cell to its target 
cell at Chemical synapses. When an action potential arrives at a nerve terminal, it 
causes an influx of Ca 2+ through voltage-gated Ca 2+ channels, which triggers the 
synaptic vesicles to fuse with the plasma membrane and release their contents 
to the extracellular space (see Figure 11-36). Some neurons fire more than 1000 
times per second, releasing neurotransmitters each time. 

The speed of transmitter release (talcing only milliseconds) indicates that the 
proteins mediating the fusion reaction do not undergo complex, multistep rear¬ 
rangements. Rather, after vesicles hâve been doclced at the presynaptic plasma 
membrane, they undergo a priming step, which préparés them for rapid fusion. 
In the primed State, the SNAREs are partly paired, their helices are not fully 
wound into the final four-helix bundle required for fusion (Figure 13-67). Pro- 
teins called complexins freeze the SNARE complexes in this metastable State. The 
bralce imposed by the complexins is released by another synaptic vesicle protein, 
synaptotagmin, which contains Ca 2+ -binding domains. A rise in cytosolic Ca 2+ 
triggers binding of synaptotagmin to phospholipids and to the SNAREs, displac- 
ing the complexins. As the SNARE bundle zippers up completely, a fusion pore 
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opens and the neurotransmitters are released. At a typical synapse, only a small 
number of the docked vesicles are primed and ready for exocytosis. The use of 
only a small number of vesicles at a time allows each synapse to fire over and 
over again in quiclc succession. With each firing, new synaptic vesicles dock and 
become primed to replace those that hâve fused and released their contents. 

Synaptic Vesicles Can Form Directly from Endocytic Vesicles 

For the nerve terminal to respond rapidly and repeatedly, synaptic vesicles need 
to be replenished very quicldy after they discharge. Thus, most synaptic vesicles 
are generated not from the Golgi membrane in the nerve cell body but by local 
recycling from the presynaptic plasma membrane in the nerve terminais (Figure 
13-68). Similarly, newly made membrane components of the synaptic vesicles are 
initially delivered to the plasma membrane by the constitutive secretory pathway 
and then retrieved by endocytosis. But instead of fusing with endosomes, most of 
the endocytic vesicles immediately fill with neurotransmitter to become synaptic 
vesicles. 

The membrane components of a synaptic vesicle include transporters spécial - 
ized for the uptalce of neurotransmitter from the cytosol, where the small-mole- 
cule neurotransmitters that médiate fast synaptic signaling are synthesized. Once 
filled with neurotransmitter, the synaptic vesicles can be used again (see Figure 
13-68). Because synaptic vesicles are abundant and relatively uniform in size, 
they can be purified in large numbers and, consequently, are the best-character- 
ized organelle of the cell, in that ail of their membrane components hâve been 
identified by quantitative proteomic analyses (Figure 13-69). Extending this anal¬ 
ysis to a complété presynaptic terminal, allows us to model the crowded environ¬ 
ment in which these reactions occur. 

Secretory Vesicle Membrane Components Are Quickly Removed 
from the Plasma Membrane 

When a secretory vesicle fuses with the plasma membrane, its contents are 
discharged from the cell by exocytosis, and its membrane becomes part of the 
plasma membrane. Although this should greatly increase the surface area of the 
plasma membrane, it does so only transiently, because membrane components 
are removed from the surface by endocytosis almost as fast as they are added by 
exocytosis, a process reminiscent of the endocytic-exocytic cycle discussed ear- 
lier. After their removal from the plasma membrane, the proteins of the secretory 
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Figure 13-68 The formation of synaptic 
vesicles in a nerve cell. These tiny uniform 
vesicles are found only in nerve cells and in 
some endocrine cells, where they store and 
secrete small-molécule neurotransmitters. 
The import of neurotransmitter directly 
into the small endocytic vesicles that form 
from the plasma membrane is mediated 
by membrane transporters that function as 
antiports and are driven by an H + gradient 
maintained by V-ATPase H + pumps in the 
vesicle membrane (discussed in Chapter 11). 
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Figure 13-69 Scale models of a brain presynaptic 
terminal and a synaptic vesicle. The illustrations show 
sections through a pre-synaptic terminal (A; enlarged in 
B) and a synaptic vesicle (C) in which proteins and lipids 
are drawn to scale based on their known stoichiometry 
and either known or approximated structures. The relative 
localization of protein molécules in different régions of the 
presynaptic terminal was inferred from super-resolution 
imaging and électron microscopy. The model in (A) 
contains 300,000 proteins of 60 different kinds that vary in 
abundance from 150 copies to 20,000 copies. In the model 
in (C), only 70% of the membrane proteins présent in the 
membrane are shown; a complété model would therefore 
show a membrane that is even more crowded than this 
picture suggests (Movie 13.7). Each synaptic vesicle 
membrane contains 7000 phospholipid molécules and 
5700 cholestérol molécules. Each also contains close to 
50 different intégral membrane protein molécules, which 
vary widely in their relative abundance and together 
contribute about 600 transmembrane a helices. The 
transmembrane v-SNARE synaptobrevin is the most 
abundant protein in the vesicle (—70 copies per vesicle). 

By contrast, the V-ATPase, which uses ATP hydrolysis to 
pump H + into the vesicle lumen, is présent in 1-2 copies 
per vesicle. The H + gradient provides the energy for 
neurotransmitter import by an H + /neurotransmitter antiport, 
which loads each vesicle with 1800 neurotransmitter 
molécules, such as glutamate, one of which is shown 
to scale. (A and B, from B.G. Wilhelm et al., Science 
344:1023-1028, 2014. With permission from AAAS; 

C, adapted from S. Takamori et al., Cell 127:831-846, 
2006. With permission from Elsevier.) 
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vesicle membrane are either recycled or shuttled to lysosomes for dégradation. 
The amount of secretory vesicle membrane that is temporarily added to the 
plasma membrane can be enormous: in a pancreatic acinar cell discharging 
digestive enzymes for delivery to the gut lumen, about 900 pm 2 of vesicle mem¬ 
brane is inserted into the apical plasma membrane (whose area is only 30 pm 2 ) 
when the cell is stimulated to secrete. 

Control of membrane traffic thus has a major rôle in maintaining the composi¬ 
tion of the various membranes of the cell. To maintain each membrane-enclosed 
compartment in the secretory and endocytic pathways at a constant size, the bal¬ 
ance between the outward and inward flows of membrane needs to be precisely 
regulated. For cells to grow, however, the forward flow needs to be greater than 
the rétrogradé flow, so that the membrane can increase in area. For cells to main¬ 
tain a constant size, the forward and rétrogradé flows must be equal. We still know 
very little about the mechanisms that coordinate these flows. 

Some Regulated Exocytosis Events Serve to Enlarge the Plasma 
Membrane 

An important taslc of regulated exocytosis is to deliver more membrane to enlarge 
the surface area of a cell's plasma membrane when such a need arises. A spectac- 
ular example is the plasma membrane expansion that occurs during the cellular- 
ization process in a fly embryo, which initially is a syncytium—a single cell con- 
taining about 6000 nuclei surrounded by a single plasma membrane (see Figure 
21-15). Within tens of minutes, the embryo is converted into the same number of 
cells. This process of cellularization requires a vast amount of new plasma mem¬ 
brane, which is added by a carefully orchestrated fusion of cytoplasmic vesicles, 
eventually forming the plasma membranes that enclose the separate cells. Similar 
vesicle fusion events are required to enlarge the plasma membrane when other 
animal cells or plant cells divide during cytokinesis (discussed in Chapter 17). 

Many animal cells, especially those subjected to mechanical stresses, fre- 
quently expérience small ruptures in their plasma membrane. In a remarkable 
process thought to involve both homotypic vesicle-vesicle fusion and exocytosis, 
a temporary cell-surface patch is quicldy fashioned from locally available inter- 
nal-membrane sources, such as lysosomes. In addition to providing an emergency 
barrier against lealcs, the patch reduces membrane tension over the wounded 
area, allowing the bilayer to flow baclc together to restore continuity and seal the 
puncture. This membrane repair process, the fusion and exocytosis of vesicles is 
triggered by the sudden increase of Ca 2+ , which is abundant in the extracellular 
space and rushes into the cell as soon as the plasma membrane is punctured. 
Figure 13-70 shows four examples in which regulated exocytosis leads to plasma 
membrane expansion. 

Polarized Cells Direct Proteins from the Trans Golgi Network to the 
Appropriate Domain of the Plasma Membrane 

Most cells in tissues are polarized , with two or more molecularly and functionally 
distinct plasma membrane domains. This raises the general problem of how the 



(D) CELLULARIZATION 


Figure 13-70 Four examples of regulated 
exocytosis leading to plasma membrane 
enlargement. The vesicles fusing with the 
plasma membrane during cytokinesis (A) 
and phagocytosis (B) are thought to be 
derived from endosomes, whereas those 
involved in wound repair (C) may be derived 
from plasma membranes. The vast amount 
of new plasma membrane inserted during 
cellularization in a fly embryo occurs by the 
fusion of cytoplasmic vesicles (D). 
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delivery of membrane from the Golgi apparatus is organized so as to maintain 
the différences between one cell-surface domain and another. A typical épithé¬ 
lial cell, for example, has an apical domain , which faces either an internai cavity 
or the outside world and often has specialized features such as cilia or a brush 
border of microvilli. It also has a basolateral domain , which covers the rest of the 
cell. The two domains are separated by a ring of tight junctions (see Figure 19-21), 
which prevent proteins and lipids (in the outer leaflet of the lipid bilayer) from dif- 
fusing between the two domains, so that the différences between the two domains 
are maintained. 

In principle, différences between plasma membrane domains need not 
dépend on the targeted delivery of the appropriate membrane components. 
Instead, membrane components could be delivered to ail régions of the cell sur¬ 
face indiscriminately but then be selectively stabilized in some locations and 
selectively eliminated in others. Although this strategy of random delivery fol- 
lowed by sélective rétention or removal seems to be used in certain cases, deliver- 
ies are often specifically directed to the appropriate membrane domain. Epithelial 
cells lining the gut, for example, secrete digestive enzymes and mucus at their api¬ 
cal surface and components of the basal lamina at their basolateral surface. Such 
cells must hâve ways of directing vesicles carrying different cargoes to different 
plasma membrane domains. Proteins from the ER destined for different domains 
travel together until they reach the TGN, where they are separated and dispatched 
in secretory or transport vesicles to the appropriate plasma membrane domain 
(Figure 13-71). 

The apical plasma membrane of most épithélial cells is greatly enriched in 
glycosphingolipids, which help protect this exposed surface from damage—for 
example, from the digestive enzymes and low pH in sites such as the gut or stom- 
ach, respectively. Similarly, plasma membrane pro teins that are linlced to the lipid 
bilayer by a GPI anchor (see Figure 12-52) are found predominantly in the api¬ 
cal plasma membrane. If recombinant DNA techniques are used to attach a GPI 
anchor to a protein that would normally be delivered to the basolateral surface, 
the protein is usually delivered to the apical surface instead. GPI-anchored pro- 
teins are thought to be directed to the apical membrane because they associate 
with glycosphingolipids in lipid rafts that form in the membrane of the TGN. As 
discussed in Chapter 10, lipid rafts form in the TGN and plasma membrane when 
glycosphingolipids and cholestérol molécules self-associate (see Figure 10-13). 


basolateral apical apical basolateral 

transport transport plasma earlv endosome 
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Figure 13-71 Two ways of sorting 
plasma membrane proteins in a 
polarized épithélial cell. (A) In the direct 
pathway, proteins destined for different 
plasma membrane domains are sorted and 
packaged into different transport vesicles. 
The lipid-raft-dependent delivery System 
to the apical domain described in the 
text is an example of the direct pathway. 

(B) In the indirect pathway, a protein is 
retrieved from the inappropriate plasma 
membrane domain by endocytosis and 
then transported to the correct domain via 
early endosomes—that is, by transcytosis. 
The indirect pathway, for example, is used 
in liver hépatocytes to deliver proteins to 
the apical domain that lines bile ducts. 
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Having selected a unique set of cargo molécules, the rafts then bud from the TGN 
into transport vesicles destined for the apical plasma membrane. This process is 
similar to the sélective partitioning of some membrane proteins into the special- 
ized lipid domains in caveolae at the plasma membrane discussed earlier. 

Membrane proteins destined for delivery to the basolateral membrane con- 
tain sorting signais in their cytosolic tail. When présent in an appropriate struc¬ 
tural context, these signais are recognized by coat proteins that package them into 
appropriate transport vesicles in the TGN. The same basolateral signais that are 
recognized in the TGN also function in early endosomes to redirect the proteins 
baclc to the basolateral plasma membrane after they hâve been endocytosed. 


WHATWE DONT KNOW 


• How are targeting and fusion 
proteins such as SNAREs regulated, 
so that they can be returned to their 
respective donor compartiments in an 
inactive State? 

• How does a cell balance exocytic 
and endocytic events to keep its 
plasma membrane a constant size? 


Summary 

Cells can secrete molécules by exocytosis in either a constitutive or a regulated fash- 
ion. Whereas the regulated pathways operate only in specialized secretory cells, 
a constitutive secretory pathway opérâtes in ail eukaryotic cells, characterized by 
continuai vesicle transportfrom the TGN to the plasma membrane. In the regulated 
pathways, the molécules are stored either in secretory vesicles or in synaptic vesicles, 
which do not fuse with the plasma membrane to release their contents until they 
receive an appropriate signal Secretory vesicles containing proteins for sécrétion 
bud from the TGN. The secretory proteins become concentrated during the forma¬ 
tion and maturation of the secretory vesicles. Synaptic vesicles, which are confined 
to nerve cells and some endocrine cells, form from both endocytic vesicles and from 
endosomes, and they médiate the regulated sécrétion of small-molecule neurotrans- 
mitters at the axon terminais of nerve cells. 

Proteins are delivered from the TGN to the plasma membrane by the constitu¬ 
tive pathway unless they are diverted into other pathways or retained in the Golgi 
apparatus. In polarized cells, the transport pathways from the TGN to the plasma 
membrane operate selectively to ensure that different sets of membrane proteins, 
secreted proteins; and lipids are delivered to the different domains of the plasma 
membrane. 


• Can newly formed daughter cells 
generate a Golgi apparatus de novo, 
or do they hâve to inherit it? 

• How do lysosomes avoid digesting 
their own membranes? 

• How does a cell maintain the 
right amount of every component 
(organelles, molécules), and how does 
it change these amounts as needed 
(for example, to greatly expand the 
endoplasmic réticulum when the cell 
needs to produce large amounts of 
secreted proteins)? 


PROBLEMS 

Which statements are true? Explain why or why not. 

13-1 In ail events involving fusion of a vesicle to a target 
membrane, the cytosolic leaflets of the vesicle and target 
bilayers always fuse together, as do the leaflets that are not 
in contact with the cytosol. 

13-2 There is one strict requirement for the exit of a pro- 
tein from the ER: it must be correctly folded. 

13-3 Ail the glycoproteins and glycolipids in intracel¬ 
lular membranes hâve oligosaccharide chains facing the 
lumenal side, and ail those in the plasma membrane hâve 
oligosaccharide chains facing the outside of the cell. 

Discuss the following problems. 

13-4 In a nondividing cell such as a liver cell, why must 
the flow of membrane between compartments be bal- 
anced, so that the retrieval pathways match the outward 
flow? Would you expect the same balanced flow in a gut 
épithélial cell, which is actively dividing? 


13-5 Enveloped viruses, which hâve a membrane coat, 
gain access to the cytosol by fusing with a cell membrane. 
Why do you suppose that these viruses encode their own 
spécial fusion protein, rather than making use of a cell's 
SNAREs? 

13-6 For fusion of a vesicle with its target membrane to 
occur, the membranes hâve to be brought to within 1.5 nm 
so that the two bilayers can join (Figure Q13-1). Assum- 
ing that the relevant portions of the two membranes at the 
fusion site are circular régions 1.5 nm in diameter, calcu- 
late the number of water molécules that would remain 
between the membranes. (Water is 55.5 M and the volume 
of a cylinder is nr 2 h.) Given that an average phospholipid 



Figure Q13-1 Close approach of a vesicle and its target membrane 
in préparation for fusion (Problem 13-6). 
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occupies a membrane surface area of 0.2 nm 2 , how many 
phospholipids would be présent in each of the opposing 
monolayers at the fusion site? Are there sufficient water 
molécules to bind to the hydrophilic head groups of this 
number of phospholipids? (It is estimated that 10-12 water 
molécules are normally associated with each phospho- 
lipid head group at the exposed surface of a membrane.) 

13-7 SNAREs exist as complementary partners that 
carry out membrane fusions between appropriate ves- 
icles and their target membranes. In this way, a vesicle 
with a particular variety of v-SNARE will fuse only with a 
membrane that carries the complementary t-SNARE. In 
some instances, however, fusions of identical membranes 
(homotypic fusions) are known to occur. For example, 
when a yeast cell forms a bud, vesicles derived from the 
mother cell's vacuole move into the bud where they fuse 
with one another to form a new vacuole. Ihese vesicles 
carry both v-SNAREs and t-SNAREs. Are both types of 
SNAREs essential for this homotypic fusion event? 

To test this point, you hâve developed an ingenious 
assay for fusion of vacuolar vesicles. You préparé vesicles 
from two different mutant strains of yeast: strain B has a 
defective gene for vacuolar alkaline phosphatase (Pase); 
strain A is defective for the protease that converts the pre- 
cursor of alkaline phosphatase (pro-Pase) into its active 
form (Pase) (Figure Q13-2A). Neither strain has active 
alkaline phosphatase, but when extracts of the strains are 
mixed, vesicle fusion generates active alkaline phospha¬ 
tase, which can be easily measured (Figure Q13-2). 

Now you delete the genes for the vacuolar 
v-SNARE, t-SNARE, or both in each of the two yeast strains. 
You préparé vacuolar vesicles from each and test them for 
their ability to fuse, as measured by the alkaline phospha¬ 
tase assay (Figure Q13-2B). 

What do these data say about the requirements for 
v-SNAREs and t-SNAREs in the fusion of vacuolar vesicles? 
Does it matter which kind of SNARE is on which vesicle? 

13-8 If you were to remove the ER retrieval signal from 
protein disulfide isomerase (PDI), which is normally a sol¬ 
uble résident of the ER lumen, where would you expect the 
modified PDI to be located? 

13-9 The KDEL receptor must shuttle baclc and forth 
between the ER and the Golgi apparatus to accomplish 
its task of ensuring that soluble ER proteins are retained 
in the ER lumen. In which compartment does the KDEL 
receptor bind its ligands more tightly? In which compart¬ 
ment does it bind its ligands more wealdy? What is thought 
to be the basis for its different binding affinities in the two 
compartments? If you were designing the System, in which 
compartment would you hâve the highest concentration of 
KDEL receptor? Would you predict that the KDEL receptor, 
which is a transmembrane protein, would itself possess an 
ER retrieval signal? 

13-10 How does the low pH of lysosomes protect the rest 
of the cell from lysosomal enzymes in case the lysosome 
breaks? 
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Figure Q13-2 SNARE requirements for vesicle fusion (Problem 13-7). 
(A) Scheme for measuring the fusion of vacuolar vesicles. (B) Results 
of fusions of vesicles with different combinations of v-SNAREs and 
t-SNAREs. The SNAREs présent on the vesicles of the two strains 
are indicated as v (v-SNARE) and t (t-SNARE). 

13-11 Melanosomes are specialized lysosomes that store 
pigments for eventual release by exocytosis. Various cells 
such as skin and hair cells then talce up the pigment, which 
accounts for their characteristic pigmentation. Mouse 
mutants that hâve defective melanosomes often hâve pale 
or unusual coat colors. One such light-colored mouse, the 
Mocha mouse (Figure Q13-3), has a defect in the gene for 
one of the subunits of the adaptor protein complex AP3, 
which is associated with coated vesicles budding from the 
trans Golgi networlc. How might the loss of AP3 cause a 
defect in melanosomes? 


Figure Q13-3 A normal 
mouse and the Mocha 
mouse (Problem 13-11). 
In addition to its light 
coat color, the Mocha 
mouse has a poor sense 
of balance. (Courtesy of 
normal mouse Mocha mouse Margit Burmeister.) 
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Energy Conversion: 
Mitochondria and Chloroplasts 


CHAPTER 

14 


To maintain their high degree of organization in a universe that is constantly drift- 
ing toward chaos, cells hâve a constant need for a plentiful supply of ATP, as we 
hâve explained in Chapter 2. In eulcaryotic cells, most of the ATP that powers life 
processes is produced by specialized, membrane-enclosed, energy-converting 
organelles. These are of two types. Mitochondria, which occur in virtually ail cells 
of animais, plants, and fungi, burn food molécules to produce ATP by oxidative 
phosphorylation. Chloroplasts, which occur only in plants and green algae, har- 
ness solar energy to produce ATP by photosynthesis. In électron micrographs, the 
most striking features of both mitochondria and chloroplasts are their extensive 
internai membrane Systems. These internai membranes contain sets of mem¬ 
brane protein complexes that worlc together to produce most of the cell's ATP. In 
bacteria, simpler versions of essentially the same protein complexes produce ATP, 
but they are located in the cell's plasma membrane (Figure 14-1). 

Comparisons of DNA sequences indicate that the energy-converting organ¬ 
elles in present-day eukaryotes originated from prokaryotic cells that were endo- 
cytosed during the évolution of eukaryotes (discussed in Chapter 1). This explains 
why mitochondria and chloroplasts contain their own DNA, which still encodes 
a subset of their proteins. Over time, these organelles hâve lost most of their own 
genomes and become heavily dépendent on proteins that are encoded by genes 
in the nucléus, synthesized in the cytosol, and then imported into the organelle. 
And the eulcaryotic cells now rely on these organelles not only for the ATP they 
need for biosynthesis, soluté transport, and movement, but also for many import¬ 
ant biosynthetic reactions that occur inside each organelle. 

The common evolutionary origin of the energy-converting machinery in mito¬ 
chondria, chloroplasts, and prokaryotes (archaea and bacteria) is reflected in the 
fundamental mechanism that they share for harnessing energy. This is lcnown 
as chemiosmotic coupling, signifying a linlc between the Chemical bond-form- 
ing reactions that generate ATP (“chemi”) and membrane transport processes 


IN THIS CHAPTER 

THE MITOCHONDRION 

THE PROTON PUMPS OF THE 
ELECTRON-TRANSPORT CHAIN 

ATP PRODUCTION IN 
MITOCHONDRIA 

CHLOROPLASTS AND 
PHOTOSYNTHESIS 

THE GENETIC SYSTEMS 
OF MITOCHONDRIA AND 
CHLOROPLASTS 



Figure 14-1 The membrane Systems of 
bacteria, mitochondria, and chloroplasts 
are related. Mitochondria and chloroplasts 
are cell organelles that hâve originated from 
bacteria and hâve retained the bacterial 
energy-conversion mechanisms. Like 
their bacterial ancestors, mitochondria 
and chloroplasts hâve an outer and an 
inner membrane. Each of the membranes 
colored in this diagram contains energy- 
harvesting electron-transport chains. The 
deep invaginations of the mitochondrial 
inner membrane and the internai 
membrane System of the chloroplast 
harbor the machinery for cellular respiration 
and photosynthesis, respectively. 
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(“osmotic"). The chemiosmotic process occurs in two linlced stages, both of which 
are performed by protein complexes in a membrane. 

Stage 1 : High-energy électrons (derived from the oxidation of food molé¬ 
cules, from pigments excited by sunlight, or from other sources described 
later) are transferred along a sériés of electron-transport protein com¬ 
plexes that form an electron-transport chain embedded in a membrane. 
Each électron transfer releases a small amount of energy that is used to 
pump protons (H + ) and thereby generate a large electroChemical gradient 
across the membrane (Figure 14-2). As discussed in Chapter 11, such an 
electrochemical gradient provides a way of storing energy, and it can be 
harnessed to do useful worlc when ions flow baclc across the membrane. 

Stage 2 : The protons flow baclc down their electrochemical gradient 
through an elaborate membrane protein machine called ATP synthase, 
which catalyzes the production of ATP from ADP and inorganic phosphate 
(Pi). This ubiquitous enzyme worlcs lilce a turbine in the membrane, driven 
by protons, to synthesize ATP (Figure 14-3). In this way, the energy derived 
from food or sunlight in stage 1 is converted into the Chemical energy of a 
phosphate bond in ATP. 

Electrons move through protein complexes in biological Systems via tightly 
bound métal ions or other carriers that talce up and release électrons easily, or by 
spécial small molécules that piclc électrons up at one location and deliver them to 
another. For mitochondria, the first of these électron carriers is NAD + , a water-sol- 
uble small molécule that talces up two électrons and one H + derived from food 
molécules (fats and carbohydrates) to become NADH. NADH transfers these élec¬ 
trons from the sites where the food molécules are degraded to the inner mito¬ 
chondrial membrane. There, the électrons from the energy-rich NADH are passed 
from one membrane protein complex to the next, passing to a lower-energy com- 
pound at each step, until they reach a final complex in which they combine with 
molecular oxygen (O 2 ) to produce water. The energy released at each step as the 
électrons flow down this path from the energy-rich NADH to the low-energy water 
molécule drives H + pumps in the inner mitochondrial membrane, utilizing three 
different membrane protein complexes. Together, these complexes generate the 
proton-motive force harnessed by ATP synthase to produce the ATP that serves as 
the universal energy currency throughout the cell (see Chapter 2). 

Figure 14-4 compares the electron-transport processes in mitochondria, 
which harness energy from food molécules, with those in chloroplasts, which har- 
ness energy from sunlight. The energy-conversion Systems of mitochondria and 
chloroplasts can be described in similar terms, and we shall see later in the chap¬ 
ter that two of their key components are closely related. One of these is the ATP 
synthase, and the other is a proton pump (colored green in Figure 14-4). 

Among the crucial constituents that are unique to photosynthetic organisms 
are the two photosystems. These use the green pigment chlorophyll to capture 


Figure 14-2 Stage 1 of chemiosmotic 
coupling. Energy from sunlight or the 
oxidation of food compounds is captured 
to generate an electrochemical proton 
gradient across a membrane. The 
electrochemical gradient serves as a 
versatile energy store that drives energy- 
requiring reactions in mitochondria, 
chloroplasts, and bacteria. 



(g) + ADP 


Figure 14-3 Stage 2 of chemiosmotic 
coupling. An ATP synthase (yellow) 
embedded in the lipid bilayer of a 
membrane harnesses the electrochemical 
proton gradient across the membrane, 
using it as a local energy store to drive 
ATP synthesis. The red arrows show the 
direction of proton movement through the 
ATP synthase. 
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Figure 14-4 Electron-transport processes. (A) The mitochondrion converts energy from Chemical fuels. (B) The chloroplast 
converts energy from sunlight. In both cases, électron flow is indicated by blue arrows. Each of the protein complexes (green) 
is embedded in a membrane. In the mitochondrion, fats and carbohydrates from food molécules are fed into the citric acid 
cycle and provide électrons to generate the energy-rich compound NADH from NAD + . These électrons then flow down an 
energy gradient as they pass from one complex to the next in the electron-transport chain, until they combine with molecular 
O 2 in the last complex to produce water. The energy released at each stage is harnessed to pump H + across the membrane. 

In the chloroplast, by contrast, électrons are extracted from water through the action of light in the photosystem II complex and 
molecular O 2 is released. The électrons pass on to the next complex in the chain, which uses some of their energy to pump 
protons across the membrane, before passing to photosystem I, where sunlight generates high-energy électrons that combine 
with NADP+ to produce NADPH. NADPH then enters the carbon-fixation cycle along with CO 2 to generate carbohydrates. 


light energy and power the transfer of électrons, not unlilce a photocell in a solar 
panel. The chloroplasts drive électron transfer in the direction opposite to that in 
mitochondria: électrons are talcen from water to produce 0 2 , and these électrons 
are used (via NADPH, a molécule closely related to the NADH used in mitochon¬ 
dria) to synthesize carbohydrates from C0 2 and water. These carbohydrates then 
serve as the source for ail other compounds a plant cell needs. 

Thus, both mitochondria and chloroplasts malce use of an electron-transfer 
chain to produce an H + gradient that powers reactions that are critical for the cell. 
However, chloroplasts generate 0 2 and talce up C0 2 , whereas mitochondria con¬ 
sume 0 2 and release C0 2 (see Figure 14-4). 

THE MITOCHONDRION 

Mitochondria occupy up to 20% of the cytoplasmic volume of a eukaryotic cell. 
Although they are often depicted as short, bacterium-lilce bodies with a diame- 
ter of 0.5-1 pm, they are in fact remarkably dynamic and plastic, moving about 
the cell, constantly changing shape, dividing, and fusing (Movie 14.1). Mito¬ 
chondria are often associated with the microtubular cytoslceleton (Figure 14-5), 
which détermines their orientation and distribution in different cell types. Thus, 
in highly polarized cells such as neurons, mitochondria can move long distances 
(up to a meter or more in the extended axons of neurons), being propelled along 
the traclcs of the microtubular cytoslceleton. In other cells, mitochondria remain 
fixed at points of high energy demand; for example, in slceletal or cardiac muscle 
cells, they pack between myofibrils, and in sperm cells they wrap tightly around 
the flagellum (Figure 14-6). 

Mitochondria also interact with other membrane Systems in the cell, most 
notably the endoplasmic réticulum (ER). Contacts between mitochondria and ER 
define specialized domains thought to facilitate the exchange of lipids between 
the two membrane Systems. These contacts also appear to induce mitochondrial 
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fission, which, as we discuss later, is involved in the distribution and partitioning 
of mitochondria within cells (Figure 14-7). 

The acquisition of mitochondria was a prerequisite for the évolution of com- 
plex animais. Without mitochondria, present-day animal cells would hâve had to 
generate ail of their ATP through anaérobie glycolysis. When glycolysis converts 
glucose to pyruvate, it releases only a small fraction of the total free energy that 
is potentially available from glucose oxidation (see Chapter 2). In mitochondria, 
the metabolism of sugars is complété: pyruvate is imported into the mitochon- 
drion and ultimately oxidized by O 2 to CO 2 and H 2 O, which allows 15 times more 
ATP to be made from a sugar than by glycolysis alone. As explained later, this 
became possible only when enough molecular oxygen accumulated in the Earth's 
atmosphère to allow organisms to take full advantage, via respiration, of the large 
amounts of energy potentially available from the oxidation of organic compounds. 

Mitochondria are large enough to be seen in the light microscope, and they 
were first identified in the nineteenth century. Real progress in understanding 
their internai structure and function, however, depended on biochemical pro¬ 
cedures developed in 1948 for isolating intact mitochondria, and on électron 
microscopy, which was first used to look at cells at about the same time. 


Figure 14-5 The relationship between 
mitochondria and microtubules. (A) A 

light micrograph of chains of elongated 
mitochondria in a living mammalian 
cell in culture. The cell was stained 
with a fluorescent dye (rhodamine 123) 
that specifically labels mitochondria in 
living cells. (B) An immunofluorescence 
micrograph of the same cell stained 
(after fixation) with fluorescent antibodies 
that bind to microtubules. Note that the 
mitochondria tend to be aligned along 
microtubules. (Courtesy of Lan Bo Chen.) 



(A) CARDIAC MUSCLE (B) SPERM TAIL 

Figure 14-6 Localization of mitochondria near sites of high ATP demand in cardiac muscle 
and a sperm tail. (A) Cardiac muscle in the wall of the heart is the most heavily used muscle in the 
body, and its continuai contractions require a reliable energy supply. It has limited built-in energy 
stores and has to dépend on a steady supply of ATP from the copious mitochondria aligned close 
to the contractile myofibrils (see Figure 16-32). (B) During sperm development, microtubules wind 
helically around the flagellar axoneme, where they are thought to help localize the mitochondria in 
the tail to produce the structure shown. 
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Figure 14-7 Interaction of mitochondria with the endoplasmic réticulum. 

(A) Fluorescence light microscopy shows that tubules of the ER (green) wrap 
around parts of the mitochondrial network (red) in mammalian cells. The 
mitochondria then divide at the contact sites. After contact is established, 
fission occurs within less than a minute, as indicated by time-lapse 
microscopy. (B) Schematic drawing of an ER tubule wrapped around part of 
the mitochondrial réticulum. It is thought that ER-mitochondrial contacts also 
médiate the exchange of lipids between the two membrane Systems. 

(A, adapted from J.R. Friedman et al., Science 334:358-362, 2011.) 


The Mitochondrion Has an Outer Membrane and an Inner 
Membrane 

Like the bacteria from which they originated, mitochondria hâve an outer and 
an inner membrane. The two membranes hâve distinct functions and properties, 
and delineate separate compartments within the organelle. The inner membrane, 
which surrounds the internai mitochondrial matrix compartment (Figure 14-8), 
is highly folded to form invaginations known as cristae (the singular is crista), 
which contain in their membranes the proteins of the electron-transport chain. 
Where the inner membrane runs parallel to the outer membrane, between the 
cristae, it is known as the inner boundary membrane. The narrow (20-30 nm) gap 
between the inner boundary membrane and the outer membrane is known as 
the intermembrane space. The cristae are about 20 nm-wide membrane dises 
or tubules that protrude deeply into the matrix and enclose the crista space. The 
crista membrane is continuous with the inner boundary membrane, and where 
their membranes join, the membrane forms narrow membrane tubes or slits, 
known as crista junctions. 

Like the bacterial outer membrane, the outer mitochondrial membrane is 
freely permeable to ions and to small molécules as large as 5000 daltons. This 
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Figure 14-8 Structure of a mitochondrion. (A) Tomographie slice through a three-dimensional map of a mouse heart mitochondrion determined by 
electron-microscope tomography. The outer membrane envelops the inner boundary membrane. The inner membrane is highly folded into tubular 
or lamellar cristae, which crisscross the matrix. The dense matrix, which contains most of the mitochondrial protein, appears dark in the électron 
microscope, whereas the intermembrane space and the crista space appear light due to their lower protein content. The inner boundary membrane 
follows the outer membrane closely at a distance of ~20 nm. The inner membrane turns sharply at the cristae junctions, where the cristae join the 
inner boundary membrane. (B) Tomographie surface-rendered portion of a yeast mitochondrion, showing how flattened cristae project into the matrix 
from the inner membrane (Movie 14.2). (C) Schematic drawing of a mitochondrion showing the outer membrane (gray), and the inner membrane 
(yellow). Note that the inner membrane is compartmentalized into the inner boundary membrane and the crista membrane. There are three distinct 
spaces: the inner membrane space, the crista space, and the matrix. (A, courtesy of Tobias Brandt; B, from K. Davies et al., Proc. Natl Acad. Sci. 
USA 109:13602-13607, 2012. With permission from the National Academy of Sciences.) 
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Figure 14-9 Biochemical fractionation of purified mitochondria into 
separate components. Large numbers of mitochondria are isolated from 
homogenized tissue by centrifugation and then suspended in a medium 
of low osmotic strength. In such a medium, water flows into mitochondria 
and greatly expands the matrix space (yellow). While the cristae of the inner 
membrane unfold to accommodate the swelling, the outer membrane— 
which has no folds—breaks, releasing structures composed of the inner 
membrane surrounding the matrix. These techniques hâve made it possible 
to study the protein composition of the inner membrane (comprising a 
mixture of cristae, boundary membranes, and cristae junctions), the outer 
membrane, and the matrix. 

is because it contains many porin molécules, a spécial class of (3-barrel-type 
membrane protein that créâtes aqueous pores across the membrane (see Figure 
10-23). As a conséquence, the intermembrane space between the outer and inner 
membrane has the same pH and ionic composition as the cytoplasm, and there is 
no electrochemical gradient across the outer membrane. 

If purified mitochondria are gently disrupted and then fractionated (Figure 
14-9), the biochemical composition of membranes and mitochondrial compart- 
ments can be determined. 

The Inner Membrane Cristae Contain the Machinery for Electron 
Transport and ATP Synthesis 

Unlilce the outer mitochondrial membrane, the inner mitochondrial membrane 
is a diffusion barrier to ions and small molécules, just like the bacterial inner 
membrane. However, selected ions, most notably protons and phosphate, as well 
as essential métabolites such as ATP and ADP, can pass through it by means of 
spécial transport proteins. 

The inner mitochondrial membrane is highly differentiated into functionally 
distinct régions with different protein compositions. As discussed in Chapter 
10, the latéral ségrégation of membrane régions with different protein and lipid 
compositions is a lcey feature of cells. In the inner mitochondrial membrane, 
the boundary membrane région is thought to contain the machinery for protein 
import, new membrane insertion, and assembly of the respiratory-chain com¬ 
plexes. The membranes of the cristae, which are continuous with the boundary 
membrane, contain the ATP synthase enzyme that produces most of the celFs 
ATP; they also contain the large protein complexes of the respiratory chain— the 
name given to the mitochondriorTs electron-transport chain. 

At the cristae junctions, where the membranes of the cristae join the boundary 
membrane, spécial protein complexes provide a diffusion barrier that segregates 
the membrane proteins in the two régions of the inner membrane; these com¬ 
plexes are also thought to anchor the cristae to the outer membrane, thus main- 
taining the highly folded topology of the inner membrane. Cristae membranes 
hâve one of the highest protein densities of ail biological membranes, with a lipid 
content of 25% and a protein content of 75% by weight. The folding of the inner 
membrane into cristae greatly increases the membrane area available for oxida- 
tive phosphorylation. In highly active cardiac muscle cells, for example, the total 
area of cristae membranes can be up to 20 times larger than the area of the celFs 
plasma membrane. In total, the surface area of cristae membranes in each human 
body adds up to roughly the size of a football field. 

The Citric Acid Cycle in the Matrix Produces NADH 

Together with the cristae that project into it, the matrix is the major working part 
of the mitochondrion. Mitochondria can use both pyruvate and fatty acids as fuel. 
Pyruvate is derived from glucose and other sugars, whereas fatty acids are derived 
from fats. Both of these fuel molécules are transported across the inner mitochon¬ 
drial membrane by specialized transport proteins, and they are then converted 
to the crucial metabolic intermediate acetyl CoA by enzymes located in the mito¬ 
chondrial matrix (see Chapter 2). 
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FOOD MOLECULES FROM CYTOSOL 


fatty acids pyruvate 



Figure 14-10 A summary of the energy- 
converting metabolism in mitochondria. 

Pyruvate and fatty acids enter the 
mitochondrion (top ofthe figure) and are 
broken down to acetyl CoA. The acetyl 
CoA is metabolized by the citric acid cycle, 
which reduces NAD + to NADH, which then 
passes its high-energy électrons to the first 
complex in the electron-transport chain. In 
the process of oxidative phosphorylation, 
these électrons pass along the electron- 
transport chain in the inner membrane 
cristae to oxygen (O 2 ). This électron 
transport generates a proton gradient, 
which drives the production of ATP by the 
ATP synthase (see Figure 14-3). Electrons 
from the oxidation of succinate, a reaction 
intermediate in the citric acid cycle (see 
Panel 2-9, pp. 106-107), take a separate 
path to enter this electron-transport chain 
(not shown, see p. 772). 

The membranes that comprise the 
mitochondrial inner membrane—the 
inner boundary membrane and the crista 
membrane—contain different mixtures of 
proteins and they are therefore shaded 
differently in this diagram. 


The acetyl groups in acetyl CoA are oxidized in the matrix via the citric acid 
cycle , also called the Krebs cycle (see Figure 2-57 and Movie 2.6). The oxidation of 
these carbon atoms in acetyl CoA produces CO 2 , which diffuses out of the mito¬ 
chondrion to be released to the environment as a waste product. More impor- 
tantly, the citric acid cycle saves a great deal of the bond energy released by this 
oxidation in the form of électrons carried by NADH. This NADH transfers its élec¬ 
trons from the matrix to the electron-transport chain in the inner mitochondrial 
membrane ; where—through the chemiosmotic coupling process described pre- 
viously (see Figures 14-2 and 14-3)—the energy that was carried by NADH élec¬ 
trons is converted into phosphate-bond energy in ATP. Figure 14-10 outlines this 
sequence of reactions schematically. 

The matrix contains the genetic System of the mitochondrion, including the 
mitochondrial DNA and the ribosomes. The mitochondrial DNA (see section on 
genetic Systems, p. 800) is organized into compact bodies—the nucleoids—by 
spécial scaffolding proteins that also function as transcription regulatory proteins. 
The large number of enzymes required for the maintenance of the mitochondrial 
genetic System, as well as for many other essential reactions to be outlined next, 
accounts for the very high protein concentration in the matrix; at more than 500 
mg/mL, this concentration is close to that in a protein crystal. 

Mitochondria Hâve Many Essential Rôles in Cellular Metabolism 

Mitochondria not only generate most of the celFs ATP; they also provide many 
other essential resources for biosynthesis and cell growth. Before describing in 
detail the remarkable machinery of the respiratory chain, we diverge briefly to 
touch on some of these important rôles. 
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Mitochondria are critical for buffering the redox potential in the cytosol. Cells 
need a constant supply of the électron acceptor NAD + for the central reaction in 
glycolysis that converts glyceraldehyde 3-phosphate to 1,3-bisphosphoglycerate 
(see Figure 2-48). This NAD + is converted to NADH in the process, and the NAD + 
needs to be regenerated by transferring the high-energy NADH électrons some- 
where. The NADH électrons will eventually be used to help drive oxidative phos¬ 
phorylation inside the mitochondrion. But the inner mitochondrial membrane 
is imperméable to NADH. The électrons are therefore passed from the NADH to 
smaller molécules in the cytosol that can move through the inner mitochondrial 
membrane. Once in the matrix, these smaller molécules transfer their électrons to 
NAD + to form mitochondrial NADH, after which they are returned to the cytosol 
for recharging—creating a so-called shuttle System for the NADH électrons. 

In addition to ATP, biosynthesis in the cytosol requires both a constant sup¬ 
ply of reducing power in the form of NADPH and small carbon-rich molécules 
to serve as building blocks (discussed in Chapter 2). Descriptions of biosynthe¬ 
sis often State that the needed carbon slceletons corne directly from the break- 
down of sugars, whereas the NADPH is produced in the cytosol by a side pathway 
for the breakdown of sugars (the pentose phosphate pathway , an alternative to 
glycolysis). But under conditions where nutrients abound and plenty of ATP is 
available, mitochondria help to generate both the reducing power and the car- 
bon-rich building blocks (the "carbon slceletons" in Panel 2-1, pp. 90-91) needed 
for cell growth. For this purpose, excess citrate produced in the mitochondrial 
matrix by the citric acid cycle (see Panel 2-9, pp. 106-107) is transported down 
its electrochemical gradient to the cytosol, where it is metabolized to produce 
essential components of the cell. Thus, for example, as part of a celFs response 
to growth signais, large amounts of acetyl CoA are produced in the cytosol from 
citrate exported from mitochondria, accelerating the production of the fatty acids 
and sterols that build new membranes (described in Chapter 10). Cancer cells are 
frequently mutated in ways that enhance this pathway, as part of their program of 
abnormal growth (see Figure 20-26). 

The urea cycle is a central metabolic pathway in mammals that converts the 
ammonia (NH 4 + ) produced by the breakdown of nitrogen-containing compounds 
(such as amino acids) to the urea excreted in urine. Two critical steps of the urea 
cycle are carried out in the mitochondria of liver cells, while the remaining steps 
occur in the cytosol. Mitochondria also play an essential part in the metabolic 
adaptation of cells to different nutritional conditions. For example, under condi¬ 
tions of starvation, proteins in our bodies are brolcen down to amino acids, and 
the amino acids are imported into mitochondria and oxidized to produce NADH 
for ATP production. 

The biosynthesis of heme groups —which, as we shall see in the next section, 
play a central part in électron transfer—is another critical process that is shared 
between the mitochondrion and the cytoplasm. Iron-sulfur clusters, which are 
essential not only for électron transfer in the respiratory chain (see p. 766), but 
also for the maintenance and stability of the nuclear genome, are produced in 
mitochondria (and chloroplasts). Nuclear genome instability, a hallmarlc of can¬ 
cer, can sometimes be linlced to the decreased function of cellular proteins that 
contain iron-sulfur clusters. 

Mitochondria also hâve a central rôle in membrane biosynthesis. Cardiolipin 
is a two-headed phospholipid (Figure 14-11) that is confined to the inner mito¬ 
chondrial membrane, where it is also produced. But mitochondria are also a maj or 
source of phospholipids for the biogenesis of other cell membranes. Phosphati- 
dylethanolamine, phosphatidylglycerol, and phosphatidic acid are synthesized in 
the mitochondrion, while phosphatidylinositol, phosphatidylcholine, and phos- 
phatidylserine are primarily synthesized in the endoplasmic réticulum (ER). As 
described in Chapter 12, most of the cell's membranes are assembled in the ER. 
The exchange of lipids between the ER and mitochondria is thought to occur at 
spécial sites of close contact (see Figure 14-7) by an as-yet unlcnown mechanism. 
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Figure 14-11 The structure of 
cardiolipin. Cardiolipin consists of two 
covalently linked phospholipid units, with a 
total of four rather than the usual two fatty 
acid chains (see Figure 10-3). Cardiolipin 
is only produced in the mitochondrial inner 
membrane, where it interacts closely with 
membrane proteins involved in oxidative 
phosphorylation and ATP transport. In 
cristae, its two juxtaposed phosphate 
groups may act as a local proton trap on 
the membrane surface. 
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Finally, mitochondria are important calcium buffers, talcing up calcium from 
the ER and sarcoplasmic réticulum at spécial membrane junctions. Cellular cal¬ 
cium levels control muscle contraction (see Chapter 16) and alterations are impli- 
cated in neurodegeneration and apoptosis. Clearly, cells and organisms dépend 
on mitochondria in many different ways. 

We now return to the central function of the mitochondrion in respiratory ATP 
génération. 

A Chemiosmotic Process Couples Oxidation Energy to ATP 
Production 

Although the citric acid cycle that talces place in the mitochondrial matrix is con- 
sidered to be part of aérobic metabolism, it does not itself use oxygen. Only the 
final step of oxidative metabolism consumes molecular oxygen (O 2 ) directly (see 
Figure 14-10). Nearly ail the energy available from metabolizing carbohydrates, 
fats, and other foodstuffs in earlier stages is saved in the form of energy-rich com- 
pounds that feed électrons into the respiratory chain in the inner mitochondrial 
membrane. These électrons, most of which are carried by NADH, finally combine 
with O 2 at the end of the respiratory chain to form water. The energy released 
during the complex sériés of électron transfers from NADH to O 2 is harnessed in 
the inner membrane to generate an electroChemical gradient that drives the con¬ 
version of ADP + Pi to ATP. For this reason, the term oxidative phosphorylation is 
used to describe this final sériés of reactions (Figure 14-12). 

The total amount of energy released by biological oxidation in the respiratory 
chain is équivalent to that released by the explosive combustion of hydrogen 
when it combines with oxygen in a single step to form water. But the combustion 
of hydrogen in a single-step Chemical reaction, which has a strongly négative AG, 
releases this large amount of energy unproductively as heat. In the respiratory 
chain, the same energetically favorable reaction H 2 + V 2 0 2 —> H 2 0 is divided into 
small steps (Figure 14-13). This stepwise process allows the cell to store nearly 
half of the total energy that is released in a useful form. At each step, the électrons, 
which can be thought of as having been removed from a hydrogen molécule to 
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Figure 14-12 The major net energy 
conversion catalyzed by the 
mitochondrion. In the process of oxidative 
phosphorylation, the mitochondrial inner 
membrane serves as a device that changes 
one form of chemical-bond energy to 
another, converting a major part of the 
energy of NADH oxidation into phosphate- 
bond energy in ATR 
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Figure 14-13 A comparison of biological 
oxidation with combustion. (A) If 

hydrogen were simply burned, nearly ail of 
the energy would be released in the form of 
heat. (B) In biological oxidation reactions, 
about half of the released energy is stored 
in a form useful to the cell by means of the 
electron-transport chain (the respiratory 
chain) in the crista membrane of the 
mitochondrion. Only the rest of the energy 
is released as heat. In the cell, the protons 
and électrons shown here as being derived 
from H 2 are removed from hydrogen 
atoms that are covalently linked to NADH 
molécules. 
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produce two protons, pass through a sériés of électron carriers in the inner mito¬ 
chondrial membrane. At each of three distinct steps along the way (marlced by the 
three electron-transport complexes of the respiratory chain, see below), much of 
the energy is utilized for pumping protons across the membrane. At the end of the 
electron-transport chain, the électrons and protons recombine with molecular 
oxygen into water. 

Water is a very low-energy molécule and is thus very stable; it can serve as an 
électron donor only when a large amount of energy from an external source is 
spent on splitting it into protons, électrons, and molecular oxygen. This is exactly 
what happens in oxygenic photosynthesis, where the external energy source is the 
sun, as we shall see later in the section on chloroplasts (p. 782). 

The Energy Derived from Oxidation Is Stored as an 
Electrochemical Gradient 

In mitochondria, the process of électron transport begins when two électrons 
and a proton are removed from NADH (to regenerate NAD + ). These électrons are 
passed to the first of about 20 different électron carriers in the respiratory chain. 
The électrons start at a large négative redox potential (see Panel 14-1, p. 765)— 
that is, at a high energy level—which gradually drops as they pass along the chain. 
The proteins involved are grouped into three large respiratory enzyme complexes , 
each composed of protein subunits that sit in the inner mitochondrial membrane. 
Each complex in the chain has a higher affinity for électrons than its predecessor, 
and électrons pass sequentially from one complex to the next until they are finally 
transferred to molecular oxygen, which has the highest électron affinity of ail. 

The net resuit is the pumping of H + out of the matrix across the inner mem¬ 
brane, driven by the energetically favorable flow of électrons. This transmem¬ 
brane movement of H + has two major conséquences: 

1. It generates a pH gradient across the inner mitochondrial membrane, with 
a high pH in the matrix (close to 8) and a lower pH in the intermembrane 
space. Since ions and small molécules equilibrate freely across the outer 
mitochondrial membrane, the pH in the intermembrane space is the same 
as in the cytosol (generally around pH 7.4). 

2. It generates a voltage gradient across the inner mitochondrial membrane, 
creating a membrane potential with the matrix side négative and the crista 
space side positive. 

The pH gradient (ApH) reinforces the effect of the membrane potential (Ay), 
because the latter acts to attract any positive ion into the matrix and to push any 
négative ion out. Together, ApH and AV malce up the electrochemical gradient, 
which is measured in units of millivolts (mV). This gradient exerts a proton- 
motive force, which tends to drive H + baclc into the matrix (Figure 14-14). 
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Figure 14-14 The electrochemical proton 
gradient across the inner mitochondrial 
membrane. This gradient is composed 
of a large force due to the membrane 
potential (AV) and a smaller force due to 
the H + concentration gradient—that is, the 
pH gradient (ApH). Both forces combine 
to generate the proton-motive force, which 
pulls H + back into the mitochondrial matrix. 
The exact relationship between these 
forces is expressed by the Nernst équation 
(see Panel 11-1, p. 616). 
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The electrochemical gradient across the inner membrane of a respiring mito- 
chondrion is typically about 180 mV (inside négative), and it consists of a mem¬ 
brane potential of about 150 mV and a pH gradient of about 0.5 to 0.6 pH units 
(each ApH of 1 pH unit is équivalent to a membrane potential of about 60 mV). 
The electrochemical gradient drives not only ATP synthesis but also the transport 
of selected molécules across the inner mitochondrial membrane, including the 
import of selected proteins from the cytoplasm (discussed in Chapter 12). 

Summary 

The mitochondrion performs most cellular oxidations and produces the bulle ofthe 
animal celVsATP. A mitochondrion has two separate membranes: the outer mem¬ 
brane and the inner membrane. The inner membrane surrounds the innermost 
space (the matrix) ofthe mitochondrion andforms the cristae, which project into 
the matrix. The matrix and the inner membrane cristae are the major working parts 
ofthe mitochondrion. The membranes thatform cristae accountfor a major part 
ofthe membrane surface area in most cells, and they contain the mitochondrion's 
electron-transport chain (the respiratory chain). 

The mitochondrial matrix contains a large variety of enzymes, including those 
that convertpyruvate andfatty acids to acetyl CoA and those that oxidize this acetyl 
CoA to CO 2 through the citric acid cycle. These oxidation reactions produce large 
amounts ofNADH, whose high-energy électrons are passed to the respiratory chain. 
The respiratory chain then uses the energy derived from transporting électrons from 
NADH to molecular oxygen to pump H + out of the matrix. This produces a large 
electrochemical proton gradient across the inner mitochondrial membrane, com- 
posed of contributions from both a membrane potential and a pH différence. This 
electrochemical gradient exerts a force to drive H + baclc into the matrix. This pro- 
ton-motive force is harnessed both to produce ATP and for the sélective transport of 
métabolites across the inner mitochondrial membrane. 


THE PROTON PUMPS OF THE ELECTRON- 
TRANSPORT CHAIN 

Having considered in general terms how a mitochondrion uses électron transport 
to generate a proton-motive force, we now turn to the molecular mechanisms 
that underlie this membrane-based energy-conversion process. In describing the 
respiratory chain of mitochondria, we accomplish the larger purpose of explain- 
ing how an electron-transport process can pump protons across a membrane. As 
stated at the beginning of this chapter, mitochondria, chloroplasts, archaea, and 
bacteria use very similar chemiosmotic mechanisms. In fact, these mechanisms 
underlie the function of ail living organisms—including anaerobes that dérivé ail 
their energy from électron transfers between two inorganic molécules, as we shall 
see later. 

We start with some of the basic principles on which ail of these processes 
dépend. 

The Redox Potential Is a Measure of Electron Affinities 

In Chemical reactions, any électrons removed from one molécule are always 
passed to another, so that whenever one molécule is oxidized, another is reduced. 
As with any other Chemical reaction, the tendency of such redox reactions to 
proceed spontaneously dépends on the free-energy change (AG) for the électron 
transfer, which in turn dépends on the relative affinities of the two molécules for 
électrons. 

Because électron transfers provide most of the energy for life, it is worth talcing 
the time to understand them. As discussed in Chapter 2, acids donate protons 
and bases accept them (see Panel 2-2, p. 93). Acids and bases exist in conjugate 
acid-base pairs, in which the acid is readily converted into the base by the loss of a 
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proton. For example, acetic acid (CH 3 COOH) is converted into its conjugate base, 
the acetate ion (CH 3 COO), in the reaction: 

CH 3 COOH ^ CH 3 COO- + H + 

In an exactly analogous way, pairs of compounds such as NADH and NAD + 
are called redox pairs, since NADH is converted to NAD + by the loss of électrons 
in the reaction: 

NADH ;=* NAD + + H + + 2e 

NADH is a strong électron donor: because two of its électrons are engaged in 
a covalent bond which releases energy when brolcen, the free-energy change for 
passing these électrons to many other molécules is favorable. Energy is required 
to form this bond from NAD + , two électrons, and a proton (the same amount of 
energy that was released when the bond was brolcen). Therefore NAD + , the redox 
partner of NADH, is of necessity a wealc électron acceptor. 

We can measure the tendency to transfer électrons from any redox pair exper- 
imentally. Ail that is required is the formation of an electrical circuit linking a 1:1 
(equimolar) mixture of the redox pair to a second redox pair that has been arbi- 
trarily selected as a reference standard, so that we can measure the voltage différ¬ 
ence between them (Panel 14-1). This voltage différence is defined as the redox 
potential; électrons move spontaneously from a redox pair lilce NADH/NAD + with 
a lower redox potential (a lower affinity for électrons) to a redox pair lilce O 2 /H 2 O 
with a higher redox potential (a higher affinity for électrons). Thus, NADH is a 
good molécule for donating électrons to the respiratory chain, while O 2 is well 
suited to act as the “sinlc” for électrons at the end of the chain. As explained in 
Panel 14-1, the différence in redox potential, AE'q, is a direct measure of the stan¬ 
dard free-energy change (AG°) for the transfer of an électron from one molécule 
to another. 

Electron Transfers Release Large Amounts of Energy 

As just discussed, those pairs of compounds that hâve the most négative redox 
potentials hâve the wealcest affinity for électrons and therefore are useful as car¬ 
riers with a strong tendency to donate électrons. Conversely, those pairs that 
hâve the most positive redox potentials hâve the greatest affinity for électrons 
and therefore are useful as carriers with a strong tendency to accept électrons. 
A 1:1 mixture of NADH and NAD + has a redox potential of -320 mV, indicating 
that NADH has a strong tendency to donate électrons; a 1:1 mixture of H 2 0 and 
V 2 O 2 has a redox potential of +820 mV, indicating that O 2 has a strong tendency to 
accept électrons. The différence in redox potential is 1140 mV, which means that 
the transfer of each électron from NADH to O 2 under these standard conditions is 
enormously favorable, since AG° = -109 kl/mole, and twice this amount of energy 
is gained for the two électrons transferred per NADH molécule (see Panel 14-1). 
If we compare this free-energy change with that for the formation of the phospho- 
anhydride bonds in ATP, where AG° = 30.6 kl/mole (see Figure 2-50), we see that, 
under standard conditions, the oxidation of one NADH molécule releases more 
than enough energy to synthesize seven molécules of ATP from ADP and Pi. (In 
the cell, the number of ATP molécules generated will be lower because the stan¬ 
dard conditions are far from the physiological ones; in addition, small amounts of 
energy are inevitably dissipated as heat along the way.) 

Transition Métal Ions and Quinones Accept and Release 
Electrons Readily 

The electron-transport properties of the membrane protein complexes in the 
respiratory chain dépend upon électron-carrying cofactors, most of which are 
transition metals such as Fe, Cu, Ni, and Mn, bound to proteins in the complexes. 
These metals hâve spécial properties that allow them to promote both enzyme 
catalysis and electron-transfer reactions. Most relevant here is the fact that their 
ions exist in several different oxidation States with closely spaced redox poten¬ 
tials, which enables them to accept or give up électrons readily; this property is 


HOW REDOX POTENTIALS ARE MEASURED 


voltmeter 



THE STANDARD REDOX POTENTIAL, E\ 


o 


The standard redox potential for a redox pair, 
defined as E 0 , is measured for a standard State 
where ail of the reactants are at a concentration of 
1 M, including H + . Since biological reactions occur at 
pH 7, biologists instead define the standard State as 
Areduced = Aoxidized snd H + = 10 - ^ M. Th is standard 
redox potential is designated by the Symbol E' 0 , in 
place of E 0 . 


One beaker (left) contains substance A with an equimolar 
mixture of the reduced (A re duced) and oxidized (A OX idized) 
members of its redox pair. The other beaker contains the 
hydrogen reference standard (2H+ + 2e - ^ H 2 ), whose redox 
potential is arbitrarily assigned as zéro by international 
agreement. (A sait bridge formed from a concentrated KCI 
solution allows K+ and Ch to move between the beakers, as 
required to neutralize the charges when électrons flow 
between the beakers.) The métal wire (dark blue) provides a 
resistance-free path for électrons, and a voltmeter then 
measures the redox potential of substance A. If électrons flow 
from Areduced to H + , as indicated here, the redox pair formed 
by substance A is said to hâve a négative redox potential. If 
they instead flow from H 2 to Aoxidized, the redox pair is said to 
hâve a positive redox potential. 
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CALCULATION OF A G 0 FROM 
REDOX POTENTIALS 

To détermine the energy change for an électron 
transfer, the A6° of the reaction (kJ/mole) is calculated 
as follows: 

A6° = -n(0.096) A E’ 0 , where n is the number of 
électrons transferred across a redox potential 
change of A E' 0 millivolts (mV), and 


A Eq = E 0 '(acceptor) - 
EXAMPLE: 
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and reduced ubiquinone 


For the transfer of one électron from NADH to 
ubiquinone: 

AEq = +30 - (-320) = +350 mV 
AG° = -n(0.096)AEo = -1(0.096)(350) = -34 kJ/mole 
The same calculation reveals that the transfer of one 
électron from ubiquinone to oxygen has an even more 
favorable AG° of -76 kJ/mole. The AG° value for the 
transfer of one électron from NADH to oxygen is the 
sum of these two values, -110 kJ/mole. 


EFFECT OF CONCENTRATION CHANGES 


As explained in Chapter 2 (see p. 60), the actual free-energy 
change for a reaction, AG, dépends on the concentration of the 
reactants and generally will be different from the standard free- 
energy change, AG°. The standard redox potentials are for a 1:1 
mixture of the redox pair. For example, the standard redox 
potential of -320 mV is for a 1:1 mixture of NADH and NAD+. 

But when there is an excess of NADH over NAD+, électron 
transfer from NADH to an électron acceptor becomes more 
favorable. This is reflected by a more négative redox potential 
and a more négative AG for électron transfer. 
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Figure 14-15 The structure of the heme group attached covalently to 
cytochrome c. The porphyrin ring of the heme is shown in red. There are six 
different cytochromes in the respiratory chain. Because the hemes in different 
cytochromes hâve slightly different structures and are kept in different local 
environments by their respective proteins, each has a different affinity for an 
électron, and a slightly different spectroscopic signature. 


exploited by the membrane protein complexes in the respiratory chain to move 
électrons both within and between complexes. 

Unlilce the colorless atoms H, C, N, and O that constitute the bulle of biological 
molécules, transition métal ions are often brightly colored, which makes the pro¬ 
teins that contain them easy to study by spectroscopic methods using visible light. 
One family of such colored proteins, the cytochromes, contains a bound heme 
group , in which an iron atom is tightly held by four nitrogen atoms at the corners 
of a square in a porphyrin ring (Figure 14-15). Similar porphyrin rings are respon- 
sible both for the red color of blood and for the green color of leaves, binding an 
iron in hemoglobin or a magnésium in chlorophyll, respectively. 

Iron-sulfur proteins contain a second major family of electron-transfer cofac¬ 
tors. In this case, either two or four iron atoms are bound to an equal number of 
sulfur atoms and to cysteine side chains, forming iron-sulfur clusters in the pro¬ 
tein (Figure 14-16). Lilce the cytochrome hemes, these clusters carry one électron 
at a time. 

The simplest of the electron-transfer cofactors in the respiratory chain—and 
the only one that is not always bound to a protein—is a quinone (called ubiqui- 
none , or coenzyme Q). A quinone (Q) is a small hydrophobie molécule that is 
freely mobile in the lipid bilayer. This électron carrier can accept or donate either 
one or two électrons. Upon réduction (note that reduced quinones are called qui- 
nols), it piclcs up a proton from water along with each électron (Figure 14-17). 

In the mitochondrial electron-transport chain, six different cytochrome 
hemes, eight iron-sulfur clusters, three copper atoms, a flavin mononucleotide 
(another electron-transfer cofactor), and ubiquinone worlc in a defined sequence 
to carry électrons from NADH to O 2 . In total, this pathway involves more than 
60 different polypeptides arranged in three large membrane protein complexes, 
each of which binds several of the above électron-carrying cofactors. 

As we would expect, the electron-transfer cofactors hâve increasing affinities 
for électrons (higher redox potentials) as the électrons move along the respiratory 
chain. The redox potentials hâve been fine-tuned during évolution by the protein 
environment of each cofactor, which alters the cofactor's normal affinity for élec¬ 
trons. Because iron-sulfur clusters hâve a relatively low affinity for électrons, they 
predominate in the first half of the respiratory chain; in contrast, the heme cyto¬ 
chromes predominate further down the chain, where a higher électron affinity is 
required. 
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NADH Transfers Its Electrons to Oxygen Through Three Large 
Enzyme Complexes Embedded in the Inner Membrane 

Membrane proteins are difficult to purify because they are insoluble in aqueous 
solutions, and they are easily disrupted by the détergents that are required to sol- 
ubilize them. But by using mild nonionic détergents, such as octylglucoside or 
dodecyl maltoside (see Figure 10-28), they can be solubilized and purified in their 
native form, and even crystallized for structure détermination. Each of the three 
different detergent-solubilized respiratory-chain complexes can be re-inserted 


Figure 14-16 The structure of an iron-sulfur cluster. These dark brown 
clusters consist either of four iron and four sulfur atoms, as shown here, or 
of two irons and two sulfurs linked to cysteines in the polypeptide chain via 
covalent sulfur bridges, or to histidines. Although they contain several iron atoms, 
each iron-sulfur cluster can carry only one électron at a time. Nine different iron- 
sulfur clusters participate in électron transport in the respiratory chain. 
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into artificial lipid bilayer vesicles and shown to pump protons across the mem¬ 
brane as électrons pass through them. 

In the mitochondrion, the three complexes are linlced in sériés, serving as élec¬ 
tron-transport-driven H + pumps that pump protons out of the matrix to acidify 
the crista space (Figure 14-18): 

1. The NADH dehydrogenase complex (often referred to as Complex I ) is 
the largest of these respiratory enzyme complexes. It accepts électrons 
from NADH and passes them through a flavin mononucleotide and eight 
iron-sulfur clusters to the lipid-soluble électron carrier ubiquinone. The 
reduced ubiquinol then transfers its électrons to cytochrome c reductase. 

2. The cytochrome c reductase (also called the cytochrome b-c\ complex ) is a 
large membrane protein assembly that functions as a dimer. Each mono¬ 
mer contains three cytochrome hemes and an iron-sulfur cluster. The com¬ 
plex accepts électrons from ubiquinol and passes them on to the small, sol¬ 
uble protein cytochrome c, which is located in the crista space and carries 
électrons one at a time to cytochrome c oxidase. 

3. The cytochrome c oxidase complex contains two cytochrome hemes and 
three copper atoms. The complex accepts électrons one at a time from 
cytochrome c and passes them to molecular oxygen. In total, four électrons 
and four protons are needed to convert one molécule of oxygen to water. 


We hâve previously discussed how the redox potential reflects électron affini- 
ties. Figure 14-19 présents an outline of the redox potentials measured along the 
respiratory chain. These potentials change in three large steps, one across each 
proton-translocating respiratory complex. The change in redox potential between 
any two électron carriers is directly proportional to the free energy released when 
an électron transfers between them. Each complex acts as an energy-conversion 
device by harnessing some of this free-energy change to pump H + across the inner 
membrane, thereby creating an electrochemical proton gradient as électrons pass 
along the chain. 
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Figure 14-17 Quinone électron 
carriers. Ubiquinone in the lipid bilayer 
picks up one H + (red) from the aqueous 
environment for each électron (blue) it 
accepts, in two steps, from respiratory- 
chain complexes. The first step involves 
the acquisition of a proton and an électron 
and converts the ubiquinone into an 
unstable ubisemiquinone radical. In the 
second step, it becomes a fully reduced 
ubiquinone (called ubiquinol), which is 
freely mobile as an électron carrier in the 
lipid bilayer of the membrane. When the 
ubiquinol donates its électrons to the next 
complex in the chain, the two protons 
are released. The long hydrophobie tail 
(green) that confines ubiquinone to the 
membrane consists of 6-10 five-carbon 
isoprene units, depending on the organism. 
The corresponding électron carrier in the 
photosynthetic membranes of chloroplasts 
is plastoquinone, which has almost the 
same structure and works in the same way. 
For simplicity, we refer to both ubiquinone 
and plastoquinone in this chapter as 
quinone (abbreviated as Q). 


Figure 14-18 The path of électrons 
through the three respiratory-chain 
proton pumps. (Movie 14.3) The 

approximate size and shape of each 
complex is shown. During the transfer 
of électrons from NADH to oxygen (blue 
arrows), ubiquinone and cytochrome c 
serve as mobile carriers that ferry électrons 
from one complex to the next. During the 
electron-transfer reactions, protons are 
pumped across the membrane by each 
of the respiratory enzyme complexes, as 
indicated (red arrows). 

For historical reasons, the three 
proton pumps in the respiratory chain 
are sometimes denoted as Complex I, 
Complex III, and Complex IV, according 
to the order in which électrons pass 
through them from NADH. Electrons from 
the oxidation of succinate by succinate 
dehydrogenase (designated as Complex II) 
are fed into the electron-transport chain in 
the form of reduced ubiquinone. Although 
embedded in the crista membrane, 
succinate dehydrogenase does not pump 
protons and thus does not contribute to 
the proton-motive force; it is therefore not 
considered to be an intégral part of the 
respiratory chain. 
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X-ray crystallography has elucidated the structure of each of the three respira- 
tory-chain complexes in great detail, and we next examine each of them in turn to 
see how they worlc. 

The NADH Dehydrogenase Complex Contains Separate Modules 
for Electron Transport and Proton Pumping 

The NADH dehydrogenase complex is a massive assembly of membrane and 
nonmembrane proteins that receives électrons from NADH and passes them to 
ubiquinone. In animal mitochondria, it consists of more than 40 different pro- 
tein subunits, with a molecular mass of nearly a million daltons. The x-ray struc¬ 
tures of the NADH dehydrogenase complex from fungi and bacteria show that it 
is L-shaped, with both a hydrophobie membrane arm and a hydrophilic arm that 
projects into the mitochondrial matrix (Figure 14-20). 

Electron transfer and proton pumping are physically separated in the NADH 
dehydrogenase complex, with électron transfer occurring in the matrix arm and 
proton pumping in the membrane arm. The NADH docks near the tip of the matrix 
arm, where it transfers its électrons via a bound flavin mononucleotide to a string 
of iron-sulfur clusters that runs down the arm, acting lilce a wire to carry électrons 
to a protein-bound molécule of ubiquinone. Electron transfer to the quinone 
is thought to trigger proton translocation in a set of proton pumps in the mem¬ 
brane arm, and for this to happen the two processes must be energetically and 
mechanically linlced. A mechanical linlc is thought to be provided by a 6-nm long, 
amphipathic a hélix that runs parallel to the membrane surface on the matrix 
side of the membrane arm. This hélix may act lilce the connecting rod in a steam 
engine to generate a mechanical, energy-transducing power strolce that links the 
quinone-binding site to the proton-translocating modules in the membrane (see 
Figure 14-20). 

The réduction of each quinone by the transfer of two électrons can cause four 
protons to be pumped out of the matrix into the crista space. In this way, NADH 
dehydrogenase generates roughly half of the total proton-motive force in mito¬ 
chondria. 


Figure 14-19 Redox potential changes 
along the mitochondrial electron- 
transport chain. The redox potential 
(designated E' 0 ) increases as électrons flow 
down the respiratory chain to oxygen. The 
standard free-energy change in kilojoules, 
ÀG°, for the transfer of each of the two 
électrons donated by an NADH molécule 
can be obtained from the left-hand 
ordinate [ÀG° = -n( 0.096) AE o, where 
n is the number of électrons transferred 
across a redox potential change of ÀE o 
mV]. Electrons flow through a respiratory 
enzyme complex by passing in sequence 
through the multiple électron carriers in 
each complex (blue arrows). As indicated, 
part of the favorable free-energy change 
is harnessed by each enzyme complex to 
pump H + across the inner mitochondrial 
membrane (red arrows). The NADH 
dehydrogenase pumps up to four H + per 
électron, the cytochrome c reductase 
complex pumps two, whereas the 
cytochrome c oxidase complex pumps one 
per électron. 

Note that NADH is not the only source 
of électrons for the respiratory chain. 

The flavin FADH 2 , which is generated 
by fatty acid oxidation (see Figure 2-56) 
and by succinate dehydrogenase in the 
citric acid cycle (see Figure 2-57), also 
contributes. Its two électrons are passed 
directly to ubiquinone, bypassing NADH 
dehydrogenase. 


Cytochrome c Reductase Takes Up and Releases Protons on the 
Opposite Side of the Crista Membrane, Thereby Pumping Protons 

As described previously, when a quinone molécule (Q) accepts its two élec¬ 
trons, it also takes up two protons to form a quinol (QH 2 ; see Figure 14-17). In 
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Figure 14-20 The structure of NADH dehydrogenase. (A) The model of the mitochondrial complex shown here is based on 
the x-ray structure of the smaller bacterial complex, which works in the same way. The matrix arm of NADH dehydrogenase 
(also known as Complex I) contains eight iron-sulfur (FeS) clusters that appear to participate in électron transport. The 
membrane contains more than 70 transmembrane helices, forming three distinct proton-pumping modules, while the matrix arm 
contains the electron-transport cofactors. (B) NADH donates two électrons, via a bound flavin mononucleotide (FMN; yellow), 
to a chain of seven iron-sulfur clusters (red and yellow spheres). From the terminal iron-sulfur cluster, the électrons pass to 
ubiquinone (orange). Electron transfer results in conformational changes (black arrows) that are thought to be transmitted to a 
long amphipathic a hélix (purple) on the matrix side of the membrane arm, which pulls on discontinuous transmembrane helices 
(red) in three membrane subunits, each of which resembles an antiporter (see Chapter 11). This movement is thought to change 
the conformation of charged residues in the three proton channels, resulting in the translocation of three protons out of the 
matrix. A fourth proton may be translocated at the interface of the two arms (dotted line). (C) This shows the symbol for NADH 
dehydrogenase used throughout this chapter. (Adapted from R.G. Efremov, R. Baradaran and L.A. Sazanov, Nature 
465:441-445, 2010. PDB code: 3M9S.) 


the respiratory chain, ubiquinol tranfers électrons from NADH dehydrogenase to 
cytochrome c reductase. Because the protons in this QH 2 molécule are talcen up 
from the matrix and released on the opposite side of the crista membrane, two 
protons are transferred from the matrix into the crista space per pair of électrons 
transferred (Figure 14-21). This vectorial transfer of protons suppléments the 
electrochemical proton gradient that is created by the NADH dehydrogenase pro¬ 
ton pumping just discussed. 

Cytochrome c reductase is a large assembly of membrane protein subunits. 
Three subunits form a catalytic core that passes électrons from ubiquinol to 
cytochrome c, with a structure that has been highly conserved from bacterial 
ancestors (Figure 14-22). It pumps protons by a vectorial transfer of protons 
that involves a binding site for a second molécule of ubiquinone; the elaborate 
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Figure 14-21 How a directional release 
and uptake of protons by a quinone 
pumps protons across a membrane. 

Two protons are picked up on the matrix 
side of the inner mitochondrial membrane 
when the reaction Q + 2e~ + 2H + —► QH 2 
is catalyzed by the NADH dehydrogenase 
complex. This molécule of ubiquinol 
(QH 2 ) diffuses rapidly in the plane of the 
membrane, becoming bound to the crista 
side of cytochrome c reductase. When 
its oxidation by cytochrome c reductase 
generates two protons and two électrons 
(see Figure 14-17), the two protons are 
released into the crista space. The flow of 
électrons is not shown in this diagram. 
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Figure 14-22 The structure of cytochrome c reductase. Cytochrome c reductase (also known as the cytochrome b-Cî 
complex) is a dimer of two identical 240,000-dalton halves, each composed of 11 different protein molécules in mammals. 

(A) A structure graphie of the entire dimer, showing in color the three proteins that form the functional core of the enzyme 
complex: cytochrome b (green) and cytochrome (blue) are colored in one half, and the Rieske protein (purple) containing an 
Fe 2 S 2 iron-sulfur cluster (red andyellow) is colored in the other. These three protein subunits interact across the two halves. 

(B) Transfer of électrons through cytochrome c reductase to the small, soluble carrier protein cytochrome c. Electrons entering 
from ubiquinol near the matrix side of the membrane are captured by the iron-sulfur cluster of the Rieske protein, which moves 
its iron-sulfur group back and forth to transfer these électrons to heme c (red). Heme c then transfers them to the carrier 
molécule cytochrome c. 

As detailed in Figure 14-23, only one of the two électrons from each ubiquinol is transferred through this path. To increase 
proton pumping, the second ubiquinol électron is passed to a molécule of ubiquinone bound to cytochrome c reductase on the 
opposite side of the membrane—near the matrix. (C) This shows the symbol for cytochrome c reductase used throughout this 
chapter. (PDB code: 1EZV.) 


redox loop mechanism used is called the Q cycle because while one of the élec¬ 
trons received from each QH 2 molécule is transferred from ubiquinone through 
the complex to the carrier protein cytochrome c, the other électron is recycled 
back into the quinone pool. Through the mechanism illustrated in Figure 14-23, 
the Q cycle increases the total amount of redox energy that can be stored in the 
electrochemical proton gradient. As a resuit, two protons are pumped across the 
crista membrane for every électron that is transferred from NADH dehydrogenase 
to cytochrome c. 

The Cytochrome c Oxidase Complex Pumps Protons and 
Reduces O2 Using a Catalytic Iron-Copper Center 

The final linlc in the mitochondrial electron-transport chain is cytochrome c oxi¬ 
dase. The cytochrome c oxidase complex accepts électrons from the soluble élec¬ 
tron carrier cytochrome c, and it uses yet a different, third mechanism to pump 
protons across the inner mitochondrial membrane. The structure of the mam- 
malian complex is illustrated in Figure 14-24. The atomic-resolution structures, 
combined with studies of the effect of mutations introduced into the enzyme by 
genetic engineering of the yeast and bacterial proteins, hâve revealed the detailed 
mechanisms of this électron-driven proton pump. 

















THE PROTON PUMPS OF THE ELECTRON-TRANSPORT CHAIN 


771 



Because oxygen has a high affinity for électrons, it can release a large amount 
of free energy when it is reduced to form water. Thus, the évolution of cellular res¬ 
piration, in which O 2 is converted to water, enabled organisms to harness much 
more energy than can be derived from anaérobie metabolism. As we discuss later, 
the availability of the large amount of energy released by the réduction of molec- 
ular oxygen to form water is thought to hâve been essential to the emergence of 
multicellular life: this would explain why ail large organisms respire. The ability 
of biological Systems to use O 2 in this way, however, requires sophisticated chem- 
istry. Once a molécule of O 2 has piclced up one électron, it forms a superoxide 
radical anion (O 2 * ) that is dangerously reactive and rapidly talces up an additional 
three électrons wherever it can get them, with destructive effects on its immé¬ 
diate environment. We can tolerate oxygen in the air we breathe only because 
the uptalce of the first électron by the O 2 molécule is slow, allowing cells to use 
enzymes to control électron uptalce by oxygen. Thus, cytochrome c oxidase holds 


Figure 14-23 The two-step mechanism 
of the cytochrome c reductase Q-cycle. 

(A) In step 1, ubiquinol reduced by NADH 
dehydrogenase docks to the cytochrome c 
reductase complex. Oxidation of the quinol 
produces two protons and two électrons. 

The protons are released into the cristae 
space. One électron passes via an iron-sulfur 
cluster to heme c ^, and then to the soluble 
électron carrier protein cytochrome c on the 
membrane surface. The second électron 
passes via hemes b\_ and b\-\ to a ubiquinone 
(red Q) bound at a separate site near the 
matrix side of the protein. Uptake of a proton 
from the matrix produces an ubisemiquinone 
radical (see Figure 14-17), which remains 
bound to this site (red QFP in B). 

(B) In step 2, a second ubiquinol (blue 
QH 2 ) docks and releases two protons and 
two électrons, as described for step 1. One 
électron is passed to a second cytochrome 
c, whereas the other électron is accepted 
by the ubisemiquinone. The ubisemiquinone 
takes up a proton from the matrix and is 
released into the lipid bilayer as fully reduced 
ubiquinol (red QH 2 ). 

On balance, the oxidation of one ubiquinol 
in the Q cycle pumps two protons through 
the membrane by a directional release 
and uptake of protons (see Figure 14-21), 
while releasing another two into the cristae 
space. In addition, in each of the two steps 
(A) and (B), one électron is transferred to a 
cytochrome c carrier (Movie 14.4). 



Figure 14-24 The structure of cytochrome c oxidase. The final complex in the mitochondrial electron-transfer chain consists of 13 different 
protein subunits, with a total mass of 204,000 daltons. (A) The entire dimeric complex is shown, positioned in the crista membrane. The highly 
conserved subunits I (green), Il (purple), and III (blue) are encoded by the mitochondrial genome, and they form the functional core of the enzyme. 

(B) The functional core of the complex. Electrons pass through this structure from cytochrome c via bound copper ions (blue spheres) and hemes 
(red) to an O 2 molécule bound between heme a 3 and a copper ion. The four protons needed to reduce O 2 to water are taken up from the matrix; see 
also Figure 14-25. (C) This shows the symbol for cytochrome c oxidase used throughout this chapter. (PDB code: 20CC.) 
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on to oxygen at a spécial bimetallic center, where it remains clamped between a 
heme-linked iron atom and a copper ion until it has piclced up a total of four élec¬ 
trons. Only then are the two oxygen atoms of the oxygen molécule safely released 
as two molécules ofwater (Figure 14-25). 

The cytochrome c oxidase reaction accounts for about 90% of the total oxygen 
uptalce in most cells. This protein complex is therefore crucial for ail aérobic life. 
Cyanide and azide are extremely toxic because they bind to the heme iron atoms 
in cytochrome c oxidase much more tightly than does oxygen, thereby greatly 
reducing ATP production. 

The Respiratory Chain Forms a Supercomplex in the Crista 
Membrane 

By using cryoelectron microscopy to examine proteins that hâve been very gently 
isolated, it can be shown that the three protein complexes that form the respira¬ 
tory chain assemble into an even larger supercomplex in the crista membrane. 
As illustrated in Figure 14-26, this structure is thought to help the mobile élec¬ 
tron carriers ubiquinone (in the crista membrane) and cytochrome c (in the crista 
space) transfer électrons with high efficiency. The formation of the supercom¬ 
plex dépends on the presence of the mitochondrial lipid cardiolipin (see Figure 
14-11), which presumably worlcs lilce a hydrophobie glue that holds the compo- 
nents together. 

In addition to the three proton pumps in the supercomplex just discussed, 
one of the enzymes in the citric acid cycle, succinate dehydrogenase, is embed- 
ded in the mitochondrial crista membrane. In the course of oxidizing succinate 
to fumarate in the matrix, this enzyme complex captures électrons in the form of 
a tightly bound FADH 2 molécule (see Panel 2-9, pp. 106-107) and passes them to 



Figure 14-25 The reaction of 0 2 with électrons in cytochrome c oxidase. Electrons from cytochrome c pass through the complex via bound 
copper ions (blue spheres) and hemes (red) to an O 2 molécule bound between heme a 3 and a copper ion. Iron ions are shown as red spheres. The 
iron atom in heme a serves as an électron queuing point where électrons are held so that they can be released to an O 2 molécule (not shown) that 
is held at the bimetallic center active site, which is formed by the central iron of the other heme (heme 83 ) and a closely apposed copper atom. The 
four protons needed to reduce O 2 to water are removed from the matrix. For each O 2 molécule that undergoes the reaction 4e _ + 4H + + O 2 —► 2 H 2 O, 
another four protons are pumped out of the matrix by mechanisms that are driven by allosteric changes in protein conformation (see Figure 14-28). 
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a molécule of ubiquinone. The reduced ubiquinol then passes its two électrons 
to the respiratory chain via cytochrome c reductase (see Figure 14-18). Succinate 
dehydrogenase is not a proton pump, and it does not contribute directly to the 
electrochemical potential utilized for ATP production in mitochondria. Thus, it is 
not considered to be an intégral part of the respiratory chain. 

Protons Can Move Rapidly Through Proteins Along Predefined 
Pathways 

The protons in water are highly mobile: by rapidly dissociating from one water 
molécule and associating with its neighbor, they can rapidly Ait through a hydro- 
gen-bonded networlc of water molécules (see Figure 2-5). But how can a pro¬ 
ton move through the hydrophobie interior of a protein embedded in the lipid 
bilayer? Proton-translocating proteins contain so-called proton wires, which are 
rows of polar or ionic side chains, or water molécules spaced at short distances, 
so that the protons can jump from one to the next (Figure 14-27). Along such 
predefined pathways, protons move up to 40 times faster than through bulle water. 
The three-dimensional structure of cytochrome c oxidase indicates two different 
proton-uptake pathways. This confirmed earlier mutagenesis studies, which had 
shown that replacing the side chains of particular aspartate or arginine residues, 
whose side chains can bind and release protons, made the cytochrome c oxidase 
less efficient as a proton pump. 

But how can électron transport cause allosteric changes in protein conforma¬ 
tions that pump protons? From the most basic point of view, if électron transport 
drives sequential allosteric changes in protein conformation that alter the redox 
State of the components, these conformational changes can be connected to pro¬ 
tein wires that allow the protein to pump H + across the crista membrane. This 
type of H + pumping requires at least three distinct conformations for the pump 
protein, as schematically illustrated in Figure 14-28. 


Figure 14-27 Proton movement through water and proteins. (A) Protons 
move rapidly through water, hopping from one H 2 O molécule to the next 
by the continuous formation and dissociation of hydronium ions, H 30 + (see 
Chapter 2). In this diagram, proton jumps are indicated by red arrows. 

(B) Protons can move even more rapidly through a protein along “proton 
wires.” These are predefined proton paths consisting of suitably spaced 
amino acid side chains that accept and release protons easily (Asp, Glu) 
or carry a waterlike hydroxyl group (Ser, Thr), along with water molécules 
trapped in the protein interior. 


Figure 14-26 The respiratory-chain 
supercomplex from bovine heart 
mitochondria. The three proton-pumping 
complexes of the mitochondrial respiratory 
chain of mammalian mitochondria 
assemble into large supercomplexes in the 
crista membrane. Supercomplexes can 
be isolated by mild detergent treatment 
of mitochondria, and their structure 
has been deciphered by single-particle 
cryoelectron microscopy. The bovine 
heart supercomplex has a total mass of 
1.7 megadaltons. Shown is a schematic 
of such a complex that consists of 
NADH dehydrogenase, cytochrome c 
reductase, and cytochrome c oxidase, as 
indicated. The facing quinol-binding sites 
of NADH dehydrogenase and cytochrome 
c reductase, plus the short distance 
between the cytochrome c-binding sites in 
cytochrome c reductase and cytochrome 
c oxidase, facilitate fast, efficient électron 
transfer. Cofactors active in électron 
transport are marked as a yellow dot 
(flavin mononucleotide), red and yellow 
dots (iron-sulfur clusters), Q (quinone), red 
squares (hemes), and a blue dot (copper 
atom). Only cofactors participating in the 
linear flow of électrons from NADH to water 
are shown. Blue arrows indicate the path 
of the électrons through the supercomplex. 
(Adapted from T. Athoff et al., EMBO J. 
30:4652-4664, 2011.) 
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Summary 

The respiratory chain embedded in the inner mitochondrial membrane contains 
three respiratory enzyme complexes , through which électrons pass on their way 
from NADH to O 2 . In these complexes , électrons are transferred along a sériés ofpro- 
tein-bound électron carriers , including hemes and iron-sulfur clusters. The energy 
released as the électrons moue to lower and lower energy levels is used to pump 
protons by different mechanisms in the three respiratory enzyme complexes , each 
coupling latéral électron transport to vectorial proton transport across the mem¬ 
brane. Electrons are shuttled between enzyme complexes by the mobile électron car¬ 
riers ubiquinone and cytochrome c to complété the électron-transport chain. The 
path of électron flow is NADH —> NADH dehydrogenase complex —> ubiquinone 
—> cytochrome c reductase —> cytochrome c —> cytochrome c oxidase complex —> 
molecular oxygen (O 2 ). 

ATP PRODUCTION IN MITOCHONDRIA 

As we hâve just discussed, the three proton pumps of the respiratory chain each 
contribute to the formation of an electrochemical proton gradient across the inner 
mitochondrial membrane. This gradient drives ATP synthesis by ATP synthase, a 
large membrane-bound protein complex that performs the extraordinary feat of 
converting the energy contained in this electrochemical gradient into biologically 
useful, chemical-bond energy in the form of ATP (see Figure 14-10). Protons flow 
down their electrochemical gradient through the membrane part of this proton 
turbine, thereby driving the synthesis of ATP from ADP and Pi in the extramem- 
branous part of the complex. As discussed in Chapter 2, the formation of ATP from 
ADP and inorganic phosphate is highly unfavorable energetically. As we shall see, 
ATP synthase can produce ATP only because of allosteric shape changes in this 
protein complex that directly couple ATP synthesis to the energetically favorable 
flow of protons across its membrane. 

The Large Négative Value of AG for ATP Hydrolysis Makes ATP 
Useful to the Cell 

An average person turns over roughly 50 kg of ATP per day. In athlètes running a 
marathon, this figure can go up to several hundred kilograms. The ATP produced 
in mitochondria is derived from the energy available in the intermediates NADH, 
FADH 2 , and GTP. These three energy-rich compounds are produced both by the 
oxidation of glucose (Table 14-1A), and by the oxidation of fats (Table 14-1B; see 
also Figure 2-56). 

Glycolysis alone can pro duce only two molécules of ATP for every molécule of 
glucose that is metabolized, and this is the total energy yield for the fermentation 
processes that occur in the absence of 0 2 (discussed in Chapter 2). In oxidative 


Figure 14-28 A general model for H + 
pumping coupled to électron transport. 

This mechanism for H + pumping by a 
transmembrane protein is thought to 
be used by NADH dehydrogenase and 
cytochrome c oxidase, and by many 
other proton pumps. The protein is driven 
through a cycle of three conformations. In 
one of these conformations, the protein has 
a high affinity for H + , causing it to pick up 
an H + on the inside of the membrane. In 
another conformation, the protein has a low 
affinity for H + , causing it to release an H + on 
the outside of the membrane. As indicated, 
the transitions from one conformation 
to another occur only in one direction, 
because they are being driven by being 
allosterically coupled to the energetically 
favorable process of électron transport 
(discussed in Chapter 11). 
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yProduct Yields from the Oxidation of Sugars and Fats 

A. Net products from oxidation of one molécule of glucose 

In cytosol (glycolysis) 

1 glucose —> 2 pyruvate + 2 NADH + 2 ATP 

In mitochondrion (pyruvate dehydrogenase and citric acid cycle) 

2 pyruvate —> 2 acetyl CoA + 2 NADH 

2 acetyl CoA -> 6 NADH + 2 FADH 2 + 2 GTP 

Net resuit in mitochondrion 

2 pyruvate -> 8 NADH + 2 FADH 2 + 2 GTP 

B. Net products from oxidation of one molécule of palmitoyl CoA (activated form 
of palmitate, a fatty acid) 

In mitochondrion (fatty acid oxidation and citric acid cycle) 

1 palmitoyl CoA —> 8 acetyl CoA + 7 NADH + 7 FADH 2 
8 acetyl CoA -► 24 NADH + 8 FADH 2 + 8 GTP 

Net resuit in mitochondrion 

1 palmitoyl CoA -► 31 NADH + 15 FADH 2 + 8 GTP 


phosphorylation, each pair of électrons donated by the NADH produced in mito- 
chondria can provide energy for the formation of about 2.5 molécules of ATP. Oxi- 
dative phosphorylation also produces 1.5 ATP molécules per électron pair from 
the FADH 2 produced by succinate dehydrogenase in the mitochondrial matrix, 
and from the NADH molécules produced by glycolysis in the cytosol. From the 
product yields of glycolysis and the citric acid cycle, we can calculate that the 
complété oxidation of one molécule of glucose—starting with glycolysis and end- 
ing with oxidative phosphorylation—gives a net yield of about 30 molécules of 
ATP. Nearly ail this ATP is produced by the mitochondrial ATP synthase. 

In Chapter 2, we introduced the concept of free energy (G). The free-energy 
change for a reaction, AG, détermines whether that reaction will occur in a cell. 
We showed on pp. 60-63 that the AG for a given reaction can be written as the sum 
of two parts: the first, called the standard free-energy change, AG°, dépends only 
on the intrinsic characters of the reacting molécules; the second dépends only on 
their concentrations. For the simple reaction A —> B, 


[B] 

[A] 


AG = AG° + RT\n^ 


where [A] and [B] dénoté the concentrations of A and B, and ln is the natural log- 
arithm. AG° is the standard reference value, which can be seen to be equal to the 
value of AG when the molar concentrations of A and B are equal (since ln 1 = 0). 

In Chapter 2, we discussed how the large, favorable free-energy change (large 
négative AG) for ATP hydrolysis is used, through coupled reactions, to drive 
many other Chemical reactions in the cell that would otherwise not occur (see 
pp. 65-66). The ATP hydrolysis reaction produces two products, ADP and P^ it is 
therefore of the type A —> B + C, where, as demonstrated in Figure 14-29, 


AG = AG° + flrinMM 

[A] 


When ATP is hydrolyzed to ADP and Pi under the conditions that normally 
exist in a cell, the free-energy change is roughly -46 to -54 kj/mole (-11 to -13 
kcal/mole). This extremely favorable AG dépends on maintaining a high concen¬ 
tration of ATP compared with the concentrations of ADP and Pi. When ATP, ADP, 
and Pi are ail présent at the same concentration of 1 mole/liter (so-called standard 
conditions), the AG for ATP hydrolysis drops to the standard free-energy change 
(AG°), which is only -30.5 kj/mole (-7.3 kcal/mole). At much lower concentra¬ 
tions of ATP relative to ADP and Pi, AG becomes zéro. At this point, the rate at 
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or abbreviated. 

[ADP] [©] 
[ATP] 

= K 


O 

For the reaction 


At equilibrium the reaction has no net effect on the disorder of 
the universe, so ÀG = 0. Therefore, at equilibrium, 


ADP + ® 


the following équation applies: 


ÀG = ÀG° + RT\r\ 


[ADP] [®] 
[ATP] 


where ÀG and ÀG° are in Joules per mole, R is the gas 
constant (8.3 J/mole K), T is the absolute température 
(K), and ail the concentrations are in moles per liter. 

When the concentrations of ail reactants are at 1 M, ÀG = ÀG° 
(since RT In 1 = 0). ÀG° is thus a constant defined as the 
standard free-energy change for the reaction. 


-RT In 


[ADP][©] 

[ATP] 


= ÀG° 


But the concentrations of reactants at equilibrium must satisfy 
the equilibrium équation: 

[ADP] [@] = K 
[ATP] 

Therefore, at equilibrium, 

AG° = -RT In K 


We thus see that whereas ÀG° indicates the equilibrium point for a 
reaction, ÀG reveals how far the reaction is from equilibrium. ÀG is 
a measure of the "driving force" for any Chemical reaction, just as the 
proton-motive force is the driving force for the translocation of protons. 


which ADP and Pi will join to form ATP will be equal to the rate at which ATP 
hydrolyzes to form ADP and Pi. In other words, when AG = 0, the reaction is at 
equilibrium (see Figure 14-29). 

It is AG, not AG°, that indicates how far a reaction is from equilibrium and 
détermines whether it can drive other reactions. Because the efficient conver¬ 
sion of ADP to ATP in mitochondria maintains such a high concentration of ATP 
relative to ADP and Pi, the ATP hydrolysis reaction in cells is lcept very far from 
equilibrium and AG is correspondingly very négative. Without this large disequi- 
librium, ATP hydrolysis could not be used to drive the reactions of the cell. At low 
ATP concentrations, many biosynthetic reactions would run backward and the 
cell would die. 


Figure 14-29 The basic relationship 
between free-energy changes and 
equilibrium in the ATP hydrolysis 
reaction. The rate constants in boxes 1 
and 2 are determined from experiments in 
which product accumulation is measured 
as a function of time (conc., concentration). 
The equilibrium constant shown here, K, 
is in units of moles per liter. (See Panel 
2-7, pp. 102-103, for a discussion of 
free energy and see Figure 3-44 for a 
discussion of the equilibrium constant.) 


The ATP Synthase Is a Nanomachine that Produces ATP by 
Rotary Catalysis 

The ATP synthase is a finely tuned nanomachine composed of 23 or more sepa- 
rate protein subunits, with a total mass of about 600,000 daltons. The ATP synthase 
can worlc both in the forward direction, producing ATP from ADP and phosphate 
in response to an electrochemical gradient, or in reverse, generating an electro- 
chemical gradient by ATP hydrolysis. To distinguish it from other enzymes that 
hydrolyze ATP, it is also called an FiF 0 ATP synthase or F-type ATPase. 

Resembling a turbine, ATP synthase is composed of both a rotor and a stator 
(Figure 14-30). To prevent the catalytic head from rotating, a stalle at the periphery 
of the complex (the stator stalle) connects the head to stator subunits embedded 
in the membrane. A second stalle in the center of the assembly (the rotor stalle) is 
connected to the rotor ring in the membrane that turns as protons fiow through it, 
driven by the electrochemical gradient across the membrane. As a resuit, proton 
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flow malces the rotor stalle rotate inside the stationary head, where the catalytic 
sites that assemble ATP from ADP and Pi are located. Three a and three P subunits 
of similar structure alternate to form the head. Each of the three P subunits has a 
catalytic nucleotide-binding site at the a/p interface. These catalytic sites are ail in 
different conformations, depending on their interaction with the rotor stalle. Ihis 
stalle acts lilee a camshaft, the device that opens and closes the valves in a com¬ 
bustion engine. As it rotâtes within the head, the stalle changes the conformations 
of the p subunits sequentially. One of the possible conformations of the catalytic 
sites has high affinity for ADP and Pi, and as the rotor stalle pushes the binding site 
into a different conformation, these two substrates are driven to form ATP. In this 
way, the mechanical force exerted by the central rotor stalle is directly converted 
into the Chemical energy of the ATP phosphate bond. 

Serving as a proton-driven turbine, the ATP synthase is driven by H + flow into 
the matrix to spin at about 8000 révolutions per minute, generating three molé¬ 
cules of ATP per turn. In this way, each ATP synthase can produce roughly 400 
molécules of ATP per second. 

Proton-driven Turbines Are of Ancient Origin 

The membrane-embedded rotors of ATP synthases consist of a ring of identical c 
subunits (Figure 14-31). Each c subunit is a hairpin of two membrane-spanning 
a helices that contain a proton-binding site defined by a glutamate or aspartate in 
the middle of the lipid bilayer. The a subunit, which is part of the stator (see Figure 
14-30), malces two narrow channels at the interface between the rotor and stator, 
each spanning half of the membrane and converging on the proton-binding site 
at the middle of the rotor subunit. Protons flow through the two half-channels 
down their electrochemical gradient from the crista space baclc into the matrix. 
A negatively charged side chain in the binding site accepts a proton arriving from 
the crista space through the first half-channel, as it rotâtes past the a subunit. The 
bound proton then rides round in the ring for a full cycle, whereupon it is thought 
to be displaced by a positively charged arginine in the a subunit, and escapes 



Figure 1 4-30 ATP synthase. The 

three-dimensional structure of the F-i F 0 
ATP synthase, determined by x-ray 
crystallography. Also known as an F-type 
ATPase, it consists of an F 0 part (from 
“oligomycin-sensitive factor”) in the 
membrane and the large, catalytic Fi head 
in the matrix. Under mild dissociation 
conditions, this complex séparâtes into 
its Fi and F 0 components, which can 
be isolated and studied individually. (A) 
Diagram of the enzyme complex showing 
how its globular head portion (green) is 
kept stationary as proton-flow across the 
membrane drives a rotor (blue) that turns 
inside it. (B) In bovine heart mitochondria, 
the F 0 rotor ring in the membrane (light 
blue) has eight c subunits. It is attached 
to the y subunit of the central stalk (dark 
blue) by the s subunit (ourple). The catalytic 
Fi head consists of a ring of three a and 
three P subunits (light and dark green), and 
it directly converts mechanical energy into 
chemical-bond energy in ATP, as described 
in the text. The elongated peripheral stalk 
of the stator (orange) is connected to the 
F 1 head by the small 5 subunit (red) at one 
end, and to the a subunit in the membrane 
(pink oval) at the other. Together with the 
c subunits of the ring rotating past it, the a 
subunit créâtes a path for protons through 
the membrane. (C) The Symbol for ATP 
synthase used throughout this book. 

The closely related ATP synthases of 
mitochondria, chloroplasts, and bacteria 
synthesize ATP by harnessing the proton- 
motive force across a membrane. This 
powers the rotation of the rotor against 
the stator in a counterclockwise direction, 
as seen from the head. The same 
enzyme complex can also pump protons 
against their electrochemical gradient 
by hydrolyzing ATP, which then drives 
the clockwise rotation of the rotor. The 
direction of operation dépends on the net 
free-energy change (AG) for the coupled 
processes of Pl + translocation across the 
membrane and the synthesis of ATP from 
ADP and Pi (Movie 14.5 and Movie 14.6). 

Measurement of the torque that the ATP 
synthase can produce by ATP hydrolysis 
reveals that the ATP synthase is 60 times 
more powerful than a diesel engine of equal 
dimensions. (B, courtesy of K. Davies. 

PDB codes: 2WPD, 2CLY, 2WSS, 2B05.) 
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through the second half-channel into the matrix. Thus proton flow causes the 
rotor ring to spin against the stator like a proton-driven turbine. 

The mitochondrial ATP synthase is of ancient origin: essentially the same 
enzyme occurs in plant chloroplasts and in the plasma membrane of bacteria or 
archaea. The main différence between them is the number of c subunits in the 
rotor ring. In mammalian mitochondria, the ring has 8 subunits. In yeast mito¬ 
chondria, the number is 10; in bacteria and archaea, it ranges from 11 to 13; in 
plant chloroplasts, there are 14; and the rings of some cyanobacteria contain 15 c 
subunits. 

The c subunits in the rotor ring can be thought of as cogs in the gears of a bicy¬ 
cle. A high gear, with a small number of cogs, is advantageous when the supply 
of protons is limited, as in mitochondria, but a low gear, with a large number of 
cogs in the wheel, is préférable when the proton gradient is high. This is the case 
in chloroplasts and cyanobacteria, where protons produced through the action of 
sunlight are plentiful. Because each rotation produces three molécules of ATP in 
the head, the synthesis of one ATP requires around three protons in mitochondria 
but up to five in photosynthetic organisms. It is the number of c subunits in the 
ring that defines how many protons need to pass through this marvelous device 
to malce each molécule of ATP, and thereby how high a ratio of ATP to ADP can be 
maintained by the ATP synthase. 

In principle, ATP synthase can also run in reverse as an ATP-powered proton 
pump that converts the energy of ATP baclc into a proton gradient across the mem¬ 
brane. In many bacteria, the rotor of the ATP synthase in the plasma membrane 
changes direction routinely, from ATP synthesis mode in aérobic respiration, to 
ATP hydrolysis mode in anaérobie metabolism. In this latter case, ATP hydroly- 
sis serves to maintain the proton gradient across the plasma membrane, which 
is used to power many other essential cell functions including nutrient transport 
and the rotation of bacterial flagella. The V-type ATPases that acidify certain cel¬ 
lular organelles are architecturally similar to the F-type ATP synthases, but they 
normally function in reverse (see Figure 13-37). 


Mitochondrial Cristae Help to Make ATP Synthesis Efficient 

In the électron microscope, the mitochondrial ATP synthase complexes can be 
seen to project like lollipops on the matrix side of cristae membranes. Recent 
studies by cryoelectron microscopy and tomography hâve shown that this large 
complex is not distributed randomly in the membrane, but forms long rows of 
dimers along the cristae ridges (Figure 14-32). The dimer rows induce or stabi- 
lize these régions of high membrane curvature, which are otherwise energetically 
unfavorable. Indeed, the formation of ATP synthase dimers and their assembly 
into rows are required for cristae formation and hâve far-reaching conséquences 
for cellular fitness. By contrast with bacterial or chloroplast ATP synthases, which 
do not form dimers, the mitochondrial complex contains additional subunits, 
located mostly near the membrane end of the stator stalle. Several of these sub¬ 
units are found to be dimer-specific. If these subunits are mutated in yeast, the 
ATP synthase in the membrane remains monomeric, the mitochondria hâve no 
cristae, cellular respiration drops by half, and the cells grow more slowly. 


Figure 14-31 F 0 ATP synthase rotor rings. (A) Atomic force microscopy 
image of ATP synthase rotors from the cyanobacterium Synechococcus 
elongatus in a lipid bilayer. Whereas 8 c subunits form the rotor in Figure 
14-30, there are 13 c subunits in this ring. (B) The x-ray structure of the F 0 
ring of the ATP synthase from Spirulina platensis, another cyanobacterium, 
shows that this rotor has 15 c subunits. In ail ATP synthases, the c subunits 
are hairpins of two membrane-spanning a helices (one subunit is highlighted 
in graÿ). The helices are highly hydrophobie, except for two glutamine 
and glutamate side chains (yellow) that create proton-binding sites in the 
membrane. (A, courtesy of Thomas Meier and Denys Pogoryelov; B, PDB 
code: 2WIE.) 
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Electron tomography suggests that the proton pumps of the respiratory chain 
are located in the membrane régions at either side of the dimer rows. Protons 
pumped into the crista space by these respiratory-chain complexes are thought 
to diffuse very rapidly along the membrane surface, with the ATP synthase rows 
creating a proton "sink” at the cristae tips (Figure 14-33). In vitro studies suggest 
that the ATP synthase needs a proton gradient of about 2 pH units to produce ATP 
at the rate required by the cell, irrespective of the membrane potential. The H + 
gradient across the inner mitochondrial membrane is only 0.5 to 0.6 pH units. The 
cristae thus seem to worlc as proton traps that enable the ATP synthase to malce 
efficient use of the protons pumped out of the mitochondrial matrix. As we shall 
see in the next section, this elaborate arrangement of membrane protein com¬ 
plexes is absent in chloroplasts, where the H + gradient is much higher. 


Figure 14-32 Dimers of mitochondrial 
ATP synthase in cristae membranes. 

(A) A three-dimensional map of a small 
mitochondrion obtained by électron 
microscope tomography shows that ATP 
synthases form long paired rows along 
cristae ridges. The outer membrane is 
gray, the inner membrane and cristae 
membranes hâve been colored light blue. 
Each head of an ATP synthase is indicated 
by a yellow sphere. (B) A three-dimensional 
map of a mitochondrial ATP synthase 
dimer in the crista membrane obtained 
by subtomogram averaging, with fitted 
x-ray structures (Movie 14.7). (A, from 
K. Davies et al., Proc. NatlAcad. Sci. USA 
108:14121-14126, 2011. With permission 
from the National Academy of Sciences; 

B, from K. Davies et al., Proc. Natl Acad. 
Sci. USA 109:13602-13607, 2012. 

With permission from the National 
Academy of Sciences.) 


Spécial Transport Proteins Exchange ATP and ADP Through the 
Inner Membrane 

Like ail biological membranes, the inner mitochondrial membrane contains 
numerous spécifie transport proteins that allow particular substances to pass 
through. One of the most abundant of these is the ADP/ATP carrier protein (Fig¬ 
ure 14-34). This carrier shuttles the ATP produced in the matrix through the inner 
membrane to the intermembrane space, from where it diffuses through the outer 
mitochondrial membrane to the cytosol. In exchange, ADP passes from the cyto¬ 
sol into the matrix for recycling into ATP. ATP 4- has one more négative charge 
than ADP 3- , and the exchange of ATP and ADP is driven by the electroChemical 
gradient across the inner membrane, so that the more negatively charged ATP is 
pushed out of the matrix, and the less negatively charged ADP is pulled in. The 
ADP/ATP carrier is but one member of a mitochondrial carrier family: the inner 
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Figure 14-33 ATP synthase dimers at 
cristae ridges and ATP production. At the 

crista ridges, the ATP synthases (yellow) form 
a sink for protons (red). The proton pumps 
of the electron-transport chain (green) are 
located in the membrane régions on either 
side of the crista. As illustrated, protons 
tend to diffuse along the membrane from 
their source to the proton sink created by 
the ATP synthase. This allows efficient ATP 
production despite the small H + gradient 
between the cytosol and matrix. Red arrows 
show the direction of the proton flow. 
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Figure 14-34 The ADP/ATP carrier protein. (A) The ADP/ATP carrier protein is a small 
membrane protein that carries the ATP produced on the matrix side of the inner membrane to 
the intermembrane space, and the ADP that is needed for ATP synthesis into the matrix. (B) In 
the ADP/ATP carrier, six transmembrane a helices define a cavity that binds either ADP or ATP In 
this x-ray structure, the substrate is replaced by a tightly bound inhibitor instead (colored). When 
ADP binds from outside the inner membrane, it triggers a conformational change and is released 
into the matrix. In exchange, a molécule of ATP quickly binds to the matrix side of the carrier 
and is transported to the intermembrane space. From there the ATP diffuses through the outer 
mitochondrial membrane to the cytoplasm, where it powers the energy-requiring processes in the 
cell. (B, PDB code: 10KC.) 


mitochondrial membrane contains about 20 related carrier proteins exchanging 
various other métabolites, including the phosphate that is required along with 
ADP for ATP synthesis. 

In some specialized fat cells, mitochondrial respiration is uncoupled from ATP 
synthesis by the uncoupling protein , another member of the mitochondrial carrier 
family. In these cells, known as brown fat cells, most of the energy of oxidation is 
dissipated as heat rather than being converted into ATP. In the inner membranes 
of the large mitochondria in these cells, the uncoupling protein allows protons 
to move down their electrochemical gradient without passing through ATP syn- 
thase. This process is switched on when heat génération is required, causing the 
cells to oxidize their fat stores at a rapid rate and produce heat rather than ATP. 
Tissues containing brown fat serve as “heating pads," helping to revive hibernat- 
ing animais and to protect newborn human babies from the cold. 

Chemiosmotic Mechanisms First Arose in Bacteria 

Bacteria use enormously diverse energy sources. Some, lilce animal cells, are aér¬ 
obic; they synthesize ATP from sugars they oxidize to CO 2 and H 2 O by glycolysis, 
the citric acid cycle, and a respiratory chain in their plasma membrane that is 
similar to the one in the inner mitochondrial membrane. Others are strict anaer- 
obes, deriving their energy either from glycolysis alone (by fermentation, see Fig¬ 
ure 2-47) or from an electron-transport chain that employs a molécule other than 
oxygen as the final électron acceptor. The alternative électron acceptor can be a 
nitrogen compound (nitrate or nitrite), a sulfur compound (sulfate or sulfite), or 
a carbon compound (fumarate or carbonate), for example. A sériés of électron 
carriers in the plasma membrane that are comparable to those in mitochondrial 
respiratory chains transfers the électrons to these acceptors. 

Despite this diversity, the plasma membrane of the vast majority of bacteria 
contains an ATP synthase that is very similar to the one in mitochondria. In bac¬ 
teria that use an electron-transport chain to harvest energy, the electron-trans¬ 
port chain pumps H + out of the cell and thereby establishes a proton-motive force 
across the plasma membrane that drives the ATP synthase to malce ATP. In other 
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Figure 14-35 The rotation of the 
bacterial flagellum driven by H + flow. 

The flagellum is attached to a sériés of 
protein rings (pink), which are embedded in 
the outer and inner membranes and rotate 
with the flagellum. The rotation is driven 
by a flow of protons through an outer ring 
of proteins (the stator) by mechanisms 
that may resemble those used by the ATP 
synthase. However, the flow of protons 
in the flagellar motor is always toward 
the cytosol, both during clockwise and 
counterclockwise rotation, whereas in 
ATP synthase this flow reverses with the 
direction of rotation (Movie 14.8). 


bacteria, the ATP synthase worlcs in reverse, using the ATP produced by glycolysis 
to pump H + and establish a proton gradient across the plasma membrane. 

Bacteria, including the strict anaerobes, maintain a proton gradient across 
their plasma membrane that is harnessed to drive many other processes. It can 
be used to drive a flagellar motor, for example (Figure 14-35). This gradient is 
harnessed to pump Na + out of the bacterium via a Na + -H + antiporter that talces 
the place of the Na + -K + pump of eukaryotic cells. The gradient is also used for the 
active inward transport of nutrients, such as most amino acids and many sugars: 
each nutrient is dragged into the cell along with one or more protons through a 
spécifie symporter (Figure 14-36; see also Chapter 11). In animal cells, by con- 
trast, most inward transport across the plasma membrane is driven by the Na + 
gradient (high Na + outside, low Na + inside) that is established by the Na + -K + pump 
(see Figure 11-15). 

Some unusual bacteria hâve adapted to live in a very alkaline environment 
and yet must maintain their cytoplasm at a physiological pH. For these cells, any 
attempt to generate an electrochemical H + gradient would be opposed by a large 
H + concentration gradient in the wrong direction (H + higher inside than outside). 
Presumably for this reason, some of these bacteria substitute Na + for H + in ail of 
their chemiosmotic mechanisms. The respiratory chain pumps Na + out of the 
cell, the transport Systems and flagellar motor are driven by an inward flux of Na + , 
and a Na + -driven ATP synthase synthesizes ATP. The existence of such bacteria 
demonstrates a critical point: the principle of chemiosmosis is more fundamental 
than the proton-motive force on which it is normally based. 

As we discuss next, an ATP synthase coupled to chemiosmotic processes is 
also a central feature of plants, where it plays critical rôles in both mitochondria 
and chloroplasts. 


(A) AEROBIC CONDITIONS 
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Figure 14-36 The importance of H + -driven transport in bacteria. 

A proton-motive force generated across the plasma membrane pumps 
nutrients into the cell and expels Na + . (A) In an aérobic bacterium, 
a respiratory chain fed by the oxidation of substrates produces an 
electrochemical proton gradient across the plasma membrane. This gradient 
is then harnessed to make ATP, as well as to transport nutrients (proline, 
succinate, lactose, and lysine) into the cell and to pump Na + out of the cell. 

(B) When the same bacterium grows under anaérobie conditions, it dérivés its 
ATP from glycolysis. As indicated, the ATP synthase in the plasma membrane 
then hydrolyzes some of this ATP to establish an electrochemical proton 
gradient that drives the same transport processes that dépend on respiratory 
chain proton-pumping in (A). 
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Summary 

The large amount offree energy released when H + flows back into the matrix from 
the cristae provides the basis for ATP production on the matrix side of mitochon¬ 
drial cristae membranes by a remarlcable protein machine—the ATP synthase. The 
ATP synthaseJunctions lilce a miniature turbine, and it is a réversible device that can 
couple proton flou) to either ATP synthesis or ATP hydrolysis. The transmembrane 
electrochemical gradient that drives ATP production in mitochondria also drives 
the active transport of selected métabolites across the inner mitochondrial mem¬ 
brane, including an efficient ADP/ATP exchange between the mitochondrion and 
the cytosol that keeps the cell'sATP pool highly charged. The resulting high cellular 
concentration ofATP malces the free-energy change for ATP hydrolysis extremely 
favorable, allowing this hydrolysis reaction to drive a very large number of ener- 
gy-requiring processes throughout the cell. The universal presence ofATP synthase 
in bacteria, mitochondria, and chloroplasts testifies to the central importance of 
chemiosmotic mechanisms in cells. 


CHLOROPLASTS AND PHOTOSYNTHESIS 

Ail animais and most microorganisms rely on the continuai uptalce of large 
amounts of organic compounds from their environment. These compounds pro¬ 
vide both the carbon-rich building bloclcs for biosynthesis and the metabolic 
energy for life. It is lilcely that the first organisms on the primitive Earth had access 
to an abundance of organic compounds produced by geochemical processes, but 
it is clear that these were used up billions of years ago. Since that time, virtually 
ail of the organic materials required by living cells hâve been produced by pho- 
tosynthetic organisms , including plants and photosynthetic bacteria. The core 
machinery that drives ail photosynthesis appears to hâve evolved more than 3 bil¬ 
lion years ago in the ancestors of présent-day bacteria; today it provides the only 
major solar energy storage mechanism on Earth. 

The most advanced photo synthetic bacteria are the cyanobacteria, which hâve 
minimal nutrient requirements. They use électrons from water and the energy of 
sunlight to convert atmospheric CO 2 into organic compounds—a process called 
carbon fixation. In the course of the overall reaction rcE^O + nC0 2 —► (light) 
(CH 2 0)„ + n0 2 , they also liberate into the atmosphère the molecular oxygen that 
then powers oxidative phosphorylation. In this way, it is thought that the évolu¬ 
tion of cyanobacteria from more primitive photosynthetic bacteria eventually 
made possible the development of the many different aérobic life-forms that pop- 
ulate the Earth today. 

Chloroplasts Resemble Mitochondria But Hâve a Separate 
Thylakoid Compartiment 

Plants (including algae) developed much later than cyanobacteria, and their pho¬ 
tosynthesis occurs in a specialized intracellular organelle—the chloroplast (Fig¬ 
ure 14-37). Chloroplasts use chemiosmotic mechanisms to carry out their energy 
interconversions in much the same way that mitochondria do. Although much 
larger than mitochondria, they are organized on the same principles. They hâve a 
highly permeable outer membrane; a much less permeable inner membrane, in 
which membrane transport proteins are embedded; and a narrow intermembrane 
space in between. Together, these two membranes form the chloroplast envelope 
(Figure 14-37D). The inner chloroplast membrane surrounds a large space called 
the stroma, which is analogous to the mitochondrial matrix. The stroma contains 
many metabolic enzymes and, as for the mitochondrial matrix, it is the place 
where ATP is made by the head of an ATP synthase. Lilce the mitochondrion, the 
chloroplast has its own genome and genetic System. The stroma therefore also 
contains a spécial set of ribosomes, RNAs, and the chloroplast DNA. 

An important différence between the organization of mitochondria and chlo¬ 
roplasts is highlighted in Figure 14-38. The inner membrane of the chloroplast is 
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not folded into cristae and does not contain electron-transport chains. Instead, 
the electron-transport chains, photosynthetic light-capturing Systems, and ATP 
synthase are ail contained in the thylakoid membrane, a separate, distinct mem¬ 
brane that forms a set of flattened, disc-like sacs, the thylakoids. The thylakoid 
membrane is highly folded into numerous local stades of flattened vesicles called 
grana, interconnected by nonstaclced thylakoids. The lumen of each thylakoid is 
connected with the lumen of other thylakoids, thereby defining a third internai 
compartment called the thylakoid space. This space represents a separate com- 
partment in each chloroplast that is not connected to either the intermembrane 
space or the stroma. 

Chloroplasts Capture Energy from Sunlight and Use It to Fix 
Carbon 

We can group the reactions that occur during photosynthesis in chloroplasts into 
two broad categories: 

1. The photosynthetic electron-transfer reactions (also called the “light 
reactions") occur in two large protein complexes, called reaction centers, 
embedded in the thylakoid membrane. A photon (a quantum of light) 
lcnoclcs an électron out of the green pigment molécule chlorophyll in the 
first reaction center, creating a positively charged chlorophyll ion. This 
électron then moves along an electron-transport chain and through a sec¬ 
ond reaction center in much the same way that an électron moves along 
the respiratory chain in mitochondria. During this electron-transport 
process, H + is pumped across the thylakoid membrane, and the resulting 


Figure 14-37 Chloroplasts in the cell. 

(A) Schematic cross section through the 
leaf of a green plant. (B) Light microscopy 
of a plant leaf cell —here, a mesophyll cell 
from Zinnia elegans —shows chloroplasts 
as bright green bodies, measuring several 
micrometers across, in the transparent 
cell interior. (C) The électron micrograph 
of a thin, stained section through a wheat 
leaf cell shows a thin rim of cytoplasm— 
containing chloroplasts, the nucléus, and 
mitochondria—surrounding a large, water- 
filled vacuole. (D) At higher magnification, 
électron microscopy reveals the chloroplast 
envelope membrane and the thylakoid 
membrane within the chloroplast that 
is highly folded into grana stacks 
(Movie 14.9). (B, courtesy of John Innés 
Foundation; C and D, courtesy of 
K. Plaskitt.) 



Figure 14-38 A mitochondrion and 
chloroplast compared. Chloroplasts 
are generally larger than mitochondria. In 
addition to an outer and inner envelope 
membrane, they contain the thylakoid 
membrane with its internai thylakoid space. 
The chloroplast thylakoid membrane, 
which is the site of solar energy conversion 
in plants and algae, corresponds to the 
mitochondrial cristae, which are the sites 
of energy conversion by cellular respiration. 
Unlike the crista membrane, which is 
continuous with the inner mitochondrial 
membrane at cristae junctions, the thylakoid 
membrane is not connected to the inner 
chloroplast membrane at any point. 
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electrochemical proton gradient drives the synthesis of ATP in the stroma. 
As the final step in this sériés of reactions, électrons are loaded (together 
with H + ) onto NADP + , converting it to the energy-rich NADPH molécule. 
Because the positively charged chlorophyll in the first reaction center 
quicldy regains its électrons from water (H 2 0), 0 2 gas is produced as a 
by-product. Ail of these reactions are confined to the chloroplast. 

2. The carbon-fixation reactions do not require sunlight. Here the ATP and 
NADPH generated by the light reactions serve as the source of energy and 
reducing power, respectively, to drive the conversion of C0 2 to carbohy¬ 
drate. These carbon-fixation reactions begin in the chloroplast stroma, 
where they generate the three-carbon sugar glyceraldehyde 3-phosphate. 
This simple sugar is exported to the cytosol, where it is used to produce 
sucrose and many other organic métabolites in the leaves of the plant. The 
sucrose is then exported to meet the metabolic needs of the nonphoto- 
synthetic plant tissues, serving as a source of both carbon skeletons and 
energy for growth. 

Thus, the formation of ATP, NADPH, and 0 2 (which requires light energy 
directly) and the conversion of C0 2 to carbohydrate (which requires light energy 
only indirectly) are separate processes (Figure 14-39). However, they are linlced 
by elaborate feedback mechanisms that allow a plant to manufacture sugars only 
when it is appropriate to do so. Several of the chloroplast enzymes required for 
carbon fixation, for example, are inactive in the darlc and reactivated by light-stim- 
ulated electron-transport processes. 


Carbon Fixation Uses ATP and NADPPI to Convert CO2 into 
Sugars 

We hâve seen earlier in this chapter how animal cells produce ATP by using the 
large amount of free energy released when carbohydrates are oxidized to C0 2 and 
H 2 0. The reverse reaction, in which plants malce carbohydrate from C0 2 and H 2 0, 
talces place in the chloroplast stroma. The large amounts of ATP and NADPH pro¬ 
duced by the photosynthetic electron-transfer reactions are required to drive this 
energetically unfavorable reaction. 



LIGHT 

REACTIONS 


DARK 

REACTIONS 


Figure 14-39 A summary of the energy- 
converting metabolism in chloroplasts. 

Chloroplasts require only water and 
carbon dioxide as inputs for their light- 
driven photosynthesis reactions, and 
they produce the nutrients for most other 
organisais on the planet. Each oxidation of 
two water molécules by a photochemical 
reaction center in the thylakoid membrane 
produces one molécule of oxygen, which 
is released into the atmosphère. At the 
same time, protons are concentrated 
in the thylakoid space. These protons 
create a large electrochemical gradient 
across the thylakoid membrane, which is 
utilized by the chloroplast ATP synthase to 
produce ATP from ADP and phosphate. 
The électrons withdrawn from water 
are transferred to a second type of 
photochemical reaction center to produce 
NADPH from NADP+. As indicated, the 
NADPH and ATP are fed into the carbon- 
fixation cycle to reduce carbon dioxide, 
thereby producing the precursors for 
sugars, amino acids, and fatty acids. The 
CO 2 that is taken up from the atmosphère 
here is the source of the carbon atoms for 
most organic molécules on Earth. 

In a plant cell, a variety of métabolites 
produced in the chloroplast are exported to 
the cytoplasm for biosyntheses. Some of 
the sugar produced is stored in the form of 
starch granules in the chloroplast, but the 
rest is transported throughout the plant as 
sucrose or converted to starch in spécial 
storage tissues. These storage tissues 
serve as a major food source for animais. 
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Figure 14-40 The initial reaction in 
carbon fixation. This carboxylation 
reaction allows one molécule each 
of carbon dioxide and water to be 
incorporated into organic carbon 
molécules. It is catalyzed in the chloroplast 
stroma by the abundant enzyme ribulose 
bisphosphate carboxylase, or Rubisco. As 
indicated, the product is two molécules of 
3-phosphoglycerate. 
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dioxide 
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Figure 14-40 illustrâtes the central reaction of carbon fixation, in which an 
atom of inorganic carbon is converted to organic carbon: CO 2 from the atmo¬ 
sphère combines with the five-carbon compound ribulose 1,5-bisphosphate plus 
water to yield two molécules of the three-carbon compound 3-phosphoglycer¬ 
ate. This carboxylation reaction is catalyzed in the chloroplast stroma by a large 
enzyme called ribulose bisphosphate carboxylase , or Rubisco for short. Because 
the reaction is so slow (each Rubisco molécule turns over only about 3 molécules 
of substrate per second, compared to 1000 molécules per second for a typical 
enzyme), an unusually large number of enzyme molécules are needed. Rubisco 
often constitutes more than 50% of the chloroplast protein mass, and it is thought 
to be the most abundant protein on Earth. In a global context, Rubisco also keeps 
the amount of the greenhouse gas CO 2 in the atmosphère at a low level. 

Although the production of carbohydrates from CO 2 and H 2 O is energetically 
unfavorable, the fixation of CO 2 catalyzed by Rubisco is an energetically favorable 
reaction. Carbon fixation is energetically favorable because a continuous supply of 
the energy-rich ribulose 1,5-bisphosphate is fed into the process. This compound 
is consumed by the addition of CO 2 , and it must be replenished. The energy and 
reducing power needed to regenerate ribulose 1,5-bisphosphate corne from the 
ATP and NADPH produced by the photosynthetic light reactions. 

The elaborate sériés of reactions in which CO 2 combines with ribulose 1,5-bis¬ 
phosphate to produce a simple sugar—a portion of which is used to regenerate 
ribulose 1,5-bisphosphate—forms a cycle, called the carbon-fixation cycle, or the 
Calvin cycle (Figure 14-41). This cycle was one of the first metabolic pathways to 
be worlced out by applying radioisotopes as tracers in biochemistry. As indicated, 
each turn of the cycle converts six molécules of 3-phosphoglycerate to three mol¬ 
écules of ribulose 1,5-bisphosphate plus one molécule of glyceraldehyde 3-phos- 
phate. Glyceraldehyde 3-phosphate, the three-carbon sugar produced by the cycle, 
then provides the starting material for the synthesis of many other sugars and ail 
of the other organic molécules that form the plant. 

Sugars Generated by Carbon Fixation Can Be Stored as Starch or 
Consumed to Produce ATP 

The glyceraldehyde 3-phosphate generated by carbon fixation in the chloroplast 
stroma can be used in a number of ways, depending on the needs of the plant. 
During periods of excess photosynthetic activity, much of it is retained in the 
chloroplast stroma and converted to starch. Like glycogen in animal cells, starch 
is a large polymer of glucose that serves as a carbohydrate reserve, and it is stored 
as large granules in the chloroplast stroma. Starch forms an important part of the 
diet of ail animais that eat plants. Other glyceraldehyde 3-phosphate molécules 
are converted to fat in the stroma. This material, which accumulâtes as fat drop- 
lets, likewise serves as an energy reserve. At night, this stored starch and fat can be 
brolcen down to sugars and fatty acids, which are exported to the cytosol to help 
support the metabolic needs of the plant. Some of the exported sugar enters the 
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Figure 14-41 The carbon-fixation 
cycle. This central metabolic pathway 
allows organic molécules to be produced 
from CO 2 and H 2 O. In the first stage 
of the cycle (carboxylation), CO 2 is 
added to ribulose 1,5-bisphosphate, as 
shown in Figure 14-40. In the second 
stage (réduction), ATP and NADPH are 
consumed to produce glyceraldehyde 
3-phosphate molécules. In the final stage 
(régénération), some of the glyceraldehyde 
3-phosphate produced is used to 
regenerate ribulose 1,5-bisphosphate. 

Other glyceraldehyde 3-phosphate 
molécules are either converted to starch 
and fat in the chloroplast stroma, or 
transported out of the chloroplast into the 
cytosol. The number of carbon atoms in 
each type of molécule is indicated in yellow. 
There are many intermediates between 
glyceraldehyde 3-phosphate and ribulose 
5-phosphate, but they hâve been omitted 
here for clarity. The entry of water into the 
cycle is also not shown (but see Figure 
14-40). 


glycolytic pathway (see Figure 2-46), where it is converted to pyruvate. Both that 
pyruvate and the fatty acids can enter the plant cell mitochondria and be fed into 
the citric acid cycle, ultimately leading to the production of large amounts of ATP 
by oxidative phosphorylation (Figure 14-42). Plants use this ATP in the same way 
that animal cells and other nonphotosynthetic organisms do to power a variety of 
metabolic reactions. 

The glyceraldehyde 3-phosphate exported from chloroplasts into the cytosol 
can also be converted into many other métabolites, including the disaccharide 
sucrose. Sucrose is the major form in which sugar is transported between the cells 
of a plant: just as glucose is transported in the blood of animais, so sucrose is 
exported from the leaves to provide carbohydrate to the rest of the plant. 

The Thylakoid Membranes of Chloroplasts Contain the Protein 
Complexes Required for Photosynthesis and ATP Génération 

We next need to explain how the large amounts of ATP and NADPH required for 
carbon fixation are generated in the chloroplast. Chloroplasts are much larger and 
less dynamic than mitochondria, but they malce use of chemiosmotic energy con¬ 
version in much the same way. As we saw in Figure 14-38, chloroplasts and mito¬ 
chondria are organized on the same principles, although the chloroplast contains 
a separate thylakoid membrane System in which its chemiosmotic mechanisms 
occur. The thylakoid membranes contain two large membrane protein com¬ 
plexes, called photosystems, which endow plants and other photosynthetic organ¬ 
isms with the ability to capture and convert solar energy for their own use. Two 
other protein complexes in the thylakoid membrane that worlc together with the 
photosystems in photophosphorylation—the génération of ATP with sunlight— 
hâve mitochondrial équivalents. These are the heme-containing cytochrome 
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Figure 14-42 How chloroplasts and mitochondria collaborate to supply cells with both métabolites and ATP. (A)The 
inner chloroplast membrane is imperméable to the ATP and NADPH that are produced in the stroma during the light reactions 
of photosynthesis. These molécules are therefore funneled into the carbon-fixation cycle, where they are used to make sugars. 
The resulting sugars and their métabolites are either stored within the chloroplast—in the form of starch or fat—or exported to 
the rest of the plant cell. There, they can enter the energy-generating pathway that ends in ATP synthesis linked to oxidative 
phosphorylation inside the mitochondrion. Unlike the chloroplast, mitochondrial membranes contain a spécifie transporter 
that makes them permeable to ATP (see Figure 14-34). Note that the O 2 released to the atmosphère by photosynthesis 
in chloroplasts is used for oxidative phosphorylation in mitochondria; similarly, the CO 2 released by the citric acid cycle in 
mitochondria is used for carbon fixation in chloroplasts. (B) In a leaf, mitochondria (red) tend to cluster close to the chloroplasts 
(green), as seen in this light micrograph. (B, courtesy of Olivier Grandjean.) 


b 6 -f complex, which both functionally and structurally resembles cytochrome 
c reductase in the respiratory chain; and the chloroplast ATP synthase, which 
closely resembles the mitochondrial ATP synthase and worlcs in the same way. 

Chlorophyll-Protein Complexes Can Transfer Either Excitation 
Energy or Electrons 

The photosystems in the thylakoid membrane are multiprotein assemblies of a 
complexity comparable to that of the protein complexes in the mitochondrial 
electron-transport chain. They contain large numbers of specifically bound chlo- 
rophyll molécules, in addition to cofactors that will be familiar from our discus¬ 
sion of mitochondria (heme, iron-sulfur clusters, and quinones). Chlorophyll, 
the green pigment of photosynthetic organisms, has a long hydrophobie tail that 
makes it behave lilce a lipid, plus a porphyrin ring that has a central Mg atom and 
an extensive System of delocalized électrons in conjugated double bonds (Figure 
14-43). When a chlorophyll molécule absorbs a quantum of sunlight (a photon), 
the energy of the photon causes one of these électrons to move from a low-energy 
molecular orbital to another orbital of higher energy. 

The excited électron in a chlorophyll molécule tends to return quicldy to its 
ground State, which can occur in one of three ways: 

1. By converting the extra energy into heat (molecular motion) or to some 
combination of heat and light of a longer wavelength (fluorescence); this 
is what usually happens when light is absorbed by an isolated chlorophyll 
molécule in solution. 

2. By transferring the energy—but not the électron—directly to a neighboring 
chlorophyll molécule by a process called résonance energy transfer. 

3. By transferring the excited électron with its négative charge to another 
nearby molécule, an électron acceptor, after which the positively charged 
chlorophyll returns to its original State by taking up an électron from some 
other molécule, an électron donor. 

Figure 14-43 The structure of chlorophyll. A magnésium atom is held in a 
porphyrin ring, which is related to the porphyrin ring that binds iron in heme 
(see Figure 14-15). Electrons are delocalized overthe bonds shaded in blue. 
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The latter two mechanisms occur when chlorophylls are attached to proteins 
in a chlorophyll-protein complex. The protein coordinates the central Mg atom in 
the chlorophyll porphyrin, most often through a histidine side chain located in the 
hydrophobie interior of a membrane, causing each of the chlorophylls in a protein 
complex to be held at exactly defined distances and orientations. The flow of exci¬ 
tation energy or électrons then dépends on both the précisé spatial arrangement 
and the local protein environment of the protein-bound chlorophylls. 

When excited by a photon, most protein-bound chlorophylls simply transmit 
the absorbed energy to another nearby chlorophyll by the process of résonance 
energy transfer. However, in a few specially positioned chlorophylls, the energy 
différence between the ground State and the excited State is just right for the pho¬ 
ton to trigger a light-induced Chemical reaction. The spécial State of such chlo¬ 
rophyll molécules dérivés from their close interaction with a second chlorophyll 
molécule in the same chlorophyll-protein complex. Together, these two chloro¬ 
phylls form a spécial pair. 

The photosynthetic électron transfer process starts when a photon of suitable 
energy ionizes a chlorophyll molécule in such a spécial pair, dissociating it into an 
électron and a positively charged chlorophyll ion. The energized électron is passed 
rapidly to a quinone in the same protein complex, preventing its unproductive 
reassociation with the chlorophyll ion. This light-induced transfer of an électron 
from a chlorophyll to a mobile électron carrier is the central charge-separation 
step in photosynthesis, in which a chlorophyll becomes positively charged and 
an électron carrier becomes negatively charged (Figure 14-44). The chlorophyll 
ion is a very strong oxidant that is able to withdraw an électron from a low-energy 
substrate; in the first step of oxygenic photosynthesis, this low-energy substrate is 
water. 

Upon transfer to a mobile carrier in the electron-transport chain, the électron 
is stabilized as part of a strong électron donor and made available for subséquent 
reactions. These subséquent reactions require more time to complété, and they 
resuit in light-generated energy-rich compounds. 

A Photosystem Consists of an Antenna Complex and a Reaction 
Center 

There are two distinct types of chlorophyll-protein complexes in the photosyn¬ 
thetic membrane. One type, called a photochemical reaction center , contains the 
spécial pair of chlorophylls just described. The other type engages exclusively in 
light absorption and résonance energy transfer and is called an antenna complex. 
Together, the two types of complex malce up a photosystem (Figure 14-45). 

The rôle of the antenna complex in the photosystem is to collect the energy of 
a sufficient number of photons for photosynthesis. Without it, the process would 
be slow and inefficient, as each reaction-center chlorophyll would absorb only 
about one light quantum per second, even in broad daylight, whereas hundreds 
per second are needed for effective photosynthesis. When light excites a chloro¬ 
phyll molécule in the antenna complex, the energy passes rapidly from one pro¬ 
tein-bound chlorophyll to another by résonance energy transfer until it reaches 
the spécial pair in the reaction center. The antenna complex is also known as a 


Figure 14-44 A general scheme 
for the charge-separation step in a 
photosynthetic reaction center. In a 

reaction center, light energy is harnessed 
to generate électrons that are held at 
a high energy level by mobile électron 
carriers in a membrane. Light energy is 
thereby converted to Chemical energy. The 
process starts when a photon absorbed 
by the spécial pair of chlorophylls in the 
reaction center knocks an électron out 
of one of the chlorophylls. The électron 
is taken up by a mobile électron carrier 
(orange) bound at the opposite membrane 
surface. A set of intermediary carriers 
embedded in the reaction center provide 
the path from the spécial pair to this 
carrier (not shown). The physical distance 
between the positively charged chlorophyll 
ion and the negatively charged électron 
carrier stabilizes the charge-separated 
State for a short time, during which the 
chlorophyll ion, a strong oxidant, withdraws 
an électron from a suitable compound 
(for example, from water, an event we 
will discuss in detail shortly). The électron 
carrier then diffuses away from the reaction 
center as a strong électron donor that 
will transfer its électron to an electron- 
transport chain. 
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light-harvesting complex, or LHC. In addition to many chlorophyll molécules, 
an LHC contains orange carotenoid pigments. The carotenoids collect light of 
a different wavelength from that absorbed by chlorophylls, helping to malce the 
antenna complex more efficient. They also hâve an important protective rôle in 
preventing the formation of harmful oxygen radicals in the photosynthetic mem¬ 
brane. 


Figure 14-45 A photosystem. Each 
photosystem consists of a reaction 
center plus a number of light-harvesting 
antenna complexes. The solar energy for 
photosynthesis is collected by the antenna 
complexes, which account for most of the 
chlorophyll in a plant cell. The energy hops 
randomly by résonance energy transfer (red 
arrows) from one chlorophyll molécule to 
another, until it reaches the reaction center 
complex, where it ionizes a chlorophyll in 
the spécial pair. The chlorophyll spécial pair 
holds ifs électrons at a lower energy than 
the chlorophyll in the antenna complexes, 
causing the energy transferred to it from 
the antenna complex to become trapped 
there. Note that it is only energy that 
moves from one chlorophyll molécule 
to another in the antenna complex, not 
électrons (Movie 14.10). 


The Thylakoid Membrane Contains Two Different Photosystems 
Working in Sériés 

The excitation energy collected by the antenna complex is delivered to the spécial 
pair in the photochemical reaction center. The reaction center is a transmem¬ 
brane chlorophyll-protein complex that lies at the heart of photosynthesis. It har- 
bors the spécial pair of chlorophyll molécules, which acts as an irréversible trap 
for excitation energy (see Figure 14-45). 

Chloroplasts contain two functionally different although structurally related 
photosystems, each of which feeds électrons generated by the action of sunlight 
into an electron-transfer chain. In the chloroplast thylakoid membrane, photo¬ 
system I is confined to the unstaclced stroma thylalcoids, while the staclced grana 
thylalcoids contain photosystem IL The two photosystems were named in order 
of their discovery, not of their actions in the photosynthetic pathway, and élec¬ 
trons are first activated in photosystem II before being transferred to photosystem 
I (Figure 14-46). The path of the électron through the two photosystems can be 
described as a Z-lilce trajectory and is lcnown as the Z scheme. In the Z scheme, 
the reaction center of photosystem II first withdraws an électron from water. The 
électron passes via an electron-transport chain (composed of the électron carrier 
plastoquinone, the cytochrome bs-f complex, and the protein plastocyanin) to 
photosystem I, which propels the électron across the membrane in a second light- 
driven charge-separation reaction that leads to NADPH production. 


sunlight 


sunlight 



center 


center 


Figure 14-46 The Z scheme for 
photosynthesis. The thylakoids of plants 
and cyanobacteria contain two different 
photosystems, known as photosystem I 
and photosystem II, which work in sériés. 
Each of the photosystem I and II reaction 
centers receives excitation energy from 
its own set of tightly associated antenna 
complexes, known as LHC-I and LHC-II, 
by résonance energy transfer. Note that, for 
historical reasons, the two photosystems 
were named opposite to the order in which 
they act, with photosystem II passing its 
électrons to photosystem I. 
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Figure 14-47 Changes in redox potential 
during photosynthesis. The redox 
potential for each molécule is indicated 
by its position along the vertical axis. 
Photosystem II passes électrons derived 
from water to photosystem I, which in turn 
passes them to NADP+ through ferredoxin- 
NADP+ reductase. The net électron flow 
through the two photosystems is from 
water to NADP+, and it produces NADPH 
as well as an electrochemical proton 
gradient. This proton gradient is used 
by the ATP synthase to produce ATP 
Details in this figure will be explained in the 
subséquent text. 


direction of électron flow 


The Z scheme is necessary to bridge the very large energy gap between water 
and NADPH (Figure 14-47). A single quantum of visible light does not contain 
enough energy both to withdraw électrons from water, which holds on to its élec¬ 
trons very tightly (redox potential +820 mV) and therefore is a very poor électron 
donor, and to force them on to NADP + , which is a very poor électron acceptor 
(redox potential -320 mV). The Z scheme first evolved in cyanobacteria to enable 
them to use water as a universally available électron source. Other, simpler pho- 
tosynthetic bacteria hâve only one photosystem. As we shall see, they cannot use 
water as an électron source and must rely on other, more energy-rich substrates 
instead, from which électrons are more readily withdrawn. The ability to extract 
électrons from water (and thereby to produce molecular oxygen) was acquired by 
plants when their ancestors toolc up the endosymbiotic cyanobacteria that later 
evolved into chloroplasts (see Figure 1-31). 

Photosystem II Uses a Manganèse Cluster to Withdraw Electrons 
From Water 

In biology, only photosystem II is able to withdraw électrons from water and to 
generate molecular oxygen as a waste product. This remarkable specialization of 
photosystem II is conferred by the unique properties of one of the two chlorophyll 
molécules of its spécial pair and by a manganèse cluster linlced to the protein. 
These chlorophyll molécules and the manganèse cluster form the catalytic core of 
the photosystem II reaction center, whose mechanism is outlined in Figure 14-48. 

Water is an inexhaustible source of électrons, but it is also extremely stable; 
therefore a large amount of energy is required to malce it part with its électrons. 
The only compound in living organisms that is able to achieve this feat after its ion- 
ization by light, is the chlorophyll spécial pair called P 6 8o (P 68 o/P 680 + redox poten¬ 
tial = +1270 mV). The reaction 2H 2 0 + 4 photons —> 4H + + 4e" + 0 2 is catalyzed by 
its adjacent manganèse cluster. The intermediates remain firmly attached to the 
manganèse cluster until two water molécules hâve been fully oxidized to 0 2 , thus 
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ensuring that no dangerous oxygen radicals are released as the reaction proceeds. 
The protons released by the two water molécules are discharged to the thylakoid 
space, contributing to the proton gradient across the thylakoid membrane (pH 
lower in the thylakoid space than in the stroma). The unique protein environment 
that endows life with this all-important ability to oxidize water has remained 
essentially unchanged throughout billions of years of évolution (Figure 14-49). 

Ail of the oxygen in the Earth's atmosphère has been generated in this way. 
Although the exact details of the water-oxidation reaction in photosystem II are 
still not fully understood, scientists are trying to construct an artificial System that 
mimics the process. If successful, this might provide a virtually endless supply of 
clean energy, helping to solve the world's energy crisis. 

The Cytochrome £>6-f Complex Connects Photosystem II 
to Photosystem I 

Following the path shown previously in Figure 14-48, the électrons extracted from 
water by photosystem II are transferred to plastoquinol, a strong électron donor 
similar to ubiquinol in mitochondria. This quinol, which can diffuse rapidly in the 
lipid bilayer of the thylakoid membrane, transfers its électrons to the cytochrome 
b Q -f complex, whose structure is homologous to the cytochrome c reductase in 
mitochondria. The cytochrome b§-f complex pumps H + into the thylakoid space 
using the same Q cycle that is utilized in mitochondria (see Figure 14-21), thereby 
adding to the proton gradient across the thylakoid membrane. 

The cytochrome bç-f complex forms the connecting linlc between photosys- 
tems II and I in the chloroplast electron-transport chain. It passes its électrons 



Figure 14-48 The conversion of light 
energy to Chemical energy in the 
photosystem II complex. (A) Schematic 
diagram of the photosystem II reaction 
center, whose spécial pair of chlorophyll 
molécules is designated as Peso based 
on the wavelength of its absorbance 
maximum (680 nm). (B) Cofactors and 
pigments at the core of the reaction 
center. Shown are the manganèse (Mn) 
cluster, the tyrosine side chain that links it 
to the Peso spécial pair, four chlorophylls 
(green), two pheophytins (light blue), two 
plastoquinones (pink), and an iron atom 
(red). The path of électrons is shown by 
blue arrows. In the manganèse cluster, four 
manganèse atoms (light blue), one calcium 
atom (purple), and five oxygen atoms (red) 
work together to catalyze the oxidation of 
water. The water-splitting reaction occurs 
in four successive steps, each requiring the 
energy of one photon. Each photon turns 
a P 680 reaction-center chlorophyll into a 
positively charged chlorophyll ion. Through 
an ionized tyrosine side chain (yellow), this 
chlorophyll ion pulls an électron away from 
a water molécule bound at the manganèse 
cluster. In this way, a total of four électrons 
are withdrawn from two water molécules 
to generate molecular oxygen, which is 
released into the atmosphère. 

Each électron that is energized by light 
passes from the spécial pair along an 
electron-transfer chain inside the complex, 
along the indicated path to the permanently 
bound plastoquinone Qa and then to 
plastoquinone Qb as électron acceptors. 
Once Qb has picked up two électrons 
(plus two protons; see Figure 14-17), 
it dissociâtes from its binding site in the 
complex and enters the lipid bilayer as a 
mobile électron carrier, being immediately 
replaced by a new, nonreduced molécule 
of plastoquinone. Note that the chlorophylls 
and pheophytins form two symmetrical 
branches of a potential electron-transport 
chain. Only one branch is active, thus 
ensuring that the plastoquinones become 
fully reduced in minimum time. 


Figure 14-49 The structure of the 
complété photosystem II complex. This 
photosystem contai ns at least 16 protein 
subunits, along with 36 chlorophylls, two 
pheophytins, two hemes, and a number of 
protective carotenoids (colored). Most of 
these pigments and cofactors are deeply 
buried, tightly complexed to protein (gray). 
The path of électrons is indicated by the 
blue arrows, and is explained in Figure 
14-48B. The photosystem II complex 
presented here is the cyanobacterial 
complex, which is simpler and more stable 
than the plant complex, which works in the 
same way. (PDB code: 3ARC.) 
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one at a time to the mobile électron carrier plastocyanin (a small copper-contain- 
ing protein that talces the place of the cytochrome c in mitochondria), which will 
transfer them to photosystem I (Figure 14-50). As we discuss next, photosystem 
I then harnesses a second photon of light to further energize the électrons that it 
receives. 


Figure 14-50 Electron flow through the 
cytochrome be-f complex to NADPH. 

The cytochrome b§-f complex is the 
functional équivalent of cytochrome c 
reductase (the cytochrome b-Ci complex) 
in mitochondria (see Figure 14-22). Like its 
mitochondrial homolog, the b§-f complex 
receives its électrons from a quinone and 
engages in a complicated Q cycle that 
pumps two protons across the membrane 
(details not shown). It hands its électrons, 
one at a time, to plastocyanin (pC). 
Plastocyanin diffuses along the membrane 
surface to photosystem I and transfers 
the électrons via ferredoxin (Fd) to the 
ferredoxin-NADP + reductase (FNR), where 
they are utilized to produce NADPH. P 700 is 
a spécial pair of chlorophylls that absorbs 
light of wavelength 700 nm. 


Photosystem I Carries Out the Second Charge-Separation Step 
in the Z Scheme 

Photosystem I receives électrons from plastocyanin in the thylalcoid space and 
transfers them, via a second charge-separation reaction, to the small protein 
ferredoxin on the opposite membrane surface (Figure 14-51). Then, in a final 
step, ferredoxin feeds its électrons to a membrane-associated enzyme complex, 
the ferredoxin-NADP + reductase, which uses the électrons to produce NADPH 
from NADP + (see Figure 14-50). 

The redox potential of the NADP + /NADPH pair (-320 mV) is already very low, 
and réduction of NADP + therefore requires a compound with an even lower redox 
potential. This turns out to be a chlorophyll molécule near the stromal membrane 
surface of photosystem I that has a redox potential of -1000 mV (chlorophyll Ao), 
malcing it the strongest known électron donor in biology. The reduced NADPH is 
released into the chloroplast stroma, where it is used for biosynthesis of glycér¬ 
aldéhyde 3-phosphate, amino acid precursors, and fatty acids, much of it to be 
exported to the cytoplasm. 



plastocyanin 




Figure 14-51 Structure and function 
of photosystem I. At the heart of the 
photosystem I complex assembly is the 
électron-transfer chain shown. At one 
end is a spécial pair of chlorophylls called 
P 700 (because it absorbs light of 700 nm 
wavelength), receiving électrons from 
plastocyanin (pC). At the other end are the 
Ao chlorophylls, which hand the électrons 
on to ferredoxin via two plastoquinones 
(PQ; purplë) and three iron-sulfur clusters. 
Even though the rôles of photosystems I 
and II in photosynthesis are very different, 
their central electron-transfer chains are 
structurally similar, indicating a common 
evolutionary origin (see Figure 14-53). 

Note that in photosystem I both branches 
of the electron-transfer chain are active, 
unlike in photosystem II (see Figure 14-48). 
(PDB code: 3LW5.) 
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The Chloroplast ATP Synthase Uses the Proton Gradient 
Generated by the Photosynthetic Light Reactions to Produce ATP 

The sequence of events that results in light-driven production of ATP and NADPH 
in chloroplasts and cyanobacteria is summarized in Figure 14-52. Starting with 
the withdrawal of électrons from water, the light-driven charge-separation steps 
in photosystems II and I enable the energetically unfavorable (uphill) flow of élec¬ 
trons from water to NADPH (see Figure 14-47). Three small mobile électron car¬ 
riers—plastoquinone, plastocyanin, and ferredoxin—participate in this process. 
Together with the électron-driven proton pump of the cytochrome b^-f complex, 
the photosystems generate a large proton gradient across the thylakoid mem¬ 
brane. The ATP synthase molécules embedded in the thylakoid membranes then 
harness this proton gradient to produce large amounts of ATP in the chloroplast 
stroma, mimicking the synthesis of ATP in the mitochondrial matrix. 

The linear Z scheme for photosynthesis thus far discussed can switch to a cir- 
cular mode of électron flow through photosystem I and the b§-f complex. Here, 
the reduced ferredoxin diffuses baclc to the h 6 -/complex to reduce plastoquinone, 
instead of passing its électrons to the ferredoxin-NADP + reductase enzyme com¬ 
plex. This, in effect, turns photosystem I into a light-driven proton pump, thereby 
increasing the proton gradient and thus the amount of ATP made by the ATP 
synthase. An elaborate set of regulatory mechanisms control this switch, which 
enables the chloroplast to generate either more NADPH (linear mode) or more 
ATP (circular mode), depending on the metabolic needs of the cell. 

Ail Photosynthetic Reaction Centers Hâve Evolved From a 
Common Ancestor 

Evidence for the prolcaryotic origins of mitochondria and chloroplasts abounds 
in their genetic Systems, as we will see in the next section. But strong and direct 
evidence for the evolutionary origins of chloroplasts can also be found in the 
molecular structures of photosynthetic reaction centers revealed in recent years 
by crystallography. The positions of the chlorophylls in the spécial pair and the 
two branches of the electron-transfer chain are basically the same in photosystem 
I, photosystem II, and the photochemical reaction centers of photosynthetic bac- 
teria (Movie 14.11). As a resuit, one can conclude that they ail hâve evolved from 
a common ancestor. Evidently, the molecular architecture of the photosynthetic 
reaction center originated only once and has remained essentially unchanged 
during évolution. By contrast, the less critical antenna Systems hâve evolved in 
several different ways and are correspondingly diverse in present-day photosyn¬ 
thetic organisms (Figure 14-53). 


Figure 14-52 Summary of électron 
and proton movements during 
photosynthesis in the thylakoid 
membrane. Electrons are withdrawn, 
through the action of light energy, from 
a water molécule that is held by the 
manganèse cluster in photosystem II. 

The électrons pass on to plastoquinone, 
which delivers them to the cytochrome b§-f 
complex that resembles the cytochrome 
c reductase of mitochondria and the 
b-c complex of bacteria. They are then 
carried to photosystem I by the soluble 
électron carrier plastocyanin, the functional 
équivalent of cytochrome c in mitochondria. 
From photosystem I they are transferred 
to ferredoxin-NADP + reductase (FNR) by 
the soluble carrier ferredoxin (Fd; a small 
protein containing an iron-sulfur center). 
Protons are pumped into the thylakoid 
space by the cytochrome b§-f complex, 
in the same way that protons are pumped 
into mitochondrial cristae by cytochrome c 
reductase (see Figure 14-21). In addition, 
the H + released into the thylakoid space 
by water oxidation, and the H + consumed 
during NADPH formation in the stroma, 
contribute to the génération of the 
electrochemical H + gradient across the 
thylakoid membrane. As illustrated, this 
gradient drives ATP synthesis by an ATP 
synthase that sits in the same membrane 
(see Figure 14-47). 
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Figure 14-53 Evolution of 
photosynthetic reaction centers. 

Pigments involved in light-harvesting are 
colored green\ those involved in the central 
photochemical events are colored red. 

(A) The primitive photochemical reaction 
center of purple bacteria contains two 
related protein subunits, L and M, that 
bind the pigments involved in the 
central process of photosynthesis, 
including a spécial pair of chlorophyll 
molécules. Electrons are fed into the 
excited chlorophylls by a cytochrome. 

LH1 is a bacterial antenna complex. 

(B) Photosystem II contains the Di and 
D 2 proteins, which are homologous to 
the L and M subunits in (A). The excited 
P 680 chlorophyll in the spécial pair 
withdraws électrons from water held by 
the manganèse cluster. LHC-II is the light- 
harvesting complex that feeds energy into 
the core antenna proteins. (C) Photosystem 
I contains the Psa A and Psa B proteins, 
each of which is équivalent to a fusion of 
the Di or D 2 protein to a core antenna 
protein of photosystem II. The loosely 
bound plastocyanin (pC) feeds électrons 
into the excited chlorophyll pair. As 
indicated, in photosystem I, électrons are 
passed from a bound quinone (Q) through 
a sériés of three iron-sulfur centers (red 
circles). (Modified from K. Rhee, E. Morris, 
J. Barber and W. Kühlbrandt, Nature 
396:283-286, 1998; and W. Kühlbrandt, 
Nature 411:896-899, 2001. With 
permission from Macmillan Publishers Ltd.) 


The Proton-Motive Force for ATP Production in Mitochondria and 
Chloroplasts Is Essentially the Same 

The proton gradient across the thylakoid membrane dépends both on the pro- 
ton-pumping activity of the cytochrome b§-f complex and on the photosynthetic 
activity of the two photosystems, which in turn dépends on light intensity. In chlo¬ 
roplasts exposed to light, H + is pumped out of the stroma (pH around 8, similar to 
the mitochondrial matrix) into the thylakoid space (pH 5-6), creating a gradient of 
2-3 pH units across the thylakoid membrane, representing a proton-motive force 
of about 180 mV. This is very similar to the proton-motive force in respiring mito¬ 
chondria. However, a membrane potential across the inner mitochondrial mem¬ 
brane malces the largest contribution to the proton-motive force that drives the 
mitochondrial ATP synthase to malce ATP, whereas a H + gradient prédominâtes 
for chloroplasts. 

In contrast to mitochondrial ATP synthase, which forms long rows of dimers 
along the cristae ridges, the chloroplast ATP synthase is monomeric and located 
in fiat membrane régions (Figure 14-54). Evidently, the H + gradient across the 
thylakoid membrane is high enough for ATP synthesis without the need for the 
elaborate arrangement of ATP synthase seen in mitochondria. 

Chemiosmotic Mechanisms Evolved in Stages 

The first living cells on Earth may hâve consumed geochemically produced 
organic molécules and generated their ATP by fermentation. Because oxygen was 
not yet présent in the atmosphère, such anaérobie fermentation reactions would 
hâve dumped organic acids—such as lactic or formic acids, for example—into the 




























CHLOROPLASTS AND PHOTOSYNTHESIS 


795 


MITOCHONDRION 




thylakoid membrane 


stroma 
pH 8 


thylakoid space 
pH 5.5 


intermembrane space 
pH 7.4 


CHLOROPLAST 


environment (see Figure 2-47). Perhaps such acids lowered the pH of the envi¬ 
ronment, favoring the survival of cells that evolved transmembrane proteins that 
could pump H + out of the cytosol, thereby preventing the cell from becoming too 
acidic (stage 1 in Figure 14-55). One of these pumps may hâve used the energy 
available from ATP hydrolysis to eject H + from the cell; such a proton pump could 
hâve been the ancestor of présent-day ATP synthases. 

As the Earth's supply of geochemically produced nutrients began to dwindle, 
organisms that could find a way to pump H + without consuming ATP would hâve 
been at an advantage: they could save the small amounts of ATP they derived 
from the fermentation of increasingly scarce foodstuffs to fuel other important 
activities. This need to conserve resources might hâve led to the évolution of 
electron-transport proteins that allowed cells to use the movement of électrons 
between molécules of different redox potentials as a source of energy for pump- 
ing H + across the plasma membrane (stage 2 in Figure 14-55). Some of these cells 
might hâve used the nonfermentable organic acids that neighboring cells had 
excreted as waste to provide the électrons needed to feed this electron-transport 
System. Some présent-day bacteria grow on formic acid, for example, using the 
small amount of redox energy derived from the transfer of électrons from formic 
acid to fumarate to pump H + . 


Figure 14-55 How ATP synthesis by chemiosmosis might hâve evolved 
in stages. The first stage could hâve involved the évolution of an ATPase that 
pumped protons out of the cell using the energy of ATP hydrolysis. Stage 2 
could hâve involved the évolution of a different proton pump, driven by an 
electron-transport chain. Stage 3 would then hâve linked these two Systems 
together to generate a primitive ATP synthase that used the protons pumped 
by the electron-transport chain to synthesize ATP. An early bacterium with 
this final System would hâve had a sélective advantage over bacteria with 
neither of the Systems or only one. 


Figure 14-54 A comparison of H + 
concentrations and the arrangement 
of ATP synthase in mitochondria and 
chloroplasts. In both organelles, the pH in 
the intermembrane space is 7.4, as in the 
cytoplasm. The pH of the mitochondrial 
matrix and the pH of the chloroplast stroma 
are both about 8 (light gray). The pH in the 
thylakoid space is around 5.5, depending 
on photosynthetic activity. This results 
in a high proton-motive force across the 
thylakoid membrane, consisting largely of 
the H + gradient (a high permeability of this 
membrane to Mg 2+ and Ch ions allows the 
flow of these ions to dissipate most of the 
membrane potential). 

In contrast to chloroplasts, the H + 
gradient across the inner mitochondrial 
membrane is insufficient for ATP production, 
and mitochondria need a membrane 
potential to bring the proton-motive force 
to the same level as in chloroplasts. The 
arrangement of the mitochondrial ATP 
synthase in rows of dimers along the 
cristae ridges (see Figure 14-32) next to the 
respiratory-chain proton pumps may help 
the flow of protons along the membrane 
surface toward the ATP synthase, as the 
availability of protons is limiting for ATP 
production. In the chloroplast, the ATP 
synthase is distributed randomly in thylakoid 
membranes. 
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Eventually, some bacteria would hâve developed H + -pumping electron-trans- 
port Systems that were so efficient that they could harvest more redox energy than 
they needed to maintain their internai pH. Such cells would probably hâve gen- 
erated large electrochemical proton gradients, which they could then use to pro¬ 
duce ATP. Protons could lealcbaclc into the cell through the ATP-driven H + pumps, 
essentially running them in reverse so that they synthesized ATP (stage 3 in Figure 
14-55). Because such cells would require much less of the dwindling supply of 
fermentable nutrients, they would hâve proliferated at the expense of their neigh- 
bors. 

By Providing an Inexhaustible Source of Reducing Power, 
Photosynthetic Bacteria Overcame a Major Evolutionary Obstacle 

The graduai déplétion of nutrients from the environment on the early Earth meant 
that organisms had to find some alternative source of carbon to malce the sugars 
that serve as the precursors for so many other cell components. Although the C0 2 
in the atmosphère provides an abundant potential carbon source, to convert it 
into an organic molécule such as a carbohydrate requires reducing the fixed C0 2 
with a strong électron donor, such as NADPH, which can generate (CH 2 0) units 
from C0 2 (see Figure 14-41). Early in cellular évolution, strong reducing agents 
(électron donors) are thought to hâve been plentiful. But once an ancestor of ATP 
synthase began to generate most of the ATP, it would hâve become impérative for 
cells to evolve a new way of generating strong reducing agents. 

A major evolutionary breakthrough in energy metabolism came with the 
development of photochemical reaction centers that could use the energy of sun¬ 
light to produce molécules such as NADPH. It is thought that this occurred early 
in the process of cellular évolution in the ancestors of the green sulfur bacteria. 
Present-day green sulfur bacteria use light energy to transfer hydrogen atoms 
(as an électron plus a proton) from H 2 S to NADPH, thereby producing the strong 
reducing power required for carbon fixation. Because the redox potential of H 2 S is 
much lower than that of H 2 0 (-230 mV for H 2 S compared with +820 mV for H 2 0), 
one quantum of light absorbed by the single photosystem in these bacteria is suf- 
ficient to generate NADPH via a relatively simple photosynthetic electron-trans- 
port chain. 

The Photosynthetic Electron-Transport Chains of Cyanobacteria 
Produced Atmospheric Oxygen and Permitted New Life-Forms 

The next evolutionary step, which is thought to hâve occurred with the develop¬ 
ment of the cyanobacteria perhaps 3 billion years ago, was the évolution of organ¬ 
isms capable of using water as the électron source for C0 2 réduction. This entailed 
the évolution of a water-splitting enzyme and also required the addition of a sec¬ 
ond photosystem, acting in sériés with the first, to bridge the large gap in redox 
potential between H 2 0 and NADPH. The biological conséquences of this evolu¬ 
tionary step were far-reaching. For the first time, there would hâve been organisms 
that could survive on water, C0 2 , and sunlight (plus a few trace éléments). These 
cells would hâve been able to spread and evolve in ways denied to the earlier pho¬ 
tosynthetic bacteria, which needed H 2 S or organic acids as a source of électrons. 
Consequently, large amounts of biologically synthesized, reduced organic mate- 
rials accumulated and oxygen entered the atmosphère for the first time. 

Oxygen is highly toxic because the oxidation of biological molécules alters 
their structure and properties indiscriminately and irreversibly. Most anaérobie 
bacteria, for example, are rapidly killed when exposed to air. Thus, organisms 
on the primitive Earth would hâve had to evolve protective mechanisms against 
the rising 0 2 levels in the environment. Late evolutionary arrivais, such as our- 
selves, hâve numerous detoxifying mechanisms that protect our cells from the ill 
effects of oxygen. Even so, an accumulation of oxidative damage to our macro- 
molecules has been postulated to contribute to human aging, as we discuss in the 
next section. 
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Figure 14-56 Major events during the évolution of living organisms on Earth. With the évolution of the membrane-based 
process of photosynthesis, organisms were able to make their own organic molécules from C0 2 gas. The delay of more than 
10 9 years between the appearance of bacteria that split water and released 0 2 during photosynthesis and the accumulation 
of high levels of 0 2 in the atmosphère is thought to be due to the initial reaction of the oxygen with the abundant ferrous iron 
(Fe 2+ ) that was dissolved in the early océans. Only when the ferrous iron was used up would oxygen hâve started to accumulate 
in the atmosphère. In response to the rising oxygen levels, nonphotosynthetic oxygen-consuming organisms evolved, and the 
concentration of oxygen in the atmosphère equilibrated at its present-day level. 


The increase in atmospheric O 2 was very slow at first and would hâve allowed 
a graduai évolution of protective devices. For example, the early seas contained 
large amounts of iron in its reduced, ferrous State (Fe 2+ ), and nearly ail the O 2 pro- 
duced by early photosynthetic bacteria would hâve been used up in oxidizing Fe 2+ 
to ferrie Fe 3+ . This conversion caused the précipitation of huge amounts of stable 
oxides, and the extensive banded iron formations in sedimentary rocks, begin- 
ning about 2.7 billion years ago, help to date the spread of the cyanobacteria. By 
about 2 billion years ago, the supply of Fe 2+ was exhausted, and the déposition 
of further iron précipitâtes ceased. Geological evidence reveals how 0 2 levels in 
the atmosphère hâve changed over billions of years, approximating current levels 
only about 0.5 billion years ago (Figure 14-56). 

The availability of O 2 enabled the rise of bacteria that developed an aérobic 
metabolism to make their ATP. These organisms could harness the large amount 
of energy released by brealcing down carbohydrates and other reduced organic 
molécules ail the way to CO 2 and H 2 O, as explained when we discussed mito- 
chondria. Components of preexisting electron-transport complexes were modi- 
fied to produce a cytochrome oxidase, so that the électrons obtained from organic 
or inorganic substrates could be transported to O 2 as the terminal électron accep- 
tor. Some present-day purple photosynthetic bacteria can switch between photo¬ 
synthesis and respiration depending on the availability of light and 0 2 , with only 
relatively minor reorganizations of their electron-transport chains. 

In Figure 14-57, we relate these postulated evolutionary pathways to different 
types of bacteria. By necessity, évolution is always conservative, taking parts of 
the old and building on them to create something new. Thus, parts of the elec¬ 
tron-transport chains that were derived to service anaérobie bacteria 3-4 billion 
years ago survive, in altered form, in the mitochondria and chloroplasts of today's 
higher eukaryotes. A good example is the overall similarity in structure and func- 
tion between the cytochrome c reductase that pumps H + in the central segment of 
the mitochondrial respiratory chain and the analogous cytochrome b-f complex 
in the electron-transport chains of both bacteria and chloroplasts, revealing their 
common evolutionary origin (Figure 14-58). 
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Figure 14-57 Evolutionary scheme showing the postulated origins of mitochondria and chloroplasts and their bacterial 
ancestors. The consumption of oxygen by respiration is thought to hâve first developed about 2 billion years ago. Nucleotide- 
sequence analyses suggest that an endosymbiotic oxygen-evolving cyanobacterium (cyan) gave rise to chloroplasts (dark 
green), while mitochondria arase from an a-proteobacterium. The nearest relatives of mitochondria (pink) are members of 
three closely related groups of a-proteobacteria—the rhizobacteria, agrobacteria, and rickettsias—known to form intimate 
associations with present-day eukaryotic cells. Proteobacteria are pink, purple photosynthetic bacteria are purple, and other 
photosynthetic bacteria are light green. 


Summary 

Chloroplasts and photosynthetic bacteria hâve the unique ability to harness the 
energy of sunlight to produce energy-rich compounds. This is achieved by the pho- 
tosystems, in which chlorophyll molécules attached to proteins are excited when 
hit by a photon. Photosystems are composed of an antenna complex that collects 
solar energy and a photochemical reaction center ; in which the collected energy is 
Junneled to a chlorophyll molécule held in a spécial position, enabling it to with- 
draw électrons from an électron donor. Chloroplasts and cyanobacteria contain 
two distinct photosystems. The two photosystems are normally linked in sériés in 
the Z scheme, and they transfer électrons from water to NADP + to form NADPH', 
generating a transmembrane electroChemical potential. One of the two photosys¬ 
tems—photosystem II—can split water by removing électrons from this ubiqui- 
tous, low-energy compound. AU the molecular oxygen (O 2 ) in our atmosphère is 
a by-product of the water-splitting reaction in this photosystem. The three-dimen- 
sional structures of photosystems I and Rare striicingly similar to the photosystems 
of purple photosynthetic bacteria, demonstrating a remaricable degree of conserva¬ 
tion over billions of years of évolution. 

The two photosystems and the cytochrome bg-f complex résidé in the thylakoid 
membrane, a separate membrane System in the central stroma compartment ofthe 
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chloroplast that is differentiated into stacked grana and unstacked stroma thyla- 
koids. Electron-transport processes in the thylakoid membrane cause protons to be 
released into the thylakoid space. The backflow of protons through the chloroplast 
ATP synthase then generates ATP. ThisATP is used in conjunction with the NADPH 
produced by photosynthesis to drive a large number of biosynthetic reactions in 
the chloroplast stroma, including the carbon-fixation cycle , which generates large 
amounts of carbohydrates from C0 2 - 

In the early évolution oflife, cyanobacteria overcame a major obstacle in devis- 
ing a way to use solar energy to split water and fix carbon dioxide. Cyanobacteria 
produced both abundant organic nutrients and molecular oxygen, enabling the rise 
of a multitude of aérobic life-forms. The chloroplasts in plants hâve evolved from 
a cyanobacterium that was endocytosed long ago by an aérobic eukaryotic host 
organism. 
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Figure 14-58 A comparison of three electron-transport chains discussed in this chapter. 

Bacteria, chloroplasts, and mitochondria ail contain a membrane-bound enzyme complex that 
resembles the cytochrome c reductase of mitochondria. These complexes ail accept électrons from 
a quinone carrier (Q) and pump H + across their respective membranes. Moreover, in reconstituted 
in vitro Systems, the different complexes can substitute for one another, and the structures of 
their protein components reveal that they are evolutionarily related. Note that the purple nonsulfur 
bacteria use a cyclic flow of électrons to produce a large electrochemical proton gradient that drives 
a reverse électron flow through NADH dehydrogenase to produce NADH from NAD + + H + + e _ . 
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THE GENETIC SYSTEMS OF MITOCHONDRIA AND 
CHLOROPLASTS 

As we discussed in Chapter 1, mitochondria and chloroplasts are thought to hâve 
evolved from endosymbiotic bacteria (see Figures 1-29 and 1-31). Both types 
of organelles still contain their own genomes (Figure 14-59). As we will discuss 
shortly, they also retain their own biosynthetic machinery for making RNA and 
organellar proteins. 

Lilce bacteria, mitochondria and chloroplasts proliferate by growth and divi¬ 
sion of an existing organelle. In actively dividing cells, each type of organelle must 
double in mass in each cell génération and then be distributed into each daughter 
cell. In addition, nondividing cells must replenish organelles that are degraded as 
part of the continuai process of organelle turnover, or produce additional organ¬ 
elles as the need arises. Organelle growth and prolifération are therefore care- 
fully controlled. The process is complicated because mitochondrial and chloro- 
plast proteins are encoded in two places: the nuclear genome and the separate 
genomes harbored in the organelles themselves. The biogenesis of mitochondria 
and chloroplasts thus requires contributions from two separate genetic Systems, 
which must be closely coordinated. 

Most organellar proteins are encoded by the nuclear DNA. The organelle 
imports these proteins from the cytosol, after they hâve been synthesized on cyto- 
solic ribosomes, through the mitochondrial protein translocases of the outer and 
inner mitochondrial membrane—TOM and TIM. In Chapter 12, we discussed 
how this happens. Here, we describe the organelle genomes and genetic Systems, 
and consider the conséquences of separate organelle genomes for the cell and the 
organism as a whole. 

The Genetic Systems of Mitochondria and Chloroplasts Resemble 
Those of Prokaryotes 

As discussed in Chapter 12, it is thought that eukaryotic cells originated through 
a symbiotic relationship between an archaeon and an aérobic bacterium (a pro- 
teobacterium). The two organisms are postulated to hâve merged to form the 
ancestor of ail nucleated cells, with the archeaon providing the nucléus and the 
proteobacterium serving as a respiring, ATP-producing endosymbiont—one that 
would eventually evolve into the mitochondrion (see Figure 12-3). This most likely 
occurred roughly 1.6 billion years ago, when oxygen had entered the atmosphère 
in substantial amounts (see Figure 14-56). The chloroplast was derived later, after 
the plant and animal lineages diverged, through endocytosis of an oxygen-pro- 
ducing cyanobacterium. 

This endosymbiont hypothesis of organelle development receives strong sup¬ 
port from the observation that the genetic Systems of mitochondria and chloro¬ 
plasts are similar to those of present-day bacteria. For example, chloroplast ribo¬ 
somes are very similar to bacterial ribosomes, both in their structure and in their 
sensitivity to various antibiotics (such as chloramphenicol, streptomycin, eryth- 
romycin, and tetracyclin). In addition, protein synthesis in chloroplasts starts with 
N-formylmethionine, as in bacteria, and not with méthionine as in the cytosol of 
eukaryotic cells. Although mitochondrial genetic Systems are much less similar to 
those of present-day bacteria than are the genetic Systems of chloroplasts, their 
ribosomes are also sensitive to antibacterial antibiotics, and protein synthesis in 
mitochondria also starts with N-formylmethionine. 

Figure 14-59 Staining of nuclear and mitochondrial DNA. In this confocal 
micrograph of a human fibroblast, the nuclear DNA is stained with the dye DAPI 
(blue) and mitochondrial DNA is visualized with fluorescent antibodies that bind 
DNA (green). The mitochondria are stained with fluorescent antibodies that 
recognize a specialized protein translocase spécifie to the outer mitochondrial 
membrane (red). Numerous copies of the mitochondrial genome are distributed 
in distinct nucleoids throughout the mitochondria that snake through the 
cytoplasm. (From C. Kukat et al., Proc. NatlAcad. Sci. USA 108:13534-13539, 

2011. With permission from the National Academy of Sciences.) 5 jim 
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The processes of organelle DNA transcription, protein synthesis, and DNA 
réplication talce place where the genome is located: in the matrix of mitochondria 
or the stroma of chloroplasts. Although the enzymes that médiate these genetic 
processes are unique to the organelle, and resemble those of bacteria (or even 
of bacterial viruses) rather than their eukaryotic analogs, the nuclear genome 
encodes the vast majority of these enzymes. Indeed, most present-day mitochon¬ 
drial and chloroplast proteins are encoded by genes that résidé in the cell nucléus. 

Over Time, Mitochondria and Chloroplasts Hâve Exported Most of 
Their Genes to the Nucléus by Gene Transfer 

The nature of the organelle genes located in the nucléus of the cell demonstrates 
that an extensive transfer of genes from organelle to nuclear DNA has occurred 
in the course of eukaryotic évolution. Such successful gene transfer is expected 
to be rare, because any gene moved from the organelle needs to adapt to both 
nuclear transcription and cytoplasmic translation requirements. In addition, the 
protein needs to acquire a signal sequence that directs it to the correct organ¬ 
elle after its synthesis in the cytosol. By comparing the genes in the mitochon¬ 
dria from different organisms, we can infer that some of the gene transfers to the 
nucléus occurred relatively recently. The smallest and presumably most highly 
evolved mitochondrial genomes, for example, encode only a few hydrophobie 
inner-membrane proteins of the electron-transport chain, plus ribosomal RNAs 
(rRNAs) and transfer RNAs (tRNAs). Other mitochondrial genomes that hâve 
remained more complex tend to contain this same subset of genes along with 
others (Figure 14-60). The most complex mitochondrial genomes include genes 
that encode components of the mitochondrial genetic System, such as RNA poly¬ 
merase subunits and ribosomal proteins; these same genes are found in the cell 
nucléus in yeast and ail animal cells. 

The proteins that are encoded by genes in the organellar DNA are synthesized 
on ribosomes within the organelle, using organelle-produced messenger RNA 
(mRNA) to specify their amino acid sequence (Figure 14-61). The protein traf- 
fic between the cytosol and these organelles seems to be unidirectional: proteins 
are normally not exported from mitochondria or chloroplasts to the cytosol. An 
important exception occurs when a cell is about to undergo apoptosis. As will be 



Figure 14-60 Comparison of 
mitochondrial genomes. Less complex 
mitochondrial genomes encode subsets 
of the proteins and ribosomal RNAs that 
are encoded by larger mitochondrial 
genomes. In this comparison, there are 
only five genes that are shared by the six 
mitochondrial genomes; these encode 
ribosomal RNAs (ms and rnl), cytochrome 
b (cob), and two cytochrome oxidase 
subunits (coxl and cox3). Blue indicates 
ribosomal RNAs; green, ribosomal proteins; 
and brown, components of the respiratory 
chain and other proteins. (Adapted from 
M.W. Gray, G. Burgerand B.F. Lang, 
Science 283:1476-1481, 1999.) 
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discussed in detail in Chapter 18, during apoptosis the mitochondrion releases 
proteins (most notably cytochrome c ) from the crista space through its outer 
mitochondrial membrane, as part of an elaborate signaling pathway that is trig- 
gered to cause cells to undergo programmed cell death. 


The Fission and Fusion of Mitochondria Are Topologically Complex 
Processes 


Figure 14-61 Biogenesis of the 
respiratory-chain proteins in human 
mitochondria. Most of the protein 
components of the mitochondrial 
respiratory chain are encoded by nuclear 
DNA, with only a small number encoded by 
mitochondrial DNA (mtDNA). Transcription 
of mtDNA produces 13 mRNAs, ail of 
which encode subunits of the oxidative 
phosphorylation System, and the 24 RNAs 
(22 transfer RNAs and 2 ribosomal RNAs) 
needed for translation of these mRNAs on 
the mitochondrial ribosomes (brown). 

The mRNAs produced by transcription 
of nuclear genes are translated on 
cytoplasmic ribosomes (green), which 
are distinct from the mitochondrial 
ribosomes. The nuclear-encoded 
mitochondrial proteins (dark green) are 
imported into mitochondria through two 
protein translocases called TOM and 
TIM, and constitute the vast majority 
of the approximately 1000 different 
protein species présent in mammalian 
mitochondria. The nuclear-encoded 
mitochondrial proteins in humans 
include the majority of the oxidative 
phosphorylation System subunits, ail 
proteins needed for expression and 
maintenance of mtDNA, and ail proteins of 
the mitochondrial ribosomes. 

The mtDNA-encoded subunits (orange) 
assemble together with the nuclear 
subunits to form a functional oxidative 
phosphorylation System. (Adapted from 
N.G. Larsson, Annu. Rev. Biochem. 
79:683-706, 2010.) 


In mammalian cells, mitochondrial DNA malces up less than 1% of the total cel¬ 
lular DNA. In other cells, however, such as the leaves of higher plants or the very 
large egg cells of amphibians, a much larger fraction of the cellular DNA may be 
présent in mitochondria or chloroplasts (Table 14-2), and a large fraction of the 
total RNA and protein synthesis talces place in the organelles. 

Mitochondria and chloroplasts are large enough to be visible by light micros- 
copy in living cells. For example, mitochondria can be visualized by expressing in 
cells a genetically engineered fusion of a mitochondrial protein linlced to green 


TABLE 14-2 


Relative Amounts of Organelle DNA in Some Cells and Tissues 


Organism 

Tissue or cell type 

DNA molécules 
per organelle 

Organelles 
per cell 

Organelle DNA as percentage 
of total cellular DNA 

Mitochondrial DNA 

Rat 

Liver 

5-10 

1000 

1 

Yeast* 

Végétative 

2-50 

1-50 

15 

Frog 

Egg 

5-10 

10 7 

99 

Chloroplast DNA 

Chlamydomonas 

Végétative 

80 

1 

7 

Maize 

Leaves 

0-300** 

20-40 

0-15** 

*The large variation in the number and size of mitochondria per cell in yeasts is due to mitochondrial fusion and fission.**ln maize, the amount of 

chloroplast DNA drops precipitously in mature leaves, after cell division ceases: the chloroplast DNA is degraded and stable mRNAs persist to 

provide for protein synthesis. 
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Figure 14-62 The mitochondrial réticulum is dynamic. (A) In yeast cells, mitochondria form a continuous réticulum on the 
cytoplasmic side of the plasma membrane (stéréo pair). (B) A balance between fission and fusion détermines the arrangement 
of the mitochondria in different cells. (C) Time-lapse fluorescent microscopy shows the dynamic behavior of the mitochondrial 
network in a yeast cell. In addition to shape changes, fission and fusion constantly remodel the network (red arrows). These 
pictures were taken at 3-minute intervals. (A and C, from J. Nunnari et al., Mol. Biol. Cell 8:1233-1242, 1997. With permission 
from the American Society for Cell Biology.) 


fluorescent protein (GFP), or cells can be incubated with a fluorescent dye that 
is specifically taken up by mitochondria because of their membrane potential. 
Such images demonstrate that the mitochondria in living cells are dynamic— 
frequently dividing by fission, fusing, and changing shape (Figure 14-62 and 
Movie 14.12). The fission of mitochondria may be necessary so that small parts of 
the network can pinch off and reach remote régions of the cell—for example in the 
thin, extended axon and dendrites of a neuron. 

The fission and fusion of mitochondria are topologically complex processes that 
must ensure the integrity of the separate mitochondrial compartments defined by 
the inner and outer membranes. These processes control the number and shape 
of mitochondria, which can vary dramatically in different cell types, ranging from 
multiple spherical or wormlike organelles to a highly branched, net-shaped single 
organelle called a réticulum. Each dépends on its own spécial set of proteins. The 
mitochondrial fission machine worlcs by assembling dynamin-related GTPases 
(discussed in Chapter 13) into helical oligomers that cause local constrictions in 
tubular mitochondria. GTP hydrolysis then generates the mechanical force that 
severs the inner and outer mitochondrial membranes in one step (Figure 14-63). 
Mitochondrial fusion requires two separate machineries, one each for the outer 
and the inner membrane (Figure 14-64). In addition to GTP hydrolysis for force 
génération, both mechanisms also dépend on the mitochondrial proton-motive 
force for reasons that are still unknown. 

Animal Mitochondria Contain the Simplest Genetic Systems 
Known 

Comparisons of DNA sequences in different organisms reveal that, in vertebrates 
(including ourselves), the mutation rate during évolution has been roughly 100 
times greater in the mitochondrial genome than in the nuclear genome. This 
différence is lilcely to be due to lower fidelity of mitochondrial DNA réplication, 



targeting 


1 


assembly-driven constriction 



Figure 14-63 A model for mitochondrial division. Dynamin-1 (yellow) 
exists as dimers in the cytosol, which form larger oligomeric structures in a 
process that requires GTP hydrolysis. Dynamin assemblies interact with the 
outer mitochondrial membrane through spécial adaptor proteins, forming a 
spiral of GTP-dynamin around the mitochondrion that causes a constriction. 

A concerted GTP-hydrolysis event in the dynamin subunits is then thought to 
produce the conformational changes that resuit in fission. (Adapted from S. 
Hoppins, L. Lackner and J. Nunnari, Arinu. Rev. Biochem. 76:751-780, 2007.) 
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inefficient DNA repair, or both, given that the mechanisms that perform these pro¬ 
cesses in the organelle are relatively simple compared with those in the nucléus. 
As discussed in Chapter 4, the relatively high rate of évolution of animal mito¬ 
chondrial genes malces a comparison of mitochondrial DNA sequences especially 
useful for estimating the dates of relatively recent evolutionary events, such as the 
steps in primate évolution. 

There are 13 protein-encoding genes in human mitochondrial DNA (Figure 
14-65). These code for hydrophobie components of the respiratory-chain com¬ 
plexes and of ATP synthase. In contrast, roughly 1000 mitochondrial proteins are 
encoded in the nucléus, produced on cytosolic ribosomes, and imported by the 
protein import machinery in the outer and inner membrane (discussed in Chap¬ 
ter 12). It has been suggested that the cytosolic production of hydrophobie mem¬ 
brane proteins and their import into the organelle may présent a problem to the 
cell, and that this is the reason why their genes hâve remained in the mitochon- 
drion. However, some of the most hydrophobie mitochondrial proteins, such as 
the c subunit of the ATP synthase rotor ring, are imported from the cytosol in some 
species (though they are mitochondrially encoded in others). And the parasites 
Plasmodium falciparum and Leishmania tarentolae, which spend most of their 
life cycles inside cells of their host organisms, hâve retained only two or three 
mitochondrially encoded proteins. 

The size range of mitochondrial DNAs is similar to that of viral DNAs. The 
mitochondrial DNA in Plasmodium falciparum (the human malaria parasite) has 
less than 6000 nucléotide pairs, whereas the mitochondrial DNAs of some land 
plants contain more than 300,000 nucléotide pairs (Figure 14-66). In animais, the 
mitochondrial genome is a simple DNA circle of about 16,600 nucléotide pairs 
(less than 0.001% of the nuclear genome), and it is nearly the same size in organ¬ 
isms as different from us as Drosophila and sea urchins. 


Mitochondria Hâve a Relaxed Codon Usage and Can Hâve a 
Variant Genetic Code 


The human mitochondrial genome has several surprising features that distin- 
guish it from nuclear, chloroplast, and bacterial genomes: 

1. Dense gene paclcing. Unlilce other genomes, the human mitochondrial 
genome seems to contain almost no noncoding DNA: nearly every nuclé¬ 
otide seems to be part of a coding sequence, either for a protein or for one 
of the rRNAs or tRNAs. Since these coding sequences run directly into each 
other, there is very little room left for regulatory DNA sequences. 

2. Relaxed codon usage. Whereas 30 or more tRNAs specify amino acids in the 
cytosol and in chloroplasts, only 22 tRNAs are required for mitochondrial 
protein synthesis. The normal codon-anticodon pairing rules are relaxed 
in mitochondria, so that many tRNA molécules recognize any one of the 
four nucléotides in the third (wobble) position. Such “2 out of 3" pairing 
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Figure 14-64 A model for mitochondrial 
fusion. The fusions of the outer and inner 
mitochondrial membranes are coordinated 
sequential events, each of which requires 
a separate set of protein factors. Outer 
membrane fusion is brought about by an 
outer-membrane GTPase (ourple), which 
forms an oligomeric complex that includes 
subunits anchored in the two membranes 
to be fused. Fusion of outer membranes 
requires GTP and an H + gradient across 
the inner membrane. For fusion of the inner 
membrane, a dynamin-related protein 
forms an oligomeric tethering complex 
(blue) that includes subunits anchored in 
the two inner membranes to be fused. 
Fusion of the inner membranes requires 
GTP and the electrical component of the 
potential across the inner membrane. 
(Adapted from S. Hoppins, L. Lackner 
and J. Nunnari, Annu. Rev. Biochem. 
76:751-780, 2007.) 


Figure 14-65 The organization of the 
human mitochondrial genome. The 

human mitochondrial genome of -16,600 
nucléotide pairs contains 2 rRNA genes, 

22 tRNA genes, and 13 protein-coding 
sequences. There are two transcriptional 
promoters, one for each strand of the 
mitochondrial DNA (mtDNA). The DNAs of 
many other animal mitochondrial genomes 
hâve been completely sequenced. Most of 
these animal mitochondrial DNAs encode 
precisely the same genes as humans, with 
the gene order being identical for animais 
ranging from fish to mammals. 
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bacterium 


allows one tRNA to pair with any one of four codons and permits protein 
synthesis with fewer tRNA molécules. 

3. Variant genetic code. Perhaps most surprising, comparisons of mitochon¬ 
drial gene sequences and the amino acid sequences of the corresponding 
proteins indicate that the genetic code is different: 4 of the 64 codons hâve 
different “meanings" from those of the same codons in other genomes 
(Table 14-3). 

The close similarity of the genetic code in ail organisms provides strong évi¬ 
dence that they ail hâve evolved from a common ancestor. How, then, do we 
explain the différences in the genetic code in many mitochondria? A hint cornes 
from the finding that the mitochondrial genetic code in different organisms is not 
the same. In the mitochondrion with the largest number of genes in Figure 14-60, 
that of the protozoan Reclinomonas, the genetic code is unchanged from the stan¬ 
dard genetic code of the cell nucléus. Yet UGA, which is a stop codon elsewhere, 
is read as tryptophan in the mitochondria of mammals, fungi, and invertebrates. 
Similarly, the codon AGG normally codes for arginine, but it codes for stop in 
the mitochondria of mammals and codes for serine in the mitochondria of Dro- 
sophila (see Table 14-3). Such variation suggests that a random drift can occur 
in the genetic code in mitochondria. Presumably, the unusually small number of 
proteins encoded by the mitochondrial genome malces an occasional change in 
the meaning of a rare codon tolerable, whereas such a change in a larger genome 
would alter the function of many proteins and thereby destroy the cell. 

Interestingly, in many species, one or two tRNAs for mitochondrial protein 
synthesis are encoded in the nucléus. Some parasites, for example trypano¬ 
somes, hâve not retained any tRNA genes in their mitochondrial DNA. Instead, 
the required tRNAs are ail produced in the cytosol and are thought to be imported 
into the mitochondrion by spécial tRNA translocases that are distinct from the 
mitochondrial protein import System. 


Figure 14-66 Comparison of various 
sizes of mitochondrial genomes with 
the genome of bacterial ancestors. The 

complété DNA sequences for thousands 
of mitochondrial genomes hâve been 
determined. The lengths of a few of these 
mitochondrial DNAs are shown to scale— 
as circles for those genomes thought to be 
circular and lines for linear genomes. The 
largest circle represents the genome of 
Rickettsia prowazekii, a small pathogenic 
bacterium whose genome most closely 
resembles that of mitochondria. The size of 
mitochondrial genomes does not correlate 
well with the number of proteins encoded 
in them: while human mitochondrial DNA 
encodes 13 proteins, the 22-fold larger 
mitochondrial DNA of Arabidopsis thaliana 
encodes only 32 proteins—that is, about 
2.5-fold as many as human mitochondrial 
DNA. The extra DNA that is found in 
Arabidopsis, Marchantia, and other plant 
mitochondria may be “junk DNA”—that is, 
noncoding DNA with no apparent function. 
The mitochondrial DNA of the protozoan 
Reclinomonas americana has 98 genes. 
(Adapted from M.W. Gray, G. Burger and 
B.F. Lang, Science 283:1476-1481, 1999.) 


Some Différences Between the “Universal” Code and Mitochondrial 
Genetic Codes* 



“Universal” 

code 

Mitochondrial codes 

Codon 

Mammals 

Invertebrates 

Yeasts 

Plants 

UGA 

STOP 

Trp 

Trp 

Trp 

STOP 

AUA 

Ile 

Met 

Met 

Met 

Ile 

CUA 

Leu 

Leu 

Leu 

Thr 

Leu 

AGA 

AGG 

Arg 

STOP 

Ser 

Arg 

Arg 


*Red italics indicate that the code differs from the “Universal” code. 



















806 Chapter 14: Energy Conversion: Mitochondria and Chloroplasts 


Chloroplasts and Bacteria Share Many Striking Similarities 

The chloroplast genomes of land plants range in size from 70,000 to 200,000 nuclé¬ 
otide pairs. More than 300 chloroplast genomes hâve nowbeen sequenced. Many 
are surprisingly similar, even in distantly related plants (such as tobacco and liv- 
erwort), and even those of green algae are closely related (Figure 14-67). Chloro¬ 
plast genes are involved in three main processes: transcription, translation, and 
photosynthesis. Plant chloroplast genomes typically encode 80-90 proteins and 
around 45 RNAs, including 37 or more tRNAs. As in mitochondria, most of the 
organelle-encoded proteins are part of larger protein complexes that also contain 
one or more subunits encoded in the nucléus and imported from the cytosol. 

The genomes of chloroplasts and bacteria hâve striking similarities. Basic reg- 
ulatory sequences, such as transcription promoters and terminators, are virtually 
identical. The amino acid sequences of the proteins encoded in chloroplasts are 
clearly recognizable as bacterial, and several clusters of genes with related func- 
tions (such as those encoding ribosomal proteins) are organized in the same way 
in the genomes of chloroplasts, the bacterium E. coli, and cyanobacteria. 

The mechanisms by which chloroplasts and bacteria divide are also similar. 
Both utilize FtsZ proteins, which are self-assembling GTPases related to tubulins 
(see Chapter 16). Bacterial FtsZ is a soluble protein that assembles into a dynamic 
ring of membrane-attached protofilaments beneath the plasma membrane in the 
middle of the dividing cell. The FtsZ ring acts as a scaffold for recruitment of other 
cell-division proteins and generates a contractile force that results in membrane 
constriction and eventually in cell division. Presumably, chloroplasts divide in 
very much the same way. Although both employ membrane-interacting GTPases, 
the mechanisms by which mitochondria and chloroplasts divide are fundamen- 
tally different. The machinery for chloroplast division acts from the inside, as in 
bacteria, while the dynamin-like GTPases divide mitochondria from the outside 
(see Figure 14-63). The chloroplasts hâve remained doser to their bacterial ori- 
gins than hâve mitochondria, since the eulcaryotic mechanisms of membrane 
constriction and vesicle formation hâve been adapted for mitochondrial fission. 

The RNA editing and RNA processing that is prévalent in chloroplasts owes 
everything to their eulcaryotic hosts. This RNA processing includes the généra¬ 
tion of transcript 5' and 3' termini and the cleavage of polycistronic transcripts. 
In addition, an RNA editing process converts spécifie C residues to U and can 
change the amino add specified by the edited codon. These and other RNA-based 
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tRNA genes 

ribosomal protein genes 
photosystem I genes 
photosystem II genes 
ATP synthase genes 
genes for be~f complex 
RNA polymerase genes 
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carboxylase (large 
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inverted repeats 
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Figure 14-67 The organization 
of the liverwort chloroplast 
genome. The chloroplast 
genome organization is similar 
in ail higher plants, although 
the size varies from species to 
species—depending on how 
much of the DNA surrounding the 
genes encoding the chloroplast’s 
16S and 23S ribosomal RNAs is 
présent in two copies. 
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processes are catalyzed by protein families that are not found in prokaryotes. One 
can aslc why the expression of so few chloroplast genes needs to be so complex. 
One explanation is that the expression of chloroplast and nuclear genes must 
be closely coordinated. More generally, the bacterial concept of the operon as 
a co-regulated set of genes in a single transcription unit has been largely aban- 
doned in chloroplasts. Polycistronic transcripts are cleaved into smaller frag¬ 
ments, which then require splicing or RNA editing to become functional. 


Organelle Genes Are Maternally Inherited in Animais and Plants 

In Saccharomyces cerevisiae (balcer's yeast), when two haploid cells mate, they 
are equal in size and contribute equal amounts of mitochondrial DNA to the dip- 
loid zygote. Mitochondrial inheritance in yeasts is therefore biparental: both par¬ 
ents contribute equally to the mitochondrial gene pool of the progeny. However, 
during the course of the subséquent asexual, végétative growth, the mitochondria 
become distributed more or less randomly to daughter cells. After a few généra¬ 
tions, the mitochondria of any given cell contain only the DNA from one or the 
other parent cell, because only a small sample of the mitochondrial DNA passes 
from the mother cell to the bud of the daughter cell. This process is lcnown as 
mitotic ségrégation , and it gives rise to a distinct form of inheritance that is called 
non-Mendelian, or cytoplasmic inheritance , in contrast to the Mendelian inheri¬ 
tance of nuclear genes. 

The inheritance of mitochondria in animais and plants is quite different. In 
these organisms, the egg cell contributes much more cytoplasm to the zygote than 
does the male gamete (sperm in animais, pollen in plants). For example, a typi- 
cal human oocyte contains about 100,000 copies of maternai mitochondrial DNA, 
whereas a sperm cell contains only a few. In addition, an active process ensures 
that the sperm mitochondria do not compete with those in the egg. As sperm 
mature, the DNA is degraded in their mitochondria. Sperm mitochondria are also 
specifically recognized then eliminated from the fertilized egg cell by autophagy 
in very much the same way that damaged mitochondria are removed (by ubiqui- 
tylation followed by delivery to lysosomes, as discussed in Chapter 13). Because of 
these two processes, the mitochondrial inheritance in both animais and plants is 
uniparental. More precisely, the mitochondrial DNA passes from one génération 
to the next by maternai inheritance. 

In about two-thirds of higher plants, the chloroplast precursors from the male 
parent (contained in pollen grains) fail to enter the zygote, so that chloroplast 
as well as mitochondrial DNA is maternally inherited. In other plants, the chlo¬ 
roplast precursors from the pollen grains enter the zygote, making chloroplast 
inheritance biparental. In such plants, defective chloroplasts are a cause of var- 
iegation: a mixture of normal and defective chloroplasts in a zygote may sort out 
by mitotic ségrégation during plant growth and development, thereby producing 
alternating green and white patches in leaves. Leaf cells in the green patches con¬ 
tain normal chloroplasts, while those in the white patches contain defective chlo¬ 
roplasts (Figure 14-68). 

Mutations in Mitochondrial DNA Can Cause Severe Inherited 
Diseases 

In humans, as we hâve explained, ail the mitochondrial DNA in a fertilized egg 
cell is inherited from the mother. Some mothers carry a mixed population of both 
mutant and normal mitochondrial genomes. Their daughters and sons will inherit 
this mixture of normal and mutant mitochondrial DNAs and be healthy unless the 
process of mitotic ségrégation results in a majority of defective mitochondria in a 
particular tissue. Muscle and the nervous System are most at rislc. Because they 
need particularly large amounts of ATP, muscle and nerve cells are particularly 
dépendent on fully functional mitochondria. 

Numerous diseases in humans are caused by mutations in mitochondrial 
DNA. These diseases are recognized by their passage from affected mothers to 



Figure 14-68 A variegated leaf. In the 

white patches, the plant cells hâve inherited 
a defective chloroplast. (Courtesy of John 
Innés Foundation.) 
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both their daughters and their sons, with the daughters but not the sons produc- 
ing children with the disease. As expected from the random nature of mitotic ség¬ 
régation, the symptoms of these diseases vary greatly between different family 
members—including not only the severity and âge of onset, but also which tissue 
is affected. There are also mitochondrial diseases that are caused by mutations 
in nuclear-encoded mitochondrial proteins; these diseases are inherited in the 
regular, Mendelian fashion. 

The Accumulation of Mitochondrial DNA Mutations Is a 
Contributor to Aging 

Mitochondria are marvels of efficiency in energy conversion, and they supply the 
cells of our body with a readily available source of energy in the form of ATP. But in 
highly developed, long-lived animais such as ourselves, the cells in our body âge 
and eventually die. A factor in this inévitable process is the accumulation of délé¬ 
tions and point mutations in mitochondrial DNA. Oxidative damage to the cell by 
reactive oxygen species ( ROS ) such as H 2 0 2 , superoxide, or hydroxyl radicals also 
increases with âge. The mitochondrial respiratory chain is the main source of ROS 
in animal cells, and animais in which mitochondrial superoxide dismutase—the 
main ROS scavenger—has been lcnoclced out, die prematurely. 

The less complex DNA réplication and repair Systems in mitochondria mean 
that accidents are corrected less efficiently. This results in a 100-fold higher occur¬ 
rence of délétions and point mutations than in nuclear DNA. Mathematical mod- 
eling suggests that most of these mutations and lésions are acquired in childhood 
or early adult life, and then proliferate by clonal expansion in later life. Due to 
mitotic ségrégation, some cells will accumulate higher levels of faulty mitochon¬ 
drial DNA than others. Above some threshold, serious deficiencies in respirato- 
ry-chain function will develop, producing cells that are senescent. In many organs 
of the human body, senescent cells with high levels of mitochondrial DNA dam¬ 
age are intermingled with normal cells, resulting in a mosaic of cells with and 
without respiratory-chain deficiency. 

The main rôle of mitochondrial fusion in cellular physiology is most lilcely to 
ensure an even distribution of mitochondrial DNA throughout the mitochondrial 
réticulum, and to prevent the accumulation of damaged DNA in one part of the 
networlc. When the fusion machinery is defective, DNA is lost from a subset of the 
mitochondria in the cell. Loss of mitochondrial DNA leads to a loss of respirato- 
ry-chain function, and it can cause disease. 

Ail of the considérations just discussed hâve suggested to some scientists that 
changes in our mitochondria are major contributors to human aging. However, 
there are many other processes that tend to go wrong as cells and tissues âge, as 
one might expect given the incredible complexity of human cell biology. Despite 
intensive research, the issue remains unresolved. 

Why Do Mitochondria and Chloroplasts Maintain a Costly 
Separate System for DNA Transcription and Translation? 

Why do mitochondria and chloroplasts require their own separate genetic Sys¬ 
tems, when other organelles that share the same cytoplasm, such as peroxisomes 
and lysosomes, do not? The question is not trivial, because maintaining a sepa¬ 
rate genetic System is costly: more than 90 proteins—including many ribosomal 
proteins, aminoacyl-tRNA synthetases, DNA polymerase, RNA polymerase, and 
RNA-processing and RNA-modifying enzymes—must be encoded by nuclear 
genes specifically for this purpose. Moreover, as we hâve seen, the mitochondrial 
genetic System entails the rislc of aging and disease. 

A possible reason for maintaining this costly and potentially hazardous 
arrangement is the highly hydrophobie nature of the nonribosomal proteins 
encoded by organelle genes. This may malce their production in and import from 
the cytoplasm simply too difficult and energy-consuming. It is also possible that 
the évolution (and eventual élimination) of the organellar genetic Systems is still 
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ongoing, but for now there is no alternative for the cell than to maintain separate 
genetic Systems for its nuclear, mitochondrial, and chloroplast genes. 

Summary 

Mitochondria are organelles thatallow eukaryotes to carry outoxidative phosphor¬ 
ylation , while chloroplasts are organelles that allow plants to carry out photosyn- 
thesis. Presumably as a resuit oftheir prokaryotic origins, each organelle maintains 
and reproduces itself in a highly coordinated process that requires the contribu¬ 
tion oftwo separate genetic Systems—one in the organelle and the other in the cell 
nucléus. The vast majority ofthe proteins in these organelles are encoded by nuclear 
DNA, synthesized in the cytosol, and then imported individually into the organ¬ 
elle. Other organelle proteins, as well as organelle ribosomal and transfer RNAs, are 
encoded by the organelle DNA; these are synthesized in the organelle itself. 

The ribosomes of chloroplasts closely resemble bacterial ribosomes, while the ori- 
gin of mitochondrial ribosomes is more difficult to trace. Extensive protein similar¬ 
ités, however, suggest that both organelles originated when a primitive eukaryotic 
cell entered into a stable endosymbiotic relationship with a bacterium. Although 
some ofthe genes of these former bacteria still function to make organelle proteins 
and RNA, most ofthem hâve been transferred into the nuclear genome, where they 
encode bacteria-lilce enzymes that are synthesized on cytosolic ribosomes and then 
imported into the organelle. The mitochondrial DNA réplication and DNA repair 
processes are substantially less effective than the corresponding processes in the cell 
nucléus. Damage therefore accumulâtes in the genome of mitochondria over time; 
this damage may be a substantial contributor to the aging ofcells and organisms, 
and it can cause serious diseases. 


WHATWE DONT KNOW 


• What structures are needed to form 
the barriers that separate and maintain 
the differentiated membrane domains 
in a single continuous membrane—as 
for the cristae and inner boundary 
membrane in mitochondria? 

• How does a eukaryotic cell regulate 
the many functions of mitochondria, 
including ATP production? 

• What are the origins and evolutionary 
history of photosynthetic complexes? 
Are there undiscovered types of 
photosynthesis présent on Earth to 
help answer this question? 

• Why is the mutation rate so much 
higher in mitochondria than in the 
nucléus (and chloroplasts)? Could this 
high rate hâve been useful to the cell? 

• What mechanisms and pathways 
hâve been used during évolution to 
transfer genes from the mitochondrion 
to the nucléus? 


PROBLEMS 

Which statements are true? Explain why or why not. 

14-1 The three respiratory enzyme complexes in the 
mitochondrial inner membrane tend to associate with 
each other in ways that facilitate the correct transfer of 
électrons between appropriate complexes. 

14-2 The number of c subunits in the rotor ring of ATP 
synthase defines how many protons need to pass through 
the turbine to make each molécule of ATP. 

14-3 Mutations that are inherited according to Mende- 
lian rules affect nuclear genes; mutations whose inheri- 
tance violâtes Mendelian rules are likely to affect organelle 
genes. 

Discuss the following problems. 

14-4 In the 1860s, Louis Pasteur noticed that when he 
added 0 2 to a culture of yeast growing anaerobically on 
glucose, the rate of glucose consumption declined dra- 
matically. Explain the basis for this resuit, which is lcnown 
as the Pasteur effect. 

14-5 Heart muscle gets most of the ATP needed to power 
its continuai contractions through oxidative phosphory¬ 
lation. When oxidizing glucose to C0 2 , heart muscle con¬ 
sumes 0 2 at a rate of 10 pmol/min per g of tissue, in order 
to replace the ATP used in contraction and give a steady- 
state ATP concentration of 5 pmol/g of tissue. At this rate, 


how many seconds would it talce the heart to consume 
an amount of ATP equal to its steady-state levels? (Com¬ 
plété oxidation of one molécule of glucose to C0 2 yields 
30 ATP, 26 of which are derived by oxidative phosphoryla¬ 
tion using the 12 pairs of électrons captured in the électron 
carriers NADH and FADH 2 .) 

14-6 Both H + and Ca 2+ are ions that move through the 
cytosol. Why is the movement of H + ions so much faster 
than that of Ca 2+ ions? How do you suppose the speed of 
these two ions would be affected by freezing the solution? 
Would you expect them to move faster or slower? Explain 
your answer. 

14-7 If isolated mitochondria are incubated with a 
source of électrons such as succinate, but without oxygen, 
électrons enter the respiratory chain, reducing each of the 
électron carriers almost completely. When oxygen is then 
introduced, the carriers become oxidized at different rates 
(Figure Q14-1). How does this resuit allow you to order 



Figure Q14-1 Rapid 
spectrophotometric 
analysis of the rates of 
oxidation of électron 
carriers in the respiratory 
chain (Problem 14-7). 
Cytochromes a and a 3 
cannot be distinguished 
and thus are listed as 
cytochrome (a + a^j. 
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the électron carriers in the respiratory chain? What is their 
order? 

14-8 Normally, the flow of électrons to O 2 is tightly 
linlced to the production of ATP via the electroChemical 
gradient. If ATP synthase is inhibited, for example, élec¬ 
trons do not flow down the électron-transport chain and 
respiration ceases. Since the 1940s, several substances— 
such as 2,4-dinitrophenol—hâve been lcnown to uncou- 
ple électron flow from ATP synthesis. Dinitrophenol was 
once prescribed as a diet drug to aid in weight loss. How 
would an uncoupler of oxidative phosphorylation pro- 
mote weight loss? Why do you suppose dinitrophenol is 
no longer prescribed? 

14-9 In actively respiring liver mitochondria, the pH in 
the matrix is about half a pH unit higher than it is in the 
cytosol. Assuming that the cytosol is at pH 7 and the matrix 
is a sphere with a diameter of 1 pm [V = (4/3)71^], calcu- 
late the total number of protons in the matrix of a respiring 
liver mitochondrion. If the matrix began at pH 7 (equal to 
that in the cytosol), how many protons would hâve to be 
pumped out to establish a matrix pH of 7.5 (a différence of 
0.5 pH units)? 

14-10 ATP synthase is the world's smallest rotary motor. 
Passage of H + ions through the membrane-embedded 
portion of ATP synthase (the F 0 component) causes rota¬ 
tion of the single, central, axle-like y subunit inside the 
head group. The tripartite head is composed of the three 
aP dimers, the P subunit of which is responsible for syn¬ 
thesis of ATP. The rotation of the y subunit induces con- 
formational changes in the aP dimers that allow ADP and 
Pi to be converted into ATP. A variety of indirect evidence 
had suggested rotary catalysis by ATP synthase, but seeing 
is believing. 

To demonstrate rotary motion, a modified form of 
the o^Psy complex was used. The P subunits were modified 
so they could be firmly anchored to a solid support and the 
y subunit was modified (on the end that normally inserts 
into the F 0 component in the inner membrane) so that a 
fluorescently tagged, readily visible filament of actin could 
be attached (Figure Q14-2A). Ibis arrangement allows 
rotations of the y subunit to be visualized as révolutions 
of the long actin filament. In these experiments, ATP syn¬ 
thase was studied in the reverse of its normal mechanism 
by allowing it to hydrolyze ATP. At low ATP concentrations, 
the actin filament was observed to revolve in steps of 120° 
and then pause for variable lengths of time, as shown in 
Figure Q14-2B. 

A. Why does the actin filament revolve in steps with 
pauses in between? What does this rotation correspond to 
in terms of the structure of the (X 3 p 3 y complex? 

B. In its normal mode of operation inside the cell, 
how many ATP molécules do you suppose would be syn- 
thesized for each complété 360° rotation of the y subunit? 
Explain your answer. 

14-11 How much energy is available in visible light? How 
much energy does sunlight deliver to Earth? How efficient 


(A) 


direction of rotation 



inner membrane 
matrix 


(B) 



Figure Q1 4-2 Experimental set-up for observing rotation of the 
y subunit of ATP synthase (Problem 14-10). (A) The immobilized 
(X 3 P 3 Y complex. The P subunits are anchored to a solid support and 
a fluorescent actin filament is attached to the y subunit. (B) Stepwise 
révolution of the actin filament. The indicated trace is a typical example 
from one experiment. The inset shows the positions in the révolution 
at which the actin filament pauses. (B, from R. Yasuda et al., Cell 
93:1117-1124, 1998. With permission from Elsevier.) 


are plants at converting light energy into Chemical energy? 
The answers to these questions provide an important 
backdrop to the subject of photosynthesis. 

Each quantum or photon of light has energy hv, 
where h is Planclc's constant (6.6 x 10 -37 lcj sec/photon) 
and v is the frequency in sec -1 . The frequency of light is 
equal to c/X, where c is the speed of light (3.0 x 10 17 nm/ 
sec) and X is the wavelength in nm. Thus, the energy (E) of 
a photon is 

E~hv- hc/X 

A. Calculate the energy of a mole of photons (6 x 10 23 
photons/mole) at 400 nm (violet light), at 680 nm (red 
light), and at 800 nm (near-infrared light). 

B. Bright sunlight strikes Earth at the rate of about 1.3 
kl/sec per square meter. Assuming for the salce of calcula¬ 
tion that sunlight consists of monochromatic light of wave¬ 
length 680 nm, how many seconds would it talce for a mole 
of photons to strilce a square meter? 

C. Assuming that it talces eight photons to fix one 
molécule of C0 2 as carbohydrate under optimal condi¬ 
tions (8-10 photons is the currently accepted value), cal¬ 
culate how long it would talce a tomato plant with a leaf 
area of 1 square meter to make a mole of glucose from C0 2 . 
Assume that photons strilce the leaf at the rate calculated 
above and, furthermore, that ail the photons are absorbed 
and used to fix C0 2 . 
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D. If it talces 468 lcj/mole to fix a mole of CO 2 into 
carbohydrate, what is the efficiency of conversion of light 
energy into Chemical energy after photon capture? Assume 
again that eight photons of red light (680 nm) are required 
to fix one molécule of CO 2 . 

14-12 In chloroplasts, protons are pumped out of the 
stroma across the thylakoid membrane, whereas in mito- 
chondria, they are pumped out of the matrix across the 
crista membrane. Explain how this arrangement allows 
chloroplasts to generate a larger proton gradient across 
the thylakoid membrane than mitochondria can generate 
across the inner membrane. 

14-13 Examine the variegated leaf shown in Figure Q14-3. 
Yellow patches surrounded by green are common, but 
there are no green patches surrounded by yellow. Propose 
an explanation for this phenomenon. 



Figure Q14-3 A variegated leaf of Aucuba japonica with green and 
yellow patches (Problem 14-13). 
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When things change, cells respond. Every cell, from the humble bacterium to 
the most sophisticated eukaryotic cell, monitors its intracellular and extracellu¬ 
lar environment, processes the information it gathers, and responds accordingly. 
Unicellular organisms, for example, modify their behavior in response to changes 
in environmental nutrients or toxins. The cells of multicellular organisms detect 
and respond to countless internai and extracellular signais that control their 
growth, division, and différentiation during development, as well as their behav¬ 
ior in adult tissues. At the heart of ail these communication Systems are regulatory 
proteins that produce Chemical signais, which are sent from one place to another 
in the body or within a cell, usually being processed along the way and integrated 
with other signais to provide clear and effective communication. 

The study of cell signaling has traditionally focused on the mechanisms by 
which eukaryotic cells communicate with each other using extracellular signal 
molécules such as hormones and growth factors. In this chapter, we describe the 
features of some of these cell-cell communication Systems, and we use them to 
illustrate the general principles by which any regulatory System, inside or outside 
the cell, is able to generate, process, and respond to signais. Our main focus is on 
animal cells, but we end by considering the spécial features of cell signaling in 
plants. 

PRINCIPLES OF CELL SIGNALING 

Long before multicellular créatures roamed the Earth, unicellular organisms 
had developed mechanisms for responding to physical and Chemical changes in 
their environment. These almost certainly included mechanisms for responding 
to the presence of other cells. Evidence cornes from studies of present-day uni¬ 
cellular organisms such as bacteria and yeasts. Although these cells lead mostly 
independent lives, they can communicate and influence one another's behavior. 
Many bacteria, for example, respond to Chemical signais that are secreted by their 
neighbors and accumulate at higher population density. This process, called quo¬ 
rum sensing , allows bacteria to coordinate their behavior, including their motility, 
antibiotic production, spore formation, and sexual conjugation. Similarly, yeast 
cells communicate with one another in préparation for mating. The budding yeast 
Saccharomyces cerevisiae provides a well-studied example: when a haploid indi- 
vidual is ready to mate, it sécrétés a peptide mating factor that signais cells of the 
opposite mating type to stop proliferating and préparé to mate. The subséquent 
fusion of two haploid cells of opposite mating type produces a diploid zygote. 

Intercellular communication achieved an astonishing level of complexity dur¬ 
ing the évolution of multicellular organisms. These organisms are tight-lcnit soci- 
eties of cells, in which the well-being of the individual cell is often set aside for the 
benefit of the organism as a whole. Complex Systems of intercellular communica¬ 
tion hâve evolved to allow the collaboration and coordination of different tissues 
and cell types. Bewildering arrays of signaling Systems govern every conceivable 
feature of cell and tissue function during development and in the adult. 

Communication between cells in multicellular organisms is mediated mainly 
by extracellular signal molécules. Some of these operate over long distances, 
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Figure 15-1 A simple intracellular 
signaling pathway activated by an 
extracellular signal molécule. The signal 
molécule usually binds to a receptor 
protein that is embedded in the plasma 
membrane of the target cell. The receptor 
activâtes one or more intracellular signaling 
pathways, involving a sériés of signaling 
proteins. Finally, one or more of the 
intracellular signaling proteins alters the 
activity of effector proteins and thereby the 
behavior of the cell. 
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signaling to cells far away; others signal only to immédiate neighbors. Most cells 
in multicellular organisms both émit and receive signais. Réception of the signais 
dépends on receptor proteins, usually (but not always) at the cell surface, which 
bind the signal molécule. The binding activâtes the receptor, which in turn acti¬ 
vâtes one or more intracellular signaling pathways or Systems. These Systems 
dépend on intracellular signaling proteins, which process the signal inside the 
receiving cell and distribute it to the appropriate intracellular targets. The tar- 
gets that lie at the end of signaling pathways are generally called effector proteins, 
which are altered in some way by the incoming signal and implement the appro¬ 
priate change in cell behavior. Depending on the signal and the type and State of 
the receiving cell, these effectors can be transcription regulators, ion channels, 
components of a metabolic pathway, or parts of the cytoskeleton (Figure 15-1). 

The fundamental features of cell signaling hâve been conserved throughout the 
évolution of the eukaryotes. In buddingyeast, for example, the response to mating 
factor dépends on cell-surface receptor proteins, intracellular GTP-binding pro¬ 
teins, and protein kinases that are clearly related to functionally similar proteins 
in animal cells. Through gene duplication and divergence, however, the signaling 
Systems in animais hâve become much more elaborate than those in yeasts; the 
human genome, for example, contains more than 1500 genes that encode recep¬ 
tor proteins, and the number of different receptor proteins is further increased by 
alternative RNA splicing and post-translational modifications. 

Extracellular Signais Can Act Over Short or Long Distances 

Many extracellular signal molécules remain bound to the surface of the signal¬ 
ing cell and influence only cells that contact it (Figure 15-2A). Such contact- 
dependent signaling is especially important during development and in immune 
responses. Contact-dependent signaling during development can sometimes 
operate over relatively large distances if the communicating cells extend long thin 
processes to malce contact with one another. 
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Figure 15-2 Four forms of intercellular 
signaling. (A) Contact-dependent signaling 
requires cells to be in direct membrane- 
membrane contact. (B) Paracrine signaling 
dépends on local mediators that are 
released into the extracellular space and 
act on neighboring cells. (C) Synaptic 
signaling is performed by neurons that 
transmit signais electrically along their 
axons and release neurotransmitters at 
synapses, which are often located far 
away from the neuronal cell body. 

(D) Endocrine signaling dépends on 
endocrine cells, which secrete hormones 
into the bloodstream for distribution 
throughout the body. Many of the same 
types of signaling molécules are used 
in paracrine, synaptic, and endocrine 
signaling; the crucial différences lie in the 
speed and selectivity with which the signais 
are delivered to their targets. 


In most cases, however, signaling cells secrete signal molécules into the extra¬ 
cellular fluid. Often, the secreted molécules are local mediators, which act only 
on cells in the local environment of the signaling cell. This is called paracrine 
signaling (Figure 15-2B). Usually, the signaling and target cells in paracrine 
signaling are of different cell types, but cells may also produce signais that they 
themselves respond to: this is referred to as autocrine signaling. Cancer cells, for 
example, often produce extracellular signais that stimulate their own survival and 
prolifération. 

Large multicellular organisms lilce us need long-range signaling mechanisms 
to coordinate the behavior of cells in remote parts of the body. Thus, they hâve 
evolved cell types specialized for intercellular communication over large dis¬ 
tances. The most sophisticated of these are nerve cells, or neurons, which typically 
extend long, branching processes (axons) that enable them to contact target cells 
far away, where the processes terminate at the specialized sites of signal trans¬ 
mission known as Chemical synapses. When a neuron is activated by stimuli from 
other nerve cells, it sends electrical impulses (action potentials) rapidly along its 
axon; when the impulse reaches the synapse at the end of the axon, it triggers 
sécrétion of a Chemical signal that acts as a neurotransmitter. The tightly orga- 
nized structure of the synapse ensures that the neurotransmitter is delivered spe- 
cifically to receptors on the postsynaptic target cell (Figure 15-2C). The details of 
this synaptic signaling process are discussed in Chapter 11. 

A quite different strategy for signaling over long distances malces use of endo¬ 
crine cells, which secrete their signal molécules, called hormones, into the 
bloodstream. The blood carries the molécules far and wide, allowing them to act 
on target cells that may lie anywhere in the body (Figure 15-2D). 

Extracellular Signal Molécules Bind to Spécifie Receptors 

Cells in multicellular animais communicate by means of hundreds of kinds of 
extracellular signal molécules. These include proteins, small peptides, amino 
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acids, nucléotides, steroids, retinoids, fatty acid dérivatives, and even dissolved 
gases such as nitric oxide and carbon monoxide. Most of these signal molécules 
are released into the extracellular space by exocytosis from the signaling cell, as 
discussed in Chapter 13. Some, however, are emitted by diffusion through the 
signaling cell's plasma membrane, whereas others are displayed on the external 
surface of the cell and remain attached to it, providing a signal to other cells only 
when they make contact. Transmembrane signal proteins may operate in this 
way, or their extracellular domains may be released from the signaling cell's sur¬ 
face by proteolytic cleavage and then act at a distance. 

Regardless of the nature of the signal, the target cell responds by means of a 
receptor, which binds the signal molécule and then initiâtes a response in the 
target cell. The binding site of the receptor has a complex structure that is shaped 
to recognize the signal molécule with high specificity, helping to ensure that the 
receptor responds only to the appropriate signal and not to the many other sig¬ 
naling molécules surrounding the cell. Many signal molécules act at very low con¬ 
centrations (typically < 10 -8 M), and their recep tors usually bind them with high 
affinity (dissociation constant K& < 10~ 8 M; see Figure 3-44). 

In most cases, receptors are transmembrane proteins on the target-cell sur¬ 
face. When these proteins bind an extracellular signal molécule (a ligand ), they 
become activated and generate various intracellular signais that alter the behav- 
ior of the cell. In other cases, the receptor proteins are inside the target cell, and 
the signal molécule has to enter the cell to bind to them: this requires that the 
signal molécule be sufficiently small and hydrophobie to diffuse across the target 
cell's plasma membrane (Figure 15-3). This chapter focuses primarily on signal¬ 
ing through cell-surface receptors, but we will briefly describe signaling through 
intracellular receptors later in the chapter. 

Each Cell Is Programmée! to Respond to Spécifie Combinations of 
Extracellular Signais 

A typical cell in a multicellular organism is exposed to hundreds of different signal 
molécules in its environment. The molécules can be soluble, bound to the extra¬ 
cellular matrix, or bound to the surface of a neighboring cell; they can be stimula- 
tory or inhibitory; they can act in innumerable different combinations; and they 
can influence almost any aspect of cell behavior. The cell responds to this blizzard 
of signais selectively, in large part by expressing only those receptors and intracel¬ 
lular signaling Systems that respond to the signais that are required for the régula¬ 
tion of that cell. 

Most cells respond to many different signais in the environment, and some 
of these signais may influence the response to other signais. One of the key chal¬ 
lenges in cell biology is to détermine how a cell intégrâtes ail of this signaling 
information in order to make decisions—to divide, to move, to differentiate, and 
so on. Many cells, for example, require a spécifie combination of extracellular sur- 
vival factors to allow the cell to continue living; when deprived of these signais, 
the cell activâtes a suicide program and kills itself—usually by apoptosis , a form 
of programmed cell death, as discussed in Chapter 18. Cell prolifération often 
dépends on a combination of signais that promote both cell division and survival, 
as well as signais that stimulate cell growth (Figure 15-4). On the other hand, dif¬ 
férentiation into a nondividing State (called terminal différentiation ) frequently 
requires a different combination of survival and différentiation signais that must 
override any signal to divide. 

In principle, the hundreds of signal molécules that an animal malces can 
be used in an almost unlimited number of combinations to control the diverse 
behaviors of its cells in highly spécifie ways. Relatively small numbers of types of 
signal molécules and receptors are sufficient. The complexity lies in the ways in 
which cells respond to the combinations of signais that they receive. 

A signal molécule often has different effects on different types of target cells. 
The neurotransmitter acétylcholine (Figure 15-5A), for example, decreases the 
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Figure 15-3 The binding of extracellular 
signal molécules to either cell-surface 
or intracellular receptors. (A) Most signal 
molécules are hydrophilic and are therefore 
unable to cross the target cell’s plasma 
membrane directly; instead, they bind to 
cell-surface receptors, which in turn generate 
signais inside the target cell (see Figure 
15-1). (B) Some small signal molécules, 
by contrast, diffuse across the plasma 
membrane and bind to receptor proteins 
inside the target cell—either in the cytosol 
or in the nucléus (as shown here). Many of 
these small signal molécules are hydrophobie 
and poorly soluble in aqueous solutions; they 
are therefore transported in the bloodstream 
and other extracellular fluids bound to carrier 
proteins, from which they dissociate before 
entering the target cell. 
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Figure 15-4 An animal cell’s 
dependence on multiple extracellular 
signal molécules. Each cell type displays 
a set of receptors that enables it to respond 
to a corresponding set of signal molécules 
produced by other cells. These signal 
molécules work in various combinations 
to regulate the behavior of the cell. As 
shown here, an individual cell often requires 
multiple signais to survive (blue arrows) 
and additional signais to grow and divide 
(red arrows) or differentiate (green arrows). 

If deprived of appropriate survival signais, 
a cell will undergo a form of cell suicide 
known as apoptosis. The actual situation is 
even more complex. Although not shown, 
some extracellular signal molécules act to 
inhibit these and other cell behaviors, or 
even to induce apoptosis. 
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rate of action potential firing in heart pacemaker cells (Figure 15-5B) and stimu¬ 
lâtes the production of saliva by salivary gland cells (Figure 15-5C), even though 
the receptors are the same on both cell types. In slceletal muscle, acétylcholine 
causes the cells to contract by binding to a different receptor protein (Figure 
15-5D). The different effects of acétylcholine in these cell types resuit from dif¬ 
férences in the intracellular signaling proteins, effector proteins, and genes that 
are activated. Thus, an extracellular signal itself has little information content; it 
simply induces the cell to respond according to its predetermined State, which 
dépends on the celFs developmental history and the spécifie genes it expresses. 


(A) acétylcholine 

CH 3 

I 

H 3 c— N— CH 3 

I 

ch 2 

ch 2 

O 

I 

C = O 

I 

ch 3 


(B) heart pacemaker cell 


(C) salivary gland cell 


(D) skeletal muscle cell 



Figure 15-5 Various responses induced by the neurotransmitter acétylcholine. (A) The Chemical structure of acétylcholine. (B-D) Different 
cell types are specialized to respond to acétylcholine in different ways. In some cases (B and C), acétylcholine binds to similar receptor proteins 
(G-protein-coupled receptors; see Figure 15-6), but the intracellular signais produced are interpreted differently in cells specialized for different 
functions. In other cases (D), the receptor protein is also different (here, an ion-channel-coupled receptor; see Figure 15-6). 
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There Are Three Major Classes of Cell-Surface Receptor Proteins 

Most extracellular signal molécules bind to spécifie receptor proteins on the sur¬ 
face of the target cells they influence and do not enter the cytosol or nucléus. 
These cell-surface receptors act as signal transducers by converting an extracel¬ 
lular ligand-binding event into intracellular signais that alter the behavior of the 
target cell. 

Most cell-surface receptor proteins belong to one of three classes, defined by 
their transduction mechanism. Ion-channel-coupled receptors, also known as 
transmitter-gated ion channels or ionotropic receptors , are involved in rapid synap- 
tic signaling between nerve cells and other electrically excitable target cells such 
as nerve and muscle cells (Figure 15-6A). This type of signaling is mediated by a 
small number of neurotransmitters that transiently open or close an ion channel 
formed by the protein to which they bind, briefly changing the ion permeability 
of the plasma membrane and thereby changing the excitability of the postsyn- 
aptic target cell. Most ion-channel-coupled receptors belong to a large family of 
homologous, multipass transmembrane proteins. Because they are discussed in 
detail in Chapter 11, we will not consider them further here. 

G-protein-coupled receptors act by indirectly regulating the activity of a 
separate plasma-membrane-bound target protein, which is generally either an 
enzyme or an ion channel. A trimeric GTP-binding protein (G protein ) médiates 
the interaction between the activated receptor and this target protein (Figure 
15-6B). The activation of the target protein can change the concentration of one or 
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Figure 15-6 Three classes of cell-surface receptors. (A) Ion-channel- 
coupled receptors (also called transmitter-gated ion channels), 

(B) G-protein-coupled receptors, and (C) enzyme-coupled receptors. 
Although many enzyme-coupled receptors hâve intrinsic enzymatic 
activity, as shown on the lett in (C), many others rely on associated 
enzymes, as shown on the right in (C). Ligands activate most enzyme- 
coupled receptors by promoting their dimerization, which results in the 
interaction and activation of the cytoplasmic domains. 
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more small intracellular signaling molécules (if the target protein is an enzyme), 
or it can change the ion permeability of the plasma membrane (if the target pro¬ 
tein is an ion channel). The small intracellular signaling molécules act in turn to 
alter the behavior of yet other signaling proteins in the cell. 

Enzyme-coupled receptors either function as enzymes or associate directly 
with enzymes that they activate (Figure 15-6C). They are usually single-pass 
transmembrane proteins that hâve their ligand-binding site outside the cell and 
their catalytic or enzyme-binding site inside. Enzyme-coupled receptors are het- 
erogeneous in structure compared with the other two classes; the great majority, 
however, are either protein kinases or associate with protein kinases, which phos- 
phorylate spécifie sets of proteins in the target cell when activated. 

There are also some types of cell-surface receptors that do not fit easily into 
any of these classes but hâve important functions in controlling the specializa- 
tion of different cell types during development and in tissue renewal and repair in 
adults. We discuss these in a later section, after we explain how G-protein-coupled 
receptors and enzyme-coupled receptors operate. First, we continue our general 
discussion of the principles of signaling via cell-surface receptors. 

Cell-Surface Receptors Relay Signais Via Intracellular Signaling 
Molécules 

Numerous intracellular signaling molécules relay signais received by cell-surface 
receptors into the cell interior. The resulting chain of intracellular signaling events 
ultimately alters effector proteins that are responsible for modifying the behavior 
of the cell (see Figure 15-1). 

Some intracellular signaling molécules are small Chemicals, which are often 
called second messengers (the “first messengers” being the extracellular signais). 
They are generated in large amounts in response to receptor activation and diffuse 
away from their source, spreading the signal to other parts of the cell. Some, such 
as cyclic AMP and Ca 2+ , are water-soluble and diffuse in the cytosol, while oth- 
ers, such as diacylglycérol , are lipid-soluble and diffuse in the plane of the plasma 
membrane. In either case, they pass the signal on by binding to and altering the 
behavior of selected signaling or effector proteins. 

Most intracellular signaling molécules are proteins, which help relay the sig¬ 
nal into the cell by either generating second messengers or activating the next 
signaling or effector protein in the pathway. Many of these proteins behave lilce 
molecular switches. When they receive a signal, they switch from an inactive to an 
active State, until another process switches them off, returning them to their inac¬ 
tive State. The switching off is just as important as the switching on. If a signaling 
pathway is to recover after transmitting a signal so that it can be ready to transmit 
another, every activated molécule in the pathway must return to its original, unac- 
tivated State. 

The largest class of molecular switches consists of proteins that are activated 
or inactivated by phosphorylation (discussed in Chapter 3). For these proteins, 
the switch is thrown in one direction by a protein kinase, which covalently adds 
one or more phosphate groups to spécifie amino acids on the signaling protein, 
and in the other direction by a protein phosphatase, which removes the phos¬ 
phate groups (Figure 15-7A). The activity of any protein regulated by phosphory¬ 
lation dépends on the balance between the activities of the kinases that phos- 
phorylate it and of the phosphatases that dephosphorylate it. About 30-50% of 
human proteins contain covalently attached phosphate, and the human genome 
encodes about 520 protein kinases and about 150 protein phosphatases. A typical 
mammalian cell makes use of hundreds of distinct types of protein kinases at any 
moment. 

Protein kinases attach phosphate to the hydroxyl group of spécifie amino 
acids on the target protein. There are two main types of protein kinase. The great 
majority are serine/threonine kinases, which phosphorylate the hydroxyl groups 
of serines and thréonines in their targets. Others are tyrosine kinases, which 
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Figure 15-7 Two types of intracellular 
signaling proteins that act as 
molecular switches. (A) A protein kinase 
covalently adds a phosphate from ATP 
to the signaling protein, and a protein 
phosphatase removes the phosphate. 
Although not shown, many signaling 
proteins are activated by déphosphorylation 
rather than by phosphorylation. (B) A GTP- 
binding protein is induced to exchange its 
bound GDP for GTP, which activâtes the 
protein; the protein then inactivâtes itself by 
hydrolyzing its bound GTP to GDP. 
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phosphorylate proteins on tyrosines. The two types of protein kinase are closely 
related members of a large family, differing primarily in the structure of their pro¬ 
tein substrate binding sites. 

Many intracellular signaling proteins controlled by phosphorylation are them- 
selves protein kinases, and these are often organized into kinase cascades. In such 
a cascade, one protein kinase, activated by phosphorylation, phosphorylâtes the 
next protein kinase in the sequence, and so on, relaying the signal onward and, in 
some cases, amplifying it or spreading it to other signaling pathways. 

The other important class of molecular switches consists of GTP-binding pro¬ 
teins (discussed in Chapter 3). These proteins switch between an "on" (actively 
signaling) State when GTP is bound and an "off" State when GDP is bound. In the 
"on" State, they usually hâve intrinsic GTPase activity and shut themselves off by 
hydrolyzing their bound GTP to GDP (Figure 15-7B). There are two major types 
of GTP-binding proteins. Large, trimeric GTP-binding proteins (also called G pro¬ 
teins ) help relay signais from G-protein-coupled receptors that activate them (see 
Figure 15-6B). Small monomeric GTPases (also called monomeric GTP-binding 
proteins) help relay signais from many classes of cell-surface receptors. 

Spécifie regulatory proteins control both types of GTP-binding proteins. 
GTPase-activating proteins (GAPs) drive the proteins into an "off" State by 
increasing the rate of hydrolysis of bound GTP. Conversely, guanine nucléotide 
exchange factors (GEFs) activate GTP-binding proteins by promoting the release 
of bound GDP, which allows a new GTP to bind. In the case of trimeric G pro teins, 
the activated receptor serves as the GEF. Figure 15-8 illustrâtes the régulation of 
monomeric GTPases. 

Not ail molecular switches in signaling Systems dépend on phosphorylation 
or GTP binding. We see later that some signaling proteins are switched on or off 
by the binding of another signaling protein or a second messenger such as cyclic 
AMP or Ca 2+ , or by covalent modifications other than phosphorylation or déphos¬ 
phorylation, such as ubiquitylation (discussed in Chapter 3). 

For simplicity, we often portray a signaling pathway as a sériés of activation 
steps (see Figure 15-1). It is important to note, however, that most signaling path¬ 
ways contain inhibitory steps, and a sequence of two inhibitory steps can hâve the 
same effect as one activating step (Figure 15-9). This double-negative activation 
is very common in signaling Systems, as we will see when we describe spécifie 
pathways later in this chapter. 

Intracellular Signais Must Be Spécifie and Précisé in a Noisy 
Cytoplasm 

Ideally, an activated intracellular signaling molécule should interact only with 
the appropriate downstream targets, and, likewise, the targets should only be 
activated by the appropriate upstream signal. In reality, however, intracellular 
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INACTIVE 

MONOMERIC GTPase 



Figure 15-8 The régulation of a 
monomeric GTPase. GTPase-activating 
proteins (GAPs) inactivate the protein by 
stimulating it to hydrolyze its bound GTP 
to GDP, which remains tightly bound to the 
inactivated GTPase. Guanine nucléotide 
exchange factors (GEFs) activate the 
inactive protein by stimulating it to release 
its GDP; because the concentration of GTP 
in the cytosol is 10 times greater than the 
concentration of GDP, the protein rapidly 
binds GTP and is thereby activated. 


ACTIVE 

MONOMERIC GTPase 


signaling molécules share the cytoplasm with a crowd of closely related signaling 
molécules that control a diverse array of cellular processes. It is inévitable that an 
occasional signaling molécule will bind or modify the wrong partner, potentially 
creating unwanted cross-tallc and interférence between signaling Systems. How 
does a signal remain strong, précisé, and spécifie under these noisy conditions? 

The first line of defense cornes from the high affinity and specificity of the 
interactions between signaling molécules and their correct partners compared to 
the relatively low affinity of the interactions between inappropriate partners. The 
binding of a signaling molécule to the correct target is determined by précisé and 
complex interactions between complementary surfaces on the two molécules. 
Protein kinases, for example, contain active sites that recognize a spécifie amino 
acid sequence around the phosphorylation site on the correct target protein, and 
they often contain additional doclcing sites that promote a spécifie, high-affinity 
interaction with the target. These and related mechanisms help provide a strong 
and persistent interaction between the correct partners, reducing the likelihood 
of inappropriate interactions with other proteins. 

Another important way that cells avoid responses to unwanted background 
signais dépends on the ability of many downstream target proteins to sim- 
ply ignore such signais. These proteins respond only when the upstream signal 
reaches a high concentration or activity level. Consider a signaling pathway in 
which a protein kinase activâtes some downstream target protein by phosphory¬ 
lation. If a response is triggered only when more than half of the target proteins are 
phosphorylated, then there will be little harm done if a small number of them are 
occasionally phosphorylated by some inappropriate protein kinase. Furthermore, 
constitutively active protein phosphatases will further reduce the impact of back¬ 
ground phosphorylation by rapidly removing much of it. In these and other ways, 
intracellular signaling Systems filter out noise, generating little or no response to 
low levels of stimuli. 
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Figure 15-9 A sequence of two inhibitory 
signais produces a positive signal. 

(A) In this simple signaling System, a 
transcription regulator is kept in an 
inactive State by a bound inhibitor 
protein. In response to some upstream 
signal, a protein kinase is activated and 
phosphorylâtes the inhibitor, causing its 
dissociation from the transcription regulator 
and thereby activating gene expression. 

(B) This signaling pathway can be 
diagrammed as a sequence of four steps, 
including two sequential inhibitory steps 
that are équivalent to a single activating 
step. 
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Cells in a population often exhibit random variation in the concentration or 
activity of their intracellular signaling molécules. Similarly, individual molécules 
in a large population of molécules vary in their activity or interactions with other 
molécules. This signal variability introduces another form of noise that can inter¬ 
fère with the précision and efficiency of signaling. Most signaling Systems, how- 
ever, are built to generate remarkably robust and précisé responses even when 
upstream signais are variable or even when some components of the System 
are disabled. In many cases, this robustness dépends on the presence of baclcup 
mechanisms: for example, a signal might employ two parallel pathways to acti- 
vate a single common downstream target protein, allowing the response to occur 
even if one pathway is crippled. 

Intracellular Signaling Complexes Form at Activated Receptors 

One simple and effective strategy for enhancing the specificity of interactions 
between signaling molécules is to localize them in the same part of the cell or 
even within large protein complexes, thereby ensuring that they interact only with 
each other and not with inappropriate partners. Such mechanisms often involve 
scaffold proteins, which bring together groups of interacting signaling proteins 
into signaling complexes, often before a signal has been received (Figure 15-10A). 
Because the scaffold holds the proteins in close proximity, they can interact at 
high local concentrations and be sequentially activated rapidly, efficiently, and 
selectively in response to an appropriate extracellular signal, avoiding unwanted 
cross-tallc with other signaling pathways. 

In other cases, such signaling complexes form only transiently in response 
to an extracellular signal and rapidly disassemble when the signal is gone. They 
often assemble around a receptor after an extracellular signal molécule has acti¬ 
vated it. In many of these cases, the cytoplasmic tail of the activated receptor is 
phosphorylated during the activation process, and the phosphorylated amino 
acids then serve as docking sites for the assembly of other signaling proteins 
(Figure 15-10B). In yet other cases, receptor activation leads to the production 
of modified phospholipid molécules (called phosphoinositides) in the adjacent 
plasma membrane, which then recruit spécifie intracellular signaling proteins to 
this région of membrane, where they are activated (Figure 15-10C). 

Modular Interaction Domains Médiate Interactions Between 
Intracellular Signaling Proteins 

Simply bringing intracellular signaling proteins together into close proximity is 
sometimes sufficient to activate them. Thus, induced proximity, where a signal 
triggers assembly of a signaling complex, is commonly used to relay signais from 
protein to protein along a signaling pathway. The assembly of such signaling com¬ 
plexes dépends on various highly conserved, small interaction domains, which 
are found in many intracellular signaling proteins. Each of these compact pro¬ 
tein modules binds to a particular structural motif in another protein or lipid. The 
recognized motif in the interacting protein can be a short peptide sequence, a 
covalent modification (such as a phosphorylated amino acid), or another protein 
domain. The use of modular interaction domains presumably facilitated the évo¬ 
lution of new signaling pathways; because it can be inserted at many locations in 
a protein without disturbing the protein's folding or function, a new interaction 
domain added to an existing signaling protein could connect the protein to addi- 
tional signaling pathways. 

There are many types of interaction domains in signaling pro teins. Src homol- 
ogy 2 ( SH2 ) domains and phosphotyrosine-binding ( PTB ) domains, for example, 
bind to phosphorylated tyrosines in a particular peptide sequence on activated 
receptors or intracellular signaling proteins. Src homology 3 ( SH3 ) domains bind 
to short, proline-rich amino acid sequences. Some pleclcstrin homology {PH) 
domains bind to the charged head groups of spécifie phosphoinositides that are 
produced in the plasma membrane in response to an extracellular signal; they 
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Figure 15-10 Three types of intracellular 
signaling complexes. (A) A receptor and 
some of the intracellular signaling proteins 
it activâtes in sequence are preassembled 
into a signaling complex on the inactive 
receptor by a large scaffold protein. 

(B) A signaling complex assembles 
transiently on a receptor only after the 
binding of an extracellular signal molécule 
has activated the receptor; here, the 
activated receptor phosphorylâtes itself at 
multiple sites, which then act as docking 
sites for intracellular signaling proteins. 

(C) Activation of a receptor leads to the 
increased phosphorylation of spécifie 
phospholipids (phosphoinositides) in 
the adjacent plasma membrane; these 
then serve as docking sites for spécifie 
intracellular signaling proteins, which can 
now interact with each other. 


(B) ASSEMBLY OF SIGNALING COMPLEX ON AN ACTIVATED RECEPTOR 
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(C) ASSEMBLY OF SIGNALING COMPLEX ON PHOSPHOINOSITIDE DOCKING SITES 
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enable the protein they are part of to dock on the membrane and interact with 
other similarly recruited signaling proteins (see Figure 15-10C). Some signaling 
proteins consist solely of two or more interaction domains and function only as 
adaptors to linlc two other proteins together in a signaling pathway. 

Interaction domains enable signaling proteins to bind to one another in mul¬ 
tiple spécifie combinations. Lilce Lego® bricks, the proteins can form linear or 
branching chains or three-dimensional networlcs, which détermine the route 
followed by the signaling pathway. As an example, Figure 15-11 illustrâtes how 
some interaction domains médiate the formation of a large signaling complex 
around the receptor for the hormone insulin. 
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Another way of bringing recep tors and intracellular signaling pro teins together 
is to concentrate them in a spécifie région of the cell. An important example is the 
primary cilium that projects lilce an antenna from the surface of most vertebrate 
cells (discussed in Chapter 16). It is usually short and nonmotile and has micro- 
tubules in its core, and a number of surface receptors and signaling proteins are 
concentrated there. We shall see later that light and smell receptors are also highly 
concentrated in specialized cilia. 


Figure 15-11 A spécifie signaling 
complex formed using modular 
interaction domains. This example is 
based on the insulin receptor, which is 
an enzyme-coupled receptor (a receptor 
tyrosine kinase, discussed later). First, 
the activated receptor phosphorylâtes 
itself on tyrosines, and one of the 
phosphotyrosines then recruits a docking 
protein called insulin receptor substrate-1 
(IRS1) via a PTB domain of IRS1 ; the PH 
domain of IRS1 also binds to spécifie 
phosphoinositides on the inner surface of 
the plasma membrane. Then, the activated 
receptor phosphorylâtes IRS1 on tyrosines, 
and one of these phosphotyrosines binds 
the SH2 domain of the adaptor protein 
Grb2. Next, Grb2 uses one of its two SH3 
domains to bind to a proline-rich région of 
a protein called Sos, which relays the signal 
downstream by acting as a G EF (see Figure 
15-8) to activate a monomeric GTPase 
called Ras (not shown). Sos also binds to 
phosphoinositides in the plasma membrane 
via its PH domain. Grb2 uses its other SH3 
domain to bind to a proline-rich sequence 
in a scaffold protein. The scaffold protein 
binds several other signaling proteins, and 
the other phosphorylated tyrosines on IRS1 
recruit additional signaling proteins that 
hâve SH2 domains (not shown). 


The Relationship Between Signal and Response Varies in Different 
Signaling Pathways 

The function of an intracellular signaling System is to detect and measure a spé¬ 
cifie stimulus in one location of a cell and then generate an appropriately timed 
and measured response at another location. The System accomplishes this taslc 
by sending information in the form of molecular "signais” from the sensor to the 
target, often through a sériés of intermediaries that do not simply pass the signal 
along but process it in various ways. Ail signaling Systems do not work in precisely 
the same way: each has evolved specialized behaviors that produce a response 
that is appropriate for the cell function that System Controls. In the following para- 
graphs, we list some of these behaviors and describe how they vary in different 
Systems, as a foundation for more detailed discussions later. 

1. Response timing varies dramatically in different signaling Systems, accord- 
ing to the speed required for the response. In some cases, such as synaptic 
signaling (see Figure 15-2C), the response can occur within milliseconds. 
In other cases, as in the control of cell fate by morphogens during develop¬ 
ment, a full response can require hours or days. 

2. Sensitivity to extracellular signais can vary greatly. Hormones tend to act 
at very low concentrations on their distant target cells, which are therefore 
highly sensitive to low concentrations of signal. Neurotransmitters, on the 
other hand, operate at much higher concentrations at a synapse, reducing 
the need for high sensitivity in postsynaptic receptors. Sensitivity is often 
controlled by changes in the number or affinity of the receptors on the tar¬ 
get cell. A particularly important mechanism for increasing the sensitivity 
of a signaling System is signal amplification, whereby a small number of 
activated cell-surface receptors evolce a large intracellular response either 
by producing large amounts of a second messenger or by activating many 
copies of a downstream signaling protein. 

3. Dynamic range of a signaling System is related to its sensitivity. Some Sys¬ 
tems, lilce those involved in simple developmental decisions, are responsive 
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over a narrow range of extracellular signal concentrations. Other Systems, 
lilce those controlling vision or the metabolic response to some hormones, 
are highly responsive over a much broader range of signal strengths. We 
will see that broad dynamic range is often achieved by adaptation mecha- 
nisms that adjust the responsiveness of the System according to the prevail- 
ing amount of signal. 

4. Persistence of a response can vary greatly. A transient response of less than 
a second is appropriate in some synaptic responses, for example, while a 
prolonged or even permanent response is required in cell fate decisions 
during development. Numerous mechanisms, including positive feedback, 
can be used to alter the duration and reversibility of a response. 

5. Signal processing can convert a simple signal into a complex response. In 
many Systems, for example, a graduai increase in an extracellular signal 
is converted into an abrupt, switchlilce response. In other cases, a sim¬ 
ple input signal is converted into an oscillatory response, produced by a 
repeating sériés of transient intracellular signais. Feedback usually lies at 
the heart of biochemical switches and oscillators, as we describe later. 

6. Intégration allows a response to be governed by multiple inputs. As dis- 
cussed earlier, for example, spécifie combinations of extracellular signais 
are generally required to stimulate complex cell behaviors such as cell sur- 
vival and prolifération (see Figure 15-4). The cell therefore has to integrate 
information coming from multiple signais, which often dépends on intra¬ 
cellular coincidence detectors; these proteins are équivalent to AND gates 
in the microprocessor of a computer, in that they are only activated if they 
receive multiple converging signais (Figure 15-12). 

7. Coordination of multiple responses in one cell can be achieved by a single 
extracellular signal. Some extracellular signal molécules, for example, stim¬ 
ulate a cell to both grow and divide. This coordination generally dépends 
on mechanisms for distributing a signal to multiple effectors, by creating 
branches in the signaling pathway. In some cases, the branching of signal- 
ing pathways can allow one signal to modulate the strength of a response to 
other signais. 

Given the complexity that arises from behaviors lilce signal intégration, distri¬ 
bution, and feedback, it is clear that signaling Systems rarely dépend on a simple 
linear sequence of steps but are often more lilce a networlc, in which information 
flows not just forward but in multiple directions—and sometimes even baclcward. 
A major research challenge is to understand the nature of these networlcs and the 
response behaviors they can achieve. 

The Speed of a Response Dépends on the Turnover of Signaling 
Molécules 

The speed of any signaling response dépends on the nature of the intracellular 
signaling molécules that carry out the target celFs response. When the response 
requires only changes in proteins already présent in the cell, it can occur very rap- 
idly: an allosteric change in a neurotransmitter-gated ion channel (discussed in 
Chapter 11), for example, can alter the plasma membrane electrical potential in 
milliseconds, and responses that dépend solely on protein phosphorylation can 
occur within seconds. When the response involves changes in gene expression 
and the synthesis of new proteins, however, it usually requires many minutes or 
hours, regardless of the mode of signal delivery (Figure 15-13). 

It is natural to thinlc of intracellular signaling Systems in terms of the changes 
produced when an extracellular signal is delivered. But it is just as important to 
consider what happens when the signal is withdrawn. During development, tran¬ 
sient extracellular signais often produce lasting effects: they can trigger a change 
in the celFs development that persists indefinitely through cell memory mecha¬ 
nisms, as we discuss later (and in Chapters 7 and 22). In most cases in adult tis- 
sues, however, the response fades when a signal ceases. Often the effect is transient 
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Figure 15-12 Signal intégration. 

Extracellular signais A and B activate 
different intracellular signaling pathways, 
each of which leads to the phosphorylation 
of protein Y but at different sites on the 
protein. Protein Y is activated only when 
both of these sites are phosphorylated, 
and therefore it becomes active only when 
signais A and B are simultaneously présent. 
Such proteins are often called coincidence 
detectors. 
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Figure 15-13 Slow and rapid responses 
to an extracellular signal. Certain types 
of signal-induced cellular responses, such 
as increased cell growth and division, 
involve changes in gene expression 
and the synthesis of new proteins; they 
therefore occur slowly, often starting an 
hour or more after the signal is received. 
Other responses—such as changes in cell 
movement, sécrétion, or metabolism— 
need not involve changes in gene 
transcription and therefore occur much 
more quickly, often starting in seconds 
or minutes; they may involve the rapid 
phosphorylation of effector proteins in 
the cytoplasm, for example. Synaptic 
responses mediated by changes in 
membrane potential are even quicker and 
can occur in milliseconds (not shown). 
Some signaling Systems generate both 
rapid and slow responses as shown here, 
allowing the cell to respond quickly to a 
signal while simultaneously initiating a more 
long-term, persistent response. 


because the signal exerts its effects by altering the concentrations of intracellular 
molécules that are short-lived (unstable), undergoing continuai turnover. Thus, 
once the extracellular signal is gone, the dégradation of the old molécules quickly 
wipes out ail traces of the signal's action. It follows that the speed with which a cell 
responds to signal removal dépends on the rate of destruction, or turnover, of the 
intracellular molécules that the signal affects. 

It is also true, although much less obvious, that this turnover rate can déter¬ 
mine the promptness of the response when an extracellular signal arrives. Con- 
sider, for example, two intracellular signaling molécules, X and Y, both of which 
are normally maintained at a steady-state concentration of 1000 molécules per 
cell. The cell synthesizes and dégradés molécule Y at a rate of 100 molécules per 
second, with each molécule having an average lifetime of 10 seconds. Molécule X 
has a turnover rate that is 10 times slower than that of Y: it is both synthesized and 
degraded at a rate of 10 molécules per second, so that each molécule has an aver¬ 
age lifetime in the cell of 100 seconds. If a signal acting on the cell causes a tenfold 
increase in the synthesis rates of both X and Y with no change in the molecu- 
lar lifetimes, at the end of 1 second the concentration of Y will hâve increased 
by nearly 900 molécules per cell (10 x 100 - 100), while the concentration of X 
will hâve increased by only 90 molécules per cell. In fact, after a molecule's syn¬ 
thesis rate has been either increased or decreased abruptly, the time required for 
the molécule to shift halfway from its old to its new equilibrium concentration is 
equal to its half-life—that is, equal to the time that would be required for its con¬ 
centration to fall by half if ail synthesis were stopped (Figure 15-14). 

The same principles apply to proteins and small molécules, whether the mol¬ 
écules are in the extracellular space or inside cells. Many intracellular proteins 
hâve short half-lives, some surviving for less than 10 minutes. In most cases, these 
are key regulatory proteins whose concentrations are rapidly controlled in the cell 
by changes in their rates of synthesis. 

As we hâve seen, many cell responses to extracellular signais dépend on the 
conversion of intracellular signaling proteins from an inactive to an active form, 
rather than on their synthesis or dégradation. Phosphorylation or the binding of 
GTP, for example, commonly activâtes signaling proteins. Even in these cases, 
however, the activation must be rapidly and continuously reversed (by déphos¬ 
phorylation or GTP hydrolysis to GDP, respectively, in these examples) to malce 
rapid signaling possible. These inactivation processes play a crucial part in deter- 
mining the magnitude, rapidity, and duration of the response. 
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Figure 15-14 The importance of rapid 
turnover. The graphs show the predicted 
relative rates of change in the intracellular 
concentrations of molécules with differing 
turnover times when their synthesis rates 
are either (A) decreased or (B) increased 
suddenly by a factor of 10. In both cases, 
the concentrations of those molécules that 
are normally degraded rapidly in the cell 
(red Unes) change quickly, whereas the 
concentrations of those that are normally 
degraded slowly (green Unes) change 
proportionally more slowly. The numbers (in 
blue) on the right are the half-lives assumed 
for each of the different molécules. 


Cells Can Respond Abruptly to a Gradually Increasing Signal 

Some signaling Systems are capable of generating a smoothly graded response 
over a wide range of extracellular signal concentrations (Figure 15-15, blue line); 
such Systems are useful, for example, in the fine tuning of metabolic processes 
by some hormones. Other signaling Systems generate significant responses only 
when the signal concentration rises beyond some threshold value. These abrupt 
responses are of two types. One is a sigmoidal response, in which low concentra¬ 
tions of stimulus do not hâve much effect, but then the response rises steeply and 
continuously at intermediate stimulus levels (Figure 15-15, red line). Such Sys¬ 
tems provide a filter to reduce inappropriate responses to low-level background 
signais but respond with high sensitivity when the stimulus falls within a small 
range of physiological signal concentrations. A second type of abrupt response 
is the discontinuons or all-or-none response, in which the response switches 
on completely (and often irreversibly) when the signal reaches some threshold 
concentration (Figure 15-15, green line). Such responses are particularly useful 
for controlling the choice between two alternative cell States, and they generally 
involve positive feedback, as we describe in more detail shortly. 

Cells use a variety of molecular mechanisms to produce a sigmoidal response 
to increasing signal concentrations. In one mechanism, more than one intracel¬ 
lular signaling molécule must bind to its downstream target protein to induce a 
response. As we discuss later, for example, four molécules of the second messen¬ 
ger cyclic AMP must bind simultaneously to each molécule of cyclic-AMP-dépen¬ 
dent protein kinase ( PKA ) to activate the kinase. A similar sharpening of response 
is seen when the activation of an intracellular signaling protein requires phos¬ 
phorylation at more than one site. Such responses become sharper as the number 
of required molécules or phosphate groups increases, and if the number is large 
enough, responses become almost all-or-none (Figure 15-16). 

Responses are also sharpened when an intracellular signaling molécule acti¬ 
vâtes one enzyme and also inhibits another enzyme that catalyzes the oppo¬ 
site reaction. A well-studied example of this common type of régulation is the 
stimulation of glycogen breakdown in skeletal muscle cells induced by the hor¬ 
mone adrénaline (epinephrine). Adrenaline's binding to a G-protein-coupled 
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Figure 15-15 Signal Processing can produce smoothly graded or 
switchlike responses. Some cell responses increase gradually as the 
concentration of extracellular signal molécule increases, eventually reaching 
a plateau as the signaling pathway is saturated, resulting in a hyperbolic 
response curve (blue line). In other cases, the signaling System reduces 
the response at low signal concentrations and then produces a steeper 
response at some intermediate signal concentration —resulting in a sigmoidal 
response curve (red line). In still other cases, the response is more abrupt and 
switchlike; the cell switches completely between a low and high response, 
without any stable intermediate response (green line). 


all-or-none 



cell-surface receptor increases the intracellular concentration of cyclic AMP, 
which both activâtes an enzyme that promûtes glycogen brealcdown and inhibits 
an enzyme that promûtes glycogen synthesis. 

Positive Feedback Can Generate an AII-or-None Response 

Lilce intracellular metabolic pathways (discussed in Chapter 2) and the Systems 
controlling gene activity (Chapter 7), most intracellular signaling Systems incor- 
porate feedback loops, in which the output of a process acts baclc to regulate that 
same process. We discussed the mathematical analysis of feedback loops in Chap¬ 
ter 8. In positive feedback, the output stimulâtes its own production; in négative 
feedback, the output inhibits its own production (Figure 15-17). Feedback loops 
are of great general importance in biology, and they regulate many Chemical and 
physical processes in cells. Those that regulate cell signaling can either operate 
exclusively within the target cell or involve the sécrétion of extracellular signais. 
Here, we focus on those feedback loops that operate entirely within the target cell; 
even the simplest of these loops can produce complex and interesting effects. 

Positive feedback in a signaling pathway can transform the behavior of the 
responding cell. If the positive feedback is of only moderate strength, its effect will 
be simply to steepen the response to the signal, generating a sigmoidal response 
lilce those described earlier; but if the feedback is strong enough, it can produce 
an all-or-none response (see Figure 15-15). This response goes hand in hand with 
a further property: once the responding System has switched to the high level of 
activation, this condition is often self-sustaining and can persist even after the 



Figure 15-16 Activation curves for 
an allosteric protein as a function 
of effector molécule concentration. 

The curves show how the sharpness of 
the activation response increases with 
an increase in the number of allosteric 
effector molécules that must be bound 
simultaneously to activate the target 
protein. The curves shown are those 
expected, under certain conditions, if the 
activation requires the simultaneous binding 
of 1,2, 8, or 16 effector molécules. 
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signal strength drops baclc below its critical value. In such a case, the System is 
said to be bistable : it can exist in either a “switched-off” or a “switched-on" State, 
and a transient stimulus can flip it from the one State to the other (Figure 15-18A 
and B). 

Through positive feedback, a transient extracellular signal can induce long- 
term changes in cells and their progeny that can persist for the lifetime of the 
organism. The signais that trigger muscle-cell spécification, for example, turn on 
the transcription of a sériés of genes that encode muscle-spécifie transcription 
regulatory proteins, which stimulate the transcription of their own genes, as well 
as genes encoding various other muscle-cell proteins; in this way, the decision 
to become a muscle cell is made permanent. This type of cell memory, which 
dépends on positive feedback, is one of the basic ways in which a cell can undergo 
a lasting change of character without any alteration in its DNA sequence. 

Studies of signaling responses in large populations of cells can give the false 
impression that a response is smoothly graded, even when strong positive feed¬ 
back is causing an abrupt, discontinuous switch in the response in individual cells. 
Only by studying the response in single cells is it possible to see its all-or-none 
character (Figure 15-19). The misleading smooth response in a cell population 
is due to the random, intrinsic variability in signaling Systems that we described 
earlier: ail cells in a population do not respond identically to the same concentra¬ 
tion of extracellular signal, especially at intermediate signal concentrations where 
the receptor is only partially occupied. 

Négative Feedback is a Common Motif in Signaling Systems 

By contrast with positive feedback, négative feedback counteracts the effect of a 
stimulus and thereby abbreviates and limits the level of the response, malcing the 
System less sensitive to perturbations (see Chapter 8). As with positive feedback, 
however, qualitatively different responses can be obtained when the feedback 
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Figure 15-17 Positive and négative 
feedback. In these simple examples, a 
stimulus activâtes protein A, which, in turn, 
activâtes protein B. Protein B then acts 
back to either increase or decrease the 
activity of A. 


Figure 15-18 Some effects of simple 
feedback. The graphs show the computed 
effects of simple positive and négative 
feedback loops (see Chapter 8). In each 
case, the input signal is an activated protein 
kinase (S) that phosphorylâtes and thereby 
activâtes another protein kinase (E); a 
protein phosphatase (I) dephosphorylates 
and inactivâtes the activated E kinase. In 
the graphs, the red line indicates the activity 
of the E kinase over time; the underlying 
blue bar indicates the time for which the 
input signal (activated S kinase) is présent. 
(A) Diagram of the positive feedback 
loop, in which the activated E kinase acts 
back to promote its own phosphorylation 
and activation; the basal activity of the I 
phosphatase dephosphorylates activated 
E at a steady, low rate. (B) The top graph 
shows that, without feedback, the activity 
of the E kinase is simply proportional (with a 
short lag) to the level of stimulation by the S 
kinase. The bottom graph shows that, with 
the positive feedback loop, the transient 
stimulation by S kinase switches the 
System from an “off” State to an “on” State, 
which then persists after the stimulus has 
been removed. (C) Diagram of the négative 
feedback loop, in which the activated E 
kinase phosphorylâtes and activâtes the I 
phosphatase, thereby increasing the rate at 
which the phosphatase dephosphorylates 
and inactivâtes the phosphorylated E 
kinase. (D) The top graph shows, again, 
the response in E kinase activity without 
feedback. The other graphs show the 
effects on E kinase activity of négative 
feedback operating after a short or long 
delay. With a short delay, the System shows 
a strong, brief response when the signal 
is abruptly changed, and the feedback 
then drives the response back down to a 
lower level. With a long delay the feedback 
produces sustained oscillations for as long 
as the stimulus is présent. 
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opérâtes more powerfully. A delayed négative feedback with a long enough delay 
can produce responses that oscillate. The oscillations may persist for as long as the 
stimulus is présent (Figure 15-18C and D) or they may even be generated sponta- 
neously, without need of an external signal to drive them. Many such oscillators 
also contain positive feedback loops that generate sharper oscillations. Later in 
this chapter, we will encounter spécifie examples of oscillatory behavior in the 
intracellular responses to extracellular signais; ail of them dépend on négative 
feedback, generally accompanied by positive feedback. 

If négative feedback opérâtes with a short delay, the System behaves lilce a 
change detector. It gives a strong response to a stimulus, but the response rap- 
idly decays even while the stimulus persists; if the stimulus is suddenly increased, 
however, the System responds strongly again, but, again, the response rapidly 
decays. This is the phenomenon of adaptation , which we now discuss. 

Cells Can Adjust Their Sensitivity to a Signal 

In responding to many types of stimuli, cells and organisms are able to detect the 
same percentage of change in a signal over a wide range of stimulus strengths. 
The target cells accomplish this through a réversible process of adaptation, or 
desensitization, whereby a prolonged exposure to a stimulus decreases the cells' 
response to that level of stimulus. In Chemical signaling, adaptation enables cells 
to respond to changes in the concentration of an extracellular signal molécule 
(rather than to the absolute concentration of the signal) over a very wide range 
of signal concentrations. The underlying mechanism is négative feedback that 
opérâtes with a short delay: a strong response modifies the signaling machinery 
involved, such that the machinery resets itself to become less responsive to the 
same level of signal (see Figure 15-18D, middle graph). Owing to the delay, how¬ 
ever, a sudden increase in the signal is able to stimulate the cell again for a short 
period before the négative feedback has time to kick in. 

Adaptation to a signal molécule can occur in various ways. It can resuit from 
inactivation of the receptors themselves. The binding of signal molécules to 
cell-surface receptors, for example, may induce the endocytosis and temporary 
séquestration of the receptors in endosomes. In some cases, such signal-induced 
receptor endocytosis leads to the destruction of the receptors in lysosomes, a pro¬ 
cess referred to as receptor down-regulation (in other cases, however, activated 
receptors continue to signal after they hâve been endocytosed). Receptors can 
also become inactivated on the cell surface—for example, by becoming phos- 
phorylated—with a short delay following their activation. Adaptation can also 
occur at sites downstream of the receptors, either by a change in intracellular 
signaling proteins involved in transducing the extracellular signal or by the pro¬ 
duction of an inhibitor protein that bloclcs the signal transduction process. These 
various adaptation mechanisms are compared in Figure 15-20. 

Though bewildering in their complexity, the multiple cross-regulatory signal¬ 
ing pathways and feedback loops that we describe in this chapter are not just a 
haphazard tangle, but a highly evolved System for processing and interpreting 


Figure 15-19 The importance of 
examining individual cells to detect 
all-or-none responses to increasing 
concentrations of an extracellular 
signal. In these experiments, immature 
frog eggs (oocytes) were stimulated with 
increasing concentrations of the hormone 
progestérone. The response was assessed 
by analyzing the activation of MAP kinase 
(discussed later), which is one of the protein 
kinases activated by phosphorylation in the 
response. The amount of phosphorylated 
(activated) MAP kinase in extracts of the 
oocytes was assessed biochemically. In 
(A), extracts of populations of stimulated 
oocytes were analyzed, and the activation 
of MAP kinase appeared to increase 
progressive^ with increasing progestérone 
concentration. There are two possible ways 
of explaining this resuit: (B) MAP kinase 
could hâve increased gradually in each 
individual cell with increasing progestérone 
concentration; or (C) individual cells could 
hâve responded in an all-or-none way, 
with the graduai increase in total MAP 
kinase activation reflecting the increasing 
number of cells responding with increasing 
progestérone concentration. When extracts 
of individual oocytes were analyzed, it 
was found that cells had either very low 
amounts or very high amounts, but not 
intermediate amounts, of the activated 
kinase, indicating that the response was 
essentially all-or-none at the level of 
individual cells, as diagrammed in (C). 
Subséquent studies revealed that this all- 
or-none response is due in part to strong 
positive feedback in the progestérone 
signaling System. (Adapted from J.E. Ferrell 
and E.M. Machleder, Science 280:895- 
898, 1998. With permission from AAAS.) 
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Figure 15-20 Some ways in which target cells can become adapted (desensitized) to an extracellular signal molécule. 

The mechanisms shown here that operate at the level of the receptor often involve phosphorylation or ubiquitylation of the 
receptor proteins. 


the vast number of signais that impinge upon animal cells. The whole molecu- 
lar control network, leading from the receptors at the cell surface to the genes in 
the nucléus, can be viewed as a computing device; and, lilce that other biological 
computing device, the brain, it présents one of the hardest problems in biology. 
We can identify the components and discover how they worlc individually. We can 
understand how small subsets of components worlc together as regulatory mod¬ 
ules, noise filters, or adaptation mechanisms, as we hâve seen. However, it is a 
much more difficult taslc to understand how the System worlcs as a whole. Ibis 
is not only because the System is complex; it is also because the way in which it 
behaves is strongly dépendent on the quantitative details of the molecular inter¬ 
actions, and, for most animal cells, we hâve only rough qualitative information. A 
major challenge for the future of signaling research is to develop more sophisti- 
cated quantitative and computational methods for the analysis of signaling Sys¬ 
tems, as described in Chapter 8. 

Summary 

Each cell in a multicellular animal is programmed to respond to a spécifie set of 
extracellular signal molécules produced by other cells. The signal molécules act by 
binding to a complementary set of receptor proteins expressed by the target cells. 
Most extracellular signal molécules activate cell-surface receptor proteins , which 
act as signal transducers, converting the extracellular signal into intracellular ones 
that alter the behavior of the target cell. Activated receptors relay the signal into 
the cell interior by activating intracellular signaling proteins. Some ofthese signal¬ 
ing proteins transduce, amplify , or spread the signal as they relay it, while others 
integrate signais from different signaling pathways. Some Junction as switches that 
are transiently activated by phosphorylation or GTP binding. Large signaling com¬ 
plexes form by means of modular interaction domains in the signaling proteins, 
which allow the proteins toform functional signaling networlcs. 

Target cells use various mechanisms, including feedback loops, to adjust the 
ways in which they respond to extracellular signais. Positive feedback loops can 
help cells to respond in an all-or-none fashion to a gradually increasing concen¬ 
tration ofan extracellular signal and to convert a short-lasting signal into a long- 
lasting, or even irréversible, response. Négative feedback allows cells to adapt to a 
signal molécule, which enables them to respond to small changes in the concentra¬ 
tion ofthe signal molécule over a large concentration range. 
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SIGNALING THROUGH G-PROTEIN-COUPLED 
RECEPTORS 

G-protein-coupled receptors (GPCRs) form the largest family of cell-surface 
receptors, and they médiate most responses to signais from the external world, as 
well as signais from other cells, including hormones, neurotransmitters, and local 
mediators. Our senses of sight, smell, and taste dépend on them. There are more 
than 800 GPCRs in humans, and in mice there are about 1000 concerned with 
the sense of smell alone. The signal molécules that act on GPCRs are as varied in 
structure as they are in function and include proteins and small peptides, as well 
as dérivatives of amino acids and fatty acids, not to mention photons of light and 
ail the molécules that we can smell or taste. The same signal molécule can activate 
many different GPCR family members; for example, adrénaline activâtes at least 
9 distinct GPCRs, acétylcholine another 5, and the neurotransmitter serotonin at 
least 14. The different receptors for the same signal are usually expressed in differ¬ 
ent cell types and elicit different responses. 

Despite the Chemical and functional diversity of the signal molécules that acti¬ 
vate them, ail GPCRs hâve a similar structure. They consist of a single polypeptide 
chain that threads baclc and forth across the lipid bilayer seven times, forming a 
cylindrical structure, often with a deep ligand-binding site at its center (Figure 
15-21). In addition to their characteristic orientation in the plasma membrane, 
they ail use G proteins to relay the signal into the cell interior. 

The GPCR superfamily includes rhodopsin, the light-activated protein in the 
vertebrate eye, as well as the large number of olfactory receptors in the vertebrate 
nose. Other family members are found in unicellular organisms: the receptors in 
yeasts that recognize secreted mating factors are an example. It is lilcely that the 
GPCRs that médiate cell-cell signaling in multicellular organisms evolved from 
the sensory receptors in their unicellular eukaryotic ancestors. 

It is remarkable that almost half of ail known drugs worlc through GPCRs or the 
signaling pathways GPCRs activate. Of the many hundreds of genes in the human 
genome that encode GPCRs, about 150 encode orphan receptors, for which the 
ligand is unknown. Many of them are lilcely targets for new drugs that remain to 
be discovered. 

Trimeric G Proteins Relay Signais From GPCRs 

When an extracellular signal molécule binds to a GPCR, the receptor undergoes 
a conformational change that enables it to activate a trimeric GTP-binding pro¬ 
tein (G protein), which couples the receptor to enzymes or ion channels in the 
membrane. In some cases, the G protein is physically associated with the recep¬ 
tor before the receptor is activated, whereas in others it binds only after receptor 
activation. There are various types of G proteins, each spécifie for a particular set 
of GPCRs and for a particular set of target proteins in the plasma membrane. They 
ail hâve a similar structure, however, and operate similarly. 

G proteins are composed of three protein subunits—a, P, and y. In the unstim- 
ulated State, the a subunit has GDP bound and the G protein is inactive (Figure 
15-22). When a GPCR is activated, it acts lilce a guanine nucléotide exchange fac¬ 
tor (GEF) and induces the a subunit to release its bound GDP, allowing GTP to 
bind in its place. GTP binding then causes an activating conformational change in 
the Ga subunit, releasing the G protein from the receptor and triggering dissocia¬ 
tion of the GTP-bound Ga subunit from the Gpy pair—both of which then interact 
with various targets, such as enzymes and ion channels in the plasma membrane, 
which relay the signal onward (Figure 15-23). 

The a subunit is a GTPase and becomes inactive when it hydrolyzes its bound 
GTP to GDP. The time required for GTP hydrolysis is usually short because the 
GTPase activity is greatly enhanced by the binding of the a subunit to a second 
protein, which can be either the target protein or a spécifie regulator of G pro¬ 
tein signaling (RGS). RGS pro teins act as a-subunit-specific GTPase-activating 
proteins (GAPs) (see Figure 15-8), and they help shut off G-protein-mediated 
responses in ail eukaryotes. There are about 25 RGS proteins encoded in the 
human genome, each of which interacts with a particular set of G proteins. 



membrane 

(A) 



Figure 15-21 A G-protein-coupled 
receptor (GPCR). (A) GPCRs that bind 
small ligands such as adrénaline hâve 
small extracellular domains, and the ligand 
usually binds deep within the plane of 
the membrane to a site that is formed by 
amino acids from several transmembrane 
segments. GPCRs that bind protein 
ligands hâve a large extracellular domain 
(not shown here) that contributes to 
ligand binding. (B) The structure of the 
P 2 -adrenergic receptor, a receptor for the 
neurotransmitter adrénaline, illustrâtes the 
typical cylindrical arrangement of the seven 
transmembrane helices in a GPCR. The 
ligand (orange) binds in a pocket between 
the helices, resulting in conformational 
changes on the cytoplasmic surface of the 
receptor that promote G-protein activation 
(not shown). (PDB code: 3P0G.) 
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Some G Proteins Regulate the Production of Cyclic AMP 

Cyclic AMP (cAMP) acts as a second messenger in some signaling pathways. 
An extracellular signal can increase cAMP concentration more than twentyfold 
in seconds (Figure 15-24). As explained earlier (see Figure 15-14), such a rapid 
response requires balancing a rapid synthesis of the molécule with its rapid 
breakdown or removal. Cyclic AMP is synthesized from ATP by an enzyme called 


Figure 15-22 The structure of an inactive 
G protein. (A) Note that both the a and the 
y subunits hâve covalently attached lipid 
molécules (red tails) that help bind them to 
the plasma membrane, and the a subunit 
has GDP bound. (B) The three-dimensional 
structure of the inactive, GDP-bound form 
of a G protein called G s , which interacts 
with numerous GPCRs, including the 
P 2 -adrenergic receptor shown in Figures 
15-21 and 15-23. The a subunit contains 
the GTPase domain and binds to one side 
of the P subunit. The y subunit binds to 
the opposite side of the P subunit, and the 
P and y subunits together form a single 
functional unit. The GTPase domain of the 
a subunit contains two major subdomains: 
the “Ras” domain, which is related to other 
GTPases and provides one face of the 
nucleotide-binding pocket; and the alpha- 
helical or “AH” domain, which clamps the 
nucléotide in place. (B, based on 
D.G. Lombright et al., Nature 379:311- 
319, 1996. With permission from Macmillan 
Publishers Ltd.) 



activated GPCR 




Figure 15-23 Activation of a G protein 
by an activated GPCR. Binding of an 
extracellular signal molécule to a GPCR 
changes the conformation of the receptor, 
which allows the receptor to bind and 
alter the conformation of a trimeric 
G protein. The AH domain of the G protein 
a subunit moves outward to open the 
nucleotide-binding site, thereby promoting 
dissociation of GDP. GTP binding then 
promûtes closure of the nucleotide-binding 
site, triggering conformational changes that 
cause dissociation of the a subunit from 
the receptor and from the Py complex. The 
GTP-bound a subunit and the Py complex 
each regulate the activities of downstream 
signaling molécules (not shown). The 
receptor stays active while the extracellular 
signal molécule is bound to it, and it can 
therefore catalyze the activation of many 
G-protein molécules (Movie 15.1). 
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adenylyl cyclase, and it is rapidly and continuously destroyed by cyclic AMP 
phosphodiesterases (Figure 15-25). Adenylyl cyclase is a large, multipass trans¬ 
membrane protein with its catalytic domain on the cytosolic side of the plasma 
membrane. There are at least eight isoforms in mammals, most of which are regu- 
lated by both G proteins and Ca 2+ . 

Many extracellular signais work by increasing cAMP concentrations inside 
the cell. These signais activate GPCRs that are coupled to a stimulatory G protein 
(G s ). The activated a subunit of G s binds and thereby activâtes adenylyl cyclase. 
Other extracellular signais, acting through different GPCRs, reduce cAMP levels 
by activating an inhibitory G protein (Gi), which then inhibits adenylyl cyclase. 

Both G s and Gi are targets for medically important bacterial toxins. Choiera 
toxin, which is produced by the bacterium that causes choiera, is an enzyme that 
catalyzes the transfer of ADP ribose from intracellular NAD + to the a subunit of 
G s . This ADP ribosylation alters the a subunit so that it can no longer hydrolyze its 
bound GTP, causing it to remain in an active State that stimulâtes adenylyl cyclase 
indefinitely. The resulting prolonged élévation in cAMP concentration within 
intestinal épithélial cells causes a large efflux of Cl - and water into the gut, thereby 
causing the severe diarrhea that characterizes choiera. Pertussis toxin , which is 
made by the bacterium that causes pertussis (whooping cough), catalyzes the 
ADP ribosylation of the a subunit of Gi, preventing the protein from interacting 
with receptors; as a resuit, the G protein remains in the inactive GDP-bound State 
and is unable to regulate its target proteins. These two toxins are widely used in 
experiments to détermine whether a cell's GPCR-dependent response to a signal 
is mediated by G s or by Gi. 

Some of the responses mediated by a G s -stimulated increase in cAMP concen¬ 
tration are listed in Table 15-1. As the table shows, different cell types respond 
differently to an increase in cAMP concentration. Some cell types, such as fat cells, 
activate adenylyl cyclase in response to multiple hormones, ail of which thereby 
stimulate the brealcdown of triglycéride (the storage form of fat) to fatty acids. 
Individuals with genetic defects in the G s a subunit show decreased responses to 
certain hormones, resulting in metabolic abnormalities, abnormal bone develop¬ 
ment, and mental retardation. 

Cyclic-AMP-Dependent Protein Kinase (PKA) Médiates Most of 
the Effects of Cyclic AMP 

In most animal cells, cAMP exerts its effects mainly by activating cyclic-AMP- 
dependent protein kinase (PKA). This kinase phosphorylâtes spécifie serines or 


Figure 15-25 The synthesis and dégradation of cyclic AMP. In a reaction 
catalyzed by the enzyme adenylyl cyclase, cyclic AMP (cAMP) is synthesized 
from ATP through a cyclization reaction that removes two phosphate 
groups as pyrophosphate (PPj); a pyrophosphatase drives this synthesis by 
hydrolyzing the released pyrophosphate to phosphate (not shown). Cyclic 
AMP is short-lived (unstable) in the cell because it is hydrolyzed by spécifie 
phosphodiesterases to form 5'-AMP, as indicated. 


Figure 15-24 An increase in cyclic AMP 
in response to an extracellular signal. 

This nerve cell in culture is responding to 
the neurotransmitter serotonin, which acts 
through a GPCR to cause a rapid rise in 
the intracellular concentration of cyclic 
AMP To monitor the cyclic AMP level, the 
cell has been loaded with a fluorescent 
protein that changes its fluorescence when 
it binds cyclic AMP Blue indicates a low 
level of cyclic AMP, yellow an intermediate 
level, and red a high level. (A) In the resting 
cell, the cyclic AMP level is about 5 x 10 -8 
M. (B) Twenty seconds after the addition 
of serotonin to the culture medium, 
the intracellular level of cyclic AMP has 
increased to more than 10 -6 M in the 
relevant parts of the cell, an increase of 
more than twentyfold. (From B.J. Bacskai 
et al., Science 260:222-226, 1993. With 
permission from AAAS.) 
























SIGNALING THROUGH G-PROTEIN-COUPLED RECEPTORS 


835 


□ Some Hormone-induced Cell Responses Mediated by Cyclic AMP 

Target tissue 

Hormone 

Major response 

Thyroid gland 

Thyroid-stimulating hormone (TSH) 

Thyroid hormone synthesis 
and sécrétion 

Adrenal cortex 

Adrenocorticotrophic hormone 
(ACTH) 

Cortisol sécrétion 

Ovary 

Luteinizing hormone (LH) 

Progestérone sécrétion 

Muscle 

Adrénaline 

Glycogen breakdown 

Bone 

Parathormone 

Bone résorption 

Heart 

Adrénaline 

Increase in heart rate and 
force of contraction 

Liver 

Glucagon 

Glycogen breakdown 

Kidney 

Vasopressin 

Water résorption 

Fat 

Adrénaline, ACTH, glucagon, TSH 

Triglycéride breakdown 


thréonines on selected target proteins, including intracellular signaling proteins 
and effector proteins, thereby regulating their activity. The target proteins differ 
from one cell type to another, which explains why the effects of cAMP vary so 
marlcedly depending on the cell type (see Table 15-1). 

In the inactive State, PKA consists of a complex of two catalytic subunits and 
two regulatory subunits. The binding of cAMP to the regulatory subunits alters 
their conformation, causing them to dissociate from the complex. The released 
catalytic subunits are thereby activated to phosphorylate spécifie target proteins 
(Figure 15-26). The regulatory subunits of PKA (also called A-kinase) are impor¬ 
tant for localizing the kinase inside the cell: spécial A-kinase anchoring proteins 
(AKAPs) bind both to the regulatory subunits and to a component of the cyto- 
skeleton or a membrane of an organelle, thereby tethering the enzyme complex 
to a particular subcellular compartment. Some AKAPs also bind other signaling 
proteins, forming a signaling complex. An AKAP located around the nucléus of 
heart muscle cells, for example, binds both PKA and a phosphodiesterase that 
hydrolyzes cAMP. In unstimulated cells, the phosphodiesterase lceeps the local 
cAMP concentration low, so that the bound PKA is inactive; in stimulated cells, 
cAMP concentration rapidly rises, overwhelming the phosphodiesterase and acti- 
vating the PKA. Among the target proteins that PKA phosphorylâtes and activâtes 
in these cells is the adjacent phosphodiesterase, which rapidly lowers the cAMP 
concentration again. This négative feedback arrangement converts what might 
otherwise be a prolonged PKA response into a brief, local puise of PKA activity. 

Whereas some responses mediated by cAMP occur within seconds (see Figure 
15-24), others dépend on changes in the transcription of spécifie genes and talce 
hours to develop fully. In cells that secrete the peptide hormone somatostatin, 


cyclic AMP 


inactive 

PKA 




regulatory inactive 

subunit catalytic 

subunit 



complex of 

cyclic AMP and active catalytic 
regulatory subunits subunits 


Figure 15-26 The activation of cyclic- 
AMP-dependent protein kinase (PKA). 

The binding of cAMP to the regulatory 
subunits of the PKA tetramer induces a 
conformational change, causing these 
subunits to dissociate from the catalytic 
subunits, thereby activating the kinase 
activity of the catalytic subunits. The 
release of the catalytic subunits requires the 
binding of more than two cAMP molécules 
to the regulatory subunits in the tetramer. 
This requirement greatly sharpens the 
response of the kinase to changes in cAMP 
concentration, as discussed earlier (see 
Figure 15-16). Mammalian cells hâve at 
least two types of PKAs: type I is mainly in 
the cytosol, whereas type II is bound via its 
regulatory subunits and spécial anchoring 
proteins to the plasma membrane, nuclear 
membrane, mitochondrial outer membrane, 
and microtubules. In both types, once the 
catalytic subunits are freed and active, they 
can migrate into the nucléus (where they 
can phosphorylate transcription regulatory 
proteins), while the regulatory subunits 
remain in the cytoplasm. 
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Figure 15-27 How a rise in intracellular 
cyclic AMP concentration can alter 
gene transcription. The binding of an 
extracellular signal molécule to its GPCR 
activâtes adenylyl cyclase via G s and 
thereby increases cAMP concentration in 
the cytosol. This rise activâtes PKA, and 
the released catalytic subunits of PKA 
can then enter the nucléus, where they 
phosphorylate the transcription regulatory 
protein CREB. Once phosphorylated, 

CREB recruits the coactivator CBP, which 
stimulâtes gene transcription. In some 
cases, at least, the inactive CREB protein is 
bound to the cyclic AMP response element 
(CRE) in DNA before it is phosphorylated 
(not shown). See Movie 15.2. 


for example, cAMP activâtes the gene that encodes this hormone. The regulatory 
région of the somatostatin gene contains a short cz's-regulatory sequence, called 
the cyclic AMP response element {CRE), which is also found in the regulatory 
région of many other genes activated by cAMP. A spécifie transcription regulator 
called CRE-binding (CREB) protein recognizes this sequence. When PKA is acti¬ 
vated by cAMP, it phosphorylâtes CREB on a single serine; phosphorylated CREB 
then recruits a transcriptional coactivator called CREB-binding protein {CBP), 
which stimulâtes the transcription of the target genes (Figure 15-27). Thus, CREB 
can transform a short cAMP signal into a long-term change in a cell, a process 
that, in the brain, is thought to play an important part in some forms of learning 
and memory. 

Some G Proteins Signal Via Phospholipids 

Many GPCRs exert their effects through G proteins that activate the plasma-mem- 
brane-bound enzyme phospholipase C-(3 (PLC(3). Table 15-2 lists some exam¬ 
ples of responses activated in this way. The phospholipase acts on a phosphory¬ 
lated inositol phospholipid (a phosphoinositide) called phosphatidylinositol 
4,5-bisphosphate [PI(4,5)P2], which is présent in small amounts in the inner half 
of the plasma membrane lipid bilayer (Figure 15-28). Recep tors that activate this 
inositol phospholipid signaling pathway mainly do so via a G protein called G q , 
which activâtes phospholipase C-(3 in much the same way that G s activâtes adeny¬ 
lyl cyclase. The activated phospholipase then cleaves the PI(4,5)P2 to generate two 
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R||>ome Cell Responses in Which GPCRs Activate PLCp 

Target tissue 

Signal molécule 

Major response 

Liver 

Vasopressin 

Glycogen breakdown 

Pancréas 

Acétylcholine 

Amylase sécrétion 

Smooth muscle 

Acétylcholine 

Muscle contraction 

Blood platelets 

Thrombin 

Platelet aggregation 


products: inositol 1,4,5-trisphosphate (IP 3 ) and diacylglycérol. At this step, the 
signaling pathway splits into two branches. 

IP 3 is a water-soluble molécule that leaves the plasma membrane and diffuses 
rapidly through the cytosol. When it reaches the endoplasmic réticulum (ER), it 
binds to and opens IP 3 -gated Ca 2+ -release channels (also called IP 3 receptors) 
in the ER membrane. Ca 2+ stored in the ER is released through the open chan¬ 
nels, quicldy raising the concentration of Ca 2+ in the cytosol (Figure 15-29). The 
increase in cytosolic Ca 2+ propagates the signal by influencing the activity of Ca 2+ - 
sensitive intracellular proteins, as we describe shortly. 

At the same time that the IP 3 produced by the hydrolysis of PI( 4 , 5 )P 2 is increas- 
ing the concentration of Ca 2+ in the cytosol, the other cleavage product of the 
PI(4,5)P 2 , diacylglycérol, is exerting different effects. It also acts as a second mes¬ 
senger, but it remains embedded in the plasma membrane, where it has several 
potential signaling rôles. One of its major functions is to activate a protein kinase 
called protein kinase C (PKC), so named because it is Ca 2 + -dependent. The initial 
rise in cytosolic Ca 2+ induced by IP 3 alters the PKC so that it translocates from the 
cytosol to the cytoplasmic face of the plasma membrane. There it is activated by 
the combination of Ca 2+ , diacylglycérol, and the negatively charged membrane 
phospholipid phosphatidylserine (see Figure 15-29). Once activated, PKC phos¬ 
phorylâtes target proteins that vary depending on the cell type. The principles are 
the same as discussed earlier for PKA, although most of the target proteins are 
different. 

Diacylglycérol can be further cleaved to release arachidonic acid, which can 
either act as a signal in its own right or be used in the synthesis of other small lipid 
signal molécules called eicosanoids. Most vertebrate cell types malce eicosanoids, 
including prostaglandins, which hâve many biological activities. They participate 
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Figure 15-28 The hydrolysis of Pl(4,5) 
P 2 by phospholipase C-p. Two second 
messengers are produced directly 
from the hydrolysis of PI(4,5)P2: inositol 
1,4,5-trisphosphate (IP 3 ), which diffuses 
through the cytosol and releases Ca 2+ 
from the endoplasmic réticulum, and 
diacylglycérol, which remains in the 
membrane and helps to activate protein 
kinase C (PKC; see Figure 15-29). There 
are several classes of phospholipase 
C: these include the P class, which is 
activated by GPCRs; as we see later, the 
y class is activated by a class of enzyme- 
coupled receptors called receptor tyrosine 
kinases (RTKs). 
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in pain and inflammatory responses, for example, and many anti-inflammatory 
drugs (such as aspirin, ibuprofen, and cortisone) act in part by inhibiting their 
synthesis. 


Figure 15-29 How GPCRs increase 
cytosolic Ca 2+ and activate protein 
kinase C. The activated GPCR 
stimulâtes the plasma-membrane-bound 
phospholipase C-(3 (PLCP) via a G protein 
called G q . The a subunit and (3y complex of 
G q are both involved in this activation. Two 
second messengers are produced when 
PI(4,5)P2 is hydrolyzed by activated PLCp. 
Inositol 1,4,5-trisphosphate (IP 3 ) diffuses 
through the cytosol and releases Ca 2+ 
from the ER by binding to and opening 
IP 3 -gated Ca 2+ -release channels (IP 3 
receptors) in the ER membrane. The large 
electrochemical gradient for Ca 2+ across 
this membrane causes Ca 2+ to escape 
into the cytosol when the release channels 
are opened. Diacylglycérol remains in the 
plasma membrane and, together with 
phosphatidylserine (not shown) and Ca 2+ , 
helps to activate protein kinase C (PKC), 
which is recruited from the cytosol to the 
cytosolic face of the plasma membrane. 

Of the 10 or more distinct isoforms of 
PKC in humans, at least 4 are activated by 
diacylglycérol (Movie 15.3). 


Ca 2+ Functions as a Ubiquitous Intracellular Mediator 

Many extracellular signais, and not just those that worlc via G proteins, trigger an 
increase in cytosolic Ca 2+ concentration. In muscle cells, Ca 2+ triggers contrac¬ 
tion, and in many secretory cells, including nerve cells, it triggers sécrétion. Ca 2+ 
has numerous other functions in a variety of cell types. Ca 2+ is such an effective 
signaling mediator because its concentration in the cytosol is normally very low 
(~10 -7 M), whereas its concentration in the extracellular fluid (~10 -3 M) and in 
the lumen of the ER [and sarcoplasmic réticulum (SR) in muscle] is high. Thus, 
there is a large gradient tending to drive Ca 2+ into the cytosol across both the 
plasma membrane and the ER or SR membrane. When a signal transiently opens 
Ca 2+ channels in these membranes, Ca 2+ rushes into the cytosol, and the result- 
ing 10-20-fold increase in the local Ca 2+ concentration activâtes Ca 2+ -responsive 
proteins in the cell. 

Some stimuli, including membrane depolarization, membrane stretch, 
and certain extracellular signais, activate Ca 2+ channels in the plasma mem¬ 
brane, resulting in Ca 2+ influx from outside the cell. Other signais, including the 
GPCR-mediated signais described earlier, act primarily through IP 3 receptors to 
stimulate Ca 2+ release from intracellular stores in the ER (see Figure 15-29). The 
ER membrane also contains a second type of regulated Ca 2+ channel called the 
ryanodine receptor (so called because it is sensitive to the plant alkaloid ryan- 
odine), which opens in response to rising Ca 2+ levels and thereby amplifies the 
Ca 2+ signal, as we describe shortly. 

Several mechanisms rapidly terminate the Ca 2+ signal and are also responsi- 
ble for lceeping the concentration of Ca 2+ in the cytosol low in resting cells. Most 
importantly, there are Ca 2+ -pumps in the plasma membrane and the ER mem¬ 
brane that use the energy of ATP hydrolysis to pump Ca 2+ out of the cytosol. Cells 
such as muscle and nerve cells, which malce extensive use of Ca 2+ signaling, hâve 
an additional Ca 2+ transporter (a Na + -driven Ca 2+ exchanger) in their plasma 
membrane that couples the efflux of Ca 2+ to the influx of Na + . 


Feedback Generates Ca 2+ Waves and Oscillations 

The IP 3 receptors and ryanodine receptors of the ER membrane hâve an import¬ 
ant feature: they are both stimulated by low to moderate cytoplasmic Ca 2+ con¬ 
centrations. This Ca 2+ -induced calcium release (CICR) results in positive feedback, 
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which has a major impact on the properties of the Ca 2+ signal. The importance of 
this feedback is seen clearly in studies with Ca 2+ -sensitive fluorescent indicators, 
such as aequorin or fura-2 (discussed in Chapter 9), which allow researchers to 
monitor cytosolic Ca 2+ in individual cells under a microscope (Figure 15-30 and 
Movie 15.4). 

When cells carrying a Ca 2+ indicator are treated with a small amount of an 
extracellular signal molécule that stimulâtes IP 3 production, tiny bursts of Ca 2+ 
are seen in one or more discrète régions of the cell. These Ca 2+ puffs or sparlcs 
reflect the local opening of small groups of IP 3 -gated Ca 2+ -release channels in the 
ER. Because various Ca 2+ -binding proteins act as Ca 2+ buffers and restrict the dif¬ 
fusion of Ca 2+ , the signal often remains localized to the site where the Ca 2+ enters 
the cytosol. If the extracellular signal is sufficiently strong and persistent, how- 
ever, the local Ca 2+ concentration can reach a sufficient level to activate nearby 
IP 3 receptors and ryanodine receptors, resulting in a regenerative wave of Ca 2+ 
release that moves through the cytosol (Figure 15-31), much lilce an action poten- 
tial in an axon. 
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Figure 15-30 The fertilization of an egg 
by a sperm triggers a wave of cytosolic 
Ca 2+ . This starfish egg was injected with 
a Ca 2+ -sensitive fluorescent dye before 
it was fertilized. A wave of cytosolic Ca 2+ 
(red), released from the ER, sweeps 
across the egg from the site of sperm 
entry (arrow). This Ca 2+ wave changes 
the egg cell surface, preventing the 
entry of other sperm, and it also initiâtes 
embryonic development (Movie 15.5). 

The initial increase in Ca 2+ is thought to 
be caused by a sperm-specific form of 
PLC (PLCQ that the sperm brings into the 
egg cytoplasm when it fuses with the egg; 
the PLCÇ cleaves PI(4,5)P 2 to produce 
IP 3 , which releases Ca 2+ from the egg 
ER. The released Ca 2+ stimulâtes further 
Ca 2+ release from the ER, producing the 
spreading wave, as we explain in Figure 
15-31. (Courtesy of Stephen A. Stricker.) 


Figure 15-31 Positive and négative 
feedback produce Ca 2+ waves and 
oscillations. This diagram shows IP 3 
receptors and ryanodine receptors on 
a portion of the ER membrane: active 
receptors are in greerr, inactive receptors 
are in red. When a small amount of cytosolic 
IP 3 activâtes a cluster of IP 3 receptors at 
one site on the ER membrane (top), the 
local release of Ca 2+ promûtes the opening 
of nearby IP 3 and ryanodine receptors, 
resulting in more Ca 2+ release. This 
positive feedback (indicated by positive 
signs) produces a regenerative wave of 
Ca 2+ release that spreads across the cell 
(see Figure 15-30). These waves of Ca 2+ 
release move more quickly across the cell 
than would be possible by simple diffusion. 
Also, unlike a diffusing burst of Ca 2+ ions, 
which will become more dilute as it spreads, 
the regenerative wave produces a high 
Ca 2+ concentration across the entire cell. 
Eventually, the local Ca 2+ concentration 
inactivâtes IP 3 receptors and ryanodine 
receptors (middle-, indicated by red négative 
signs), shutting down the Ca 2+ release. 
Ca 2+ -pumps reduce the local cytosolic Ca 2+ 
concentration to its normal low levels. The 
resuit is a Ca 2+ spike: positive feedback 
drives a rapid rise in cytosolic Ca 2+ , and 
négative feedback sends it back down 
again. The Ca 2+ channels remain refractory 
to further stimulation for some period of 
time, delaying the génération of another 
Ca 2+ spike (bottom). Eventually, however, 
the négative feedback wears off, allowing 
IP 3 to trigger another Ca 2+ wave. The end 
resuit is repeated Ca 2+ oscillations (see 
Figure 15-32). Under some conditions, these 
oscillations can be seen as repeating narrow 
waves of Ca 2+ moving across the cell. 
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Another important property of IP 3 receptors and ryanodine receptors is that 
they are inhibited, after some delay, by high Ca 2+ concentrations (a form of néga¬ 
tive feedback). Thus, the rise in Ca 2+ in a stimulated cell leads to inhibition of Ca 2+ 
release; because Ca 2+ pumps remove the cytosolic Ca 2+ , the Ca 2+ concentration 
falls (see Figure 15-31). The décliné in Ca 2+ eventually relieves the négative feed¬ 
back, allowing cytosolic Ca 2+ to rise again. As in other cases of delayed négative 
feedback (see Figure 15-18), the resuit is an oscillation in the Ca 2+ concentration. 
These oscillations persist for as long as receptors are activated at the cell surface, 
and their frequency reflects the strength of the extracellular stimulus (Figure 
15-32). The frequency, amplitude, and breadth of oscillations can also be modu- 
lated by other signaling mechanisms, such as phosphorylation, which influence 
the Ca 2+ sensitivity of Ca 2+ channels or affect other components in the signaling 
System. 

The frequency of Ca 2+ oscillations can be translated into a frequency-dépen¬ 
dent cell response. In some cases, the frequency-dépendent response itself is also 
oscillatory: in hormone-secreting pituitary cells, for example, stimulation by an 
extracellular signal induces repeated Ca 2+ spikes, each of which is associated with 
a burst of hormone sécrétion. In other cases, the frequency-dependent response 
is non-oscillatory: in some types of cells, for instance, one frequency of Ca 2+ spikes 
activâtes the transcription of one set of genes, while a higher frequency activâtes 
the transcription of a different set. How do cells sense the frequency of Ca 2+ spikes 
and change their response accordingly? The mechanism presumably dépends on 
Ca 2+ -sensitive proteins that change their activity as a function of Ca 2+ -spilce fre¬ 
quency. A protein kinase that acts as a molecular memory device seems to hâve 
this remarkable property, as we discuss next. 

Ca 2 + /Calmodulin-Dependent Protein Kinases Médiate Many 
Responses to Ca 2+ Signais 

Various Ca 2+ -binding proteins help to relay the cytosolic Ca 2+ signal. The most 
important is calmodulin, which is found in ail eukaryotic cells and can consti- 
tute as much as 1% of a cell's total protein mass. Calmodulin functions as a mul- 
tipurpose intracellular Ca 2+ receptor, governing many Ca 2+ -regulated processes. 
It consists of a highly conserved, single polypeptide chain with four high-affinity 
Ca 2+ -binding sites (Figure 15-33A). When activated by Ca 2+ binding, it under- 
goes a conformational change. Because two or more Ca 2+ ions must bind before 


vasopressin concentration 
0.4 nM 0.6 nM 0.9 nM 



Figure 15-32 Vasopressin-induced Ca 2+ 
oscillations in a liver cell. The cell was 
loaded with the Ca 2+ -sensitive protein 
aequorin and then exposed to increasing 
concentrations of the peptide signal 
molécule vasopressin, which activâtes a 
GPCR and thereby PLCP (see Table 15-2). 
Note that the frequency of the Ca 2+ spikes 
increases with an increasing concentration 
of vasopressin but that the amplitude of 
the spikes is not affected. Each spike lasts 
about 7 seconds. (Adapted from 
N.M. Woods, K.S.R. Cuthbertson and 
PH. Cobbold, Nature 319:600-602, 

1986. With permission from Macmillan 
Publishers Ltd.) 
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calmodulin adopts its active conformation, the protein displays a sigmoidal 
response to increasing concentrations of Ca 2+ (see Figure 15-16). 

The allosteric activation of calmodulin by Ca 2+ is analogous to the activation 
of PKA by cyclic AMP, except that Ca 2+ /calmodulin has no enzymatic activity 
itself but instead acts by binding to and activating other proteins. In some cases, 
calmodulin serves as a permanent regulatory subunit of an enzyme complex, but 
usually the binding of Ca 2+ instead enables calmodulin to bind to various target 
proteins in the cell to alter their activity. 

When an activated molécule of Ca 2+ /calmodulin binds to its target protein, 
the calmodulin further changes its conformation, the nature of which dépends on 
the spécifie target protein (Figure 15-33B). Among the many targets calmodulin 
régulâtes are enzymes and membrane transport proteins. As one example, Ca 2+ / 
calmodulin binds to and activâtes the plasma membrane Ca 2+ -pump that uses 
ATP hydrolysis to pump Ca 2+ out of cells. Thus, whenever the concentration of 
Ca 2+ in the cytosol rises, the pump is activated, which helps to return the cytosolic 
Ca 2+ level to resting levels. 

Many effects of Ca 2+ , however, are more indirect and are mediated by protein 
phosphorylations catalyzed by a family of protein kinases called Ca 2+ /calmodu- 
lin-dependent kinases (CaM-kinases). Some CaM-kinases phosphorylate tran¬ 
scription regulators, such as the CREB protein (see Figure 15-27), and in this way 
activate or inhibit the transcription of spécifie genes. 

One of the best-studied CaM-kinases is CaM-kinase II, which is found in most 
animal cells but is especially enriched in the nervous System. It constitutes up 
to 2% of the total protein mass in some régions of the brain, and it is highly con- 
centrated in synapses. CaM-kinase II has several remarkable properties. To begin 
with, it has a spectacular quaternary structure: twelve copies of the enzyme are 
assembled into a staclced pair of rings, with kinase domains on the outside linked 
to a central hub (Figure 15-34). This structure helps the enzyme function as a 
molecular memory device, switching to an active State when exposed to Ca 2+ / 
calmodulin and then remaining active even after the Ca 2+ signal has decayed. 
This is because adjacent kinase subunits can phosphorylate each other (a pro- 
cess called autophosphorylation ) when Ca 2+ /calmodulin activâtes them (Figure 
15-34). Once a kinase subunit is autophosphorylated, it remains active even in 
the absence of Ca 2+ , thereby prolonging the duration of the kinase activity beyond 
that of the initial activating Ca 2+ signal. The enzyme maintains this activity until a 
protein phosphatase removes the autophosphorylation and shuts the kinase off. 
CaM-kinase II activation can thereby serve as a memory trace of a prior Ca 2+ puise, 
and it seems to hâve a rôle in some types of memory and learning in the vertebrate 
nervous System. Mutant mice that laclc a brain-specific form of the enzyme hâve 
spécifie defects in their ability to remember where things are. 


Figure 15-33 The structure of 
Ca 2+ /calmodulin. (A) The molécule has a 
dumbbell shape, with two globular ends, 
which can bind to many target proteins. 

The globular ends are connected by a long, 
exposed a hélix, which allows the protein to 
adopt a number of different conformations, 
depending on the target protein it interacts 
with. Each globular head has two Ca 2+ - 
binding sites (Movie 15.6). (B) Shown is 
the major structural change that occurs in 
Ca 2+ /calmodulin when it binds to a target 
protein (in this example, a peptide that 
consists of the Ca 2+ /calmodulin-binding 
domain of a Ca 2+ /calmodulin-dependent 
protein kinase). Note that the 
Ca 2+ /calmodulin has “jack-knifed” to 
surround the peptide. When it binds 
to other targets, it can adopt different 
conformations. (A, based on x-ray 
crystallographic data from Y.S. Babu et al., 
Nature 315:37-40, 1985. With permission 
from Macmillan Publishers Ltd; B, based 
on x-ray crystallographic data from 
W.E. Meador, A.R. Means, and 
F.A. Quiocho, Science 257:1251-1255, 
1992, and on nuclear magnetic résonance 
(NMR) spectroscopy data from M. Ikura et 
al., Science 256:632-638, 1992.) 
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Figure 15-34 The stepwise activation of CaM-kinase II. (A) Each CaM-kinase II protein has two major domains: an amino¬ 
terminal kinase domain (green) and a carboxyl-terminal hub domain (blue), linked by a regulatory segment. Six CaM-kinase 
Il proteins are assembled into a giant ring in which the hub domains interact tightly to produce a central structure that is 
surrounded by kinase domains. The complété enzyme contains two stacked rings, for a total of 12 kinase proteins, but only one 
ring is shown here for clarity. When the enzyme is inactive, the ring exists in a dynamic equilibrium between two States. The first 
(upperleft) is a compact State, in which the kinase domains interact with the hub, so that the regulatory segment is buried in the 
kinase active site and thereby blocks catalytic activity. In the second inactive State (upper middlë), a kinase domain has popped 
out and is linked to the central hub by its regulatory segment, which continues to inhibit the kinase but is now accessible to 
Ca 2+ /calmodulin. If présent, Ca 2+ /calmodulin will bind the regulatory segment and prevent it from inhibiting the kinase, thereby 
locking the kinase in an active State (upper right). If the adjacent kinase subunit also pops out from the hub, it will also be 
activated by Ca 2+ /calmodulin, and the two kinases will then phosphorylate each other on their regulatory segments (lower 
right). This autophosphorylation further activâtes the enzyme. It also prolongs the activity of the enzyme in two ways. First, 
it traps the bound Ca 2+ /calmodulin so that it does not dissociate from the enzyme until cytosolic Ca 2+ levels return to basal 
values for at least 10 seconds (not shown). Second, it converts the enzyme to a Ca 2+ -independent form, so that the kinase 
remains active even after the Ca 2+ /calmodulin dissociâtes from it (lower left). This activity continues until the action of a protein 
phosphatase overrides the autophosphorylation activity of CaM-kinase II. (B) This structural model of the enzyme is based on 
x-ray crystallographic analysis. 

The remarkable dodecameric structure of the enzyme allows it to achieve a broad range of intermediate activity States in 
response to different Ca 2+ oscillation frequencies: higher frequencies tend to cause more subunits in the enzyme to reach the 
phosphorylated active State (see Figure 15-35). The behavior of CaM-kinase II is also controlled by the length of the linker 
segment between the kinase and hub domains. The linker is longer in some isoforms of the enzyme; in these isoforms, the 
kinase domains tend to pop out of the ring more frequently, making it more sensitive to Ca 2+ . These and other mechanisms 
allow the cell to tailor the responsiveness of the enzyme to the needs of different types of neurons. (Adapted from L.H. Chao et 
al., Cell 146:732-745, 2011. PDB code: 3SOA.) 

Another remarkable property of CaM-kinase II is that the enzyme can use its 
intrinsic memory mechanism to décodé the frequency of Ca 2+ oscillations. This 
property is thought to be especially important at a nerve cell synapse, where 
changes in intracellular Ca 2+ levels in a postsynaptic cell as a resuit of neural activ¬ 
ity can lead to long-term changes in the subséquent effectiveness of that synapse 
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(discussed in Chapter 11). When CaM-kinase II is exposed to both a protein phos¬ 
phatase and répétitive puises of Ca 2+ /calmodulin at different frequencies that 
mimic those observed in stimulated cells, the enzyme's activity increases steeply 
as a function of puise frequency (Figure 15-35). 

Some G Proteins Directly Regulate Ion Channels 

G proteins do not act exclusively by regulating the activity of membrane-bound 
enzymes that alter the concentration of cyclic AMP or Ca 2+ in the cytosol. The 
a subunit of one type of G protein (called G 12 ), for example, activâtes a guanine 
nucléotide exchange factor (GEF) that activâtes a monomeric GTPase of the Rho 
family (discussed later and in Chapter 16), which régulâtes the actin cytoskeleton. 

In some other cases, G proteins directly activate or inactivate ion channels in 
the plasma membrane of the target cell, thereby altering the ion permeability— 
and hence the electrical excitability—of the membrane. As an example, acétyl¬ 
choline released by the vagus nerve reduces the heart rate (see Figure 15-5B). This 
effect is mediated by a spécial class of acétylcholine receptors that activate the 
Gi protein discussed earlier. Once activated, the a subunit of Gi inhibits adenylyl 
cyclase (as described previously), while the Py subunits bind to IC channels in 
the heart muscle cell plasma membrane and open them. The opening of these 
IC channels makes it harder to depolarize the cell and thereby contributes to the 
inhibitory effect of acétylcholine on the heart. (These acétylcholine receptors, 
which can be activated by the fungal allcaloid muscarine, are called muscarinic 
acétylcholine receptors to distinguish them from the very different nicotinic acé¬ 
tylcholine receptors, which are ion-channel-coupled receptors on skeletal muscle 
and nerve cells that can be activated by the binding of nicotine, as well as by acé¬ 
tylcholine.) 

Other G proteins regulate the activity of ion channels less directly, either by 
stimulating channel phosphorylation (by PKA, PKC, or CaM-kinase, for example) 
or by causing the production or destruction of cyclic nucléotides that directly acti¬ 
vate or inactivate ion channels. These cyclic-nucleotide-gated ion channels hâve a 
crucial rôle in both smell (olfaction) and vision, as we now discuss. 


Figure 15-35 CaM-kinase II as a 
frequency décoder of Ca 2+ oscillations. 

(A) At low frequencies of Ca 2+ spikes, the 
enzyme becomes inactive after each spike, 
as the autophosphorylation induced by 
Ca 2+ /calmodulin binding does not maintain 
the enzyme’s activity long enough for the 
enzyme to remain active until the next 
Ca 2+ spike arrives. (B) At higher spike 
frequencies, however, the enzyme fails to 
inactivate completely between Ca 2+ spikes, 
so its activity ratchets up with each spike. 

If the spike frequency is high enough, 
this progressive increase in enzyme 
activity will continue until the enzyme is 
autophosphorylated on ail subunits and is 
therefore maximally activated. Although not 
shown, once enough of its subunits are 
autophosphorylated, the enzyme can be 
maintained in a highly active State even with 
a relatively low frequency of Ca 2+ spikes (a 
form of cell memory). The binding of Ca 2+ / 
calmodulin to the enzyme is enhanced by 
the CaM-kinase II autophosphorylation 
(an additional form of positive feedback), 
helping to generate a more switchlike 
response to repeated Ca 2+ spikes. (From 
P.l. Hanson, T. Meyer, L. Stryer, and 
H. Schulman, Neuron 12:943-956, 1994. 
With permission from Elsevier.) 


Smell and Vision Dépend on GPCRs That Regulate Ion Channels 

Humans can distinguish more than 10,000 distinct smells, which they detect using 
specialized olfactory receptor neurons in the lining of the nose. These cells use 
spécifie GPCRs called olfactory receptors to recognize odors; the receptors are 
displayed on the surface of the modified cilia that extend from each cell (Figure 
15-36). The receptors act through cAMP. When stimulated by odorant binding, 






844 Chapter15: Cell Signaling 




they activate an olfactory-specific G protein (lcnown as G 0 if ), which in turn acti¬ 
vâtes adenylyl cyclase. The resulting increase in cAMP opens cyclic-AMP-gated 
cation channels, thereby allowing an influx of Na + , which depolarizes the olfac- 
tory receptor neuron and initiâtes a nerve impulse that travels along its axon to 
the brain. 

There are about 1000 different olfactory receptors in a mouse and about 350 in 
a human, each encoded by a different gene and each recognizing a different set 
of odorants. Each olfactory receptor neuron produces only one of these receptors; 
the neuron responds to a spécifie set of odorants by means of the spécifie recep¬ 
tor it displays, and each odorant activâtes its own characteristic set of olfactory 
receptor neurons. The same receptor also helps direct the elongating axon of each 
developing olfactory neuron to the spécifie target neurons that it will connect to 
in the brain. A different set of GPCRs acts in a similar way in some vertebrates to 
médiate responses to pheromones, Chemical signais detected in a different part of 
the nose that are used in communication between members of the same species. 
Humans, however, are thought to laclc functional pheromone receptors. 

Vertebrate vision employs a similarly elaborate, highly sensitive, signal-detec- 
tion process. Cyclic-nucleotide-gated ion channels are also involved, but the 
crucial cyclic nucléotide is cyclic GMP (Figure 15-37) rather than cAMP. As with 
cAMP, a continuous rapid synthesis (by guanylyl cyclase ) and rapid dégradation 
(by cyclic GMP phosphodiesterase) Controls the concentration of cyclic GMP in 
the cytosol. 

In visual transduction responses, which are the fastest G-protein-mediated 
responses known in vertebrates, the receptor activation stimulated by light causes 
a fall rather than a rise in the level of the cyclic nucléotide. The pathway has been 
especially well studied in rod photoreceptors (rods) in the vertebrate retina. Rods 
are responsible for noncolor vision in dim light, whereas cône photoreceptors 
(cônes) are responsible for color vision in bright light. A rod photoreceptor is a 
highly specialized cell with outer and inner segments, a cell body, and a synap- 
tic région where the rod passes a Chemical signal to a retinal nerve cell (Figure 
15-38). This nerve cell relays the signal to another nerve cell in the retina, which 
in turn relays it to the brain. 

The phototransduction apparatus is in the outer segment of the rod, which 
contains a staclc of dises, each formed by a closed sac of membrane that is densely 
paclced with photosensitive rhodopsin molécules. The plasma membrane sur- 
rounding the outer segment contains cyclic-GMP-gated cation channels. Cyclic 
GMP bound to these channels keeps them open in the darlc. Paradoxically, light 
causes a hyperpolarization (which inhibits synaptic signaling) rather than a 
depolarization of the plasma membrane (which would stimulate synaptic signal¬ 
ing). Hyperpolarization (that is, the membrane potential moves to a more néga¬ 
tive value—discussed in Chapter 11) results because the light-induced activation 
of rhodopsin molécules in the dise membrane decreases the cyclic GMP con¬ 
centration and closes the cation channels in the surrounding plasma membrane 
(Figure 15-39). 


Figure 15-36 Olfactory receptor 
neurons. (A) A section of olfactory 
epithelium in the nose. Olfactory receptor 
neurons possess modified cilia, which 
Project from the surface of the epithelium 
and contain the olfactory receptors, as well 
as the signal transduction machinery. The 
axon, which extends from the opposite 
end of the receptor neuron, conveys 
electrical signais to the brain when an 
odorant activâtes the cell to produce an 
action potential. In rodents, at least, the 
basal cells act as stem cells, producing 
new receptor neurons throughout life, to 
replace the neurons that die. (B) A scanning 
électron micrograph of the cilia on the 
surface of an olfactory neuron. (B, from 
E.E. Morrison and R.M. Costanzo, 

J. Comp. Neurol. 297:1-13, 1990. With 
permission from Wiley-Liss.) 
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Figure 15-38 A rod photoreceptor cell. There are about 1000 dises in 
the outer segment. The dise membranes are not connected to the plasma 
membrane. The inner and outer segments are specialized parts of a primary 
cilium (discussed in Chapter 16). A primary cilium extends from the surface of 
most vertebrate cells, where it serves as a signaling organelle. 


Rhodopsin is a member of the GPCR family, but the activating extracellular 
signal is not a molécule but a photon of light. Each rhodopsin molécule con- 
tains a covalently attached chromophore, ll-cis retinal, which isomerizes almost 
instantaneously to ail -trans retinal when it absorbs a single photon. The isomeri- 
zation alters the shape of the retinal, forcing a conformational change in the pro- 
tein (opsin). The activated rhodopsin molécule then alters the conformation of 
the G protein transducin (G ? ), causing the transducin a subunit to activate cyclic 
GMP phosphodiesterase. The phosphodiesterase then hydrolyzes cyclic GMP, 
so that cyclic GMP levels in the cytosol fall. This drop in cyclic GMP concentra¬ 
tion decreases the amount of cyclic GMP bound to the plasma membrane cation 
channels, allowing more of these cyclic-GMP-sensitive channels to close. In this 
way, the signal quicldy passes from the dise membrane to the plasma membrane, 
and a light signal is converted into an electrical one, through a hyperpolarization 
of the rod cell plasma membrane. 

Rods use several négative feedback loops to allow the cells to revert quicldy to a 
resting, darlc State in the aftermath of a flash of light—a requirement for perceiving 
the shortness of the flash. A rhodopsin-specific protein kinase called rhodopsin 
kinase ( RK ) phosphorylâtes the cytosolic tail of activated rhodopsin on multiple 
serines, partially inhibiting the ability of the rhodopsin to activate transducin. An 
inhibitory protein called arrestin (discussed later) then binds to the phosphory- 
lated rhodopsin, further inhibiting rhodopsin's activity. Mice or humans with a 
mutation that inactivâtes the gene encoding RK hâve a prolonged light response. 

At the same time as arrestin shuts off rhodopsin, an RGS protein (discussed 
earlier) binds to activated transducin, stimulating the transducin to hydrolyze its 
bound GTP to GDP, which returns transducin to its inactive State. In addition, the 
cation channels that close in response to light are permeable to Ca 2+ , as well as 
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Figure 15-39 The response of a rod 
photoreceptor cell to light. Rhodopsin 
molécules in the outer-segment dises 
absorb photons. Photon absorption closes 
cation channels in the plasma membrane, 
which hyperpolarizes the membrane 
and reduces the rate of neurotransmitter 
release from the synaptic région. Because 
the neurotransmitter inhibits many of the 
postsynaptic retinal neurons, illumination 
serves to free the neurons from inhibition 
and thus, in effect, excites them. The neural 
connections of the retina lie between the 
light source and the outer segment, and so 
the light must pass through the synapses 
and rod cell nucléus to reach the light 
sensors. 
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to Na + , so that when they close, the normal influx of Ca 2+ is inhibited, causing 
the Ca 2+ concentration in the cytosol to fall. The decrease in Ca 2+ concentration 
stimulâtes guanylyl cyclase to replenish the cyclic GMP, rapidly returning its level 
to where it was before the light was switched on. A spécifie Ca 2+ -sensitive protein 
médiates the activation of guanylyl cyclase in response to the fall in Ca 2+ levels. In 
contrast to calmodulin, this protein is inactive when Ca 2+ is bound to it and active 
when it is Ca 2+ -free. It therefore stimulâtes the cyclase when Ca 2+ levels fall fol- 
lowing a light response. 

Négative feedback mechanisms do more than just return the rod to its resting 
State after a transient light flash; they also help the rod to adapt, stepping down 
the response when the rod is exposed to light continuously. Adaptation, as we 
discussed earlier, allows the receptor cell to function as a sensitive detector of 
changes in stimulus intensity over an enormously wide range of baseline levels 
of stimulation. It is why we can see faint stars in a darlc sky, or a caméra flash in 
bright sunlight. 

The various trimeric G proteins we hâve discussed in this chapter fall into four 
major families, as summarized in Table 15-3. 

Nitric Oxide Is a Gaseous Signaling Mediator That Passes 
Between Cells 

Signaling molécules like cyclic nucléotides and calcium are hydrophilic small 
molécules that generally act within the cell where they are produced. Some sig¬ 
naling molécules, however, are hydrophobie enough, small enough, or both, to 
pass readily across the plasma membrane and carry signais to nearby cells. An 
important and remarkable example is the gas nitric oxide (NO), which acts as a 
signal molécule in many tissues of both animais and plants. 

In mammals, one of NO's many functions is to relax smooth muscle in the walls 
of blood vessels. The neurotransmitter acétylcholine stimulâtes NO synthesis by 


TABLE 15-3 


Four Major Families of Trimeric G Proteins* 


Family 

Some family 
members 

Subunits that 
médiate action 

Some functions 

1 

G s 

a 

Activâtes adenylyl cyclase; 
activâtes Ca 2+ channels 


Golf 

a 

Activâtes adenylyl cyclase in 
olfactory sensory neurons 

II 

Gi 

a 

Inhibits adenylyl cyclase 



Py 

Activâtes K + channels 


G 0 

Py 

Activâtes K + channels; inactivâtes 
Ca 2+ channels 



a and py 

Activâtes phospholipase C-p 


Gt (transducin) 

a 

Activâtes cyclic GMP 
phosphodiesterase in vertebrate 
rod photoreceptors 

III 

G q 

a 

Activâtes phospholipase C-p 

IV 

G12/13 

a 

Activâtes Rho family monomeric 
GTPases (via Rho-GEF) to regulate 
the actin cytoskeleton 


*Families are determined by amino acid sequence relatedness of the a subunits. Only selected 
examples are included. About 20 a subunits and at least 6 P subunits and 11 y subunits hâve 
been described in humans. 
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activating a GPCR on the membranes of the endothélial cells that line the interior 
of the vessel. The activated receptor triggers IP 3 synthesis and Ca 2+ release (see 
Figure 15-29), leading to stimulation of an enzyme that synthesizes NO. Because 
dissolved NO passes readily across membranes, it diffuses out of the cell where it 
is produced and into neighboring smooth muscle cells, where it causes muscle 
relaxation and thereby vessel dilation (Figure 15-40). It acts only locally because 
it has a short half-life—about 5-10 seconds—in the extracellular space before oxy- 
gen and water convert it to nitrates and nitrites. 

The effect of NO on blood vessels provides an explanation for the mechanism of 
action of nitroglycérine, which has been used for about 100 years to treat patients 
with angina (pain resulting from inadéquate blood flow to the heart muscle). The 
nitroglycérine is converted to NO, which relaxes blood vessels. This reduces the 
worldoad on the heart and, as a conséquence, reduces the oxygen requirement of 
the heart muscle. 

NO is made by the deamination of the amino acid arginine, catalyzed by 
enzymes called NO synthases (NOS) (see Figure 15-40). The NOS in endothélial 
cells is called eNOS, while that in nerve and muscle cells is called nNOS. Both 
eNOS and nNOS are stimulated by Ca 2+ . Macrophages, by contrast, malce yet 
another NOS, called inducible NOS ( iNOS ), that is constitutively active but syn- 
thesized only when the cells are activated, usually in response to an infection. 

In some target cells, including smooth muscle cells, NO binds reversibly to 
iron in the active site of guanylyl cyclase, stimulating synthesis of cyclic GMP. NO 
can increase cyclic GMP in the cytosol within seconds, because the normal rate of 
turnover of cyclic GMP is high: rapid dégradation to GMP by a phosphodiesterase 
constantly balances the production of cyclic GMP by guanylyl cyclase. The drug 
Viagra® and its relatives inhibit the cyclic GMP phosphodiesterase in the pénis, 
thereby increasing the amount of time that cyclic GMP levels remain elevated in 
the smooth muscle cells of penile blood vessels after NO production is induced 
by local nerve terminais. The cyclic GMP, in turn, lceeps the blood vessels relaxed 
and thereby the pénis erect. NO can also signal cells independently of cyclic 
GMP. It can, for example, alter the activity of an intracellular protein by covalently 
nitrosylating thiol (-SH) groups on spécifie cysteines in the protein. 


(A) 

smooth muscle cell basal lamina 



Figure 15-40 The rôle of nitric oxide (NO) in smooth muscle relaxation in a blood vessel 

wall. (A) Simplified cross section of a blood vessel, showing the endothélial cells lining the 
lumen and the smooth muscle cells around them. (B) The neurotransmitter acétylcholine 
stimulâtes blood vessel dilation by activating a GPCR—the muscarinic acétylcholine 
receptor —on the surface of endothélial cells. This receptor activâtes a G protein, G q , thereby 
stimulating IP 3 synthesis and Ca 2+ release by the mechanisms illustrated in Figure 15-29. Ca 2+ 
activâtes nitric oxide synthase, causing the endothélial cells to produce NO from arginine. The 
NO diffuses out of the endothélial cells and into the neighboring smooth muscle cells, where 
it activâtes guanylyl cyclase to produce cyclic GMP. The cyclic GMP triggers a response that 
causes the smooth muscle cells to relax, increasing blood flow through the vessel. 
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Second Messengers and Enzymatic Cascades Amplify Signais 

Despite the différences in molecular details, the different intracellular signaling 
pathways that GPCRs trigger share certain features and obey similar general prin¬ 
cipes. They dépend on relay chains of intracellular signaling proteins and sec¬ 
ond messengers. These relay chains provide numerous opportunities for amplify- 
ing the responses to extracellular signais. In the visual transduction cascade, for 
example, a single activated rhodopsin molécule catalyzes the activation of hun- 
dreds of molécules of transducin at a rate of about 1000 transducin molécules per 
second. Each activated transducin molécule activâtes a molécule of cyclic GMP 
phosphodiesterase, each of which hydrolyzes about 4000 molécules of cyclic 
GMP per second. This catalytic cascade lasts for about 1 second and results in 
the hydrolysis of more than 10 5 cyclic GMP molécules for a single quantum of 
light absorbed, and the resulting drop in the concentration of cyclic GMP in turn 
transiently closes hundreds of cation channels in the plasma membrane (Figure 
15-41). As a resuit, a rod cell can respond to even a single photon of light in a way 
that is highly reproducible in its timing and magnitude. 

Likewise, when an extracellular signal molécule binds to a receptor that indi- 
rectly activâtes adenylyl cyclase via G s , each receptor protein may activate many 
molécules of G s protein, each of which can activate a cyclase molécule. Each 
cyclase molécule, in turn, can catalyze the conversion of a large number of ATP 
molécules to cAMP molécules. A similar amplification opérâtes in the IP 3 signal¬ 
ing pathway. In these ways, a nanomolar (10 -9 M) change in the concentration of 
an extracellular signal can induce micromolar (10 ~ 6 M) changes in the concen¬ 
tration of a second messenger such as cAMP or Ca 2+ . Because these messengers 
function as allosteric effectors to activate spécifie enzymes or ion channels, a sin¬ 
gle extracellular signal molécule can alter many thousands of protein molécules 
within the target cell. 

Any such amplifying cascade of stimulatory signais requires counterbalancing 
mechanisms at every step of the cascade to restore the System to its resting State 
when stimulation ceases. As emphasized earlier, the response to stimulation can 
be rapid only if the inactivating mechanisms are also rapid. Cells therefore hâve 
efficient mechanisms for rapidly degrading (and resynthesizing) cyclic nucléo¬ 
tides and for buffering and removing cytosolic Ca 2+ , as well as for inactivating the 
responding enzymes and ion channels once they hâve been activated. This is not 
only essential for turning a response off, but is also important for defining the rest¬ 
ing State from which a response begins. 

Each protein in the signaling relay chain can be a separate target for régula¬ 
tion, including the receptor itself, as we discuss next. 


t 

one rhodopsin molécule 
absorbs one photon 

l 

500 G-protein (transducin) 
molécules are activated 

1 

500 cyclic GMP phosphodiesterase 
molécules are activated 

i 

10 5 cyclic GMP molécules 
are hydrolyzed 

1 


250 cation channels 
close 



10 6 —10 7 Na + ions per second are prevented 
from entering the cell 
for a period of ~1 second 


membrane potential is 
altered by 1 mV 

I 

I 

SIGNAL RELAYED TO B RAI N 

Figure 15-41 Amplification in the light- 
induced catalytic cascade in vertebrate 
rods. The red arrows indicate the steps 
where amplification occurs, with the 
thickness of the arrow roughly indicating 
the magnitude of the amplification. 


GPCR Desensitization Dépends on Receptor Phosphorylation 

As discussed earlier, when target cells are exposed to a high concentration of a 
stimulating ligand for a prolonged period, they can become desensitized, or 
adapted, in several different ways. An important class of adaptation mechanisms 
dépends on alteration of the quantity or condition of the receptor molécules 
themselves. 

For GPCRs, there are three general modes of adaptation (see Figure 15-20): 
(1) In receptor séquestration, they are temporarily moved to the interior of the 
cell (internalized) so that they no longer hâve access to their ligand. (2) In recep¬ 
tor down-regulation, they are destroyed in lysosomes after internalization. (3) In 
receptor inactivation, they become altered so that they can no longer interact with 
G proteins. 

In each case, the desensitization of the GPCRs dépends on their phosphory¬ 
lation by PKA, PKC, or a member of the family of GPCR kinases (GRKs), which 
includes the rhodopsin-specific kinase RK involved in rod photoreceptor desen¬ 
sitization discussed earlier. The GRKs phosphorylate multiple serines and thré¬ 
onines on a GPCR, but they do so only after ligand binding has activated the 
receptor, because it is the activated receptor that allosterically activâtes the GRK. 
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G PCR kinase (G R K) 


Figure 15-42 The rôles of GPCR kinases (GRKs) and arrestins in GPCR desensitization. 

A GRK phosphorylâtes only activated receptors because it is the activated GPCR that activâtes the 
GRK. The binding of an arrestin to the phosphorylated receptor prevents the receptor from binding 
to its G protein and also directs its endocytosis (not shown). Mice that are déficient in one form 
of arrestin fail to desensitize in response to morphine, for example, attesting to the importance of 
arrestins for desensitization. 


As with rhodopsin, once a receptor has been phosphorylated by a GRK, it binds 
with high affïnity to a member of the arrestin family of proteins (Figure 15-42). 

The bound arrestin can contribute to the desensitization process in at least 
two ways. First, it prevents the activated receptor from interacting with G pro- 
teins. Second, it serves as an adaptor protein to help couple the receptor to the 
clathrin-dependent endocytosis machinery (discussed in Chapter 13), inducing 
receptor-mediated endocytosis. The fate of the internalized GPCR-arrestin com- 
plex dépends on other proteins in the complex. In some cases, the receptor is 
dephosphorylated and recycled baclc to the plasma membrane for reuse. In oth- 
ers, it is ubiquitylated, endocytosed, and degraded in lysosomes (discussed later). 

Receptor endocytosis does not necessarily stop the receptor from signaling. In 
some cases, the bound arrestin recruits other signaling proteins to relay the signal 
onward from the internalized GPCRs along new pathways. 

Summary 

GPCRs can indirectly activate or inactivate either plasma-membrane-bound 
enzymes or ion channels via G proteins. When an activated receptor stimulâtes a G 
protein, the G protein undergoes a conformational change that activâtes its a sub¬ 
unit, thereby triggering release ofa fiy complex. Either component can then directly 
regulate the activity oftarget proteins in the plasma membrane. Some GPCRs either 
activate or inactivate adenylyl cyclase, thereby altering the intracellular concentra¬ 
tion ofthe second messenger cyclic AMP. Others activate a phosphoinositide-spe- 
cific phospholipase C (PLCfî), which generates two second messenger s. One is inosi¬ 
tol 1,4,5-trisphosphate (IP3), which releases Ca 2+ from the ER and thereby increases 
the concentration ofCa 2+ in the cytosol. The other is diacylglycérol , which remains 
in the plasma membrane and helps activate protein kinase C (PKC). An increase in 
cytosolic cy clic AMP or Ca 2+ levels affects cells mainly by stimulating cAMP-depen- 
dent protein kinase (PKA) and Ca 2+ /calmodulin-dependent protein kinases (CaM- 
kinases), respectively. 

PKC, PKA, and CaM-kinases phosphorylate spécifie target proteins and thereby 
alter the activity ofthe proteins. Each type ofcell has its own characteristic set of 
target proteins that is regulated in these ways, enabling the cell to make its own 
distinctive response to the second messengers. The intracellular signaling cascades 
activated by GPCRs greatly amplify the responses, so that many thousands oftarget 
protein molécules are changed for each molécule of extracellular signaling ligand 
bound to its receptor. The responses mediated by GPCRs are rapidly turned offwhen 
the extracellular signal is removed, and activated GPCRs are inactivated by phos¬ 
phorylation and association with arrestins. 
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SIGNALING THROUGH ENZYME-COUPLED 
RECEPTORS 

Like GPCRs, enzyme-coupled receptors are transmembrane proteins with their 
ligand-binding domain on the outer surface of the plasma membrane. Instead of 
having a cytosolic domain that associâtes with a trimeric G protein, however, their 
cytosolic domain either has intrinsic enzyme activity or associâtes directly with an 
enzyme. Whereas a GPCR has seven transmembrane segments, each subunit of 
an enzyme-coupled receptor typically has only one. GPCRs and enzyme-coupled 
receptors often activate some of the same signaling pathways. In this section, we 
describe some of the important features of signaling by enzyme-coupled recep¬ 
tors, with an emphasis on the most common class of these proteins, the receptor 
tyrosine kinases. 

Activated Receptor Tyrosine Kinases (RTKs) Phosphorylate 
Themselves 

Many extracellular signal proteins act through receptor tyrosine kinases (RTKs). 
These include many secreted and cell-surface-bound proteins that control cell 
behavior in developing and adult animais. Some of these signal proteins and their 
RTKs are listed in Table 15-4. 

There are about 60 human RTKs, which can be classified into about 20 struc¬ 
tural subfamilies, each dedicated to its complementary family of protein ligands. 
Figure 15-43 shows the basic structural features of a number of the families 
that operate in mammals. In ail cases, the binding of the signal protein to the 
ligand-binding domain on the extracellular side of the receptor activâtes the tyro¬ 
sine kinase domain on the cytosolic side. This leads to phosphorylation of tyro¬ 
sine side chains on the cytosolic part of the receptor, creating phosphotyrosine 
docking sites for various intracellular signaling proteins that relay the signal. 

How does the binding of an extracellular ligand activate the kinase domain on 
the other side of the plasma membrane? For a GPCR, ligand binding is thought to 
change the relative orientation of several of the transmembrane a helices, thereby 
shifting the position of the cytoplasmic loops relative to one another. It is unlikely, 
however, that a conformational change could propagate across the lipid bilayer 


TABLE 15-4 


Some Signal Proteins That Act Via RTKs 


Signal protein family 

Receptor family 

Some représentative responses 

Epidermal growth factor (EGF) 

EGF receptors 

Stimulâtes cell survival, growth, prolifération, or différentiation 
of various cell types; acts as inductive signal in development 

Insulin 

Insulin receptor 

Stimulâtes carbohydrate utilization and protein synthesis 

Insulin-like growth factor (IGF1) 

IGF receptor-1 

Stimulâtes cell growth and survival in many cell types 

Nerve growth factor (NGF) 

Trk receptors 

Stimulâtes survival and growth of some neurons 

Platelet-derived growth factor (PDGF) 

PDGF receptors 

Stimulâtes survival, growth, prolifération, and migration of 
various cell types 

Macrophage-colony-stimulating factor 
(MCSF) 

MCSF receptor 

Stimulâtes monocyte/macrophage prolifération and 
différentiation 

Fibroblast growth factor (FGF) 

FGF receptors 

Stimulâtes prolifération of various cell types; inhibits 
différentiation of some precursor cells; acts as inductive 
signal in development 

Vascular endothélial growth factor (VEGF) 

VEGF receptors 

Stimulâtes angiogenesis 

Ephrin 

Eph receptors 

Stimulâtes angiogenesis; guides cell and axon migration 
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Figure 15-43 Some subfamilies of 

RTKs. Only one or two members of each 
subfamily are indicated. Note that in 
some cases, the tyrosine kinase domain 
is interrupted by a “kinase insert région” 
that is an extra segment emerging from 
the folded kinase domain. The functions of 
most of the cysteine-rich, immunoglobulin- 
like, and fibronectin-type-lll-like domains 
are not known. Some of the ligands for the 
receptors shown are listed in Table 15-4, 
along with some représentative responses 
that they médiate. 


through a single transmembrane a hélix. Instead, for most RTKs, ligand binding 
causes the receptors to dimerize, bringing the two cytoplasmic kinase domains 
together and thereby promoting their activation (Figure 15-44). 

Dimerization stimulâtes kinase activity by a variety of mechanisms. In many 
cases, such as the insulin receptor, dimerization simply brings the kinase domains 
close to each other in an orientation that allows them to phosphorylate each other 
on spécifie tyrosines in the kinase active sites, thereby promoting conformational 
changes that fully activate both kinase domains. In other cases, such as the recep¬ 
tor for epidermal growth factor {EGF), the kinase is not activated by phosphory¬ 
lation but by conformational changes brought about by interactions between the 
two kinase domains outside their active sites (Figure 15-45). 
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Figure 15-44 Activation of RTKs by dimerization. In the absence of extracellular signais, most RTKs exist as monomers 
in which the internai kinase domain is inactive. Binding of ligand brings two monomers together to form a dimer. In most 
cases, the close proximity in the dimer leads the two kinase domains to phosphorylate each other, which has two effects. 

First, phosphorylation at some tyrosines in the kinase domains promûtes the complété activation of the domains. Second, 
phosphorylation at tyrosines in other parts of the receptors generates docking sites for intracellular signaling proteins, resulting 
in the formation of large signaling complexes that can then broadeast signais along multiple signaling pathways. 

Mechanisms of dimerization vary widely among different RTK family members. In some cases, as shown here, the ligand 
itself is a dimer and brings two receptors together by binding them simultaneously. In other cases, a monomeric ligand can 
interact with two receptors simultaneously to bring them together, or two ligands can bind independently on two receptors to 
promote dimerization. In some RTKs—notably those in the insulin receptor family—the receptor is always a dimer (see Figure 
15-43), and ligand binding causes a conformational change that brings the two internai kinase domains doser together. 
Although many RTKs are activated by transautophosphorylation as shown here, there are some important exceptions, including 
the EGF receptor illustrated in Figure 15-45. 
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Phosphorylated Tyrosines on RTKs Serve as Docking Sites for 
Intracellular Signaling Proteins 

Once the kinase domains of an RTK dimer are activated, they phosphorylate mul¬ 
tiple additional sites in the cytosolic parts of the receptors, typically in disordered 
régions outside the kinase domain (see Figure 15-44). This phosphorylation cré¬ 
âtes high-affinity docking sites for intracellular signaling proteins. Each signal¬ 
ing protein binds to a particular phosphorylated site on the activated receptors 
because it contains a spécifie phosphotyrosine-binding domain that recognizes 
surrounding features of the polypeptide chain in addition to the phosphotyrosine. 

Once bound to the activated RTK, a signaling protein may become phosphor¬ 
ylated on tyrosines and thereby activated. In many cases, however, the binding 
alone may be sufficient to activate the doclced signaling protein, by either induc- 
ing a conformational change in the protein or simply bringing it near the protein 
that is next in the signaling pathway. Thus, receptor phosphorylation serves as a 
switch to trigger the assembly of an intracellular signaling complex, which can 
then relay the signal onward, often along multiple routes, to various destinations 
in the cell. Because different RTKs bind different combinations of these signaling 
proteins, they activate different responses. 

Some RTKs use additional docking proteins to enlarge the signaling complex 
at activated receptors. Insulin and IGF1 receptor signaling, for example, dépend 
on a specialized docking protein called insulin receptor substrate 1 ( IRS1 ). IRS1 
associâtes with phosphorylated tyrosines on the activated receptor and is then 
phosphorylated at multiple sites, thereby creating many more docking sites than 
could be accommodated on the receptor alone (see Figure 15-11). 


Figure 15-45 Activation of the EGF 
receptor kinase. In the absence of 
ligand, the EGF receptor exists primarily 
as an inactive monomer. EGF binding 
results in a conformational change that 
promûtes dimerization of the external 
domains. The receptor kinase domain, 
unlike that of many RTKs, is not activated 
by transautophosphorylation. Instead, 
dimerization orients the internai kinase 
domains into an asymmetric dimer, in which 
one kinase domain (the “activator”) pushes 
against the other kinase domain (the 
“receiver”), thereby causing an activating 
conformational change in the receiver. The 
active receiver domain then phosphorylâtes 
multiple tyrosines in the C-terminal tails of 
both receptors, generating docking sites for 
intracellular signaling proteins (see Figure 
15-44). 


Proteins with SH2 Domains Bind to Phosphorylated Tyrosines 

A whole menagerie of intracellular signaling proteins can bind to the phospho- 
tyrosines on activated RTKs (or on docking proteins such as IRS1). They help to 
relay the signal onward, mainly through chains of protein-protein interactions 
mediated by modular interaction domains , as discussed earlier. Some of the 
doclced proteins are enzymes, such as phospholipase C-y (PLCy), which func- 
tions in the same way as phospholipase C-P—activating the inositol phospholipid 
signaling pathway discussed earlier in connection with GPCRs (see Figures 15-28 
and 15-29). Through this pathway, RTKs can increase cytosolic Ca 2+ levels and 
activate PKC. Another enzyme that docks on these receptors is the cytoplasmic 
tyrosine kinase Src, which phosphorylâtes other signaling proteins on tyrosines. 
Yet another is phosphoinositide 3-kinase (PI 3-kinase), which phosphorylâtes lip- 
ids rather than proteins; as we discuss later, the phosphorylated lipids then serve 
as docking sites to attract various signaling proteins to the plasma membrane. 

The intracellular signaling proteins that bind to phosphotyrosines hâve varied 
structures and functions. However, they usually share highly conserved phospho¬ 
tyrosine-binding domains. These can be either SH2 domains (for Src homology 
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région ) or, less commonly, PTB domains (for phosphotyrosine-binding). By recog- 
nizing spécifie phosphorylated tyrosines, these small interaction domains enable 
the proteins that contain them to bind to activated RTKs, as well as to many other 
intracellular signaling proteins that hâve been transiently phosphorylated on 
tyrosines (Figure 15-46). As discussed previously, many signaling proteins also 
contain other interaction domains that allow them to interact specifically with 
other proteins as part of the signaling process. These domains include the SH3 
domain , which binds to proline-rich motifs in intracellular proteins (see Figure 
15-11). 

Not ail proteins that bind to activated RTKs via SH2 domains help to relay 
the signal onward. Some act to decrease the signaling process, providing néga¬ 
tive feedback. One example is the c-Cbl protein, which can dock on some acti¬ 
vated receptors and catalyze their ubiquitylation, covalently adding one or more 
ubiquitin molécules to spécifie sites on the receptor. This promûtes the endo- 
cytosis and dégradation of the receptors in lysosomes—an example of receptor 
down-regulation (see Figure 15-20). Endocytic proteins that contain ubiquitin- 
interaction motifs (UIMs) recognize the ubiquitylated RTKs and direct them into 
clathrin-coated vesicles and, ultimately, into lysosomes (discussed in Chapter 
13). Mutations that inactivate c-Cbl-dependent RTK down-regulation cause pro- 
longed RTK signaling and thereby promote the development of cancer. 

As is the case for GPCRs, ligand-induced endocytosis of RTKs does not always 
decrease signaling. In some cases, RTKs are endocytosed with their bound signal¬ 
ing proteins and continue to signal from endosomes or other intracellular com- 
partments. This mechanism, for example, allows nerve growth factor ( NGF ) to 
bind to its spécifie RTK (called TrkA) at the end of a long nerve cell axon and signal 
to the cell body of the same cell a long distance away. Here, signaling endocytic 
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Figure 15-46 The binding of SH2- 
containing intracellular signaling 
proteins to an activated RTK. 

(A) This drawing of a receptor fo r platelet- 
derived growth factor (PDGF) shows five 
phosphotyrosine docking sites, three in 
the kinase insert région and two on the 
C-terminal tail, to which the three signaling 
proteins shown bind as indicated. The 
numbers on the right indicate the positions 
of the tyrosines in the polypeptide chain. 
These binding sites hâve been identified 
by using recombinant DNA technology to 
mutate spécifie tyrosines in the receptor. 
Mutation of tyrosines 1009 and 1021, 
for example, prevents the binding and 
activation of PLCy, so that receptor 
activation no longer stimulâtes the inositol 
phospholipid signaling pathway. The 
locations of the SH2 (red) and SH3 (blue) 
domains in the three signaling proteins 
are indicated. (Additional phosphotyrosine 
docking sites on this receptor are not 
shown, including those that bind the 
cytoplasmic tyrosine kinase Src and two 
adaptor proteins.) It is unclear how many 
signaling proteins can bind simultaneously 
to a single RTK. (B) The three-dimensional 
structure of an SH2 domain, as determined 
by x-ray crystallography. The binding 
pocket for phosphotyrosine is shaded 
in orange on the right, and a pocket for 
binding a spécifie amino acid side chain 
(isoleucine, in this case) is shaded in yellow 
on the left. The RTK polypeptide segment 
that binds the SH2 domain is shown in 
yellow (see also Figure 3-40). (C) The SH2 
domain is a compact, “plug-in” module, 
which can be inserted almost anywhere in 
a protein without disturbing the protein’s 
folding or function (discussed in Chapter 
3). Because each domain has distinct sites 
for recognizing phosphotyrosine and for 
recognizing a particular amino acid side 
chain, different SH2 domains recognize 
phosphotyrosine in the context of different 
flanking amino acid sequences. (B, based 
on data from G. Waksman et al., Cell 
72:779-790, 1993. With permission from 
Elsevier. PDB code: 2SRC.) 










854 Chapter15: Cell Signaling 


vesicles containing TrkA, with NGF bound on the lumenal side and signaling pro¬ 
teins doclced on the cytosolic side, are transported along the axon to the cell body, 
where they signal the cell to survive. 

Some signaling proteins are composed almost entirely of SH2 and SH3 
domains and function as adaptors to couple tyrosine-phosphorylated proteins to 
other proteins that do not hâve their own SH2 domains (see Figure 15-11). Adap- 
tor proteins of this type help to couple activated RTKs to the important signaling 
protein Ras , a monomeric GTPase that, in turn, can activate various downstream 
signaling pathways, as we now discuss. 

The GTPase Ras Médiates Signaling by Most RTKs 

The Ras superfamily consists of various families of monomeric GTPases, but only 
the Ras and Rho families relay signais from cell-surface receptors (Table 15-5). By 
interacting with different intracellular signaling proteins, a single Ras or Rho fam- 
ily member can coordinately spread the signal along several distinct downstream 
signaling pathways, thereby acting as a signaling hub. 

There are three major, closely related Ras proteins in humans: H-, K-, and 
N-Ras (see Table 15-5). Although they hâve subtly different functions, they are 
thought to worlc in the same way, and we will refer to them simply as Ras. Like 
many monomeric GTPases, Ras contains one or more covalently attached lipid 
groups that help anchor the protein to the cytoplasmic face of the membrane, 
from where it relays signais to other parts of the cell. Ras is often required, for 
example, when RTKs signal to the nucléus to stimulate cell prolifération or dif¬ 
férentiation, both of which require changes in gene expression. If Ras function 
is inhibited by various experimental approaches, the cell prolifération or différ¬ 
entiation responses normally induced by the activated RTKs do not occur. Con- 
versely, 30% of human tumors express hyperactive mutant forms of Ras, which 
contribute to the uncontrolled prolifération of the cancer cells. 

Like other GTP-binding proteins, Ras functions as a molecular switch, cycling 
between two distinct conformational States—active when GTP is bound and 
inactive when GDP is bound (Movie 15.7). As discussed earlier for monomeric 
GTPases in general, two classes of signaling proteins regulate Ras activity by 
influencing its transition between active and inactive States (see Figure 15-8). 


The Ras Superfamily of Monomeric GTPases 

Family 

Some family Some functions 

members 

Ras 

H-Ras, K-Ras, 

N-Ras 

Relay signais from RTKs 


Rheb 

Activâtes mTOR to stimulate cell growth 


Rapl 

Activated by a cyclic-AMP-dependent GEF; 
influences cell adhesion by activating integrins 

Rho* 

Rho, Rac, Cdc42 

Relay signais from surface receptors to the 
cytoskeleton and elsewhere 

ARF* 

ARF1-ARF6 

Regulate assembly of protein coats on intracellular 
vesicles 

Rab* 

Rab1-60 

Regulate intracellular vesicle traffic 

Ran* 

Ran 

Régulâtes mitotic spindle assembly and nuclear 
transport of RNAs and proteins 

*The Rho family is discussed in Chapter 16, the ARF and Rab proteins in Chapter 13, and Ran 
in Chapters 12 and 17. The three-dimensional structure of Ras is shown in Figure 3-67. 
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Figure 15-47 How an RTK activâtes 
Ras. Grb2 recognizes a spécifie 
phosphorylated tyrosine on the activated 
receptor by means of an SH2 domain 
and recruits Sos by means of two SH3 
domains. Sos stimulâtes the inactive Ras 
protein to replace its bound GDP by GTP, 
which activâtes Ras to relay the signal 
downstream. 


Ras guanine nucléotide exchange factors (Ras-GEFs) stimulate the dissociation 
of GDP and the subséquent uptalce of GTP from the cytosol, thereby activating 
Ras. Ras GTPase-activating proteins (Ras-GAPs) increase the rate of hydrolysis of 
bound GTP by Ras, thereby inactivating Ras. Hyperactive mutant forms of Ras are 
résistant to GAP-mediated GTPase stimulation and are locked permanently in the 
GTP-bound active State, which is why they promote the development of cancer. 

But how do RTKs normally activate Ras? In principle, they could either activate 
a Ras-GEF or inhibit a Ras-GAP. Even though some GAPs bind directly (via their 
SH2 domains) to activated RTKs (see Figure 15-46A), it is the indirect coupling of 
the receptor to a Ras-GEF that drives Ras into its active State. The loss of function 
of a Ras-GEF has a similar effect to the loss of function of Ras itself. Activation of 
the other Ras superfamily proteins, including those of the Rho family, also occurs 
through the activation of GEFs. The particular GEF détermines in which mem¬ 
brane the GTPase is activated and, by acting as a scaffold, it can also détermine 
which downstream proteins the GTPase activâtes. 

The GEF that médiates Ras activation by RTKs was discovered by genetic 
studies of eye development in Drosophila, where an RTK called Sevenless ( Sev ) 
is required for the formation of a photoreceptor cell called R7. Genetic screens 
for components of this signaling pathway led to the discovery of a Ras-GEF called 
Son-of-sevenless {Sos). Further genetic screens uncovered another protein, now 
called Grb2 , which is an adaptor protein that links the Sev receptor to the Sos pro¬ 
tein; the SH2 domain of the Grb2 adaptor binds to the activated receptor, while its 
two SH3 domains bind to Sos. Sos then promûtes Ras activation. Biochemical and 
cell biological studies hâve shown that Grb2 and Sos also linlc activated RTKs to 
Ras in mammalian cells, revealing that this is a highly conserved mechanism in 
RTK signaling (Figure 15-47). Once activated, Ras activâtes various other signal¬ 
ing proteins to relay the signal downstream, as we discuss next. 

Ras Activâtes a MAP Kinase Signaling Module 

Both the tyrosine phosphorylations and the activation of Ras triggered by acti¬ 
vated RTKs are usually short-lived (Figure 15-48). Tyrosine-specific protein phos¬ 
phatases quickly reverse the phosphorylations, and Ras-GAPs induce activated 



Figure 15-48 Transient activation 
of Ras revealed by single-molecule 
fluorescence résonance energy transfer 

(FRET). (A) Schematic drawing of the 
experimental strategy. Cells of a human 
cancer cell line are genetically engineered 
to express a Ras protein that is covalently 
linked to yellow fluorescent protein (YFP). 
GTP that is labeled with a red fluorescent 
dye is microinjected into some of the cells. 
The cells are then stimulated with the 
extracellular signal protein EGF, and single 
fluorescent molécules of Ras-YFP at the 
inner surface of the plasma membrane are 
followed by video fluorescence microscopy 
in individual cells. When a fluorescent 
Ras-YFP molécule becomes activated, it 
exchanges unlabeled GDP for fluorescently 
labeled GTP; the energy emitted by the 
YFP now activâtes the fluorescent GTP 
to émit red light (called fluorescence 
résonance energy transfer, or FRET; see 
Figure 9-26). Thus, the activation of 
single Ras molécules can be followed by 
the émission of red fluorescence from a 
previously yellow-green fluorescent spot 
at the plasma membrane. As shown 
in (B), activated Ras molécules can be 
detected after about 30 seconds of EGF 
stimulation. The red signal peaks at about 
3 minutes and then decreases to baseline 
by 6 minutes. As Ras-GAP is found to be 
recruited to the same spots at the plasma 
membrane as Ras, it presumably plays a 
major part in rapidly shutting off the Ras 
signal. (Modified from H. Murakoshi et al., 
Proc. NatlAcad. Sci. USA 101:7317-7322, 
2004. With permission from National 
Academy of Sciences.) 
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Figure 15-49 The MAP kinase module 
activated by Ras. The three-component 
module begins with a MAP kinase kinase 
kinase called Raf. Ras recruits Raf to the 
plasma membrane and helps activate it. 
Raf then activâtes the MAP kinase kinase 
Mek, which then activâtes the MAP kinase 
Erk. Erk in turn phosphorylâtes a variety 
of downstream proteins, including other 
protein kinases, as well as transcription 
regulators in the nucléus. The resulting 
changes in protein activities and gene 
expression cause complex changes in cell 
behavior. 


Ras to inactivate itself by hydrolyzing its bound GTP to GDP. To stimulate cells to 
proliferate or differentiate, these short-lived signaling events must be converted 
into longer-lasting ones that can sustain the signal and relay it downstream to the 
nucléus to alter the pattern of gene expression. One of the lcey mechanisms used 
for this purpose is a System of proteins called the mitogen-activated protein kinase 
module (MAP kinase module) (Figure 15-49). The three components of this Sys¬ 
tem form a functional signaling module that has been remarkably well conserved 
during évolution and is used, with variations, in many different signaling contexts. 

The three components are ail protein kinases. The final kinase in the sériés is 
called simply MAP kinase (MAPK). The next one upstream from this is MAP kinase 
kinase (MAPKK): it phosphorylâtes and thereby activâtes MAP kinase. Next above 
that, receiving an activating signal directly from Ras, is MAP kinase kinase kinase 
(MAPKKK): it phosphorylâtes and thereby activâtes MAPKK. In the mammalian 
Ras-MAP-kinase signaling pathway, these three kinases are known by shorter 
names: Raf (= MAPKKK), Mek (= MAPKK), and Erk (=MAPK). 

Once activated, the MAP kinase relays the signal downstream by phosphory- 
lating various proteins in the cell, including transcription regulators and other 
protein kinases (see Figure 15-49). Erk, for example, enters the nucléus and phos¬ 
phorylâtes one or more components of a transcription regulatory complex. This 
activâtes the transcription of a set of immédiate early genes , so named because 
they turn on within minutes after an RTK receives an extracellular signal, even 
if protein synthesis is experimentally blocked with drugs. Some of these genes 
encode other transcription regulators that turn on other genes, a process that 
requires both protein synthesis and more time. In this way, the Ras-MAP-kinase 
signaling pathway conveys signais from the cell surface to the nucléus and alters 
the pattern of gene expression. Among the genes activated by this pathway are 
some that stimulate cell prolifération, such as the genes encoding Gi cyclins (dis- 
cussed in Chapter 17). 

Extracellular signais usually activate MAP kinases only transiently, and the 
period during which the kinase remains active influences the response. When 
EGF activâtes its receptors in a neural precursor cell line, for example, Erk MAP 
kinase activity pealcs at 5 minutes and rapidly déclinés, and the cells later go on to 
divide. By contrast, when NGF activâtes its receptors on the same cells, Erk activ¬ 
ity remains high for many hours, and the cells stop proliferating and differentiate 
into neurons. 

Many factors influence the duration and other features of the signaling 
response, including positive and négative feedback loops, which can combine to 
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give responses that are either graded or switchlike and either brief or long lasting. 
In an example illustrated earlier, in Figure 15-19, MAP kinase activâtes a complex 
positive feedback loop to produce an all-or-none, irréversible response when frog 
oocytes are stimulated to mature by a brief exposure to the extracellular signal 
molécule progestérone. In many cells, MAP kinases activate a négative feedback 
loop by increasing the concentration of a protein phosphatase that removes the 
phosphate from MAP kinase. The increase in the phosphatase results from both 
an increase in the transcription of the phosphatase gene and the stabilization of 
the enzyme against dégradation. In the Ras-MAP-lcinase pathway shown in Fig¬ 
ure 15-49, Eric also phosphorylâtes and inactivâtes Raf, providing another néga¬ 
tive feedback loop that helps shut off the MAP kinase module. 

Scaffold Proteins Help Prevent Cross-talk Between 
Parallel MAP Kinase Modules 

Three-component MAP kinase signaling modules operate in ail eukaryotic cells, 
with different modules mediating different responses in the same cell. In bud- 
ding yeast, for example, one such module médiates the response to mating 
pheromone, another the response to starvation, and yet another the response to 
osmotic shoclc. Some of these MAP kinase modules use one or more of the same 
kinases and yet manage to activate different effector proteins and hence different 
responses. As discussed earlier, one way in which cells avoid cross-talk between 
the different parallel signaling pathways and ensure that each response is spécifie 
is to use scaffold proteins (see Figure 15-10A). In budding yeast cells, such scaf- 
folds bind ail or some of the kinases in each MAP kinase module to form a com¬ 
plex and thereby help to ensure response specificity (Figure 15-50). 

Mammalian cells also use this scaffold strategy to prevent cross-talk between 
different MAP kinase modules. At least five parallel MAP kinase modules can 
operate in a mammalian cell. These modules malce use of at least 12 MAP kinases, 
7 MAPKKs, and 7 MAPKKKs. Two of these modules (terminating in MAP kinases 
called JNK and p38) are activated by different kinds of cell stresses, such as ultra¬ 
violet (UV) irradiation, heat shoclc, and osmotic stress, as well as by inflammatory 
cytokines; others mainly médiate responses to signais from other cells. 

Although the scaffold strategy provides précision and avoids cross-talk, it 
reduces the opportunities for amplification and spreading of the signal to different 
parts of the cell, which require at least some of the components to be diffusible. It 
is unclear to what extent the individual components of MAP kinase modules can 
dissociate from the scaffold during the activation process to permit amplification. 


Figure 15-50 The organization of two 
MAP kinase modules by scaffold 
proteins in budding yeast. Budding 
yeast hâve at least six three-component 
MAP kinase modules involved in a variety 
of biological processes, including the 
two responses illustrated here—a mating 
response and the response to high 
osmolarity. (A) The mating response is 
triggered when a mating factor secreted 
by a yeast of opposite mating type binds 
to a GPCR. This activâtes a G protein, the 
Py complex of which indirectly activâtes 
the MAPKKK (kinase A), which then relays 
the response onward. Once activated, the 
MAP kinase (kinase C) phosphorylâtes 
and thereby activâtes several proteins that 
médiate the mating response, in which the 
yeast cell stops dividing and préparés for 
fusion. The three kinases in this module 
are bound to scaffold protein 1. (B) In a 
second response, a yeast cell exposed to a 
high-osmolarity environment is induced to 
synthesize glycerol to increase its internai 
osmolarity. This response is mediated by 
an osmolarity-sensing receptor protein and 
a different MAP kinase module bound to 
a second scaffold protein. (Note that the 
kinase domain of scaffold 2 provides the 
MAPKK activity of this module.) Although 
both pathways use the same MAPKKK 
(kinase A, green), there is no cross-talk 
between them because the kinases in each 
module are bound to different scaffold 
proteins, and the osmosensor is bound to 
the same scaffold protein as the particular 
kinase it activâtes. 
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Rho Family GTPases Functionally Couple Cell-Surface Receptors 
to the Cytoskeleton 

Besides the Ras proteins, the other class of Ras superfamily GTPases that relays 
signais from cell-surface receptors is the large Rho family (see Table 15-5). Rho 
family monomeric GTPases regulate both the actin and microtubule cytoslcele- 
tons, controlling cell shape, polarity, motility, and adhesion (discussed in Chapter 
16); they also regulate cell-cycle progression, gene transcription, and membrane 
transport. They play a lcey part in the guidance of cell migration and nerve axon 
outgrowth, mediating cytoslceletal responses to the activation of a spécial class of 
guidance receptors. We focus on this aspect of Rho family function here. 

The three best-characterized family members are Rho itself, Rac, and Cdc42, 
each of which affects multiple downstream target proteins. In the same way as for 
Ras, GEFs activate and GAPs inactivate the Rho family GTPases; there are more 
than 80 Rho-GEFs and more than 70 Rho-GAPs in humans. Some of the GEFs and 
GAPs are spécifie for one particular family member, whereas others are less spé¬ 
cifie. Unlilce Ras, which is membrane-associated even when inactive (with GDP 
bound), inactive Rho family GTPases are often bound to guanine nucléotide disso¬ 
ciation inhibitors (GDIs ) in the cytosol, which prevent the GTPases from interact- 
ing with their Rho-GEFs at the plasma membrane. 

Signaling by extracellular signaling proteins of the ephrin family provides an 
example of how RTKs can activate a Rho GTPase. Ephrins bind and thereby acti¬ 
vate members of the Eph family of RTKs (see Figure 15-43). One member of the 
Eph family is found on the surface of motor neurons and helps guide the migrat- 
ing tip of the axon (called a growth cône ) to its muscle target. The binding of a 
cell-surface ephrin protein activâtes the Eph receptor, causing the growth cônes to 
collapse, thereby repelling them from inappropriate régions and lceeping them on 
traclc. The response dépends on a Rho-GEF called ephexin, which is stably asso- 
ciated with the cytosolic tail of the Eph receptor. When ephrin binding activâtes 
the Eph receptor, the receptor activâtes a cytoplasmic tyrosine kinase that phos¬ 
phorylâtes ephexin on a tyrosine, enhancing the ability of ephexin to activate the 
Rho protein Rho A. The activated RhoA (RhoA-GTP) then régulâtes various down¬ 
stream target proteins, including some effector proteins that control the actin 
cytoskeleton, causing the growth cône to collapse (Figure 15-51). 

Having considered how RTKs use GEFs and monomeric GTPases to relay sig¬ 
nais into the cell, we now consider a second major strategy that RTKs use that 
dépends on a quite different intracellular relay mechanism. 
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Figure 15-51 Growth cône collapse 
mediated by Rho family GTPases. 

The binding of ephrin Al proteins on an 
adjacent cell activâtes Eph A4 RTKs on 
the growth cône. Phosphotyrosines on 
the activated Eph receptors recruit and 
activate a cytoplasmic tyrosine kinase to 
phosphorylate the receptor-associated 
Rho-GEF ephexin on a tyrosine. This 
enhances the ability of the ephexin to 
activate RhoA. RhoA then induces the 
growth cône to collapse by stimulating the 
myosin-dependent contraction of the actin 
cytoskeleton. 
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PI 3-Kinase Produces Lipid Docking Sites in the Plasma 
Membrane 

As mentioned earlier, one of the proteins that binds to the intracellular tail of RTK 
molécules is the plasma-membrane-bound enzyme phosphoinositide 3-kinase 
(PI 3-kinase). Ibis kinase principally phosphorylâtes inositol phospholipids 
rather than pro teins, and both RTKs and GPCRs can activate it. It plays a central 
part in promoting cell survival and growth. 

Phosphatidylinositol [PI) is unique among membrane lipids because it can 
undergo réversible phosphorylation at multiple sites on its inositol head group 
to generate a variety of phosphorylated PI lipids called phosphoinositides. When 
activated, PI 3-kinase catalyzes phosphorylation at the 3 position of the inositol 
ring to generate several phosphoinositides (Figure 15-52). The production of 
PI( 3 , 4 , 5 )P 3 matters most because it can serve as a docking site for various intracel¬ 
lular signaling proteins, which assemble into signaling complexes that relay the 
signal into the cell from the cytosolic face of the plasma membrane (see Figure 
15-10C). 

Notice the différence between this use of phosphoinositides and their use 
described earlier, in which PI( 4 , 5 )P 2 is cleaved by PLC(3 (in the case of GPCRs) or 
PLCy (in the case of RTKs) to generate soluble IP 3 and membrane-bound diacyl¬ 
glycérol (see Figures 15-28 and 15-29). By contrast, PI( 3 , 4 , 5 )P 3 is not cleaved by 
either PLC. It is made from PI( 4 , 5 )P 2 and then remains in the plasma membrane 
until spécifie phosphoinositide phosphatases dephosphorylate it. Prominent 
among these is the PTEN phosphatase, which dephosphorylates the 3 position 
of the inositol ring. Mutations in PTEN are found in many cancers: by prolonging 
signaling by PI 3-kinase, they promote uncontrolled cell growth. 

There are various types of PI 3-kinases. Ihose activated by RTKs and GPCRs 
belong to class I. These are heterodimers composed of a common catalytic sub¬ 
unit and different regulatory subunits. RTKs activate class la PI 3-kinases, in which 
the regulatory subunit is an adaptor protein that binds to two phosphotyrosines 
on activated RTKs through its two SH2 domains (see Figure 15-46A). GPCRs acti¬ 
vate class Ib PI 3-kinases, which hâve a regulatory subunit that binds to the Py 
complex of an activated trimeric G protein (G q ) when GPCRs are activated by 
their extracellular ligand. The direct binding of activated Ras can also activate the 
common class I catalytic subunit. 
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Figure 15-52 The génération of 
phosphoinositide docking sites by PI 
3-kinase. PI 3-kinase phosphorylâtes the 
inositol ring on carbon atom 3 to generate 
the phosphoinositides shown at the bottom 
of the figure (diverting them away from the 
pathway leading to IP 3 and diacylglycérol; 
see Figure 15-28). The most important 
phosphorylation (indicated in red) is of 
PI( 4 , 5 )P 2 to PI( 3 , 4 , 5 )P 3 , which can serve 
as a docking site for signaling proteins with 
PI( 3 , 4 , 5 )P 3 -binding PH domains. Other 
inositol phospholipid kinases (not shown) 
catalyze the phosphorylations indicated by 
the green arrows. 
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Intracellular signaling proteins bind to PI(3,4,5)P3 produced by activated PI 
3-lcinase via a spécifie interaction domain, such as a pleckstrin homology (PH) 
domain, first identified in the platelet protein pleckstrin. PH domains function 
mainly as protein-protein interaction domains, and it is only a small subset of 
them that bind to PI(3,4,5)P 3 ; at least some of these also recognize a spécifie mem- 
brane-bound protein as well as the PI(3,4,5)P 3 , which greatly increases the speci- 
ficity of the binding and helps to explain why the signaling proteins with PI(3,4,5) 
P 3 -binding PH domains do not ail dock at ail PI(3,4,5)P 3 sites. PH domains occur 
in about 200 human proteins, including the Ras-GEF Sos discussed earlier (see 
Figure 15-11). 

One especially important PH-domain-containing protein is the serine/threo- 
nine protein kinase Akt. The PI-3-kinase-Akt signalingpathway is the major path- 
way activated by the hormone insulin. It also plays a lcey part in promoting the 
survival and growth of many cell types in both invertebrates and vertebrates, as 
we now discuss. 

The PI-3-Kinase-Akt Signaling Pathway Stimulâtes Animal Cells to 
Survive and Grow 

As discussed earlier, extracellular signais are usually required for animal cells to 
grow and divide, as well as to survive (see Figure 15-4). Members of the insulin- 
like growth factor (IGF ) family of signal proteins, for example, stimulate many 
types of animal cells to survive and grow. They bind to spécifie RTKs (see Figure 
15-43), which activate PI 3-kinase to produce PI(3,4,5)P 3 . The PI(3,4,5)P 3 recruits 
two protein kinases to the plasma membrane via their PH domains—Akt (also 
called protein kinase B , or PKB ) and phosphoinositide-dépendent protein kinase 
1 ( PDK1 ), and this leads to the activation of Akt (Figure 15-53). Once activated, 


survival signal 



active apoptosis- 
inhibitory protein 


Figure 15-53 One way in which signaling through PI 3-kinase promotes cell survival. An extracellular survival signal activâtes an RTK, 
which recruits and activâtes PI 3-kinase. The PI 3-kinase produces PI(3,4,5)P3, which serves as a docking site for two serine/threonine kinases 
with PH domains—Akt and the phosphoinositide-dependent kinase PDK1 — and brings them into proximity at the plasma membrane. The Akt is 
phosphorylated on a serine by a third kinase (usually mTOR in complex 2), which alters the conformation of the Akt so that it can be phosphorylated 
on a threonine by PDK1, which activâtes the Akt. The activated Akt now dissociâtes from the plasma membrane and phosphorylâtes various target 
proteins, including the Bad protein. When unphosphorylated, Bad holds one or more apoptosis-inhibitory proteins (of the Bcl2 family—discussed in 
Chapter 18) in an inactive State. Once phosphorylated, Bad releases the inhibitory proteins, which now can block apoptosis and thereby promote 
cell survival. As shown, the phosphorylated Bad binds to a ubiquitous cytosolic protein called 14-3-3, which keeps Bad out of action. 
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(A) WITHOUT GROWTH 

(B) WITH GROWTH 

Figure 15-54 Activation of mTOR by 

FACTOR 

FACTOR 

the PI-3-kinase-Akt signaling pathway. 

(A) In the absence of extracellular growth 


active PI 3-kinase 

factors, Tsc2 (a Rheb-GAP) keeps Rheb 


\ 

\ 

active Akt 

1 

inactive; mTOR in complex 1 is inactive, 
and there is no cell growth. (B) In the 


presence of growth factors, activated Akt 
phosphorylâtes and inhibits Tsc2, thereby 
promoting the activation of Rheb. Activated 

active Tsc2 

i 

inactive Tsc2 

Rheb (Rheb-GTP) helps activate mTOR 
in complex 1, which in turn stimulâtes cell 
growth. Figure 15-53 shows how growth 

inactive Rheb (Rheb-GDP) 

active Rheb (Rheb-GTP) 

1 

factors (or survival signais) activate Akt. 

The Erk MAP kinase (see Figure 15-49) 
can also phosphorylate and inhibit Tsc2 

inactive mTOR 

active mTOR 

and thereby activate mTOR. Thus, both 

(in complex 1) 

(in complex 1) 

* 

the PI-3-kinase-Akt and Ras-MAP-kinase 
signaling pathways converge on mTOR in 
complex 1 to stimulate cell growth. 

NO CELL GROWTH 

CELL GROWTH 

Tsc2 is short for tuberous sclerosis 
protein 2, and it is one component of a 


heterodimer composed of Tscl and Tsc2 
(not shown); these proteins are so called 
because mutations in either gene encoding 
them cause the genetic disease tuberous 
sclerosis, which is associated with benign 

Alct phosphorylâtes various target proteins at the plasma membrane, as well as in tumors that contain abnormally large cells. 
the cytosol and nucléus. The effect on most of the known targets is to inactivate 
them; but the targets are such that these actions of Alct ail conspire to enhance cell 
survival and growth, as illustrated for one cell survival pathway in Figure 15-53. 

The control of cell growth by the PI-3-kinase-Akt pathway dépends in part on 
a large protein kinase called TOR (named as the target of rapamycin, a bacterial 
toxin that inactivâtes the kinase and is used clinically as both an immunosup- 
pressant and anticancer drug). TOR was originally identified in yeasts in genetic 
screens for rapamycin résistance; in mammalian cells, it is called mTOR, which 
exists in cells in two functionally distinct multiprotein complexes. mTOR complex 

1 contains the protein raptor; this complex is sensitive to rapamycin, and it stimu¬ 
lâtes cell growth—both by promoting ribosome production and protein synthesis 
and by inhibiting protein dégradation. Complex 1 also promûtes both cell growth 
and cell survival by stimulating nutrient uptalce and metabolism. mTOR complex 

2 contains the protein rictor and is insensitive to rapamycin; it helps to activate Alct 
(see Figure 15-53), and it régulâtes the actin cytoslceleton via Rho family GTPases. 

The mTOR in complex 1 intégrâtes inputs from various sources, including 
extracellular signal proteins referred to as growth factors and nutrients such as 
amino acids, both of which help activate mTOR and promote cell growth. The 
growth factors activate mTOR mainly via the PI-3-kinase-Akt pathway. Alct acti¬ 
vâtes mTOR in complex 1 indirectly by phosphorylating, and thereby inhibiting, a 
GAP called Tsc2. Tsc2 acts on a monomeric Ras-related GTPase called Rheb (see 
Table 15-5). Rheb in its active form (Rheb-GTP) activâtes mTOR in complex 1. The 
net resuit is that Alct activâtes mTOR and thereby promûtes cell growth (Figure 
15-54). We discuss how mTOR stimulâtes ribosome production and protein syn¬ 
thesis in Chapter 17 (see Figure 17-64). 

RTKs and GPCRs Activate Overlapping Signaling Pathways 

As mentioned earlier, RTKs and GPCRs activate some of the same intracellular 
signaling pathways. Both, for example, can activate the inositol phospholipid 
pathway triggered by phospholipase C. Moreover, even when they activate differ¬ 
ent pathways, the different pathways can converge on the same target proteins. 

Figure 15-55 illustrâtes both of these types of signaling overlaps: it summarizes 
five parallel intracellular signaling pathways that we hâve discussed so far—one 
triggered by GPCRs, two triggered by RTKs, and two triggered by both lcinds of 
receptors. Interactions among these pathways allow different extracellular signal 
molécules to modulate and coordinate each other's effects. 
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G protein G protein —► phospholipase C Grb2 PI 3-kinase 



Figure 15-55 Five parallel intracellular 
signaling pathways activated by GPCRs, 
RTKs, or both. In this simplified example, 
the five kinases (shaded yellow) at the end 
of each signaling pathway phosphorylate 
target proteins (shaded red), many of 
which are phosphorylated by more than 
one of the kinases. The phospholipase C 
activated by the two types of receptors is 
different: GPCRs activate PLCp, whereas 
RTKs activate PLCy (not shown). Although 
not shown, some GPCRs can also activate 
Ras, but they do so independently of Grb2, 
via a Ras-GEF that is activated by Ca 2+ 
and diacylglycérol. 


Some Enzyme-Coupled Receptors Associate with Cytoplasmic 
Tyrosine Kinases 

Many cell-surface receptors dépend on tyrosine phosphorylation for their activ- 
ity and yet laclc a tyrosine kinase domain. These receptors act through cytoplas¬ 
mic tyrosine kinases, which are associated with the receptors and phosphorylate 
various target proteins, often including the receptors themselves, when the recep¬ 
tors bind their ligand. These tyrosine-kinase-associated receptors thus function 
in much the same way as RTKs, except that their kinase domain is encoded by 
a separate gene and is noncovalently associated with the receptor polypeptide 
chain. A variety of receptor classes belong in this category, including the receptors 
for antigen and interleukins on lymphocytes (discussed in Chapter 24), integrins 
(discussed in Chapter 19), and receptors for various cytokines and some hor¬ 
mones. As with RTKs, many of these receptors are either preformed dimers or are 
cross-linked into dimers by ligand binding. 

Some of these receptors dépend on members of the largest family of mam- 
malian cytoplasmic tyrosine kinases, the Src family (see Figures 3-10 and 3-64), 
which includes Src, Yes, Fgr, Fyn, Lclc, Lyn, Hck, and Bile. These protein kinases 
ail contain SH2 and SH3 domains and are located on the cytoplasmic side of the 
plasma membrane, held there partly by their interaction with transmembrane 
receptor proteins and partly by covalently attached lipid chains. Different family 
members are associated with different receptors and phosphorylate overlapping 
but distinct sets of target proteins. Lyn, Fyn, and Lclc, for example, are each asso¬ 
ciated with different sets of receptors on lymphocytes. In each case, the kinase is 
activated when an extracellular ligand binds to the appropriate receptor protein. 
Src itself, as well as several other family members, can also bind to activated RTKs; 
in these cases, the receptor and cytoplasmic kinases mutually stimulate each oth- 
er's catalytic activity, thereby strengthening and prolonging the signal (see Figure 
15-51). There are even some G pro teins (G s and Gi) that can activate Src, which 
is one way that the activation of GPCRs can lead to tyrosine phosphorylation of 
intracellular signaling proteins and effector proteins. 
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Another type of cytoplasmic tyrosine kinase associâtes with integrins, the 
main receptors that cells use to bind to the extracellular matrix (discussed in 
Chapter 19). The binding of matrix components to integrins activâtes intracellular 
signaling pathways that influence the behavior of the cell. When integrins cluster 
at sites of matrix contact, they help trigger the assembly of cell-matrix junctions 
called focal adhesions. Among the many proteins recruited into these junctions is 
the cytoplasmic tyrosine kinase called focal adhesion kinase (FAK), which binds 
to the cytosolic tail of one of the integrin subunits with the assistance of other 
proteins. The clustered FAK molécules phosphorylate each other, creating phos- 
photyrosine docking sites where the Src kinase can bind. Src and FAK then phos¬ 
phorylate each other and other proteins that assemble in the junction, including 
many of the signaling proteins used by RTKs. In this way, the two tyrosine kinases 
signal to the cell that it has adhered to a suitable substratum, where the cell can 
now survive, grow, divide, migrate, and so on. 

The largest and most diverse class of receptors that rely on cytoplasmic tyro¬ 
sine kinases to relay signais into the cell is the class of cytokine receptors , which we 
consider next. 

Cytokine Receptors Activate the JAK-STAT Signaling Pathway 

The large family of cytokine receptors includes receptors for many kinds of local 
mediators (collectively called cytokines ), as well as receptors for some hormones, 
such as growth hormone and prolactin (Movie 15.8). These receptors are stably 
associated with cytoplasmic tyrosine kinases called Janus kinases (JAKs) (after 
the two-faced Roman god), which phosphorylate and activate transcription regu- 
lators called STATs (signal transducers and activators of transcription). STAT pro¬ 
teins are located in the cytosol and are referred to as latent transcription regulators 
because they migrate into the nucléus and regulate gene transcription only after 
they are activated. 

Although many intracellular signaling pathways lead from cell-surface recep¬ 
tors to the nucléus, where they alter gene transcription (see Figure 15-55), the 
JAK-STAT signaling pathway provides one of the more direct routes. Cytokine 
receptors are dimers or trimers and are stably associated with one or two of 
the four known JAKs (JAK1, JAK2, JAK3, and Tylc2). Cytokine binding alters the 
arrangement so as to bring two JAKs into close proximity so that they phosphor¬ 
ylate each other, thereby increasing the activity of their tyrosine kinase domains. 
The JAKs then phosphorylate tyrosines on the cytoplasmic tails of cytokine recep¬ 
tors, creating phosphotyrosine docking sites for STATs (Figure 15-56). Some 
adaptor proteins can also bind to some of these sites and couple cytokine recep¬ 
tors to the Ras-MAP-kinase signaling pathway discussed earlier, but these will not 
be discussed here. 

There are at least six STATs in mammals. Each has an SH2 domain that per¬ 
forais two functions. First, it médiates the binding of the STAT protein to a phos- 
photyrosine docking site on an activated cytokine receptor. Once bound, the JAKs 
phosphorylate the STAT on tyrosines, causing the STAT to dissociate from the 
receptor. Second, the SH2 domain on the released STAT now médiates its binding 
to a phosphotyrosine on another STAT molécule, forming either a STAT homodi- 
mer or a heterodimer. The STAT dimer then translocates to the nucléus, where, 
in combination with other transcription regulatory proteins, it binds to a spécifie 
cz's-regulatory sequence in various genes and stimulâtes their transcription (see 
Figure 15-56). In response to the hormone prolactin, for example, which stimu¬ 
lâtes breast cells to produce mille, activated STAT5 stimulâtes the transcription of 
genes that encode mille proteins. Table 15-6 lists some of the more than 30 cyto¬ 
kines and hormones that activate the JAK-STAT pathway by binding to cytokine 
receptors. 

Négative feedback régulâtes the responses mediated by the JAK-STAT path¬ 
way. In addition to activating genes that encode proteins mediating the cyto- 
kine-induced response, the STAT dimers can also activate genes that encode 
inhibitory proteins that help shut off the response. Some of these proteins bind to 
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Figure 15-56 The JAK-STAT signaling 
pathway activated by cytokines. The 

binding of the cytokine either causes two 
separate receptor polypeptide chai ns 
to dimerize (as shown) or re-orients the 
receptor chai ns in a preformed dimer. 

In either case, the associated JAKs 
are brought together so that they can 
phosphorylate each other on tyrosines to 
become fully activated, after which they 
phosphorylate the receptors to generate 
binding sites for the SH2 domains of STAT 
proteins. The JAKs also phosphorylate 
the STAT proteins, which dissociate from 
the receptor to form dimers and enter the 
nucléus to control gene expression. 


and inactivate phosphorylated JAKs and their associated phosphorylated recep¬ 
tors; others bind to phosphorylated STAT dimers and prevent them from bind¬ 
ing to their DNA targets. Such négative feedback mechanisms, however, are not 
enough on their own to turn off the response. Inactivation of the activated JAKs 
and STATs requires déphosphorylation of their phosphotyrosines. 

Protein Tyrosine Phosphatases Reverse Tyrosine Phosphorylations 

In ail signaling pathways that use tyrosine phosphorylation, the tyrosine phos¬ 
phorylations are reversed by protein tyrosine phosphatases. These phosphatases 
are as important in the signaling process as the protein tyrosine kinases that add 
the phosphates. Whereas only a few types of serine/threonine protein phosphatase 
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catalytic subunits are responsible for removing phosphate groups from phos- 
phorylated serines and thréonines on proteins, there are about 100 protein tyro¬ 
sine phosphatases encoded in the human genome, including some dual-specific- 
ity phosphatases that also dephosphorylate serines and thréonines. 

Lilce tyrosine kinases, the tyrosine phosphatases occur in both cytoplasmic 
and transmembrane forms. Unlilce serine/threonine protein phosphatases, which 
generally hâve broad specificity, most tyrosine phosphatases display exquisite 
specificity for their substrates, removing phosphate groups from only selected 
phosphotyrosines on a subset of proteins. Together, these phosphatases ensure 
that tyrosine phosphorylations are short-lived and that the level of tyrosine phos¬ 
phorylation in resting cells is very low. They do not, however, simply continuously 
reverse the effects of protein tyrosine kinases; they are often regulated to act only 
at the appropriate time and place. 

Having discussed the crucial rôle of tyrosine phosphorylation and déphos¬ 
phorylation in the intracellular signaling pathways activated by many enzyme- 
coupled receptors, we now turn to a class of enzyme-coupled receptors that rely 
on serine and threonine phosphorylation. These receptor serine/threonine kinases 
activate an even more direct signaling pathway to the nucléus than does the JAK- 
STAT pathway. They directly phosphorylate latent transcription regulators called 
Smads, which then translocate into the nucléus to control gene transcription. 

Signal Proteins of the TGFp Superfamily Act Through Receptor 
Serine/Threonine Kinases and Smads 

The transforming growth factor-P (TGFP) superfamily consists of a large num- 
ber (33 in humans) of structurally related, secreted, dimeric proteins. They act 
either as hormones or, more commonly, as local mediators to regulate a wide 
range of biological functions in ail animais. During development, they regulate 
pattern formation and influence various cell behaviors, including prolifération, 
spécification and différentiation, extracellular matrix production, and cell death. 
In adults, they are involved in tissue repair and in immune régulation, as well as 
in many other processes. The superfamily consists of the TGFp /activin family and 
the larger bone morphogenetic protein (BMP) family. 

Ail of these proteins act through enzyme-coupled receptors that are single- 
pass transmembrane proteins with a serine/threonine kinase domain on the 
cytosolic side of the plasma membrane. There are two classes of these receptor 
serine/threonine kinases— type I and type //—which are structurally similar 
homodimers. Each member of the TGFp superfamily binds to a characteristic 
combination of type-I and type-II receptor dimers, bringing the kinase domains 
together so that the type-II receptor can phosphorylate and activate the type-I 
receptor, forming an active tetrameric receptor complex. 

Once activated, the receptor complex uses a strategy for rapidly relaying the 
signal to the nucléus that is very similar to the IAK-STAT strategy used by cyto¬ 
kine receptors. The activated type-I receptor directly binds and phosphorylâtes 
a latent transcription regulator of the Smad family (named after the first two 
proteins identified, Sma in C. elegans and Mad in Drosophila). Activated TGFp/ 
activin receptors phosphorylate Smad2 or Smad3, while activated BMP receptors 
phosphorylate Smadl, Smad5, or Smad8. Once one of these receptor-activated 
Smads ( R-Smads ) has been phosphorylated, it dissociâtes from the receptor and 
binds to Smad4 (called a co-Smad), which can form a complex with any of the five 
R-Smads. The Smad complex then translocates into the nucléus, where it associ¬ 
âtes with other transcription regulators and Controls the transcription of spécifie 
target genes (Figure 15-57). Because the partner proteins in the nucléus vary 
depending on the cell type and State of the cell, the genes affected vary. 

Activated TGFP receptors and their bound ligand are endocytosed by two dis¬ 
tinct routes, one leading to further activation and the other leading to inactiva¬ 
tion. The activation route dépends on clathrin-coated vesicles and leads to early 
endosomes (discussed in Chapter 13), where most of the Smad activation occurs. 
An anchoring protein called SARA (for Smad anchorfor receptor activation ) has 


866 


Chapter15: Cell Signaling 


DNA 



CYTOSOL 

Smad2 or 
Smad3 


- Smad4 


TGFp-responsive 

c/s-regulatory 

sequence 


TARGET GENE TRANSCRIPTION 


Figure 15-57 The Smad-dependent 
signaling pathway activated by TGFp. 

The TGFp dimer promotes the assembly of 
a tetrameric receptor complex containing 
two copies each of the type-l and 
type-Il receptors. The type-Il receptors 
phosphorylate spécifie sites on the type-l 
receptors, thereby activating their kinase 
domains and leading to phosphorylation 
of R-Smads such as Smad2 and Smad3. 
Smads open up to expose a dimerization 
surface when they are phosphorylated, 
leading to the formation of a trimeric Smad 
complex containing two R-Smads and the 
co-Smad, Smad4. The phosphorylated 
Smad complex enters the nucléus and 
collaborâtes with other transcription 
regulators to control the transcription of 
spécifie target genes. 


an important rôle in this pathway; it is concentrated in early endosomes and binds 
to both activated TGFP receptors and Smads, increasing the efficiency of recep- 
tor-mediated Smad phosphorylation. The inactivation route dépends on caveolae 
(discussed in Chapter 13) and leads to receptor ubiquitylation and dégradation in 
protéasomes. 

During the signaling response, the Smads shuttle continuously between 
the cytoplasm and the nucléus: they are dephosphorylated in the nucléus and 
exported to the cytoplasm, where they can be rephosphorylated by activated 
receptors. In this way, the effect exerted on the target genes reflects both the con¬ 
centration of the extracellular signal and the time the signal continues to act on 
the cell-surface receptors (often several hours). Cells exposed to a morphogen at 
high concentration, or for a long time, or both, will switch on one set of genes, 
whereas cells receiving a lower or more transient exposure will switch on another 
set. 

As in other signaling Systems, négative feedback régulâtes the Smad path¬ 
way. Among the target genes activated by Smad complexes are those that encode 
inhibitory Smads , either Smad6 or Smad7. Smad7 (and possibly Smad6) binds 
to the cytosolic tail of the activated receptor and inhibits its signaling ability in 
at least three ways: (1) it competes with R-Smads for binding sites on the recep¬ 
tor, decreasing R-Smad phosphorylation; (2) it recruits a ubiquitin ligase called 
Smurf, which ubiquitylates the receptor, leading to receptor internalization and 
dégradation (it is because Smurfs also ubiquitylate and promote the dégradation 
of Smads that they are called Smad ubiquitylation regulatory factors, or Smurfs); 
and (3) it recruits a protein phosphatase that dephosphorylates and inactivâtes 
the receptor. In addition, the inhibitory Smads bind to the co-Smad, Smad4, and 
inhibit it, either by preventing its binding to R-Smads or by promoting its ubiqui¬ 
tylation and dégradation. 

Although receptor serine/threonine kinases operate mainly through the Smad 
pathway just described, they can also stimulate other intracellular signaling pro¬ 
teins such as MAP kinases and PI 3-kinase. Conversely, signaling proteins in other 
pathways can phosphorylate Smads and thereby influence signaling along the 
Smad pathway. 

Summary 

There are various classes of enzyme-coupled receptors, the most common of which 
are receptor tyrosine kinases (RTKs), tyrosine-kinase-associated receptors, and 
receptor serine/threonine kinases . 
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Ligand binding to RTKs causes their dimerization, which leads to activation 
of their kinase domains. These activated kinase domains phosphorylate multiple 
tyrosines on the receptors, producing a set of phosphotyrosines that serve as doclc- 
ing sites for a set of intracellular signaling proteins, which bind via their SH2 (or 
PTB) domains. One such signaling protein serves as an adaptor to couple some 
activated receptors to a Ras-GEF (Sos), which activâtes the monomeric GTPase Ras; 
Ras, in turn, activâtes a three-component MAP kinase signaling module, which 
relays the signal to the nucléus by phosphorylating transcription regulatory pro¬ 
teins. Another important signaling protein that can dock on activated RTKs is PI 
3-1cinase, which phosphorylâtes spécifie phosphoinositides to produce lipid docking 
sites in the plasma membrane for signaling pro teins with phosphoinositide-bind- 
ing PH domains, including the serine/threonine protein kinase Akt (PKB), which 
play s a key part in the control ofcell survival and cellgrowth. Many receptor classes, 
including some RTKs, activate Rho family monomeric GTPases; which functionally 
couple the receptors to the cytoskeleton. 

Tyrosine-kinase-associated receptors dépend on varions cytoplasmic tyrosine 
kinases for their action. These kinases include members of the Src family, which 
associate with many kinds of receptors, and the focal adhesion kinase (FAK), which 
associâtes with integrins atfocal adhesions. The cytoplasmic tyrosine kinases then 
phosphorylate a variety of signaling proteins to relay the signal onward. The larg- 
est family of receptors in this class is the cytokine receptor family. When stimulated 
by ligand binding, these receptors activate JAK cytoplasmic tyrosine kinases, which 
phosphorylate STATs. The STATs then dimerize, translocate to the nucléus, and acti¬ 
vate the transcription of spécifie genes. Receptor serine/threonine kinases, which 
are activated by signal proteins ofthe TGFfi superfamily, act similarly: they directly 
phosphorylate and activate Smads, which then oligomerize with another Smad, 
translocate to the nucléus; and regulate gene transcription. 

ALTERNATIVE SIGNALING ROUTES IN GENE 
REGULATION 

Major changes in the behavior of a cell tend to dépend on changes in the expres¬ 
sion of numerous genes. Thus, many extracellular signaling molécules carry out 
their effects, in whole or in part, by initiating signaling pathways that change 
the activities of transcription regulators. There are numerous examples of gene 
régulation in both GPCR and enzyme-coupled receptor pathways (see Figures 
15-27 and 15-49). In this section, we describe some of the less common signaling 
mechanisms by which gene expression can be controlled. We begin with several 
pathways that dépend on regulated proteolysis to control the activity and location 
of latent transcription regulators. We then turn to a class of extracellular signal 
molécules that do not employ cell-surface receptors but enter the cell and inter- 
act directly with transcription regulators to perform their functions. Finally, we 
briefly discuss some of the mechanisms by which gene expression is controlled by 
the circadian rhythm: the daily cycle of light and darlc. 

The Receptor Notch Is a Latent Transcription Regulatory Protein 

Signaling through the Notch receptor protein is used widely in animal devel¬ 
opment. As discussed in Chapter 22, it has a general rôle in controlling cell fate 
choices and regulating pattern formation during the development of most tissues, 
as well as in the continuai renewal of tissues such as the lining of the gut. It is best 
known, however, for its rôle in the production of Drosophila neural cells, which 
usually arise as isolated single cells within an épithélial sheet of precursor cells. 
During this process, when a precursor cell commits to becoming a neural cell, it 
signais to its immédiate neighbors not to do the same; the inhibited cells develop 
into epidermal cells instead. This process, called latéral inhibition , dépends on a 
contact-dependent signaling mechanism that is activated by a single-pass trans¬ 
membrane signal protein called Delta, displayed on the surface of the future 
neural cell. By binding to the Notch receptor protein on a neighboring cell, Delta 
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signais to the neighbor not to become neural (Figure 15-58). When this signaling 
process is defective, a huge excess of neural cells is produced at the expense of 
epidermal cells, which is léthal. 

Notch is a single-pass transmembrane protein that requires proteolytic Pro¬ 
cessing to function. It acts as a latent transcription regulator and provides the sim- 
plest and most direct signaling pathway known from a cell-surface receptor to the 
nucléus. When activated by the binding of Delta on another cell, a plasma-mem- 
brane-bound protease cleaves off the cytoplasmic tail of Notch, and the released 
tail translocates into the nucléus to activate the transcription of a set of Notch- 
response genes. The Notch tail fragment acts by binding to a DNA-binding pro¬ 
tein, converting it from a transcriptional repressor into a transcriptional activator. 

The Notch receptor undergoes three successive proteolytic cleavage steps, but 
only the last two dépend on Delta binding. As part of its normal biosynthesis, it is 
cleaved in the Golgi apparatus to form a heterodimer, which is then transported 
to the cell surface as the mature receptor. The binding of Delta to Notch induces 
a second cleavage in the extracellular domain, mediated by an extracellular pro¬ 
tease. A final cleavage quickly follows, cutting free the cytoplasmic tail of the acti¬ 
vated receptor (Figure 15-59). Note that, unlilce most receptors, the activation of 
Notch is irréversible; once activated by ligand binding, the protein cannot be used 
again. 

This final cleavage of the Notch tail occurs just within the transmembrane seg¬ 
ment, and it is mediated by a protease complex called y-secretase, which is also 
responsible for the intramembrane cleavage of various other proteins. One of its 
essential subunits is Presenilin, so called because mutations in the gene encod- 
ing it are a frequent cause of early-onset, familial Alzheimer's disease, a form of 
presenile dementia. The protease complex is thought to contribute to this and 
other forms of Alzheimer's disease by generating extracellular peptide fragments 
from a transmembrane neuronal protein; the fragments accumulate in excessive 
amounts and form aggregates of misfolded protein called amyloid plaques, which 
may injure nerve cells and contribute to their degeneration and loss. 

Both Notch and Delta are glycoproteins, and their interaction is regulated by 
the glycosylation of Notch. The Fringefamily of glycosyl transferases, in particular, 
adds extra sugars to the O-linked oligosaccharide (discussed in Chapter 13) on 
Notch, which alters the specificity of Notch for its ligands. This has provided the 
first example of the modulation of ligand-receptor signaling by differential recep¬ 
tor glycosylation. 


Figure 15-58 Latéral inhibition mediated 
by Notch and Delta during neural cell 
development in Drosophila. When 
individual cells in the epithelium begin 
to develop as neural cells, they signal to 
their neighbors not to do the same. This 
inhibitory, contact-dependent signaling 
is mediated by the ligand Delta, which 
appears on the surface of the future neural 
cell and binds to Notch receptor proteins 
on the neighboring cells. In many tissues, 
ail the cells in a cluster initially express 
both Delta and Notch, and a compétition 
occurs, with one cell emerging as winner, 
expressing Delta strongly and inhibiting 
its neighbors from doing likewise. In other 
cases, additional factors interact with Delta 
or Notch to make some cells susceptible 
to the latéral inhibition signal and others 
unresponsive to it. 


Wnt Proteins Bind to Frizzled Receptors and Inhibit the 
Dégradation of p-Catenin 

Wnt proteins are secreted signal molécules that act as local mediators and mor¬ 
phogens to control many aspects of development in ail animais that hâve been 
studied. They were discovered independently in Aies and in mice: in Drosophila , 
the Wingless ( Wg ) gene originally came to light because of its rôle as a morphogen 
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Figure 15-59 The Processing and activation of Notch by proteolytic cleavage. The numbered red arrowheads indicate the 
sites of proteolytic cleavage. The first proteolytic Processing step occurs within the trans Golgi networkto generate the mature 
heterodimeric Notch receptor that is then displayed on the cell surface. The binding to Delta on a neighboring cell triggers the 
next two proteolytic steps: the complex of Delta and the Notch fragment to which it is bound is endocytosed by the Delta- 
expressing cell, exposing the extracellular cleavage site in the transmembrane Notch subunit. Note that Notch and Delta interact 
through their repeated EGF-like domains. The released Notch tail migrâtes into the nucléus, where it binds to the Rbpsuh 
protein, which it converts from a transcriptional repressor to a transcriptional activator. 


in wing development, while in mice, the Intl gene was found because it promoted 
the development of breast tumors when activated by the intégration of a virus next 
to it. Both of these genes encode Wnt proteins. Wnts are unusual as secreted pro¬ 
teins in that they hâve a fatty acid chain covalently attached to their N-terminus, 
which increases their binding to cell surfaces. There are 19 Wnts in humans, each 
having distinct, but often overlapping, functions. 

Wnts can activate at least two types of intracellular signaling pathways. Our 
primary focus here is the Wnt/fi-catenin pathway (also known as the canonical 
Wnt pathway), which is centered on the latent transcription regulator fi-catenin. 
A second pathway, called the planar polarity pathway, coordinates the polariza- 
tion of cells in the plane of a developing epithelium and dépends on Rho fam- 
ily GTPases. Both of these pathways begin with the binding of Wnts to Frizzled 
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family cell-surface receptors, which are seven-pass transmembrane proteins that 
resemble GPCRs in structure but do not generally worlc through the activation 
of G proteins. Instead, when activated by Wnt binding, Frizzled proteins recruit 
the scaffold protein Dishevelled, which helps relay the signal to other signaling 
molécules. 

The Wnt/p-catenin pathway acts by regulating the proteolysis of the multi- 
functional protein P-catenin (or Armadillo in Aies). A portion of the cell's P-cat- 
enin is located at cell-cell junctions and thereby contributes to the control of cell- 
cell adhesion (discussed in Chapter 19), while the remaining P-catenin is rapidly 
degraded in the cytoplasm. Dégradation dépends on a large protein dégradation 
complex, which binds P-catenin and lceeps it out of the nucléus while promot- 
ing its dégradation. The complex contains at least four other proteins: a protein 
kinase called casein kinase 1 ( CK1 ) phosphorylâtes the P-catenin on a serine, 
priming it for further phosphorylation by another protein kinase called glycogen 
synthase kinase 3 ( GSK3 ); this final phosphorylation marks the protein for ubiq- 
uitylation and rapid dégradation in protéasomes. Two scaffold proteins called 
axin and Adenomatous polyposis coli {APC) hold the protein complex together 
(Figure 15-60A). APC gets its name from the finding that the gene encoding it 
is often mutated in a type of benign tumor (adenoma) of the colon; the tumor 
projects into the lumen as a polyp and can eventually become malignant. (This 
APC should not be confused with the anaphase-promoting complex, or APC/C, 
that plays a central part in sélective protein dégradation during the cell cycle—see 
Figure 17-15A.) 

Wnt proteins regulate P-catenin proteolysis by binding to both a Frizzled pro¬ 
tein and a co-receptor that is related to the low-density lipoprotein (LDT) receptor 
(discussed in Chapter 13) and is therefore called an LDL-receptor-related protein 
(LRP). In a poorly understood process, the activated receptor complex recruits 
the Dishevelled scaffold and promûtes the phosphorylation of the TRP receptor 
by the two protein kinases, GSK3 and CK1. Axin is brought to the receptor com¬ 
plex and inactivated, thereby disrupting the P-catenin dégradation complex in 
the cytoplasm. In this way, the phosphorylation and dégradation of p-catenin are 
prevented, enabling unphosphorylated P-catenin to accumulate and translocate 
to the nucléus, where it alters the pattern of gene transcription (Figure 15-60B). 


(A) WITHOUT Wnt SIGNAL 


(B) WITH Wnt SIGNAL 


LRP 



inactive 

Dishevelled 



(3-catenin 

w 



Wnt TARGET GENES OFF 


activated 

I RP Wnt 





TRANSCRIPTION OF Wnt TARGET GENES 


Figure 15-60 The Wnt/p-catenin 
signaling pathway. (A) In the absence of 
a Wnt signal, p-catenin that is not bound 
to cell-cell adherens junctions (not shown) 
interacts with a dégradation complex 
containing APC, axin, GSK3, and CK1. In 
this complex, p-catenin is phosphorylated 
by CK1 and then by GSK3, triggering 
its ubiquitylation and dégradation in 
protéasomes. Wnt-responsive genes are 
kept inactive by the Groucho co-repressor 
protein bound to the transcription regulator 
LEF1/TCF. (B) Wnt binding to Frizzled 
and LRP clusters the two co-receptors 
together, and the cytosolic tail of LRP is 
phosphorylated by GSK3 and then by CK1. 
Axin binds to the phosphorylated LRP and 
is inactivated and/or degraded, resulting in 
disassembly of the dégradation complex. 
The phosphorylation of p-catenin is 
thereby prevented, and unphosphorylated 
P-catenin accumulâtes and translocates 
to the nucléus, where it binds to LEF1/ 

TCF, displaces the co-repressor Groucho, 
and acts as a coactivator to stimulate the 
transcription of Wnt target genes. The 
scaffold protein Dishevelled is required for 
the signaling pathway to operate; it binds 
to Frizzled and becomes phosphorylated 
(not shown), but its précisé rôle is 
unknown. 














ALTERNATIVE SIGNALING ROUTES IN GENE REGULATION 


871 


In the absence of Wnt signaling, Wnt-responsive genes are lcept silent by an 
inhibitory complex of transcription regulatory proteins. The complex includes 
proteins of the LEF1/TCF family bound to a co-repressor protein of the Groucho 
family (see Figure 15-60A). In response to a Wnt signal, P-catenin enters the 
nucléus and binds to the LEF1/TCF proteins, displacing Groucho. The P-catenin 
now functions as a coactivator, inducing the transcription of the Wnt target genes 
(see Figure 15-60B). Thus, as in the case of Notch signaling, Wnt/P-catenin signal¬ 
ing triggers a switch from transcriptional repression to transcriptional activation. 

Among the genes activated by P-catenin is Myc , which encodes a protein 
(Myc) that is an important regulator of cell growth and prolifération (discussed 
in Chapter 17). Mutations of the Apc gene occur in 80% of human colon cancers 
(discussed in Chapter 20). These mutations inhibit the protein's ability to bind 
P-catenin, so that P-catenin accumulâtes in the nucléus and stimulâtes the tran¬ 
scription of c-Myc and other Wnt target genes, even in the absence of Wnt signal¬ 
ing. The resulting uncontrolled cell growth and prolifération promote the devel¬ 
opment of cancer. 

Various secreted inhibitory proteins regulate Wnt signaling in development. 
Some bind to the LRP receptors and promote their down-regulation, whereas 
others compete with Frizzled receptors for secreted Wnts. In Drosophila at least, 
Wnts activate négative feedback loops, in which Wnt target genes encode proteins 
that help shut the response off; some of these proteins inhibit Dishevelled, and 
others are secreted inhibitors. 

Hedgehog Proteins Bind to Patched, Relieving Its Inhibition of 
Smoothened 

Hedgehog proteins and Wnt proteins act in similar ways. Both are secreted sig¬ 
nal molécules, which act as local mediators and morphogens in many develop- 
ing invertebrate and vertebrate tissues. Both proteins are modified by covalently 
attached lipids, dépend on secreted or cell-surface-bound heparan sulfate proteo- 
glycans (discussed in Chapter 19) for their action, and activate latent transcription 
regulators by inhibiting their dégradation. They both trigger a switch from tran¬ 
scriptional repression to transcriptional activation, and excessive signaling along 
either pathway in adult cells can lead to cancer. They even use some of the same 
intracellular signaling proteins and sometimes collaborate to médiate a response. 

The Hedgehog proteins were discovered in Drosophila , where this protein 
family has only one member. Mutation of the Hedgehog gene produces a larva 
covered with spilcy processes (denticles), lilce a hedgehog. At least three genes 
encode Hedgehog proteins in vertebrates— Sonic, Desert, and Indian hedgehog. 
The active forms of ail Hedgehog proteins are covalently coupled to cholestérol, as 
well as to a fatty acid chain. The cholestérol is added during an unusual Process¬ 
ing step, in which a precursor protein cleaves itself to produce a smaller, choles- 
terol-containing signal protein. Most of what we lcnow about the Hedgehog sig¬ 
naling pathway came initially from genetic studies in Aies, and it is the fly pathway 
that we summarize here. 

The effects of Hedgehog are mediated by a latent transcription regulator called 
Cubitus interruptus (Ci), the régulation of which is reminiscent of the régulation 
of P-catenin by Wnts. In the absence of a Hedgehog signal, Ci is ubiquitylated and 
proteolytically cleaved in protéasomes. Instead of being completely degraded, 
however, Ci is processed to form a smaller fragment, which accumulâtes in the 
nucléus, where it acts as a transcriptional repressor, helping to lceep Hedge- 
hog-responsive genes silent. The proteolytic processing of the Ci protein dépends 
on its phosphorylation by three protein kinases—PKA and two kinases also used 
in the Wnt pathway, namely GSK3 and CK1. As in the Wnt pathway, the proteo¬ 
lytic processing occurs in a multiprotein complex. The complex includes the pro¬ 
tein kinase Fused and a scaffold protein Costal2 , which stably associâtes with Ci, 
recruits the three other kinases, and binds the complex to microtubules, thereby 
lceeping unprocessed Ci out of the nucléus (Figure 15-61A). 
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Hedgehog functions by blocking the proteolytic processing of Ci, thereby 
changing it into a transcriptional activator. It does this by a convoluted signal¬ 
ing process that dépends on three transmembrane proteins: Patched, iHog, and 
Smoothened. Patched is predicted to cross the plasma membrane 12 times, and, 
although much of it is in intracellular vesicles, some is on the cell surface where 
it can bind the Hedgehog protein. iHog is also on the cell surface and is thought 
to serve as a co-receptor for Hedgehog. Smoothened is a seven-pass transmem¬ 
brane protein with a structure very similar to a GPCR, but it does not seem to act 
as a Hedgehog receptor or even as an activator of G proteins; it is controlled by 
Patched and iHog. 

In the absence of a Hedgehog signal, Patched employs an unknown mech- 
anism to lceep Smoothened sequestered and inactive in intracellular vesicles 
(see Figure 15-61 A). The binding of Hedgehog to iHog and Patched inhibits the 
activity of Patched and induces its endocytosis and dégradation. The resuit is that 
Smoothened is liberated from inhibition and translocates to the plasma mem¬ 
brane, where it recruits the protein complex containing Ci, Fused, and Costal2. 


Figure 15-61 Hedgehog signaling 
in Drosophila. (A) In the absence of 
Hedgehog, most Patched is in intracellular 
vesicles (not shown), where it keeps 
Smoothened inactive and sequestered. The 
Ci protein is bound in a cytosolic protein 
dégradation complex, which includes the 
protein kinase Fused and the scaffold 
protein Costal2. Costal2 recruits three 
other protein kinases (PKA, GSK3, and 
CK1 ; not shown), which phosphorylate 
Ci. Phosphorylated Ci is ubiquitylated and 
then cleaved in protéasomes (not shown) 
to form a transcriptional repressor, which 
accumulâtes in the nucléus to help keep 
Hedgehog target genes inactive. 

(B) Hedgehog binding to iHog and Patched 
removes the inhibition of Smoothened by 
Patched. Smoothened is phosphorylated 
by PKA and CK1 and translocates to the 
plasma membrane, where it recruits the 
complex containing Fused, Costal2, and 
Ci. Costal2 releases unprocessed Ci, which 
accumulâtes in the nucléus and activâtes 
the transcription of Hedgehog target genes. 
Many details in the pathway are poorly 
understood, including the rôle of Fused. 
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Costal2 is no longer able to bind the other three kinases, and so Ci is no longer 
cleaved and can now enter the nucléus and activate the transcription of Hedge- 
hog target genes (Figure 15-61B). Among the genes activated by Ci is Patched 
itself; the resulting increase in Patched protein on the cell surface inhibits further 
Hedgehog signaling—providing another example of négative feedback. 

Many gaps remain in our understanding of the Hedgehog signaling pathway. It 
is not lcnown, for example, how Patched lceeps Smoothened inactive and intracel- 
lular. As the structure of Patched resembles a transmembrane transporter protein, 
it has been proposed that it may transport a small molécule into the cell that keeps 
Smoothened sequestered in vesicles. 

Even less is known about the more complex Hedgehog pathway in vertebrate 
cells. In addition to there being at least three types of vertebrate Hedgehog pro¬ 
teins, there are three Ci-like transcription regulator proteins {GUI, Gli2 , and Gli3 ) 
downstream of Smoothened. Gli2 and Gli3 are most similar to Ci in structure and 
function, and Gli3 has been shown to undergo proteolytic processing lilce Ci and 
to act as either a transcriptional repressor or a transcriptional activator. Moreover, 
in vertebrates, Smoothened, upon activation, becomes localized to the surface of 
the primary cilium (discussed in Chapter 16), where the Gli pro teins are also con- 
centrated, thereby increasing the speed and efficiency of signaling. 

Hedgehog signaling can promote cell prolifération, and excessive Hedgehog 
signaling can lead to cancer. Inactivating mutations in one of the two human 
Patched genes, for example, which lead to excessive Hedgehog signaling, occur 
frequently in basai cell carcinoma of the skin, the most common form of cancer in 
Caucasians. A small molécule called cyclopamine, made by a meadowlily, is being 
used to treat cancers associated with excessive Hedgehog signaling. It bloclcs 
Hedgehog signaling by binding tightly to Smoothened and inhibiting its activity. 
It was originally identified because it causes severe developmental defects in the 
progeny of sheep grazing on such lilies; these include the presence of a single cen¬ 
tral eye (a condition called cyclopia ), which is also seen in mice that are déficient 
in Hedgehog signaling. 

Many Stressful and Inflammatory Stimuli Act Through 
an NFicB-Dependent Signaling Pathway 

The NFkB proteins are latent transcription regulators that are présent in most 
animal cells and are central to many stressful, inflammatory, and innate immune 
responses. These responses occur as a reaction to infection or injury and help pro- 
tect stressed multicellular organisms and their cells (discussed in Chapter 24). An 
excessive or inappropriate inflammatory response in animais can also damage 
tissue and cause severe pain, and chronic inflammation can lead to cancer; as 
in the case of Wnt and Hedgehog signaling, excessive NFkB signaling is found 
in a number of human cancers. NFkB proteins also hâve important rôles during 
normal animal development: the Drosophila NFkB family member Dorsal, for 
example, has a crucial rôle in specifying the dorsal-ventral axis of the developing 
fly embryo (discussed in Chapter 22). 

Various cell-surface receptors activate the NFkB signaling pathway in animal 
cells. Toll receptors in Drosophila and Toll-like receptors in vertebrates, for exam¬ 
ple, recognize pathogens and activate this pathway in triggering innate immune 
responses (discussed in Chapter 24). The receptors for tumor necrosis factor a 
{TNFa) and interleukin-1 ( IL1 ), which are vertebrate cytokines especially impor¬ 
tant in inducing inflammatory responses, also activate this signaling pathway. The 
Toll, Toll-like, and IL1 receptors belong to the same family of proteins, whereas 
TNF receptors belong to a different family; ail of them, however, act in similar ways 
to activate NFkB. When activated, they trigger a multiprotein ubiquitylation and 
phosphorylation cascade that releases NFkB from an inhibitory protein complex, 
so that it can translocate to the nucléus and turn on the transcription of hundreds 
of genes that participate in inflammatory and innate immune responses. 

There are five NFkB pro teins in mammals {RelA, RelB, c-Rel, NFkBI, and 
NFkB2 ), and they form a variety of homodimers and heterodimers, each of which 
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Figure 15-62 The activation of the 
NFkB pathway by TNFa. Both TNFa 
and its receptors are trimers. The binding 
of TNFa causes a rearrangement of the 
clustered cytosolic tails of the receptors, 
which now recruit various signaling 
proteins, resulting in the activation of 
a protein kinase that phosphorylâtes 
and activâtes IkB kinase kinase (IKK). 

IKK is a heterotrimer composed of two 
kinase subunits (IKKa and IKK|3) and a 
regulatory subunit called NEMO. IKKp then 
phosphorylâtes IkB on two serines, which 
marks the protein for ubiquitylation and 
dégradation in protéasomes. The released 
NFkB translocates into the nucléus, where, 
in collaboration with coactivator proteins, 
it stimulâtes the transcription of its target 
genes. 


activâtes its own characteristic set of genes. Inhibitory proteins called IkB bind 
tightly to the dimers and hold them in an inactive State within the cytoplasm of 
unstimulated cells. There are three major IkB proteins in mammals (IkB a, P, and 
s), and the signais that release NFkB dimers do so by triggering a signaling path¬ 
way that leads to the phosphorylation, ubiquitylation, and conséquent dégrada¬ 
tion of the IkB pro teins (Figure 15-62). 

Among the genes activated by the released NFkB is the gene that encodes 
IkBol This activation leads to increased synthesis of hcBa protein, which binds 
to NFkB and inactivâtes it, creating a négative feedback loop (Figure 15-63A). 
Experiments on TNFa-induced responses, as well as computer modeling stud- 
ies of the responses, indicate that the négative feedback produces two types of 
NFkB responses, depending on the duration of the TNFa stimulus; importantly, 
the two types of responses induce different patterns of gene expression (Figure 
15-63B, C, and D). The négative feedback through hcBa is required for both types 
of responses: in cells déficient in hcBa, even a short exposure to TNFa induces a 
sustained activation of NFkB, without oscillations, and ail of the NFKB-responsive 
genes are activated. 

Thus far, we hâve focused on the mechanisms by which extracellular signal 
molécules use cell-surface receptors to initiate changes in gene expression. We 
now turn to a class of extracellular signais that bypasses the plasma membrane 
entirely and Controls, in the most direct way possible, transcription regulatory 
proteins inside the cell. 

Nuclear Receptors Are Ligand-Modulated Transcription Regulators 

Various small, hydrophobie signal molécules diffuse directly across the plasma 
membrane of target cells and bind to intracellular receptors that are transcription 
regulators. These signal molécules include steroid hormones, thyroid hormones, 
retinoids, and vitamin D. Although they differ greatly from one another in both 
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Figure 15-63 Négative feedback in the NFkB signaling pathway induces oscillations in NFkB activation. (A) Drawing 
showing how activated NFkB stimulâtes the transcription of the IkBcx gene, the protein product of which acts back in 
the cytoplasm to sequester and inhibit NFkB there; if the stimulus is persistent, the newly made IkBoi protein will then be 
ubiquitylated and degraded, liberating active NFkB again so that it can return to the nucléus and activate transcription (see 
Figure 15-62). (B) A short exposure to TNFa produces a single, short puise of NFkB activation, beginning within minutes and 
ending by 1 hour. This response turns on the transcription of gene A but not gene B. (C) A sustained exposure to TNFa for 
the entire 6 hours of the experiment produces oscillations in NFkB activation that damp down over time. This response turns 
on the transcription of both genes; gene B turns on only after several hours, indicating that gene B transcription requires 
prolonged activation of NFkB, for reasons that are not understood. (D) These time-lapse confocal fluorescence micrographs 
from a different study of TNFa stimulation show the oscillations of NFkB in a cultured cell, as indicated by its periodic movement 
into the nucléus (N) of a fusion protein composed of NFkB fused to a red fluorescent protein. In the cell at the center of the 
micrographs, NFkB is active and in the nucléus at 6, 60, 210, 380, and 480 minutes, but it is exclusively in the cytoplasm at 
0, 120, 300, 410, and 510 minutes. (A-C, based on data from A. Floffmann et al., Science 298:1241-1245, 2002, and adapted 
from A.Y. Ting and D. Endy, Science 298:1189-1190, 2002; D, from D.E. Nelson et al., Science 306:704-708, 2004. Ail with 
permission from AAAS.) 


Chemical structure (Figure 15-64) and function, they ail act by a similar mech- 
anism. They bind to their respective intracellular receptor proteins and alter the 
ability of these proteins to control the transcription of spécifie genes. Thus, these 
proteins serve both as intracellular receptors and as intracellular effectors for the 
signal. 

The receptors are ail structurally related, being part of the very large nuclear 
receptor superfamily. Many family members hâve been identified by DNA 
sequencing only, and their ligand is not yet lcnown; they are therefore referred 
to as orphan nuclear receptors , and they malce up large fractions of the nuclear 
receptors encoded in the genomes of humans, Drosophila , and the nematode 
C. elegans. Some mammalian nuclear receptors are regulated by intracellular 
métabolites rather than by secreted signal molécules; the peroxisome proliféra¬ 
tion-activated receptors (PPARs ), for example, bind intracellular lipid métabolites 
and regulate the transcription of genes involved in lipid metabolism and fat-cell 
différentiation. It seems lilcely that the nuclear receptors for hormones evolved 
from such receptors for intracellular métabolites, which would help explain their 
intracellular location. 

Steroid hormones—which include cortisol, the steroid sex hormones, vita- 
min D (in vertebrates), and the molting hormone eedysone (in insects)—are ail 
made from cholestérol. Cortisol is produced in the cortex of the adrenal glands 
and influences the metabolism of many types of cells. The steroid sex hormones 
are made in the testes and ovaries and are responsible for the secondary sex 
characteristics that distinguish males from females. Vitamin D is synthesized in 
the skin in response to sunlight; after it has been converted to its active form in 
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the liver or kidneys, it régulâtes Ca 2+ metabolism, promoting Ca 2+ uptalce in the 
gut and reducing its excrétion in the kidneys. The thyroid hormones , which are 
made from the amino acid tyrosine, act to increase the metabolic rate of many 
cell types, while the retinoids, such as retinoic acid, are made from vitamin A and 
hâve important rôles as local mediators in vertebrate development. Although ail 
of these signal molécules are relatively insoluble in water, they are made soluble 
for transport in the bloodstream and other extracellular fluids by binding to spé¬ 
cifie carrier proteins, from which they dissociate before entering a target cell (see 
Figure 15-3B). 

The nuclear receptors bind to spécifie DNA sequences adjacent to the genes 
that the ligand régulâtes. Some of the receptors, such as those for cortisol, are 
located primarily in the cytosol and enter the nucléus only after ligand bind¬ 
ing; others, such as the thyroid and retinoid receptors, are bound to DNA in the 
nucléus even in the absence of ligand. In either case, the inactive receptors are 
usually bound to inhibitory protein complexes. Ligand binding alters the con¬ 
formation of the receptor protein, causing the inhibitory complex to dissociate, 
while also causing the receptor to bind coactivator proteins that stimulate gene 
transcription (Figure 15-65). In other cases, however, ligand binding to a nuclear 
receptor inhibits transcription: some thyroid hormone receptors, for example, act 
as transcriptional activators in the absence of their hormone and become tran- 
scriptional repressors when hormone binds. 

Thus far, we hâve focused on the control of gene expression by extracellular 
signal molécules produced by other cells. We now turn to gene régulation by a 
more global environmental signal: the cycle of light and darkness that results from 
the Earth's rotation. 


Figure 15-64 Some signal molécules 
that bind to intracellular receptors. Note 
that ail of them are small and hydrophobie. 
The active, hydroxylated form of vitamin D 3 
is shown. Estradiol and testosterone are 
steroid sex hormones. 


Circadian Clocks Contain Négative Feedback Loops That Control 
Gene Expression 

Life on Earth evolved in the presence of a daily cycle of day and night, and many 
present-day organisms (ranging from archaea to plants and humans) possess an 
internai rhythm that dictâtes different behaviors at different times of day. These 
behaviors range from the cyclical change in metabolic enzyme activities of a bac- 
terium to the elaborate sleep-wake cycles of humans. The internai oscillators that 
control such diurnal rhythms are called circadian clocks. 

Having a circadian dock enables an organism to anticipate the regular daily 
changes in its environment and take appropriate action in advance. Of course, 
the internai cloclc cannot be perfectly accurate, and so it must be capable of being 
reset by external eues such as the light of day. Thus, circadian clocks lceep running 
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Figure 15-65 The activation of 
nuclear receptors. Ail nuclear receptors 
bind to DNA as either homodimers or 
heterodimers, but for simplicity we show 
them as monomers. (A) The receptors ail 
hâve a related structure, which includes 
three major domains, as shown. An inactive 
receptor is bound to inhibitory proteins. 

(B) Typically, the binding of ligand to the 
receptor causes the ligand-binding domain 
of the receptor to clamp shut around the 
ligand, the inhibitory proteins to dissociate, 
and coactivator proteins to bind to the 
receptor’s transcription-activating domain, 
thereby increasing gene transcription. 

In other cases, ligand binding has the 
opposite effect, causing co-repressor 
proteins to bind to the receptor, thereby 
decreasing transcription (not shown). 

(C) The structure of the ligand-binding 
domain of the retinoic acid receptor is 
shown in the absence (left) and presence 
(middle) of ligand (shown in red). When 
ligand binds, the blue a hélix acts as a 
lid that snaps shut, trapping the ligand in 
place. The shift in the conformation of the 
receptor upon ligand binding also créâtes a 
binding site for a small a hélix (orange) on 
the surface of coactivator proteins. (PDB 
codes: 1LBD, 2ZYO, and 2ZXZ.) 


even when the environmental eues (changes in light and darlc) are removed, but 
the period of this free-running rhythm is generally a little less or more than 24 
hours. External signais indicating the time of day cause small adjustments in 
the running of the cloclc, so as to lceep the organism in synchrony with its envi¬ 
ronment. Following more drastic shifts, circadian cycles become gradually reset 
(entrained) by the new cycle of light and darlc, as anyone who has experienced jet 
lag can attest. 

We might expect that the circadian cloclc would be a complex multicellular 
device, with different groups of cells responsible for different parts of the oscil¬ 
lation mechanism. Remarlcably, however, in almost ail multicellular organisms, 
including humans, the timelceepers are individual cells. Thus, a cloclc that opér¬ 
âtes in each member of a specialized group of brain cells (the SCN cells in the 
suprachiasmatic nucléus of the hypothalamus) Controls our diurnal cycles of 
sleeping and walcing, body température, and hormone release. Even if these cells 
are removed from the brain and dispersed in a culture dish, they will continue to 
oscillate individually, showing a cyclic pattern of gene expression with a period of 
approximately 24 hours. In the intact body, the SCN cells receive neural eues from 
the retina, entraining the SCN cells to the daily cycle of light and darlc; they also 
send information about the time of day to another brain area, the pineal gland, 
which relays the time signal to the rest of the body by releasing the hormone mela- 
tonin in time with the cloclc. 

Although the SCN cells hâve a central rôle as timelceepers in mammals, almost 
ail the other cells in the mammalian body hâve an internai circadian rhythm, 
which has the ability to reset in response to light. Similarly, in Drosophila, many 
different types of cells hâve a similar circadian cloclc, which continues to cycle 
when they hâve been dissected away from the rest of the fly and can be reset by 
externally imposed light and darlc cycles. 

The working of circadian cloclcs, therefore, is a fundamental problem in cell 
biology. Although we do not yet understand ail the details, studies in a wide vari- 
ety of organisms hâve revealed the basic principles and molecular components. 
The lcey principle is that circadian cloclcs generally dépend on négative feedback 
loops. As discussed earlier, oscillations in the activity of an intracellular signaling 
protein can occur if that protein inhibits its own activity with a long delay (see 
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Figure 15-18C and D). In Drosophila and many other animais, including humans, 
the heart of the circadian cloclc is a delayed négative feedback loop based on tran¬ 
scription regulators: accumulation of certain gene products switches off the tran¬ 
scription of their own genes, but with a delay, so that the cell oscillâtes between a 
State in which the products are présent and transcription is switched off, and one 
in which the products are absent and transcription is switched on (Figure 15-66). 
The négative feedback underlying circadian rhythms does not hâve to be based on 
transcription regulators. In some cell types, the circadian cloclc is constructed of 
proteins that govern their own activities through post-translational mechanisms, 
as we discuss next. 


Figure 15-66 Simplifiée! outline of the 
mechanism of the circadian clock in 
Drosophila cells. A central feature of 
the clock is the periodic accumulation 
and decay of two transcription regulatory 
proteins, Tim (short for timeless, based 
on the phenotype of a gene mutation) 
and Per (short for period). The mRNAs 
encoding these proteins rise gradually 
during the day and are translated in the 
cytosol, where the two proteins associate 
to form a heterodimer. After a time delay, 
the heterodimer dissociâtes and Tim and 
Per are transported into the nucléus, where 
Per represses the Tim and Per genes, 
resulting in négative feedback that causes 
the levels of Tim and Per to fall. In addition 
to this transcriptional feedback, the clock 
dépends on numerous other proteins. For 
example, the controlled dégradation of Per 
indicated in the diagram imposes delays 
in the accumulation of Tim and Per, which 
are crucial to the functioning of the clock. 
Steps at which spécifie delays are imposed 
are shown in red. 

Entrainment (or resetting) of the clock 
occurs in response to new light-dark 
cycles. Although most Drosophila cells 
do not hâve true photoreceptors, light is 
sensed by intracellular flavoproteins, also 
called cryptochromes. In the presence of 
light, these proteins associate with the 
Tim protein and cause its dégradation, 
thereby resetting the clock. (Adapted from 
J.C. Dunlap, Science 311:184—186, 2006.) 


Three Proteins in a Test Tube Can Reconstitute a Cyanobacterial 
Circadian Clock 

The best understood circadian clock is found in the photosynthetic cyanobacte- 
rium, Synechococcus elongatus. The core oscillator in this organism is remarkably 
simple, being composed of just three proteins— KaiA, KaiB, and KaiC. The cen¬ 
tral player is KaiC, a multifunctional enzyme that catalyzes its own phosphoryla¬ 
tion and déphosphorylation in a 24-hour cycle: it gradually phosphorylâtes itself 
sequentially at two sites during the day and dephosphorylates itself during the 
night. This timing dépends on interactions with the two other Kai proteins: KaiA 
binds to unphosphorylated KaiC and stimulâtes KaiC autophosphosphorylation, 
first at one site and then, with a delay, at the other. The second phosphorylation 
promûtes the binding of the third protein, KaiB, which bloclcs the stimulatory 
effect of KaiA and thereby allows KaiC to dephosphorylate itself, bringing KaiC 
baclc to its dephosphorylated State. This clock dépends on a négative feedback 
loop: KaiC drives its own phosphorylation until, after a delay, it recruits an inhib- 
itor, KaiB, that stimulâtes KaiC to dephosphorylate itself. Amazingly, when the 
three Kai proteins are purified and incubated in a test tube with ATP, KaiC phos¬ 
phorylation and déphosphorylation occur with roughly 24-hour timing over a 
period of several days (Figure 15-67). 

Circadian oscillations in KaiC phosphorylation lead to parallel rhythms in the 
expression of large numbers of genes involved in controlling metabolic activities 
and cell division (see Figure 15-67). As a resuit, many aspects of cell behavior are 
synchronized with the circadian cycle. 

Even in continuous darkness, cyanobacterial cells generate free-running oscil¬ 
lations of KaiC phosphorylation with roughly 24-hour periods. As in other circa¬ 
dian cloclcs, the cyanobacterial clock is entrained by the environmental light/dark 
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cycle. Light is thought to affect the circadian dock indirectly: the activities of Kai 
proteins are influenced by changes in intracellular redox potential, which occur 
as a resuit of increased photosynthetic activity during the day. 

Summary 

Some signaling pathways that are especially important in animal development 
dépend on proteolysis to control the activity and location of latent transcription reg- 
ulatory proteins. Notch receptors are themselves such proteins, which are activated 
by cleavage when Delta on another cell binds to them; the cleaved cytosolic tail 
of Notch migrâtes into the nucléus, where it stimulâtes the transcription ofNotch- 
responsive genes. In the Wnt/f-catenin signaling pathway, by contrast, the prote¬ 
olysis of the latent transcription regulatory protein fi-catenin is inhibited when a 
secreted Wnt protein binds to both a Frizzled and LRP receptor protein; as a resuit , 
fi-catenin accumulâtes in the nucléus and activâtes the transcription ofWnt target 
genes. 

Hedgehog signaling in flies works much like Wnt signaling. In the absence of 
a signal, a bifunctional, cytoplasmic transcription regulator, Ci, is proteolyti- 
cally cleaved toform a transcriptional repressor that keeps Hedgehog target genes 
silenced. The binding of Hedgehog to its receptors (Patched and iHog) inhibits the 
proteolytic processing of Ci; as a resuit, the intact Ci protein accumulâtes in the 
nucléus and activâtes the transcription of Hedgehog-responsive genes. In Notch, 
Wnt, and Hedgehog signaling, the extracellular signal triggers a switch from tran¬ 
scriptional repression to transcriptional activation. 

Signaling through the latent transcription regulator NFkB also dépends on 
proteolysis. NFkB proteins are normally held in an inactive State by inhibitory IkB 
proteins in the cytoplasm. A variety of extracellular stimuli, including proinflam- 
matory cytokines, trigger the phosphorylation and ubiquitylation ofhcB, marking 
itfor dégradation; this enables the NFkB to translocate to the nucléus and activate 


Figure 15-67 The core circadian 
oscillator of cyanobacteria. (A) KaiC is 
a combined kinase and phosphatase that 
phosphorylâtes and dephosphorylates itself 
on two adjacent sites. In the absence of 
other proteins, the phosphatase activity 
is dominant, and the protein is mostly 
unphosphorylated. The binding of KaiA 
to KaiC suppresses the phosphatase 
activity and promûtes the kinase activity, 
leading to KaiC phosphorylation, first 
at site 1 and then at site 2, resulting 
in diphosphorylated KaiC. KaiC then 
dephosphorylates itself slowly at site 1, 
even in the presence of KaiA, so that KaiC 
is phosphorylated only at site 2. This form 
of KaiC interacts with KaiB, which blocks 
the stimulatory effects of KaiA, thereby 
reducing the rate of KaiC phosphorylation 
and allowing déphosphorylation to occur. 
Diphosphorylated KaiC increases in 
abundance during the day and peaks 
around dusk. It activâtes other proteins 
that phosphorylate a transcription 
regulator (RpaA), which then stimulâtes 
expression of some genes (the dusk genes 
that peak in early evening) and inhibits 
expression of other genes (the dawn genes 
that peak in the morning). When KaiC 
déphosphorylation gradually occurs during 
the night, these effects are reversed: dusk 
genes are turned off and dawn genes are 
turned on. 

(B) In this experiment, the three Kai 
proteins were purified and mixed in a test 
tube with ATP (which is required for KaiC 
kinase activity). Every two hours over the 
next 3 days, the KaiC protein was analyzed 
by polyacrylamide gel electrophoresis, 
in which the phosphorylated form of 
KaiC migrâtes more slowly (upper band, 
P-KaiC) than the nonphosphorylated 
form (lower band, NP-KaiC). The three 
different phosphorylated forms of KaiC 
are not distinguished by this method. The 
phosphorylation of KaiC oscillâtes with a 
roughly 24-hour period. (C) The amount 
of phosphorylated and unphosphorylated 
KaiC in the experiment in B is plotted on 
this graph, along with the amount of total 
protein. (B and C, from M. Nakajima et 
al., Science 308:414-415, 2005. With 
permission from AAAS.) 
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the transcription ofits targetgenes. NFkB also activâtes the transcription ofthegene 
that encodes IkBo, creating a négative feedback loop, which can produce prolonged 
oscillations in NFkB activity with sustained extracellular signaling. 

Some small, hydrophobie signal molécules , including steroid and thyroid hor¬ 
mones , diffuse across the plasma membrane ofthe targetcell and activate intracel- 
lular receptor proteins that directly regulate the transcription of spécifie genes. 

In many cell types, gene expression is governed by circadian clocks , in which 
delayed négative feedback produces 24-hour oscillations in the activities of tran¬ 
scription regulators, anticipating the cell's changing needs during the day and night. 

SIGNALING IN PLANTS 

In plants, as in animais, cells are in constant communication with one another. 
Plant cells communicate to coordinate their activities in response to the changing 
conditions of light, darlc, and température, which guide the plant's cycle of growth, 
flowering, and fruiting. Plant cells also communicate to coordinate activities in 
their roots, stems, and leaves. In this final section, we consider how plant cells sig¬ 
nal to one another and how they respond to light. Less is known about the recep- 
tors and intracellular signaling mechanisms involved in cell communication in 
plants than is known in animais, and we will concentrate mainly on how the recep- 
tors and intracellular signaling mechanisms differ from those used by animais. 

Multicellularity and Cell Communication Evolved Independently in 
Plants and Animais 

Although plants and animais are both eukaryotes, they hâve evolved separately 
for more than a billion years. Their last common ancestor is thought to hâve been 
a unicellular eukaryote that had mitochondria but no chloroplasts; the plant lin- 
eage acquired chloroplasts after plants and animais diverged. The earliest fossils 
of multicellular animais and plants date from almost 600 million years ago. Thus, 
it seems that plants and animais evolved multicellularity independently, each 
starting from a different unicellular eukaryote, some time between 1.6 and 0.6 bil¬ 
lion years ago (Figure 15-68). 

If multicellularity evolved independently in plants and animais, the molécules 
and mechanisms used for cell communication will hâve evolved separately and 
would be expected to be different. There should be some degree of resemblance, 
however, because the genes in both plants and animais diverged from those con- 
tained by their last common unicellular ancestor. Thus, whereas both plants and 
animais use nitric oxide, cyclic GMP, Ca 2+ , and Rho family GTPases for signal¬ 
ing, there are no homologs of the nuclear receptor family, Ras, IAK, STAT, TGFp, 



Figure 15-68 The proposée! divergence 
of plant and animal lineages from a 
common unicellular eukaryotic ancestor. 

The plant lineage acquired chloroplasts 
after the two lineages diverged. Both 
lineages independently gave rise to 
multicellular organisms—plants and 
animais. (Paintings courtesy of John Innés 
Foundation.) 
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Notch, Wnt, or Hedgehog encoded by the completely sequenced genome of Ara- 
bidopsis thaliana, the small flowering plant. Similarly, plants do not seem to use 
cyclic AMP for intracellular signaling. Nevertheless, the general strategies under- 
lying signaling are frequently very similar in plants and animais. Both, for exam¬ 
ple, use enzyme-coupled cell-surface receptors, as we now discuss. 

Receptor Serine/Threonine Kinases Are the Largest Class 
of Cell-Surface Receptors in Plants 

Most cell-surface receptors in plants are enzyme-coupled. However, whereas 
the largest class of enzyme-coupled receptors in animais is the receptor tyro¬ 
sine kinase (RTK) class, this type of receptor is extremely rare in plants. Instead, 
plants rely largely on a great diversity of transmembrane receptor serine/threonine 
kinases , which hâve a typical serine/threonine kinase cytoplasmic domain and an 
extracellular ligand-binding domain. The most abundant types of these receptors 
hâve a tandem array of extracellular leucine-rich repeat structures and are there- 
fore called leucine-rich repeat (LRR) receptor kinases. 

There are about 175 LRR receptor kinases encoded by the Arabidopsis genome. 
These include a protein called Bril , which forms part of a cell-surface steroid 
hormone receptor. Plants synthesize a class of steroids that are called brassino- 
steroids because they were originally identified in the mustard family Brassica- 
ceae , which includes Arabidopsis. These signal molécules regulate the growth and 
différentiation of plants throughout their life cycle. Binding of a brassinosteroid 
to a Bril cell-surface receptor kinase initiâtes an intracellular signaling cascade 
that uses a GSK3 protein kinase and a protein phosphatase to regulate the phos¬ 
phorylation and dégradation of spécifie transcription regulatory proteins in the 
nucléus, and thereby spécifie gene transcription. Mutant plants that are déficient 
in the Bril receptor kinase are insensitive to brassinosteroids and are therefore 
dwarfs. 

The LRR receptor kinases are only one of many classes of transmembrane 
receptor serine/threonine kinases in plants. There are at least six additional fami- 
lies, each with its own characteristic set of extracellular domains. The lectin recep¬ 
tor kinases, for example, hâve extracellular domains that bind carbohydrate signal 
molécules. The Arabidopsis genome encodes over 300 receptor serine/threonine 
kinases, which malces them the largest family of receptors known in plants. Many 
are involved in defense responses against pathogens. 

Ethylene Blocks the Dégradation of Spécifie Transcription 
Regulatory Proteins in the Nucléus 

Various plant growth regulators (also called plant hormones) help to coordinate 
plant development. They include ethylene, auxin, cytolcinins, gibberellins, and 
abscisic acid, as well as brassinosteroids. Growth regulators are ail small molé¬ 
cules made by most plant cells. They diffuse readily through cell walls and can 
either act locally or be transported to influence cells further away. Each growth 
regulator can hâve multiple effects. The spécifie effect dépends on environmental 
conditions, the nutritional State of the plant, the responsiveness of the target cells, 
and which other growth regulators are acting. 

Ethylene is an important example. This small gas molécule (Figure 15-69A) 
can influence plant development in various ways; it can, for example, promote 
fruit ripening, leaf abscission, and plant senescence. It also functions as a stress 
signal in response to wounding, infection, flooding, and so on. When the shoot of 
a germinating seedling, for instance, encounters an obstacle, ethylene promûtes 
a complex response that allows the seedling to safely bypass the obstacle (Figure 
15-69B and C). 

Plants hâve various ethylene receptors, which are located in the endoplasmic 
réticulum and are ail structurally related. They are dimeric, multipass transmem¬ 
brane proteins, with a copper-containing ethylene-binding domain and a domain 
that interacts with a cytoplasmic protein called CTR1, which is closely related 
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Figure 15-69 The ethylene-mediated 
triple response that occurs when the 
growing shoot of a germinating seedling 
encounters an obstacle underground. 

(A) The structure of ethylene. (B) In the 
absence of obstacles, the shoot grows 
upward and is long and thin. (C) If the 
shoot encounters an obstacle, such as 
a piece of gravel in the soil, the seedling 
responds to the encounter in three ways. 
First, it thickens its stem, which can then 
exert more force on the obstacle. Second, 
it shields the tip of the shoot (at top) by 
increasing the curvature of a specialized 
hook structure. Third, it reduces the shoot’s 
tendency to grow away from the direction 
of gravity, so as to avoid the obstacle. 
(Courtesy of Melanie Webb.) 
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in sequence to the Raf MAP kinase kinase kinase discussed earlier (see Figure 
15-49). Surprisingly, it is the empty receptors that are active and lceep CTR1 active. 
By an unlcnown signaling mechanism, active CTR1 stimulâtes the ubiquitylation 
and dégradation in protéasomes of a nuclear transcription regulator called EIN3, 
which is required for the transcription of ethylene-responsive genes. In this way, 
the empty but active receptors lceep ethylene-response genes off. Ethylene bind- 
ing inactivâtes the receptors, altering their conformation so that they no longer 
activate CTR1. The EIN3 protein is no longer ubiquitylated and degraded and can 
now activate the transcription of the large number of ethylene-responsive genes 
(Figure 15-70). 

Regulated Positioning of Auxin Transporters Patterns Plant Growth 

The plant hormone auxin, which is generally indole-3-acetic acid (Figure 
15-71 A), binds to receptor proteins in the nucléus. It helps plants grow toward 
light, grow upward rather than branch out, and grow their roots downward. It also 
régulâtes organ initiation and positioning and helps plants flower and bear fruit. 
Tilce ethylene (and lilce some of the animal signal molécules we hâve described 
in this chapter), auxin influences gene expression by controlling the dégradation 
of transcription regulators. It worlcs by stimulating the ubiquitylation and dégra¬ 
dation of repressor proteins that bloclc the transcription of auxin target genes in 
unstimulated cells (Figure 15-71B and C). 

Auxin is unique in the way that it is transported. Unlilce animal hormones, 
which are usually secreted by a spécifie endocrine organ and transported to target 
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Figure 15-70 The ethylene signaling pathway. (A) In the absence of ethylene, the receptors and 
CTR1 are active, causing the ubiquitylation and destruction of EIN3, the transcription regulatory 
protein in the nucléus that is responsible for the transcription of ethylene-responsive genes. (B) The 
binding of ethylene inactivâtes the receptors and disrupts the activation of CTR1. The EIN3 protein 
is not degraded and can therefore activate the transcription of ethylene-responsive genes. 
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Figure 15-71 The auxin signaling 
pathway. (A) The structure of the auxin 
indole-3-acetic acid. (B) In the absence of 
auxin, a transcriptional repressor protein 
(called Aux/IAA) binds and suppresses 
a transcription regulatory protein (called 
auxin-response factor, ARF), which is 
required for the transcription of auxin- 
responsive genes. (C) The auxin receptor 
proteins are mainly located in the 
nucléus and form part of ubiquitin ligase 
complexes (not shown). When activated 
by auxin binding, the receptor-auxin 
complexes recruit the ubiquitin ligase 
complexes, which ubiquitylate the Aux/IAA 
proteins, marking them for dégradation in 
protéasomes. ARF is now free to activate 
the transcription of auxin-responsive genes. 
There are many ARFs, Aux/IAA proteins, 
and auxin receptors that work as illustrated. 
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cells via the circulatory System, auxin has its own transport System. Spécifie plas- 
ma-membrane-bound influx transporter proteins and efflux transporter proteins 
move auxin into and out of plant cells, respectively. The efflux transporters can be 
distributed asymmetrically in the plasma membrane to malce the efflux of auxin 
directional. A row of cells with their auxin efflux transporters confined to the basal 
plasma membrane, for example, will transport auxin from the top of the plant to 
the bottom. 

In some régions of the plant, the localization of the auxin transporters, and 
therefore the direction of auxin flow, is highly dynamic and regulated. A cell can 
rapidly redistribute transporters by controlling the traffic of vesicles containing 
them. The auxin efflux transporters, for example, normally recycle between intra- 
cellular vesicles and the plasma membrane. A cell can redistribute these trans¬ 
porters on its surface by inhibiting their endocytosis in one domain of the plasma 
membrane, causing the transporters to accumulate there. One example occurs in 
the root, where gravity influences the direction of growth. The auxin efflux trans¬ 
porters are normally distributed symmetrically in the cap cells of the root. Within 
minutes of a change in the direction of the gravity vector, however, the efflux trans¬ 
porters redistribute to one side of the cells, so that auxin is pumped out toward the 
side of the root pointing downward. Because auxin inhibits root-cell élongation, 
this redirection of auxin transport causes the root tip to reorient, so that it grows 
downward again (Figure 15-72). 

Phytochromes Detect Red Light, and Cryptochromes Detect Blue 
Light 

Plant development is greatly influenced by environmental conditions. Unlilce 
animais, plants cannot move when conditions become unfavorable; they hâve 
to adapt or they die. The most important environmental influence on plants is 
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Figure 15-72 Auxin transport and root gravitropism. (A-C) Roots respond to a 90° change 
in the gravity vector and adjust their direction of growth so that they grow downward again. The 
cells that respond to gravity are in the center of the root cap, while it is the epidermal cells further 
back (on the lower side) that decrease their rate of élongation to restore downward growth. (D) The 
gravity-responsive cells in the root cap redistribute their auxin efflux transporters in response to 
the displacement of the root. This redirects the auxin flux mainly to the lower part of the displaced 
root, where it inhibits the élongation of the epidermal cells. The resulting asymmetrical distribution 
of auxin in the Arabidopsis root tip shown here is assessed indirectly, using an auxin-responsive 
reporter gene that encodes a protein fused to green fluorescent protein (GFP); the epidermal cells 
on the downward side of the root are green, whereas those on the upper side are not, reflecting 
the asymmetrical distribution of auxin. The distribution of auxin efflux transporters in the plasma 
membrane of cells in different régions of the root (shown as gray rectangles) is indicated in red, and 
the direction of auxin efflux is indicated by a green arrow. (The fluorescence photograph in D is from 
T. Paciorek et al., Nature 435:1251-1256, 2005. With permission from Macmillan Publishers Ltd.) 


light, which is their energy source and has a major rôle throughout their entire life 
cycle—from germination, through seedling development, to flowering and senes- 
cence. Plants hâve thus evolved a large set of light-sensitive proteins to monitor 
the quantity, quality, direction, and duration of light. These are usually referred 
to as photoreceptors. However, because the term photoreceptor is also used for 
light-sensitive cells in the animal retina (see Figure 15-38), we shall use the term 
photoprotein instead. 

Ail photoproteins sense light by means of a covalently attached light-absorb- 
ing chromophore, which changes its shape in response to light and then induces a 
change in the protein's conformation. The best-known plant photoproteins are the 
phytochromes, which are présent in ail plants and in some algae but are absent in 
animais. These are dimeric, cytoplasmic serine/threonine kinases, which respond 
differentially and reversibly to red and far-red light: whereas red light usually acti¬ 
vâtes the kinase activity of the phytochrome, far-red light inactivâtes it. When acti- 
vated by red light, the phytochrome is thought to phosphorylate itself and then 
to phosphorylate one or more other proteins in the cell. In some light responses, 
the activated phytochrome translocates into the nucléus, where it activâtes tran¬ 
scription regulators to alter gene transcription (Figure 15-73). In other cases, the 
activated phytochrome activâtes a latent transcription regulator in the cytoplasm, 
which then translocates into the nucléus to regulate gene transcription. In still 
other cases, the photoprotein triggers signaling pathways in the cytosol that alter 
the celFs behavior without involving the nucléus. 
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Figure 15-73 One way in which 
phytochromes médiate a light response 
in plant cells. When activated by red 
light, the phytochrome, which is a dimeric 
protein kinase, phosphorylâtes itself and 
then moves into the nucléus, where it 
activâtes transcription regulatory proteins 
to stimulate the transcription of red-light- 
responsive genes. 


I 

TRANSCRIPTION OF 
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Plants sense blue light using photoproteins of two other sorts, phototropin 
and cryptochromes. Phototropin is associated with the plasma membrane and is 
partly responsible for phototropism, the tendency of plants to grow toward light. 
Phototropism occurs by directional cell élongation, which is stimulated by auxin, 
but the links between phototropin and auxin are unknown. 

Cryptochromes are flavoproteins that are sensitive to blue light. They are 
structurally related to blue-light-sensitive enzymes called photolyases, which 
are involved in the repair of ultraviolet-induced DNA damage in ail organisms, 
except most mammals. Unlike phytochromes, cryptochromes are also found in 
animais, where they hâve an important rôle in circadian cloclcs (see Figure 15-66). 
Although cryptochromes are thought to hâve evolved from the photolyases, they 
do not hâve a rôle in DNA repair. 

Summary 

Plants and animais are thought to hâve evolved multicellularity and cell com¬ 
munication mechanisms independently, each startingfrom a different unicellular 
eulcaryote, which in turn evolved from a common unicellular eulcaryotic ancestor. 
Not surprisingly, therefore, the mechanisms used to signal between cells in animais 
and in plants hâve both similarities and différences. Whereas animais rely heav- 
ily on GPCRs and RTKs, plants rely mainly on enzyme-coupled receptors of the 
receptor serine/threonine kinase type , especially ones with extracellular leucine- 
rich repeats. Various plant hormones, or growth regulators, including ethylene 
and auxin, help coordinate plant development. Ethylene acts through intracellular 
receptors to stop the dégradation of spécifie nuclear transcription regulators, which 
can then activate the transcription of ethylene-responsive genes. The receptors for 
some other plant hormones, including auxin, also regulate the dégradation of spé¬ 
cifie transcription regulators, although the details vary. Auxin signaling is unusual 
in that it has its own highly regulated transport System, in which the dynamic posi- 
tioning of plasma-membrane-bound auxin transporters Controls the direction of 
auxin flow and thereby the direction of plant growth. Light has an important rôle 
in regulating plant development. These light responses are mediated by a variety of 
light-sensitive photoproteins, including phytochromes, which are responsive to red 
light, and cryptochromes and phototropin, which are sensitive to blue light. 


WHATWE DONT KNOW 


• How does a cell integrate the 
information received from its many 
different cell-surface receptors to 
make all-or-none decisions? 

• Much of what we know about cell 
signaling cornes from biochemical 
studies of isolated proteins in test 
tubes. What is the précisé quantitative 
behavior of intracellular signaling 
networks in an intact cell, or in an 
intact animal, where countless other 
signais and cell components might 
influence signaling specificity and 
strength? 

• How do intracellular signaling 
circuits generate spécifie and dynamic 
signaling patterns such as oscillations 
and waves, and how are these 
patterns sensed and interpreted by 
the cell? 

• Scaffold proteins and activated 
receptor tyrosine kinases nucleate 
the assembly of large intracellular 
signaling complexes. What is the 
dynamic behavior of these complexes, 
and how does this behavior influence 
downstream signaling? 
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PROBLEMS 

Which statements are true? Explain why or why not. 

15-1 Ail second messengers are water-soluble and dif¬ 
fuse freely through the cytosol. 

15-2 In the régulation of molecular switches, protein 
kinases and guanine nucléotide exchange factors (GEFs) 
always turn proteins on, whereas protein phosphatases 
and GTPase-activating proteins (GAPs) always turn pro¬ 
teins off. 

15-3 Most intracellular signaling pathways provide 
numerous opportunities for amplifying the responses to 
extracellular signais. 

15-4 Binding of extracellular ligands to receptor tyro¬ 
sine kinases (RTKs) activâtes the intracellular catalytic 
domain by propagating a conformational change across 
the lipid bilayer through a single transmembrane a hélix. 

15-5 Protein tyrosine phosphatases display exquisite 
specificity for their substrates, unlilce most serine/thre- 
onine protein phosphatases, which hâve rather broad 
specificity. 

15-6 Even though plants and animais independently 
evolved multicellularity, they use virtually ail the same sig¬ 
naling proteins and second messengers for cell-cell com¬ 
munication. 

Discuss the following problems. 

15-7 Suppose that the circulating concentration of hor¬ 
mone is 10 -10 M and the K c \ for binding to its receptor is 10~ 8 
M. What fraction of the receptors will hâve hormone bound? 
If a meaningful physiological response occurs when 50% of 
the receptors hâve bound a hormone molécule, how much 
will the concentration of hormone hâve to rise to elicit a 
response? The fraction of receptors (R) bound to hormone 
(H) to form a receptor-hormone complex (R-H) is [R-El]/ 
([R] + [R-H]) = [R-H]/[R] TO t= [H]/([H] + K d ). 

15-8 Cells communicate in ways that resemble human 
communication. Décidé which of the following forms of 
human communication are analogous to autocrine, para¬ 
crine, endocrine, and synaptic signaling by cells. 

A. A téléphoné conversation 

B. Talking to people at a cocktail party 

C. A radio announcement 

D. Talking to yourself 

15-9 Why do signaling responses that involve changes 
in proteins already présent in the cell occur in millisec- 
onds to seconds, whereas responses that require changes 
in gene expression require minutes to hours? 

15-10 How is it that different cells can respond in differ¬ 
ent ways to exactly the same signaling molécule even when 
they hâve identical receptors? 


15-11 Why do you suppose that phosphorylation/ 
déphosphorylation, as opposed to allosteric binding of 
small molécules, for example, has evolved to play such a 
prominent rôle in switching proteins on and off in signal¬ 
ing pathways? 


15-12 Consider a signaling pathway that proceeds 
through three protein kinases that are sequentially acti- 
vated by phosphorylation. In one case, the kinases are 
held in a signaling complex by a scaffolding protein; in 
the other, the kinases are freely diffusible (Figure Q15-1). 
Discuss the properties of these two types of organization 
in terms of signal amplification, speed, and potential for 
cross-talk between signaling pathways. 



Figure Q15-1 A kinase 
cascade organized by 
a scaffolding protein 
or composed of freely 
diffusing components 
(Problem 15-12). 


15-13 Describe three ways in which a graduai increase in 
an extracellular signal can be sharpened by the target cell 
to produce an abrupt or nearly all-or-none response. 

15-14 Activation ("maturation") of frog oocytes is sig- 
naled through a MAP kinase signaling module. An increase 
in the hormone progestérone triggers the module by stim- 
ulating the translation of Mos mRNA, which is the frog's 
MAP kinase kinase kinase (Figure Q15-2). Maturation is 
easy to score visually by the presence of a white spot in 
the middle of the brown surface of the oocyte (see Figure 
Q15-2). To détermine the dose-response curve for pro- 
gesterone-induced activation of MAP kinase, you place 16 
oocytes in each of six plastic dishes and add various con¬ 
centrations of progestérone. After an overnight incubation, 
you crush the oocytes, préparé an extract, and détermine 
the State of MAP kinase phosphorylation (hence, activa¬ 
tion) by SDS polyacrylamide-gel electrophoresis (Figure 
Q15-3A). This analysis shows a graded response of MAP 
kinase to increasing concentrations of progestérone. 

progestérone Figure Q15-2 Progesterone-induced 

I MAP kinase activation, leading to oocyte 

maturation (Problem 15-14). (Courtesy 
. OS of Helfrid Hochegger.) 
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(A) POOLED OOCYTES 



progestérone (pM) 


(B) INDIVIDUAL OOCYTES 
-+ 0.03 pM progestérone 



0.1 liM progestérone 



0.3 liM progestérone 



Figure Q15-3 Activation 
of frog oocytes 
(Problem 15-14). 

(A) Phosphorylation 
of MAP kinase in 
pooled oocytes. 

(B) Phosphorylation of 
MAP kinase in individual 
oocytes. MAP kinase 
was detected by 
immunoblotting using 

a MAP-kinase-specific 
antibody. The first two 
lanes in each gel contain 
nonphosphorylated, 
inactive MAP kinase (-) 
and phosphorylated, 
active MAP kinase (+)■ 
(From J.E. Ferrell, Jr., and 
E.M. Machleder, Science 
280:895-898, 1998. 

With permission from 
AAAS.) 


Before you crushed the oocytes, you noticed that not ail 
oocytes in individual dishes had white spots. Had some 
oocytes undergone partial activation and not yet reached 
the white-spot stage? To answer this question, you repeat 
the experiment, but this time you analyze MAP kinase acti¬ 
vation in individual oocytes. You are surprised to find that 
each oocyte has either a fully activated or a completely 
inactive MAP kinase (Figure Q15-3B). How can an all-or- 
none response in individual oocytes give rise to a graded 
response in the population? 

15-15 Propose spécifie types of mutations in the gene for 
the regulatory subunit of cyclic-AMP-dépendent protein 
kinase (PKA) that could lead to either a permanently active 
PKA or a permanently inactive PKA. 

15-16 Phosphorylase kinase intégrâtes signais from the 
cyclic-AMP-dépendent and Ca 2+ -dependent signaling 
pathways that control glycogen breakdown in liver and 
muscle cells (Figure Q15-4). Phosphorylase kinase is com- 
posed of four subunits. One is the protein kinase that cata- 
lyzes the addition of phosphate to glycogen phosphorylase 
to activate it for glycogen breakdown. The other three sub¬ 
units are regulatory proteins that control the activity of the 



phosphorylase kinase 
inactive 


phosphorylase kinase 
active 


Figure Q15-4 Intégration of cyclic-AMP-dependent and Ca 2+ - 
dependent signaling pathways by phosphorylase kinase in liver and 
muscle cells (Problem 15-16). 


catalytic subunit. Two contain sites for phosphorylation by 
PKA, which is activated by cyclic AMP. The remaining sub- 
unit is calmodulin, which binds Ca 2+ when the cytosolic 
Ca 2+ concentration rises. The regulatory subunits control 
the equilibrium between the active and inactive confor¬ 
mations of the catalytic subunit, with each phosphate and 
Ca 2+ nudging the equilibrium toward the active confor¬ 
mation. How does this arrangement allow phosphorylase 
kinase to serve its rôle as an integrator protein for the mul¬ 
tiple pathways that stimulate glycogen breakdown? 


15-17 The Wnt planar polarity signaling pathway nor- 
mally ensures that each wing cell in Drosophila has a sin¬ 
gle hair. Overexpression of the Frizzled gene from a heat- 
shock promoter (hs -Fz) causes multiple hairs to grow from 
many cells (Figure Q15-5A). This phenotype is suppressed 
if hs -Fz is combined with a heterozygous délétion (Dsh A ) 
of the Dishevelled gene (Figure Q15-5B). Do these results 
allow you to order the action of Frizzled and Dishevelled 
in the signaling pathway? If so, what is the order? Explain 
your reasoning. 


(A) 


(B) 



Figure Q15-5 Pattern of hair growth on wing cells in genetically 
different Drosophila (Problem 15-17). (From C.G. Winter et al., Cell 
105:81-91,2001. With permission from Elsevier.) 
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The Cytoskeleton 


CHAPTER 

16 


For cells to function properly, they must organize themselves in space and inter- 
act mechanically with each other and with their environment. They hâve to be 
correctly shaped, physically robust, and properly structured internally. Many 
hâve to change their shape and move from place to place. Ail cells hâve to be able 
to rearrange their internai components as they grow, divide, and adapt to chang- 
ing circumstances. These spatial and mechanical functions dépend on a remarlc- 
able System of filaments called the cytoskeleton (Figure 16-1). 

The cytoskeleton's varied functions dépend on the behavior of three families 
of protein filaments —actin filaments, microtubules, and intermediate filaments. 
Each type of filament has distinct mechanical properties, dynamics, and biolog- 
ical rôles, but ail share certain fundamental features. Just as we require our lig¬ 
aments, bones, and muscles to worlc together, so ail three cytoskeletal filament 
Systems must normally function collectively to give a cell its strength, its shape, 
and its ability to move. 

In this chapter, we describe the function and conservation of the three main 
filament Systems. We explain the basic principles underlying filament assembly 
and disassembly, and how other proteins interact with the filaments to alter their 
dynamics, enabling the cell to establish and maintain internai order, to shape and 
remodel its surface, and to move organelles in a directed manner from one place 
to another. Finally, we discuss how the intégration and régulation of the cytoskel¬ 
eton allows a cell to move to new locations. 


FUNCTION AND ORIGIN OF THE CYTOSKELETON 

The three major cytoskeletal filaments are responsible for different aspects of the 
celFs spatial organization and mechanical properties. Actin filaments détermine 
the shape of the celFs surface and are necessary for whole-cell locomotion; they 
also drive the pinching of one cell into two. Microtubules détermine the positions 
of membrane-enclosed organelles, direct intracellular transport, and form the 
mitotic spindle that segregates chromosomes during cell division. Intermediate 
filaments provide mechanical strength. Ail of these cytoskeletal filaments interact 
with hundreds of accessory proteins that regulate and linlc the filaments to other 
cell components, as well as to each other. The accessory proteins are essential for 
the controlled assembly of the cytoskeletal filaments in particular locations, and 
they include the motor proteins , remarkable molecular machines that convert the 
energy of ATP hydrolysis into mechanical force that can either move organelles 
along the filaments or move the filaments themselves. 

In this section, we discuss the general features of the proteins that malce up 
the filaments of the cytoskeleton. We focus on their ability to form intrinsically 


Figure 16-1 The cytoskeleton. (A) A cell in culture has been fixed and 
labeled to show its cytoplasmic arrays of microtubules (green) and actin 
filaments (red). (B) This dividing cell has been labeled to show its spindle 
microtubules (green) and surrounding cage of intermediate filaments (red), 
The DNA in both cells is labeled in blue. (A, courtesy of Albert Tousson; 

B, courtesy of Conly Rieder.) 
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polarized and self-organized structures that are highly dynamic, allowing the cell 
to rapidly modify cytoslceletal structure and function under different conditions. 

Cytoskeletal Filaments Adapt to Form Dynamic or Stable 
Structures 

Cytoslceletal Systems are dynamic and adaptable, organized more lilce ant trails 
than interstate highways. A single trail of ants may persist for many hours, extend- 
ing from the ant nest to a délectable picnic site, but the individual ants within the 
trail are anything but static. If the ant scouts find a new and better source of food, 
or if the picniclcers clean up and leave, the dynamic structure adapts with aston- 
ishing rapidity. In a similar way, large-scale cytoskeletal structures can change or 
persist, according to need, lasting for lengths of time ranging from less than a min¬ 
ute up to the cell's lifetime. But the individual macromolecular components that 
malce up these structures are in a constant State of flux. Thus, lilce the alteration of 
an ant trail, a structural rearrangement in a cell requires little extra energy when 
conditions change. 

Régulation of the dynamic behavior and assembly of cytoskeletal filaments 
allows eulcaryotic cells to build an enormous range of structures from the three 
basic filament Systems. The micrographs in Panel 16-1 illustrate some of these 
structures. Actin filaments underlie the plasma membrane of animal cells, pro- 
viding strength and shape to its thin lipid bilayer. They also form many types of 
cell-surface projections. Some of these are dynamic structures, such as the lamel- 
lipodia and filopodia that cells use to explore territory and move around. More 
stable arrays allow cells to brace themselves against an underlying substratum 
and enable muscle to contract. The regular bundles of stereocilia on the surface 
of hair cells in the inner ear contain stable bundles of actin filaments that tilt as 
rigid rods in response to Sound, and similarly organized microvilli on the surface 
of intestinal épithélial cells vastly increase the apical cell-surface area to enhance 
nutrient absorption. In plants, actin filaments drive the rapid streaming of cyto- 
plasm inside cells. 

Microtubules, which are frequently found in a cytoplasmic array that extends 
to the cell periphery, can quicldy rearrange themselves to form a bipolar mitotic 
spindle during cell division. They can also form cilia, which function as motile 
whips or sensory devices on the surface of the cell, or tightly aligned bundles that 
serve as traclcs for the transport of materials down long neuronal axons. In plant 
cells, organized arrays of microtubules help to direct the pattern of cell wall syn- 
thesis, and in many protozoans they form the framework upon which the entire 
cell is built. 

Intermediate filaments line the inner face of the nuclear envelope, forming 
a protective cage for the cell's DNA; in the cytosol, they are twisted into strong 
cables that can hold épithélial cell sheets together or help nerve cells to extend 
long and robust axons, and they allow us to form tough appendages such as hair 
and fingernails. 

An important and dramatic example of rapid reorganization of the cytoskele¬ 
ton occurs during cell division, as shown in Figure 16-2 for a fibroblast growing 
in a tissue-culture dish. After the chromosomes hâve replicated, the interphase 
microtubule array that spreads throughout the cytoplasm is reconfigured into the 
bipolar mitotic spindle, which segregates the two copies of each chromosome into 
daughter nuclei. At the same time, the specialized actin structures that enable the 
fibroblast to crawl across the surface of the dish rearrange so that the cell stops 
moving and assumes a more spherical shape. Actin and its associated motor pro- 
tein myosin then form a belt around the middle of the cell, the contractile ring, 
which constricts lilce a tiny muscle to pinch the cell in two. When division is com¬ 
plété, the cytoslceletons of the two daughter fibroblasts reassemble into their 
interphase structures to convert the two rounded-up daughter cells into smaller 
versions of the flattened, crawling mother cell. 

Many cells require rapid cytoskeletal rearrangements for their normal func- 
tioning during interphase as well. For example, the neutrophil, a type of white 
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Actin filaments (also known as microfilaments) are helical polymers of 
the protein actin. They are flexible structures with a diameter of 8 nm 
that organize into a variety of linear bundles, two-dimensional 
networks, and three-dimensional gels. Although actin filaments are 
dispersed throughout the cell, they are most highly concentrated in the 
cortex, just beneath the plasma membrane, (i) Single actin filament; 

(ii) microvilli; (iii) stress fibers ( red) terminating in focal adhesions 
( green ); (iv) striated muscle. 


Micrographs courtesy of R. Craig (i and iv); P.T. Matsudaira and D.R. Burgess (ii); K. Burridge (iii). 


MICROTUBULES 




25 nm 


Microtubules are long, hollow cylinders made of the protein tubulin. 

With an outer diameter of 25 nm, they are much more rigid than actin 
filaments. Microtubules are long and straight and frequently hâve one 
end attached to a microtubule-organizing center (MTOC) called a 
centrosome, (i) Single microtubule; (ii) cross section at the base of three 
cilia showing triplet microtubules; (iii) interphase microtubule array 
(green) and organelles (rec/); (iv) ciliated protozoan. 

Micrographs courtesy of R. Wade (i); D.T. Woodrow and R.W. Linck (ii); D. Shima (iii); D. Burnette (iv). 




INTERMEDIATE FILAMENTS 


Intermediate filaments are ropelike fibers with a diameter of about 
10 nm; they are made of intermediate filament proteins, which constitute 
a large and heterogeneous family. One type of intermediate filament 
forms a meshwork called the nuclear lamina just beneath the inner 
nuclear membrane. Other types extend across the cytoplasm, giving cells 
mechanical strength. In an épithélial tissue, they span the cytoplasm from 
one cell—cell junction to another, thereby strengthening the entire 
epithelium, (i) Individual intermediate filaments; (ii) Intermediate 
filaments ( b/ue) in neurons and (iii) épithélial cell; (iv) nuclear lamina. 


Micrographs courtesy of R. Quinlai 
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blood cell, chases and engulfs bacterial and fungal cells that accidentally gain 
access to the normally stérile parts of the body, as through a eut in the skin. Lilce 
most crawling cells, neutrophils advance by extending a protrusive structure filled 
with newly polymerized actin filaments. When the elusive bacterial prey moves in 
a different direction, the neutrophil is poised to reorganize its polarized protru¬ 
sive structures within seconds (Figure 16-3). 

The Cytoskeleton Détermines Cellular Organization and Polarity 

In cells that hâve achieved a stable, differentiated morphology—such as mature 
neurons or épithélial cells—the dynamic éléments of the cytoskeleton must also 
provide stable, large-scale structures for cellular organization. On specialized épi¬ 
thélial cells that line organs such as the intestine and the lung, cytoskeletal-based 
cell-surface protrusions including microvilli and cilia are able to maintain a con¬ 
stant location, length, and diameter over the entire lifetime of the cell. For the 
actin bundles at the cores of microvilli on intestinal épithélial cells, this is only a 
few days. But the actin bundles at the cores of stereocilia on the hair cells of the 
inner ear must maintain their stable organization for the entire lifetime of the ani¬ 
mal, since these cells do not turn over. Nonetheless, the individual actin filaments 



time 0 min 1 min 2 min 3 min 


Figure 16-2 Diagram of changes in 
cytoskeletal organization associated 
with cell division. The crawling fibroblast 
drawn here has a polarized, dynamic 
actin cytoskeleton (shown in red) that 
assembles lamellipodia and filopodia to 
push its leading edge toward the right. The 
polarization of the actin cytoskeleton is 
assisted by the microtubule cytoskeleton 
(green), consisting of long microtubules 
that emanate from a single microtubule- 
organizing center located in front of 
the nucléus. When the cell divides, the 
polarized microtubule array rearranges 
to form a bipolar mitotic spindle, which 
is responsible for aligning and then 
segregating the duplicated chromosomes 
(brown). The actin filaments form a 
contractile ring at the center of the cell 
that pinches the cell in two after the 
chromosome ségrégation. After cell 
division is complété, the two daughter 
cells reorganize both the microtubule and 
actin cytoskeletons into smaller versions of 
those that were présent in the mother cell, 
enabling them to crawl their separate ways. 


Figure 16-3 A neutrophil in pursuit of 
bacteria. In this préparation of human 
blood, a clump of bacteria (white arrow) is 
about to be captured by a neutrophil. As 
the bacteria move, the neutrophil quickly 
reassembles the dense actin network at 
its leading edge (highlighted in red) to 
push toward the location of the bacteria 
(Movie 16.1). Rapid disassembly and 
reassembly of the actin cytoskeleton in 
this cell enables it to change its orientation 
and direction of movement within a few 
minutes. (From a video recorded by 
David Rogers.) 
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remain strikingly dynamic and are continuously remodeled and replaced every 48 
hours, even within stable cell-surface structures that persist for décades. 

Besides forming stable, specialized cell-surface protrusions, the cytoslcele- 
ton is also responsible for large-scale cellular polarity, enabling cells to tell the 
différence between top and bottom, or front and baclc. The large-scale polarity 
information conveyed by cytoslceletal organization is often maintained over the 
lifetime of the cell. Polarized épithélial cells use organized arrays of microtubules, 
actin filaments, and intermediate filaments to maintain the critical différences 
between the apical surface and the basolateral surface. They also must maintain 
strong adhesive contacts with one another to enable this single layer of cells to 
serve as an effective physical barrier (Figure 16-4). 

Filaments Assemble from Protein Subunits That Impart Spécifie 
Physical and Dynamic Properties 

Cytoslceletal filaments can reach from one end of the cell to the other, spanning 
tens or even hundreds of micrometers. Yet the individual protein molécules that 
form the filaments are only a few nanometers in size. The cell builds the filaments 
by assembling large numbers of the small subunits, lilce building a slcyscraper out 
of bricks. Because these subunits are small, they can diffuse rapidly in the cyto¬ 
sol, whereas the assembled filaments cannot. In this way, cells can undergo rapid 
structural reorganizations, disassembling filaments at one site and reassembling 
them at another site far away. 

Actin filaments and microtubules are built from subunits that are compact 
and globular —actin subunits for actin filaments and tubulin subunits for microtu¬ 
bules—whereas intermediate filaments are made up of smaller subunits that are 
themselves elongated and fibrous. Ail three major types of cytoslceletal filaments 
form as helical assemblies of subunits (see Figure 3-22) that self-associate, using 
a combination of end-to-end and side-to-side protein contacts. Différences in 
the structures of the subunits and the strengths of the attractive forces between 
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Figure 16-4 Organization of the 
cytoskeleton in polarized épithélial cells. 

Ail the components of the cytoskeleton 
cooperate to produce the characteristic 
shapes of specialized cells, including the 
épithélial cells that line the small intestine, 
diagrammed here. At the apical (upper) 
surface, facing the intestinal lumen, 
bundled actin filaments (red) form microvilli 
that increase the cell surface area available 
for absorbing nutrients from food. Below 
the microvilli, a circumferential band of 
actin filaments is connected to cell—cell 
adherens junctions that anchor the cells to 
each other. Intermediate filaments (blue) 
are anchored to other kinds of adhesive 
structures, including desmosomes and 
hemidesmosomes, that connect the 
épithélial cells into a sturdy sheet and 
attach them to the underlying extracellular 
matrix; these structures are discussed 
in Chapter 19. Microtubules (green) run 
vertically from the top of the cell to the 
bottom and provide a global coordinate 
System that enables the cell to direct newly 
synthesized components to their proper 
locations. 
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them produce important différences in the stability and mechanical properties 
of each type of filament. Whereas covalent linkages between their subunits hold 
together the backbones of many biological polymers—including DNA, RNA, and 
proteins—it is wealc noncovalent interactions that hold together the three types of 
cytoslceletal polymers. Consequently, their assembly and disassembly can occur 
rapidly, without covalent bonds being formed or brolcen. 

The subunits of actin filaments and microtubules are asymmetrical and bind 
to one another head-to-tail so that they ail point in one direction. This subunit 
polarity gives the filaments structural polarity along their length, and malces the 
two ends of each polymer behave differently. In addition, actin and tubulin sub¬ 
units are both enzymes that catalyze the hydrolysis of a nucleoside triphosphate— 
ATP and GTP, respectively. As we discuss later, the energy derived from nucléo¬ 
tide hydrolysis enables the filaments to remodel rapidly. By controlling when and 
where actin and microtubules assemble, the cell harnesses the polar and dynamic 
properties of these filaments to generate force in a spécifie direction, to move the 
leading edge of a migrating cell forward, for example, or to pull chromosomes 
apart during cell division. In contrast, the subunits of intermediate filaments are 
symmetrical, and thus do not form polarized filaments with two different ends. 
Intermediate filament subunits also do not catalyze the hydrolysis of nucléotides. 
Nevertheless, intermediate filaments can be disassembled rapidly when required. 
In mitosis , for example, kinases phosphorylate the subunits, leading to their dis¬ 
sociation. 

Cytoslceletal filaments in living cells are not built by simply stringing sub¬ 
units together in single file. A thousand tubulin subunits lined up end-to-end, 
for example, would span the diameter of a small eukaryotic cell, but a filament 
formed in this way would laclc the strength to avoid breakage by ambient thermal 
energy, unless each subunit in the filament was bound extremely tightly to its two 
neighbors. Such tight binding would limit the rate at which the filaments could 
disassemble, malcing the cytoskeleton a static and less useful structure. To provide 
both strength and adaptability, microtubules are built of 13 protofilaments— lin- 
ear strings of subunits joined end-to-end—that associate with one another later- 
ally to form a hollow cylinder. The addition or loss of a subunit at the end of one 
protofilament malces or breaks a small number of bonds. In contrast, loss of a sub¬ 
unit from the middle of the filament requires brealcing many more bonds, while 
brealcing it in two requires brealcing bonds in multiple protofilaments ail at the 
same time (Figure 16-5). The greater energy required to break multiple noncova¬ 
lent bonds simultaneously allows microtubules to resist thermal breakage, while 
allowing rapid subunit addition and loss at the filament ends. Helical actin fila¬ 
ments are much thinner and therefore require much less energy to break. How- 
ever, multiple actin filaments are often bundled together inside cells, providing 
mechanical strength, while allowing dynamic behavior of filament ends. 

As with other spécifie protein-protein interactions, many hydrophobie interac¬ 
tions and noncovalent bonds hold the subunits in a cytoslceletal filament together 
(see Figure 3-4). The locations and types of subunit-subunit contacts differ for 
the different filaments. Intermediate filaments, for example, assemble by forming 
strong latéral contacts between a-helical coiled-coils, which extend over most of 
the length of each elongated fibrous subunit. Because the individual subunits are 
staggered in the filament, intermediate filaments form strong, ropelilce structures 
that tolerate stretching and bending to a greater extent than do either actin fila¬ 
ments or microtubules (Figure 16-6). 

Accessory Proteins and Motors Regulate Cytoskeletal Filaments 

The cell régulâtes the length and stability of its cytoskeletal filaments, as well as 
their number and geometry. It does so largely by regulating their attachments 
to one another and to other components of the cell, so that the filaments can 
form a wide variety of higher-order structures. Direct covalent modification of 
the filament subunits régulâtes some filament properties, but most of the régu¬ 
lation is performed by hundreds of accessory proteins that détermine the spatial 
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distribution and the dynamic behavior of the filaments, converting information 
received through signaling pathways into cytoslceletal action. These accessory 
proteins bind to the filaments or their subunits to détermine the sites of assembly 
of new filaments, to regulate the partitioning of polymer proteins between fila¬ 
ment and subunit forms, to change the kinetics of filament assembly and disas- 
sembly, to harness energy to generate force, and to linlc filaments to one another 
or to other cell structures such as organelles and the plasma membrane. In these 
processes, the accessory proteins bring cytoslceletal structure under the control 
of extracellular and intracellular signais, including those that trigger the dramatic 
transformations of the cytoslceleton that occur during each cell cycle. Acting 
together, the accessory proteins enable a eukaryotic cell to maintain a highly orga- 
nized but flexible internai structure and, in many cases, to move. 
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Figure 16-5 The thermal stability of 
cytoskeletal filaments with dynamic 
ends. A protofilament consisting of a single 
strand of subunits is thermally unstable, 
since breakage of a single bond between 
subunits is sufficient to break the filament. 

In contrast, formation of a cytoskeletal 
filament from more than one protofilament 
allows the ends to be dynamic, while 
enabling the filaments themselves to 
be résistant to thermal breakage. In a 
microtubule, for example, removing a single 
subunit dimer from the end of the filament 
requires breaking noncovalent bonds with a 
maximum of three other subunits, whereas 
fracturing the filament in the middle requires 
breaking noncovalent bonds in ail thirteen 
protofilaments. 


Figure 16-6 Flexibility and stretch in 
an intermediate filament. Intermediate 
filaments are formed from elongated fibrous 
subunits with strong latéral contacts, 
resulting in résistance to stretching 
forces. When a tiny mechanical probe is 
dragged across an intermediate filament, 
the filament is stretched over three times 
its length before it breaks, as illustrated 
by the fluorescently labeled filaments in 
the photomicrographs. This technique 
is termed atomic force microscopy (see 
Figure 9-33). (Adapted from L. Kreplak et 
al., J. Mol. Biol. 354:569-577, 2005. With 
permission from Elsevier.) 
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Among the most fascinating proteins that associate with the cytoskeleton are 
the motor proteins. These proteins bind to a polarized cytoslceletal filament and 
use the energy derived from repeated cycles of ATP hydrolysis to move along it. 
Dozens of different motor proteins coexist in every eukaryotic cell. They differ in 
the type of filament they bind to (either actin or microtubules), the direction in 
which they move along the filament, and the "cargo" they carry. Many motor pro¬ 
teins carry membrane-enclosed organelles—such as mitochondria, Golgi stades, 
or secretory vesicles—to their appropriate locations in the cell. Other motor pro¬ 
teins cause cytoslceletal filaments to exert tension or to slide against each other, 
generating the force that drives such phenomena as muscle contraction, ciliary 
beating, and cell division. 

Cytoslceletal motor proteins that move unidirectionally along an oriented 
polymer traclc are reminiscent of some other proteins and protein complexes dis- 
cussed elsewhere in this boolc, such as DNA and RNA polymerases, helicases, and 
ribosomes. Ail of these proteins hâve the ability to use Chemical energy to propel 
themselves along a linear traclc, with the direction of sliding dépendent on the 
structural polarity of the traclc. Ail of them generate motion by coupling nucleo- 
side triphosphate hydrolysis to a large-scale conformational change (see Figure 
3-75). 


Bacterial Cell Organization and Division Dépend on Homologs of 
Eukaryotic Cytoskeletal Proteins 

While eukaryotic cells are typically large and morphologically complex, bacterial 
cells are usually only a few micrometers long and assume simple shapes such 
as spheres or rods. Bacteria also laclc elaborate networlcs of intracellular mem¬ 
brane-enclosed organelles. Historically, biologists assumed that a cytoskeleton 
was not necessary in such simple cells. We now lcnow, however, that bacteria con- 
tain homologs of ail three of the eukaryotic cytoskeletal filaments. Furthermore, 
bacterial actins and tubulins are more diverse than their eukaryotic versions, both 
in the types of assemblies they form and in the functions they carry out. 

Nearly ail bacteria and many archaea contain a homolog of tubulin called FtsZ, 
which can polymerize into filaments and assemble into a ring (called the Z-ring) 
at the site where the septum forms during cell division (Figure 16-7). Although 
the Z-ring persists for many minutes, the individual filaments within it are highly 
dynamic, with an average filament half-life of about thirty seconds. As the bacte- 
rium divides, the Z-ring becomes smaller until it has completely disassembled. 
FtsZ filaments in the Z-ring are thought to generate a bending force that drives 
the membrane invagination necessary to complété cell division. The Z-ring may 
also serve as a site for localization of enzymes required for building the septum 
between the two daughter cells. 

Many bacteria also contain homologs of actin. Two of these, MreB and Mbl, 
are found primarily in rod-shaped or spiral-shaped cells where they assemble 
to form dynamic patches that move circumferentially along the length of the 
cell (Figure 16-8A). These proteins contribute to cell shape by serving as a scaf- 
fold to direct the synthesis of the peptidoglycan cell wall, in much the same way 
that microtubules help organize the synthesis of the cellulose cell wall in higher 


Figure 16-7 The bacterial FtsZ protein, a tubulin homolog in 
prokaryotes. (A) A band of FtsZ protein forms a ring in a dividing bacterial 
cell. This ring has been labeled by fusing the FtsZ protein to green fluorescent 
protein (GFP), which allows it to be observed in living E coli cells with a 
fluorescence microscope. (B) FtsZ filaments and circles, formed in vitro, as 
visualized using électron microscopy. (C) Dividing chloroplasts (red) from a red 
alga also cleave using a protein ring made from FtsZ (yellow). (A, from X. Ma, 
D.W. Ehrhardt and W. Margolin, Proc. Natl Acad. Soi. USA 93:12998-13003, 
1996; B, from Fl.P. Erickson et al., Proc. Natl Acad. Sci. USA 93:519-523, 

1996. Both with permission from National Academy of Sciences; C, from 
S. Miyagishima et al., Plant Cell 13:2257-2268, 2001, with permission from 
American Society of Plant Biologists.) 
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Figure 16-8 Actin homologs in bacteria détermine cell shape. (A) The MreB protein forms abundant patches made up 
of many short, interwoven linear or helical filaments that are seen to move circumferentially along the length of the bacterium 
and are associated with sites of cell wall synthesis. (B) The common soil bacterium Bacillus subtilis normally forms cells with a 
regular rodlike shape when viewed by scanning électron microscopy (left). In contrast, B. subtilis cells lacking the actin homolog 
MreB or Mbl grow in distorted or twisted shapes and eventually die (center and right). (A, from P. Vats and L. Rothfield, Proc. 
NatlAcad. Soi. USA 104:17795-17800, 2007. With permission from National Academy of Sciences; B, from A. Chastanet and 
R. Carballido-Lopez, Front. Biosci. 4S:1582-1606, 2012. With permission Frontiers in Bioscience.) 


plant cells (see Figure 19-65). As with FtsZ, MreB and Mbl filaments are highly 
dynamic, with half-lives of a few minutes, and nucléotide hydrolysis accompanies 
the polymerization process. Mutations disrupting MreB or Mbl expression cause 
extreme abnormalities in cell shape and defects in chromosome ségrégation (Fig¬ 
ure 16-8B). 

Relatives of MreB and Mbl hâve more specialized rôles. Aparticularly intriguing 
bacterial actin homolog is ParM, which is encoded by a gene on certain bacterial 
plasmids that also carry genes responsible for antibiotic résistance and cause the 
spread of multidrug résistance in épidémies. Bacterial plasmids typically encode 
ail the gene products that are necessary for their own ségrégation, presumably 
as a strategy to ensure their inheritance and propagation in bacterial hosts fol- 
lowing plasmid réplication. ParM assembles into filaments that associate at each 
end with a copy of the plasmid, and growth of the ParM filament pushes the rep- 
licated plasmid copies apart (Figure 16-9). This spindle-like structure apparently 
arises from the sélective stabilization of filaments that bind to specialized proteins 
recruited to the origins of réplication on the plasmids. A distant relative of both 
tubulin and FtsZ, called TubZ, has a similar function in other bacterial species. 

Thus, self-association of nucleotide-binding proteins into dynamic filaments 
is used in ail cells, and the actin and tubulin families are very ancient, predating 
the split between the eukaryotic and bacterial kingdoms. 

At least one bacterial species, Caulobacter crescentus, appears to harbor a pro¬ 
tein with significant structural similarity to the third major class of cytoslceletal 
filaments found in animal cells, the intermediate filaments. A protein called cres- 
centin forms a filamentous structure that influences the unusual crescent shape 
of this species; when the gene encoding crescentin is deleted, the Caulobacter 
cells grow as straight rods (Figure 16-10). 



Figure 16-9 Rôle of the actin homolog 
ParM in plasmid ségrégation in bacteria. 

(A) Some bacterial drug-resistance 
plasmids (orange) encode an actin 
homolog, ParM, that will spontaneously 
nucleate to form small, dynamic filaments 
(green) throughout the bacterial cytoplasm. 
A second plasmid-encoded protein 
called ParR (blue) binds to spécifie DNA 
sequences in the plasmid and also 
stabilizes the dynamic ends of the ParM 
filaments. When the plasmid duplicates, 
both ends of the ParM filaments become 
stabilized, and the growing ParM filaments 
push the duplicated plasmids to opposite 
ends of the cell. (B) In these bacterial 
cells harboring a drug-resistance plasmid, 
the plasmids are labeled in red and the 
ParM protein in green. Left, a short ParM 
filament bundle connects the two daughter 
plasmids shortly after their duplication. 
Right, the fully assembled ParM filament 
has pushed the duplicated plasmids to the 
cell pôles. (A, adapted from E.C. Garner, 
C.S. Campbell and R.D. Mullins, Science 
306:1021-1025, 2004; B, from J. Moller- 
Jensen et al., Mol. Cell 12:1477-1487, 
2003. With permission from Elsevier.) 
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Summary 

The cytoplasm ofeulcaryotic cells is spatially organized by a networlc ofprotein fil¬ 
aments known as the cytoskeleton. This network contains three principal types of 
filaments: actinfilaments, microtubules, and intermediatefilaments. AU three types 
of filaments form as helical assemblies ofsubunits that self-associate using a combi¬ 
nation of end-to-end and side-to-side protein contacts. Différences in the structure 
of the subunits and the manner of their self-assembly give the filaments different 
mechanical properties. Subunit assembly and disassembly constantly remodel ail 
three types of cytoskeletal filaments. Actin and tubulin (the subunits of actin fila¬ 
ments and microtubules, respectively) bind and hydrolyze nucleoside triphosphates 
(ATP and GTP, respectively), and assemble head-to-tail to generate polarized fila¬ 
ments capable of generating force. In living cells, accessory proteins modulate the 
dynamics and organization of cytoskeletal filaments, resulting in complex events 
such as cell division or migration, and generating elaborate cellular architecture 
toform polarized tissues such as epithelia. Bacterial cells also contain homologs of 
actin, tubulin, and intermediate filaments that form dynamic structures that help 
control cell shape and division. 

ACTIN AND ACTIN-BINDING PROTEINS 

The actin cytoskeleton perforais a wide range of functions in diverse cell types. 
Each actin subunit, sometimes called globular or G-actin, is a 375-amino-acid 
polypeptide carrying a tightly associated molécule of ATP or ADP (Figure 
16-1 IA). Actin is extraordinarily well conserved among eukaryotes. The amino 
acid sequences of actins from different eukaryotic species are usually about 90% 
identical. Small variations in actin amino acid sequence can cause significant 
functional différences: In vertebrates, for example, there are three isoforms of 
actin, termed a, P, and y, that differ slightly in their amino acid sequences and 
hâve distinct functions. a-Actin is expressed only in muscle cells, while P- and 
y-actins are found together in almost ail non-muscle cells. 

Actin Subunits Assemble Head-to-Tail to Create Flexible, 

Polar Filaments 

Actin subunits assemble head-to-tail to form a tight, right-handed hélix, forming 
a structure about 8 nm wide called filamentous or F-actin (Figure 16-1 IB and C). 
Because the asymmetrical actin subunits of a filament ail point in the same direc¬ 
tion, filaments are polar and hâve structurally different ends: a slower-growing 
minus end and a faster-growing plus end. The minus end is also referred to as the 
“pointed end" and the plus end as the “barbed end," because of the “arrowhead" 
appearance of the complex formed between actin filaments and the motor pro¬ 
tein myosin (Figure 16-12). Within the filament, the subunits are positioned with 
their nucleotide-binding cleft directed toward the minus end. 

Individual actin filaments are quite flexible. The stiffness of a filament can be 
characterized by its persistence length, the minimum filament length at which ran- 
dom thermal fluctuations are lilcely to cause it to bend. The persistence length 
of an actin filament is only a few tens of micrometers. In a living cell, however, 


Figure 16-10 Caulobacter and 
crescentin. The sickle-shaped bacterium 
Caulobacter crescentus expresses a 
protein, crescentin, with a sériés of coiled- 
coil domains similar in size and organization 
to the domains of eukaryotic intermediate 
filaments. (A) The crescentin protein forms 
a fiber (labeled in red) that runs down 
the inner side of the curving bacterial cell 
wall. (B) When the gene is disrupted, the 
bacteria grow as straight rods (bottom). 
(From N. Ausmees, J.R. Kuhn and 
C. Jacobs-Wagner, Cell 115:705-713, 
2003. With permission from Elsevier.) 
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accessory proteins cross-link and bundle the filaments together, making large- 
scale actin structures that are much more rigid than an individual actin filament. 


Nucléation Is the Rate-Limiting Step in the Formation of Actin 
Filaments 

The régulation of actin filament formation is an important mechanism by which 
cells control their shape and movement. Small oligomers of actin subunits can 
assemble spontaneously, but they are unstable and disassemble readily because 
each monomer is bound to only one or two other monomers. For a new actin fila¬ 
ment to form, subunits must assemble into an initial aggregate, or nucléus, that is 
stabilized by multiple subunit-subunit contacts and can then elongate rapidly by 
addition of more subunits. This process is called filament nucléation . 

Many features of actin nucléation and polymerization hâve been studied with 
purified actin in a test tube (Figure 16-13). The instability of smaller actin aggre- 
gates créâtes a kinetic barrier to nucléation. When polymerization is initiated, this 
barrier results in a lag phase during which no filaments are observed. During this 
lag phase, however, a few of the small, unstable aggregates succeed in making the 
transition to a more stable form that resembles an actin filament. This leads to a 


plus minus 

end end 



Figure 16-11 The structures of an actin 
monomer and actin filament. (A) The 

actin monomer has a nucléotide (either 
ATP or ADP) bound in a deep cleft in the 
center of the molécule. (B) Arrangement 
of monomers in a filament consisting 
of two protofilaments, held together by 
latéral contacts, which wind around each 
other as two parallel strands of a hélix, 
with a twist repeating every 37 nm. Ail the 
subunits within the filament hâve the same 
orientation. (C) Electron micrograph of 
negatively stained actin filament. 

(C, courtesy of Roger Craig.) 


Figure 16-12 Structural polarity of the 
actin filament. (A) This électron micrograph 
shows an actin filament polymerized from a 
short actin filament seed that was decorated 
with myosin motor domains, resulting in an 
arrowhead pattern. The filament has grown 
much faster at the barbed (plus) end than at 
the pointed (minus) end. (B) Enlarged image 
and model showing the arrowhead pattern. 
(A, courtesy of Tom Pollard; B, adapted 
from M. Whittaker, B.O. Carragher and 
K.A. Milligan, Ultramicro. 54:245-260, 1995.) 
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Figure 16-13 The time course of actin polymerization in a test tube. (A) Polymerization of pure actin subunits into filaments 
occurs after a lag phase. (B) Polymerization occurs more rapidly in the presence of preformed fragments of actin filaments, 
which act as nuclei for filament growth. As indicated, the % free subunits after polymerization reflects the critical concentration 
(C c ), at which there is no net change in polymer. Actin polymerization is often studied by observing the change in the light 
émission from a fluorescent probe, called pyrene, that has been covalently attached to the actin. Pyrene-actin fluoresces more 
brightly when it is incorporated into actin filaments. 


phase of rapid filament élongation during which subunits are added quicldy to 
the ends of the nucleated filaments (Figure 16-13A). Finally, as the concentration 
of actin monomers déclinés, the System approaches a steady State at which the 
rate of addition of new subunits to the filament ends exactly balances the rate 
of subunit dissociation. The concentration of free subunits left in solution at this 
point is called the critical concentration , C c . As explained in Panel 16-2, the value 
of the critical concentration is equal to the rate constant for subunit loss divided 
by the rate constant for subunit addition—that is, C c = 1c 0 fî/k 0n , which is equal 
to the dissociation constant, K& and the inverse of the equilibrium constant, K 
(see Figure 3-44). In a test tube, the C c for actin polymerization—that is, the free 
actin monomer concentration at which the fraction of actin in the polymer stops 
increasing—is about 0.2 pM. Inside the cell, the concentration of unpolymerized 
actin is much higher than this, and the cell has evolved mechanisms to prevent 
most of its monomeric actin from assembling into filaments, as we discuss later. 

The lag phase in filament growth is eliminated if preexisting seeds (such as 
fragments of actin filaments that hâve been chemically cross-linlced) are added to 
the solution at the beginning of the polymerization reaction (Figure 16-13B). The 
cell talces great advantage of this nucléation requirement: it uses spécial proteins 
to catalyze filament nucléation at spécifie sites, thereby determining the location 
at which new actin filaments are assembled. 

Actin Filaments Hâve Two Distinct Ends That Grow at Different 
Rates 

Due to the uniform orientation of asymmetric actin subunits in the filament, the 
structures at its two ends are different. This orientation malces the two ends of 
each polymer different in ways that hâve a profound effect on filament growth 
rates. The kinetic rate constants for actin subunit association and dissociation— 
/c on and /c 0 ff, respectively—are much greater at the plus end than the minus end. 
This can be seen when an excess of purified actin monomers is allowed to assem¬ 
ble onto polarity-marlced filaments—the plus end of the filament elongates up to 
ten times faster (see Figure 16-12). If filaments are rapidly diluted so that the free 
subunit concentration drops below the critical concentration, the plus end also 
depolymerizes faster. 

It is important to note, however, that the two ends of an actin filament hâve the 
same net affinity for actin subunits, if ail of the subunits are in the same nucléotide 
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State. Addition of a subunit to either end of a filament of n subunits results in a fil¬ 
ament of n + 1 subunits. Thus, the free-energy différence, and therefore the equi- 
librium constant (and the critical concentration), must be the same for addition 
of subunits at either end of the polymer. In this case, the ratio of the rate con¬ 
stants, fcoff/fcon; must be identical at the two ends, even though the absolute values 
of these rate constants are very different at each end (see Panel 16-2). 

The cell talces advantage of actin filament dynamics and polarity to do mechan- 
ical worlc. Filament élongation proceeds spontaneously when the free-energy 
change (AG) for addition of the soluble subunit is less than zéro. This is the case 
when the concentration of subunits in solution exceeds the critical concentra¬ 
tion. A cell can couple an energetically unfavorable process to this spontaneous 
process; thus, the cell can use free energy released during spontaneous filament 
polymerization to move an attached load. For example, by orienting the fast-grow- 
ing plus ends of actin filaments toward its leading edge, a motile cell can push its 
plasma membrane forward, as we discuss later. 

ATP Hydrolysis Within Actin Filaments Leads to Treadmilling at 
Steady State 

Thus far in our discussion of actin filament dynamics, we hâve ignored the critical 
fact that actin can catalyze the hydrolysis of the nucleoside triphosphate ATP. For 
free actin subunits, this hydrolysis proceeds very slowly; however, it is acceler- 
ated when the subunits are incorporated into filaments. Shortly after ATP hydro¬ 
lysis occurs, the free phosphate group is released from each subunit, but the ADP 
remains trapped in the filament structure. Thus, two different types of filament 
structures can exist, one with the "T form" of the nucléotide bound (ATP), and one 
with the “D form" bound (ADP). 

When the nucléotide is hydrolyzed, much of the free energy released by cleav- 
age of the phosphate-phosphate bond is stored in the polymer. This malces the 
free-energy change for dissociation of a subunit from the D-form polymer more 
négative than the free-energy change for dissociation of a subunit from the T-form 
polymer. Consequently, the ratio of k 0 ff/1c 0n for the D-form polymer, which is 
numerically equal to its critical concentration [C c (D)] ; is larger than the corre- 
sponding ratio for the T-form polymer. Thus, C c (D) is greater than C c (T). At cer¬ 
tain concentrations of free subunits, D-form polymers will therefore shrinlc while 
T-form polymers grow. 

In living cells, most soluble actin subunits are in the T form, as the free con¬ 
centration of ATP is about tenfold higher than that of ADP. However, the longer the 
time that subunits hâve been in the actin filament, the more lilcely they are to hâve 
hydrolyzed their ATP. Whether the subunit at each end of a filament is in the T or 
the D form dépends on the rate of this hydrolysis compared with the rate of sub¬ 
unit addition. If the concentration of actin monomers is greater than the critical 
concentration for both the T-form and D-form polymer, then subunits will add to 
the polymer at both ends before the nucléotides in the previously added subunits 
are hydrolyzed; as a resuit, the tips of the actin filament will remain in the T form. 
On the other hand, if the subunit concentration is less than the critical concentra¬ 
tions for both the T-form and D-form polymer, then hydrolysis may occur before 
the next subunit is added and both ends of the filament will be in the D form and 
will shrinlc. At intermediate concentrations of actin subunits, it is possible for the 
rate of subunit addition to be faster than nucléotide hydrolysis at the plus end, 
but slower than nucléotide hydrolysis at the minus end. In this case, the plus end 
of the filament remains in the T conformation, while the minus end adopts the D 
conformation. The filament then undergoes a net addition of subunits at the plus 
end, while simultaneously losing subunits from the minus end. This leads to the 
remarkable property of filament treadmilling (Figure 16-14; see Panel 16-2). 

At a particular intermediate subunit concentration, the filament growth at the 
plus end exactly balances the filament shrinlcage at the minus end. Under these 
conditions, the subunits cycle rapidly between the free and filamentous States, 
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PANEL 16-2: The Polymerization of Actin and Tubulin 



ON RATES AND OFF RATES 

A linear polymer of protein molécules, such 
as an actin filament or a microtubule, 
assembles (polymerizes) and disassembles 
(depolymerizes) by the addition and removal 
of subunits at the ends of the polymer. The 
rate of addition of these subunits (called 
monomers) is given by the rate constant k on , 
which has units of M -1 sec -1 . The rate of loss 


is given by /c off (units of sec 

polymer (with n subunits) 

- 1 ). 

subunit 






+ ■ 

kor\ | 

| ^off 








polymer (with n +1 subunits) 


NUCLEATION ^ helical polymer is stabilized by multiple contacts between 
adjacent subunits. In the case of actin, two actin molécules bind relatively weakly 
to each other, but addition of a third actin monomer to form a trimer makes the 
entire group more stable. 


* / Â £& 


Further monomer addition can take place onto this trimer, which therefore acts 
as a nucléus for polymerization. For tubulin, the nucléus is larger and has a more 
complicated structure (possibly a ring of 13 or more tubulin molécules)—but the 
principle is the same. 

The assembly of a nucléus is relatively slow, which explains the lag phase 
seen during polymerization. The lag phase can be reduced or abolished 
entirely by adding premade nuclei, such as fragments of already polymerized 
microtubules or actin filaments. 


THE CRITICAL CONCENTRATION 

The number of monomers that add to the 
polymer (actin filament or microtubule) per 
second will be proportional to the 
concentration of the free subunit (k on C), but 
the subunits will leave the polymer end at a 
constant rate (/c off ) that does not dépend on C. 
As the polymer grows, subunits are used up, 
and C is observed to drop until it reaches a 
constant value, called the critical concentration 
(C c ). At this concentration, the rate of subunit 
addition equals the rate of subunit loss. 

At this equilibrium. 


k nn G — k. 


so that 


off 


-K, 


(where K à is the dissociation constant; see 
Figure 3-44). 


TIME COURSE OF POLYMERIZATION 

The assembly of a protein into a long helical polymer such as a cytoskeletal 
filament or a bacterial flagellum typically shows the following time course: 

EQUILIBRIUM 



time 


The lag phase corresponds to time taken for nucléation. 

The growth phase occurs as monomers add to the exposed ends of the 
growing filament, causing filament élongation. 

The equilibrium phase, or steady State, is reached when the growth of the 
polymer due to monomer addition precisely balances the shrinkage of the 
polymer due to disassembly back to monomers. 


PLUS AND MINUS ENDS 

The two ends of an actin filament or microtubule polymerize 
at different rates. The fast-growing end is called the plus end, 
whereas the slow-growing end is called the minus end. The 
différence in the rates of growth at the two ends is made 
possible by changes in the conformation of each subunit as 
it enters the polymer. 


free 

subunit 


subunit in 
polymer 


minus 

end 


plus 

end 


SLOW 


FAST 


This conformational change affects the rates at which subunits add to 
the two ends. 

Even though k on and k 0 n will hâve different values for the plus and 
minus ends of the polymer, their ratio k off //c on —and hence C c —must be 
the same at both ends for a simple polymerization reaction (no ATP or 
GTP hydrolysis). This is because exactly the same subunit interactions 
are broken when a subunit is lost at either end, and the final State of 
the subunit after dissociation is identical. Therefore, the AG for subunit 


loss, which détermines the equilibrium constant for its association 
with the end, is identical at both ends: if the plus end grows four 
times faster than the minus end, it must also shrink four times 
faster. Th us, for C > C c , both ends grow; for C < C c , both ends 
shrink. 

The nucleoside triphosphate hydrolysis that accompanies 
actin and tubulin polymerization removes this constraint. 
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NUCLEOTIDE HYDROLYSIS 

Each actin molécule carries a tightly bound ATP molécule that is hydrolyzed to a 
tightly bound ADP molécule soon after its assembly into the polymer. Similarly, each 
tubulin molécule carries a tightly bound GTP that is converted to a tightly bound 
GDP molécule soon after the molécule assembles into the polymer. 

(T = monomer carrying ATP 

T -► T __ ►___ D or GTP) 

free monomer subunit in polymer 


(D = monomer carrying ADP 
or GDP) 


Hydrolysis of the bound nucléotide reduces the binding affinity of the subunit for 
neighboring subunits and makes it more likely to dissociate from each end of the 
filament (see Figure 16-44 for a possible mechanism). It is usually the t form 
that adds to the filament and the d form that leaves. 

Considering events at the plus end only: 


D D D T 
D D D D 


m 


As before, the polymer will grow until C = C c . 
k D on and /c T off since they are usually very smal 

. D r ^ D off 

:k off or C c= 7T“ 

kn n 


1 

For illustrative purposes, we can ignore 
I, so that polymer growth ceases when 


k\n C = 


This is a steady State and not a true equilibrium, because the ATP or GTP that is 
hydrolyzed must be replenished by a nucléotide exchange reaction of the 
free subumt 



TREADMILLING 

One conséquence of the nucléotide hydrolysis that accompanies polymer 
formation is to change the critical concentration at the two ends of the 
polymer. Since /c D off and k T on refer to different reactions, their ratio 
/c D off //c T on need not be the same at both ends of the polymer, so that: 

C c (minus end) > C c (plus end) 

Thus, if both ends of a polymer are exposed, polymerization proceeds 
until the concentration of free monomer reaches a value that 
is above C c for the plus end but below C c for the minus end. At this 
steady State, subunits undergo a net assembly at the plus end and a 
net disassembly at the minus end at an identical rate. The polymer 
maintains a constant length, even though there is a net flux of subunits 
through the polymer, known as treadmilling. 


0 


© 


y 
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DYNAMIC INSTABILITY 

Microtubules depolymerize about 100 times faster from an end containing 
GDP-tubulin than from one containing GTP-tubulin. A GTP cap favors growth, 
but if it is lost, then depolymerization ensues. 

GTP cap 


GROWING 


SHRINKING 


HNttNta, 


Individual microtubules can therefore alternate between a period of slow growth and 
a period of rapid disassembly, a phenomenon called dynamic instability. 













904 Chapter 16: The Cytoskeleton 


soluble subunits are in T form ( ■ ) 
polymers are a mixture of T form ( ■ ) and D form ( # ) 

*■ * 

1 POLYMERIZATION FOLLOWED 
BY NUCLEOTIDE HYDROLYSIS 



minus-end addition is slow— plus-end addition is fast— 

hydrolysis catches up hydrolysis lags behind 



(A) 


C c (T) < C c (D) 


For C c (T) < C < C c (D) 
treadmilling occurs 


while the total length of the filament remains unchanged. This “steady-state tread- 
milling” requires a constant consumption of energy in the form of ATP hydrolysis. 


The Functions of Actin Filaments Are Inhibited by Both Polymer- 
stabilizing and Polymer-destabilizing Chemicals 

Chemical compounds that stabilize or destabilize actin filaments are important 
tools in studies of the filaments' dynamic behavior and function in cells. The cyto- 
chalasins are fungal products that prevent actin polymerization by binding to the 
plus end of actin filaments. Latrunculin prevents actin polymerization by binding 
to actin subunits. The phalloidins are toxins isolated from th eAmanita mushroom 
that bind tightly ail along the side of actin filaments and stabilize them against 
depolymerization. Ail of these compounds cause dramatic changes in the actin 
cytoskeleton and are toxic to cells, indicating that the function of actin filaments 
dépends on a dynamic equilibrium between filaments and actin monomers 
(Table 16-1). 


Figure 16-14 Treadmilling of an actin 
filament, made possible by the ATP 
hydrolysis that follows subunit addition. 

(A) Explanation for the different critical 
concentrations (C c ) at the plus and minus 
ends. Subunits with bound ATP (T-form 
subunits) polymerize at both ends of 
a growing filament, and then undergo 
nucléotide hydrolysis within the filament. As 
the filament grows, élongation is faster than 
hydrolysis at the plus end in this example, 
and the terminal subunits at this end are 
therefore always in the T form. However, 
hydrolysis is faster than élongation at the 
minus end, and so terminal subunits at 
this end are in the D form. (B) Treadmilling 
occurs at intermediate concentrations of 
free subunits. The critical concentration for 
polymerization on a filament end in the 
T form is lower than for a filament end 
in the D form. If the actual subunit 
concentration is somewhere between these 
two values, the plus end grows while the 
minus end shrinks, resulting in treadmilling. 


Actin-Binding Proteins Influence Filament Dynamics and 
Organization 

In a test tube, polymerization of actin is controlled simply by its concentration, as 
described above, and by pH and the concentrations of salts and ATP. Within a cell, 
however, actin behavior is also regulated by numerous accessory proteins that 
bind actin monomers or filaments (summarized in Panel 16-3). At steady State 


TABLE 16-1 


Chemical Inhibitors of Actin and Microtubules 


Chemical 

Effect on 
filaments 

Mechanism 

Original source 

Actin 

Latrunculin 

Depolymerizes 

Binds actin subunits 

Sponges 

Cytochalasin B 

Depolymerizes 

Caps filament plus ends 

Fungi 

Phalloidin 

Stabilizes 

Binds along filaments 

Amanita mushroom 

Microtubules 

Taxol® 

(paclitaxel) 

Stabilizes 

Binds along filaments 

Yew tree 

Nocodazole 

Depolymerizes 

Binds tubulin subunits 

Synthetic 

Colchicine 

Depolymerizes 

Caps filament ends 

Autumn crocus 
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nucleates assembly 
to form a web and remains 
associated with the minus end 



profilin 

binds subunits, 
speeds élongation 


tropomodulin 

prevents assembly and 
disassembly at minus end 
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cofilin 

binds ADP-actin filaments, 
accelerates disassembly 
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tropomyosin 
stabilizes filament 



Some of the major accessory proteins of the actin cytoskeleton. Except for the myosin motor proteins, an example of each 
major type is shown. Each of these is discussed in the text. However, most cells contain more than a hundred different 
actin-binding proteins, and it is likely that there are important types of actin-associated proteins that are not yet recognized. 
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in vitro , when the monomer concentration is 0.2 pM, filament half-life, a measure 
of how long an individual actin monomer spends in a filament as it treadmills, is 
approximately 30 minutes. In a non-muscle vertebrate cell, actin half-life in fila¬ 
ments is only 30 seconds, demonstrating that cellular factors modify the dynamic 
behavior of actin filaments. Actin-binding proteins dramatically alter actin fila¬ 
ment dynamics and organization through spatial and temporal control of mono¬ 
mer availability, filament nucléation, élongation, and depolymerization. In the 
following sections, we describe the ways in which these accessory proteins modify 
actin function in the cell. 

Monomer Availability Controls Actin Filament Assembly 

In most non-muscle vertebrate cells, approximately 50% of the actin is in 
filaments and 50% is soluble—and yet the soluble monomer concentration is 
50-200 pM, well above the critical concentration. Why does so little of the actin 
polymerize into filaments? The reason is that the cell contains proteins that bind 
to the actin monomers and malce polymerization much less favorable (an action 
similar to that of the drug latrunculin). A small protein called thymosin is the most 
abundant of these proteins. Actin monomers bound to thymosin are in a loclced 
State, where they cannot associate with either the plus or minus ends of actin fila¬ 
ments and can neither hydrolyze nor exchange their bound nucléotide. 

How do cells recruit actin monomers from this buffered storage pool and use 
them for polymerization? The answer dépends on another monomer-binding pro¬ 
tein called profilin. Profilin binds to the face of the actin monomer opposite the 
ATP-binding cleft, blocking the side of the monomer that would normally associ¬ 
ate with the filament minus end, while leaving exposed the site on the monomer 
that binds to the plus end (Figure 16-15). When the profilin-actin complex binds 
a free plus end, a conformational change in actin reduces its affinity for profilin 
and the profilin falls off, leaving the actin filament one subunit longer. Profilin 
competes with thymosin for binding to individual actin monomers. Thus, by reg- 
ulating the local activity of profilin, cells can control the movement of actin sub- 
units from the sequestered thymosin-bound pool onto filament plus ends. 

Several mechanisms regulate profilin activity, including profilin phosphoryla¬ 
tion and profilin binding to inositol phospholipids. These mechanisms can define 
the sites where profilin acts. For example, profilin is required for filament assem¬ 
bly at the plasma membrane, where it is recruited by an interaction with acidic 
membrane phospholipids. At this location, extracellular signais can activate pro¬ 
filin to produce local actin polymerization and the extension of actin-rich motile 
structures such as filopodia and lamellipodia. 

Actin-Nucleating Factors Accelerate Polymerization and Generate 
Branched or Straight Filaments 

In addition to the availability of active actin subunits, a second prerequisite for 
cellular actin polymerization is filament nucléation. Proteins that contain actin 
monomer binding motifs linlced in tandem médiate the simplest mechanism of 
filament nucléation. These actin-nucleating proteins bring several actin subunits 
together to form a seed. In most cases, actin nucléation is catalyzed by one of two 
different types of factors: the Arp 2/3 complex or the formins. The first of these is 
a complex of proteins that includes two actin-related proteins, or ARPs, each of 
which is about 45% identical to actin. The Arp 2/3 complex nucleates actin fila¬ 
ment growth from the minus end, allowing rapid élongation at the plus end (Fig¬ 
ure 16-16A and B). The complex can attach to the side of another actin filament 
while remaining bound to the minus end of the filament that it has nucleated, 
thereby building individual filaments into a treelilce web (Figure 16-16C and D). 

Formins are dimeric proteins that nucleate the growth of straight, unbranched 
filaments that can be cross-linked by other proteins to form parallel bundles. Each 
formin subunit has a binding site for monomeric actin, and the formin dimer 
appears to nucleate actin filament polymerization by capturing two monomers. 
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PROFILIN COMPETES WITH THYMOSIN FOR BINDING TO ACTIN MONOMERS 
AND PROMOTES ASSEMBLY 


Figure 16-15 Effects of thymosin and profilin on actin polymerization. An actin monomer bound to thymosin is sterically 
prevented from binding to and elongating the plus end of an actin filament (left). An actin monomer bound to profilin, on the 
other hand, is capable of elongating a filament (right). Thymosin and profilin cannot both bind to a single actin monomer at the 
same time. In a cell in which most of the actin monomer is bound to thymosin, the activation of a small amount of profilin can 
produce rapid filament assembly. As indicated (bottom), profilin binds to actin monomers that are transiently released from 
the thymosin-bound monomer pool, shuttles them onto the plus ends of actin filaments, and is then released and recycled for 
further rounds of filament élongation. 


As the newly nucleated filament grows, the formin dimer remains associated with 
the rapidly growing plus end while still allowing the addition of new subunits at 
that end (Figure 16-17). This mechanism of filament assembly is clearly differ¬ 
ent from that used by the Arp 2/3 complex, which remains stably bound to the 
filament minus end, preventing subunit addition or loss at that end. Formin-de- 
pendent actin filament growth is strongly enhanced by the association of actin 
monomers with profilin (Figure 16-18). 

Lilce profilin activation, actin filament nucléation by Arp 2/3 complexes and 
formins occurs primarily at the plasma membrane, and the highest density of 
actin filaments in most cells is at the cell periphery. The layer just beneath the 
plasma membrane is called the cell cortex, and the actin filaments in this région 
détermine the shape and movement of the cell surface, allowing the cell to change 
its shape and stiffness rapidly in response to changes in its external environment. 

Actin-Filament-Binding Proteins Alter Filament Dynamics 

Actin filament behavior is regulated by two major classes of binding pro teins: 
those that bind along the side of a filament and those that bind to the ends (see 
Panel 16-3). Side-binding pro teins include tropomyosin, an elongated protein 
that binds simultaneously to six or seven adjacent actin subunits along each of the 
two grooves of the helical actin filament. In addition to stabilizing and stiffening 
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the filament, the binding of tropomyosin can prevent the actin filament from 
interacting with other proteins; this aspect of tropomyosin function is important 
in the control of muscle contraction, as we discuss later. 

An actin filament that stops growing and is not specifically stabilized in the cell 
will depolymerize rapidly, particularly at its plus end, once the actin molécules 
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Figure 16-16 Nucléation and actin web formation by the Arp 2/3 complex. (A) The structures of Arp2 and Arp3 compared to the structure 
of actin. Although the face of the molécule équivalent to the plus end (top) in both Arp2 and Arp3 is very similar to the plus end of actin itself, 
différences on the sides and minus end prevent these actin-related proteins from forming filaments on their own or coassembling into filaments 
with actin. (B) A model for actin filament nucléation by the Arp 2/3 complex. In the absence of an activating factor, Arp2 and Arp3 are held by 
their accessory proteins in an orientation that prevents them from nucleating a new actin filament. When an activating factor (indicated by the blue 
triangle) binds the complex, Arp2 and Arp3 are brought together into a new configuration that resembles the plus end of an actin filament. Actin 
subunits can then assemble onto this structure, bypassing the rate-limiting step of filament nucléation. (C) The Arp 2/3 complex nucleates filaments 
most efficiently when it is bound to the side of a preexisting actin filament. The resuit is a filament branch that grows at a 70° angle relative to the 
original filament. Repeated rounds of branching nucléation resuit in a treelike web of actin filaments. (D) Top, électron micrographs of branched actin 
filaments formed by mixing purified actin subunits with purified Arp 2/3 complexes. Bottom, reconstructed image of a branch where the crystal 
structures of actin (pink) and the Arp 2/3 complex hâve been fitted to the électron density. The mother filament runs from top to bottom, and the 
daughter filament branches off to the right where the Arp 2/3 complex binds to three actin subunits in the mother filament. (D, top, from R.D. Mullins 
et al., Proc. Natl Acad. Soi. USA 95:6181-6186, 1998, with permission from National Academy of Sciences; botttom, from N. Volkmann et al., 
Science 293:2456-2459, 2001, with permission from AAAS.) 
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hâve hydrolyzed their ATP. The binding of plus-end capping protein (also called 
CapZ for its location in the muscle Z band) stabilizes an actin filament at its plus 
end by rendering it inactive, greatly reducing the rates of filament growth and 
depolymerization (Figure 16-19). At the minus end, an actin filament may be 
capped by the Arp 2/3 complex that was responsible for its nucléation, although 
many minus ends in a typical cell are released from the Arp 2/3 complex and are 
uncapped. 

Tropomodulin, best lcnown for its function in the capping of exceptionally 
long-lived actin filaments in muscle, binds tightly to the minus ends of actin fil¬ 
aments that hâve been coated and thereby stabilized by tropomyosin. It can also 
transiently cap pure actin filaments and significantly reduce their élongation and 
depolymerization rates. A large family of tropomodulin proteins régulâtes actin 
filament length and stability in many cell types. 

For maximum effect, proteins that bind the side of actin filaments coat the fil¬ 
ament completely, and must therefore be présent in high amounts. In contrast, 
end-binding proteins can affect filament dynamics even when they are présent at 
very low levels. Since subunit addition and loss occur primarily at filament ends, 
one molécule of an end-binding protein per actin filament (roughly one molécule 
per 200-500 actin subunits) can be enough to transform the architecture of an 
actin filament network. 


Severing Proteins Regulate Actin Filament Depolymerization 

Another important mechanism of actin filament régulation dépends on proteins 
that break an actin filament into many smaller filaments, thereby generating a 
large number of new filament ends. The fate of these new ends dépends on the 
presence of other accessory proteins. Under some conditions, newly formed ends 
nucleate filament élongation, thereby accelerating the assembly of new filament 
structures. Under other conditions, severing promûtes the depolymerization of 
old filaments, speeding up the depolymerization rate by tenfold or more. In addi¬ 
tion, severing changes the physical and mechanical properties of the cytoplasm: 
stiff, large bundles and gels become more fluid. 

One class of actin-severing proteins is the gelsolin superfamily. These proteins 
are activated by high levels of cytosolic Ca 2+ . Gelsolin interacts with the side of the 
actin filament and contains subdomains that bind to two different sites: one that 
is exposed on the surface of the filament and one that is hidden between adjacent 


Figure 16-18 Profilin and formins. Some members of the formin protein 
family hâve unstructured domains or “whiskers” that contain several binding 
sites for profilin or the profilin-actin complex. These flexible domains serve 
as a staging area for addition of actin to the growing plus end of the actin 
filament when formin is bound. Under some conditions, this can enhance 
the rate of actin filament élongation so that filament growth is faster than that 
expected for a diffusion-controlled reaction, and faster in the presence of 
formin and profilin than the rate for pure actin alone (see also Figure 3-78). 


Figure 16-17 Actin élongation mediated 
by formins. Formin proteins (green) form 
a dimeric complex that can nucleate the 
formation of a new actin filament (red) and 
remain associated with the rapidly growing 
plus end as it elongates. The formin protein 
maintains its binding to one of the two 
actin subunits exposed at the plus end as it 
allows each new subunit to assemble. Only 
part of the large dimeric formin molécule 
is shown here. Other régions regulate its 
activity and link it to particular structures 
in the cell. Many formins are indirectly 
connected to the cell plasma membrane 
and aid the insertional polymerization of 
the actin filament directly beneath the 
membrane surface. 
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population of 
filaments: growth 
at plus and minus 
ends 


capped 
population of 
filaments: growth 
at minus end only 


Figure 16-19 Filament capping and 
its effects on filament dynamics. A 

population of uncapped filaments adds 
and loses subunits at both the plus and 
minus ends, resulting in rapid growth or 
shrinkage, depending on the concentration 
of available free monomers (green line). 

In the presence of a protein that caps the 
plus end (red line), only the minus end is 
able to add or lose subunits; consequently, 
filament growth will be slower at ail 
monomer concentrations above the critical 
concentration, and filament shrinkage will 
be slower at ail monomer concentrations 
below the critical concentration. In addition, 
the critical concentration for the population 
shifts to that of the filament minus end. 


subunits. According to one model, gelsolin binds the side of an actin filament until 
a thermal fluctuation créâtes a small gap between neighboring subunits, at which 
point gelsolin inserts itself into the gap to break the filament. After the severing 
event, gelsolin remains attached to the actin filament and caps the new plus end. 

Another important actin-filament destabilizing protein, found in ail eulcary- 
otic cells, is cofilin. Also called actin depolymerizing factor, cofilin binds along the 
length of the actin filament, forcing the filament to twist a little more tightly (Fig¬ 
ure 16-20). This mechanical stress weakens the contacts between actin subunits 
in the filament, making the filament brittle and more easily severed by thermal 
motions, generating filament ends that undergo rapid disassembly. As a resuit, 
most of the actin filaments inside cells are shorter lived than are filaments formed 
from pure actin in a test tube. 

Cofilin binds preferentially to ADP-containing actin filaments rather than to 
ATP-containing filaments. Since ATP hydrolysis is usually slower than filament 
assembly, the newest actin filaments in the cell still contain mostly ATP and are 
résistant to depolymerization by cofilin. Cofilin therefore tends to dismantle the 
older filaments in the cell. As we will discuss later, the cofilin-mediated disassem¬ 
bly of old but not new actin filaments is critical for the polarized, directed growth 
of the actin networlc that is responsible for unidirectional cell crawling and the 
intracellular motility of pathogens. Actin filaments can be protected from cofilin 
by tropomyosin binding. Thus, the dynamics of actin in different subcellular loca¬ 
tions dépends on the balance of stabilizing and destabilizing accessory proteins. 


(A) actin filament 



57 nm 


Figure 16-20 Twisting of an actin filament induced by cofilin. (A) Three-dimensional 
reconstruction from cryoelectron micrographs of filaments made of pure actin. The bracket shows 
the span of two twists of the actin hélix. (B) Reconstruction of an actin filament coated with cofilin, 
which binds in a 1:1 stoichiometry to actin subunits ail along the filament. Cofilin is a small protein 
(14 kD) compared to actin (43 kD), and so the filament appears only slightly thicker. The energy of 
cofilin binding serves to deform the actin filament, twisting it more tightly and reducing the distance 
spanned by each twist of the hélix. (From A. McGough et al., J. Cell Biol. 138:771-781, 1997. With 
permission from the authors.) 
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Figure 16-21 Actin arrays in a cell. 

A fibroblast crawling in a tissue-culture 
dish is shown with four areas enlarged to 
show the arrangement of actin filaments. 
The actin filaments are shown in red, with 
arrowheads pointing toward the minus 
end. Stress fibers are contractile and exert 
tension. The actin cortex underlies the 
plasma membrane and consists of gel- 
like networks or dendritic actin networks 
that enable membrane protrusion at 
lamellopodia. Filopodia are spike-like 
projections of the plasma membrane that 
allow a cell to explore its environment. 


Higher-Order Actin Filament Arrays Influence Cellular Mechanical 
Properties and Signaling 

Actin filaments in animal cells are organized into several types of arrays: dendritic 
networks, bundles, and weblike (gel-like) networks (Figure 16-21). Different 
structures are initiated by the action of distinct nucleating proteins: the actin fila¬ 
ments of dendritic networks are nucleated by the Arp 2/3 complex, while bundles 
are made of the long, straight filaments produced by formins. The proteins nucle¬ 
ating the filaments in the gel-like networks are not yet well defined. 

The structural organization of different actin networks dépends on specialized 
accessory proteins. As explained earlier, Arp 2/3 organizes filaments into den¬ 
dritic networks by attaching filament minus ends to the side of other filaments. 
Other actin filament structures are assembled and maintained by two classes of 
pro teins: bundling proteins , which cross-linlc actin filaments into a parallel array, 
and gel-forming proteins, which hold two actin filaments together at a large angle 
to each other, thereby creating a looser meshwork. Both bundling and gel-form¬ 
ing proteins generally hâve two similar actin-filament-binding sites, which can 
either be part of a single polypeptide chain or contributed by each of two polypep¬ 
tide chains held together in a dimer (Figure 16-22). The spacing and arrangement 
of these two filament-binding domains détermine the type of actin structure that 
a given cross-linking protein forms. 

Each type of bundling protein also détermines which other molécules can 
interact with the cross-linked actin filaments. Myosin II is the motor protein that 
enables stress fibers and other contractile arrays to contract. The very close paclc- 
ing of actin filaments caused by the small monomeric bundling protein fimbrin 
apparently excludes myosin, and thus the parallel actin filaments held together 
by fimbrin are not contractile. On the other hand, a-actinin cross-links oppositely 
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Figure 16-22 The modular structures of 
four actin-cross-linking proteins. Each of 
the proteins shown has two actin-binding 
sites (red) that are related in sequence. 
Fimbrin has two directly adjacent actin- 
binding sites, so that it holds its two actin 
filaments very close together (14 nm apart), 
aligned with the same polarity (see Figure 
16-23A). The two actin-binding sites in 
a-actinin are separated by a spacer around 
30 nm long, so that it forms more loosely 
packed actin bundles (see Figure 16-23A). 
Filamin has two actin-binding sites with a 
V-shaped linkage between them, so that it 
cross-links actin filaments into a network 
with the filaments oriented almost at right 
angles to one another (see Figure 16-24). 
Spectrin is a tetramer of two a and two 
(3 subunits, and the tetramer has two actin- 
binding sites spaced about 200 nm apart 
(see Figure 10-38). 
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polarized actin filaments into loose bundles, allowing the binding of myosin and 
formation of contractile actin bundles (Figure 16-23). Because of the very differ¬ 
ent spacing and orientation of the actin filaments, bundling by fimbrin automat- 
ically discourages bundling by a-actinin, and vice versa, so that the two types of 
bundling protein are mutually exclusive. 

The bundling proteins that we hâve discussed so far hâve straight, stiff connec¬ 
tions between their two actin-filament-binding domains. Other actin cross-link- 
ing proteins hâve either a flexible or a stiff, bent connection between their two 
binding domains, allowing them to form actin filament webs or gels, rather than 
actin bundles. Filamin (see Figure 16-22) promûtes the formation of a loose 
and highly viscous gel by clamping together two actin filaments roughly at right 
angles (Figure 16-24A). Cells require the actin gels formed by filamin to extend 
the thin, sheetlike membrane projections called lamellipodia that help them to 
crawl across solid surfaces. In humans, mutations in the filamin A gene cause 
defects in nerve-cell migration during early embryonic development. Cells in the 
periventricular région of the brain fail to migrate to the cortex and instead form 
nodules, causing a syndrome called periventricular heterotopia (Figure 16-24B). 
Interestingly, in addition to binding actin, filamins hâve been reported to interact 
with a large number of cellular proteins of great functional diversity, including 
membrane receptors for signaling molécules, and filamin mutations can also lead 
to defects in development of bone, the cardiovascular System, and other organs. 
Thus, filamins may also function as signaling scaffolds by connecting and coordi- 
nating a wide variety of cellular processes with the actin cytoskeleton. 

A very different, well-studied web-forming protein is spectrin, which was first 
identified in red blood cells. Spectrin is a long, flexible protein made out of four 
elongated polypeptide chains (two a subunits and two P subunits), arranged so 
that the two actin-filament-binding sites are about 200 nm apart (compared with 
14 nm for fimbrin and about 30 nm for a-actinin; see Figure 16-23). In the red 
blood cell, spectrin is concentrated just beneath the plasma membrane, where it 
forms a two-dimensional weblike networlc held together by short actin filaments 
whose précisé lengths are tightly regulated by capping proteins at each end; spec¬ 
trin links this web to the plasma membrane because it has separate binding sites 
for peripheral membrane proteins, which are themselves positioned near the lipid 
bilayer by intégral membrane pro teins (see Figure 10-38). The resulting networlc 
créâtes a strong, yet flexible cell cortex that provides mechanical support for the 
overlying plasma membrane, allowing the red blood cell to spring baclc to its origi¬ 
nal shape after squeezing through a capillary. Close relatives of spectrin are found 
in the cortex of most other vertebrate cell types, where they also help to shape and 
stiffen the surface membrane. A particularly striking example of spectrin's rôle 
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Figure 16-23 The formation of two 
types of actin filament bundles. 

(A) Fimbrin cross-links actin filaments into 
tight bundles, which exclude the motor 
protein myosin II from participating in the 
assembly. In contrast, a-actinin, which is 
a homodimer, cross-links actin filaments 
into loose bundles, which allow myosin 
(not shown) to incorporate into the bundle. 
Fimbrin and a-actinin tend to exclude 
one another because of the very different 
spacing of the actin filament bundles that 
they form. (B) Electron micrograph of 
purified a-actinin molécules. (B, courtesy 
of John Heuser.) 
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Figure 16-24 Filamin cross-links actin 
filaments into a three-dimensional 
network and is required for normal 
neuronal migration. (A) Each filamin 
homodimer is about 160 nm long when 
fully extended and forms a flexible, high- 
angle link between two adjacent actin 
filaments. A set of actin filaments cross- 
linked by filamin forms a mechanically 
strong web or gel. (B) Magnetic résonance 
imaging of a normal human brain (left) 
and of a patient with periventricular 
heterotopia (right) caused by mutation 
in the filamin A gene. In contrast to the 
smooth ventricular surface in the normal 
brain, a rough zone of cortical neurons 
(arrowheads) is seen along the latéral walls 
of the ventricles, representing neurons 
that hâve failed to migrate to the cortex 
during brain development. Remarkably, 
although many neurons are not in the 
right place, the intelligence of affected 
individuals is frequently normal or only 
mildly compromised, and the major clinical 
syndrome is epilepsy that often starts in the 
second decade of life. (B, adapted from 
Y. Feng and C.A. Walsh, Nat. Cell Biol. 
6:1034-1038, 2004. With permission 
from Macmillan Publishers Ltd.) 


in promoting mechanical stability is the long, thin axon of neurons in the néma¬ 
tode worm Caenorhabditis elegans, where spectrin is required to lceep them from 
breaking during the twisting motions the worms malce during crawling. 

The connections of the cortical actin cytoslceleton to the plasma membrane 
are only partially understood. Members of the ERM family (named for its first 
three members, ezrin, radixin, and moesin), help organize membrane domains 
through their ability to interact with transmembrane proteins and the underlying 
cytoslceleton. In so doing, they not only provide structural links to strengthen the 
cell cortex, but also regulate the activities of signal transduction pathways. Moesin 
also increases cortical stiffness to promote cell rounding during mitosis. Measure- 
ments by atomic force microscopy indicate that the cell cortex remains soft during 
mitosis when moesin is depleted. ERM proteins are thought to bind to and orga¬ 
nize the cortical actin cytoslceleton in a variety of contexts, thereby affecting the 
shape and stiffness of the membrane as well as the localization and activity of 
signaling molécules. 

Bacteria Can Hijack the Host Actin Cytoskeleton 

The importance of accessory proteins in actin-based motility and force produc¬ 
tion is illustrated beautifully by studies of certain bacteria and viruses that use 
components of the host-cell actin cytoskeleton to move through the cytoplasm. 
The cytoplasm of mammalian cells is extremely viscous, containing organelles 
and cytoslceletal éléments that inhibit diffusion of large particles lilce bacteria or 
viruses. To move around in a cell and invade neighboring cells, several patho¬ 
gens, including Listeria monocytogenes (which causes a rare but serious form of 
food poisoning), overcome this problem by recruiting and activating the Arp 2/3 
complex at their surface. The Arp 2/3 complex nucleates the assembly of actin 
filaments that generate a substantial force and push the bacterium through the 
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Figure 16-25 The actin-based movement of Listeria monocytogenes. (A) Fluorescence micrograph of an infected cell 
that has been stained to reveal bacteria in red and actin filaments in green. Note the cometlike tail of actin filaments behind 
each moving bacterium. Régions of overlap between red and green fluorescence appear yellow. (B) Listeria motility can be 
reconstituted in a test tube with ATP and just four purified proteins: actin, Arp 2/3 complex, capping protein, and cofilin. This 
micrograph shows the dense actin tails behind bacteria (black). (C) The ActA protein on the bacterial surface activâtes the 
Arp 2/3 complex to nucleate new filament assembly along the sides of existing filaments. Filaments grow at their plus end 
until capped by capping protein. Actin is recycled through the action of cofilin, which enhances depolymerization at the minus 
ends of the filaments. By this mechanism, polymerization is focused at the rear surface of the bacterium, propelling it forward 
(see Movie 23.7). (A, courtesy of Julie Theriot and Tim Mitchison; B, from T.R Loisel et al., Nature 401:613—616, 1999. With 
permission from Macmillan Publishers Ltd.) 

cytoplasm at rates of up to 1 pm/sec, leaving behind a long actin "cornet tail" 

(Figure 16-25; see also Figures 23-28 and 23-29). This motility can be reconsti¬ 
tuted in a test tube by adding the bacteria to a mixture of pure actin, Arp 2/3 com¬ 
plex, cofilin, and capping protein, illustrating how actin polymerization dynamics 
generate movement through spatial régulation of filament assembly and disas- 
sembly. As we shall see, actin-based movement of this sort also underlies mem¬ 
brane protrusion at the leading edge of motile cells. 

Summary 

Actin is a highly conserved cytoskeletal protein that is présent in high concentra¬ 
tions in nearly ail eukaryotic cells. Nucléation présents a kinetic barrier to actin 
polymerization, but onceformed, actin filaments undergo dynamic behavior due 
to hydrolysis ofthe bound nucléotide ATP. Actin filaments are polarized and can 
undergo treadmilling when a filament assembles at the plus end while simultane- 
ously depolymerizing at the minus end. In cells, actin filament dynamics are reg- 
ulated at every step, and the variedforms and functions of actin dépend on a ver¬ 
satile répertoire of accessory proteins. Approximately half ofthe actin is Icept in a 
monomeric form through association with sequestering proteins such as thymosin. 

Nucléation factors such as the Arp 2/3 complex and formins promote formation 
of branched and parallel filaments, respectively. Interplay between proteins that 
bind or cap actin filaments and those that promote filament severing or depolym¬ 
erization can slow or accelerate the kinetics of filament assembly and disassembly. 

Another class of accessory proteins assembles the filaments into larger ordered struc¬ 
tures by cross-linlcing them to one another in geometrically defined ways. Connec¬ 
tions between these actin arrays and the plasma membrane of cells give an animal 
cell mechanical strength and permit the élaboration of cortical cellular structures 
such as lamellipodia, filopodia, and microvilli. By inducing actin filament polym¬ 
erization at their surface, intracellular pathogens can hijack the host-cell cytoskele¬ 
ton and move around inside the cell. 









MYOSIN AND ACTIN 


915 



Figure 16-26 Myosin II. (A) The two globular heads and long tail of a myosin II molécule shadowed with platinum can be seen 
in this électron micrograph. (B) A myosin II molécule is composed of two heavy chains (each about 2000 amino acids long; 
green) and four light chains (blue). The light chains are of two distinct types, and one copy of each type is présent on each 
myosin head. Dimerization occurs when the two a helices of the heavy chains wrap around each other to form a coiled-coil, 
driven by the association of regularly spaced hydrophobie amino acids (see Figure 3-9). The coiled-coil arrangement makes an 
extended rod in solution, and this part of the molécule forms the tail. (A, courtesy of David Shotton.) 


MYOSIN AND ACTIN 

A crucial feature of the actin cytoslceleton is that it can form contractile structures 
that cross-link and slide actin filaments relative to one another through the action 
of myosin motor proteins. In addition to driving muscle contraction, actin-myo- 
sin assemblies perform important functions in non-muscle cells. 

Actin-Based Motor Proteins Are Members of the Myosin 
Superfamily 

The first motor protein to be identified was skeletal muscle myosin, which gén¬ 
érâtes the force for muscle contraction. This protein, now called myosin II, is an 
elongated protein formed from two heavy chains and two copies of each of two 
light chains. Each heavy chain has a globular head domain at its N-terminus that 
contains the force-generating machinery, followed by a very long amino acid 
sequence that forms an extended coiled-coil that médiates heavy-chain dimeriza- 
tion (Figure 16-26). The two light chains bind close to the N-terminal head 
domain, while the long coiled-coil tail bundles itself with the tails of other myosin 
molécules. These tail-tail interactions form large, bipolar “thick filaments" that 
hâve several hundred myosin heads, oriented in opposite directions at the two 
ends of the thiclc filament (Figure 16-27). 
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Figure 16-27 The myosin II bipolar thick filament in muscle. (A) Electron micrograph of a myosin II thick filament isolated from frog muscle. Note 
the central bare zone, which is free of head domains. (B) Schematic diagram, not drawn to scale. The myosin II molécules aggregate by means of 
their tail régions, with their heads projecting to the outside of the filament. The bare zone in the center of the filament consists entirely of myosin II 
tails. (C) A small section of a myosin II filament as reconstructed from électron micrographs. An individual myosin molécule is highlighted in green. 
The cytoplasmic myosin II filaments in non-muscle cells are much smaller, although similarly organized (see Figure 16-39). (A, courtesy of Murray 
Stewart; C, based on R.A. Crowther, R. Padrôn and R. Craig, J. Mol. Biol. 184:429-439, 1985. With permission from Academie Press.) 
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Figure 16-28 Direct evidence for the 
motor activity of the myosin head. In this 
experiment, purified myosin heads were 
attached to a glass slide, and then actin 
filaments labeled with fluorescent phalloidin 
were added and allowed to bind to the 
myosin heads. (A) When ATP was added, 
the actin filaments began to glide along 
the surface, owing to the many individual 
steps taken by each of the dozens of 
myosin heads bound to each filament. The 
video frames shown in this sequence were 
recorded about 0.6 second apart; the two 
actin filaments shown (one red and one 
green ) were moving in opposite directions 
at a rate of about 4 gm/sec. (B) Diagram 
of the experiment. The large red arrows 
indicate the direction of actin filament 
movement (Movie 16.2). (A, courtesy of 
James Spudich.) 


Each myosin head binds and hydrolyzes ATP, using the energy of ATP hydroly- 
sis to wallc toward the plus end of an actin filament (Figure 16-28). The opposing 
orientation of the heads in the thiclc filament malces the filament efficient at slid- 
ing pairs of oppositely oriented actin filaments toward each other, shortening the 
muscle. In slceletal muscle, in which carefully arranged actin filaments are aligned 
in “thin filament” arrays surrounding the myosin thiclc filaments, the ATP-driven 
sliding of actin filaments results in a powerful contraction. Cardiac and smooth 
muscle contain myosin II molécules that are similarly arranged, although differ¬ 
ent genes encode them. 

Myosin Generates Force by Coupling ATP Hydrolysis to 
Conformational Changes 

Motor proteins use structural changes in their ATP-binding sites to produce cyclic 
interactions with a cytoslceletal filament. Each cycle of ATP binding, hydrolysis, 
and release propels them forward in a single direction to a new binding site along 
the filament. For myosin II, each step of the movement along actin is generated by 
the swinging of an 8.5-nm-long a hélix, or lever arm, which is structurally stabi- 
lized by the binding of light chains. At the base of this lever arm next to the head, 
there is a pistonlilce hélix that connects movements at the ATP-binding cleft in the 
head to small rotations of the so-called converter domain. A small change at this 
point can swing the hélix lilce a long lever, causing the far end of the hélix to move 
by about 5.0 nm. 

These changes in the conformation of the myosin are coupled to changes in 
its binding affinity for actin, allowing the myosin head to release its grip on the 
actin filament at one point and snatch hold of it again at another. The full mech- 
anochemical cycle of nucléotide binding, nucléotide hydrolysis, and phosphate 
release (which causes the “power strolce") produces a single step of movement 
(Figure 16-29). At low ATP concentrations, the interval between the force-pro- 
ducing step and the binding of the next ATP is long enough that single steps can 
be observed (Figure 16-30). 

Sliding of Myosin II Along Actin Filaments Causes Muscles to 
Contract 

Muscle contraction is the most familiar and best-understood form of movement in 
animais. In vertebrates, running, walking, swimming, and flying ail dépend on the 
rapid contraction of slceletal muscle on its scaffolding of bone, while involuntary 
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ATTACHED At the start of the cycle shown in this figure, 
a myosin head lacking a bound nucléotide is locked 
tightly onto an actin filament in a rigor configuration (so 
named because it is responsible for rigor mortis, the 
rigidity of death). In an actively contracting muscle, this 
State is very short-lived, being rapidly terminated by the 
binding of a molécule of ATP. 


RELEASED A molécule of ATP binds to the large cleft on 
the "back" of the head (that is, on the side furthest from 
the actin filament) and immediately causes a slight 
change in the conformation of the actin-binding site, 
reducing the affinity of the head for actin and allowing 
it to move along the filament. (The space drawn here 
between the head and actin emphasizes this change, 
although in reality the head probably remains very close 
to the actin.) 
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COCKED The cleft closes like a clam shell around the 
ATP molécule, triggering a movement in the lever arm 
that causes the head to be displaced along the filament 
by a distance of about 5 nm. Hydrolysis of ATP occurs, but 
the ADP and inorganic phosphate (Pi) remain tightly 
bound to the protein. 


FORCE-GENERATING Weak binding of the myosin head 
to a new site on the actin filament causes release of the 
inorganic phosphate produced by ATP hydrolysis, 
concomitantly with the tight binding of the head to actin. 
This release triggers the power stroke—the 
force-generating change in shape during which the head 
regains its original conformation. In the course of the 
power stroke, the head loses its bound ADP, thereby 
returning to the start of a new cycle. 


ATTACHED At the end of the cycle, the myosin head is 
again locked tightly to the actin filament in a rigor 
configuration. Note that the head has moved to a new 
position on the actin filament. 


Figure 16-29 The cycle of structural changes used by myosin II to walk along an actin filament. In the myosin II cycle, the 
head remains bound to the actin filament for only about 5% of the entire cycle time, allowing many myosins to work together to 
move a single actin filament (Movie 16.3). (Based on I. Rayment et al., Science 261:50-58, 1993.) 


movements such as heart pumping and gut peristalsis dépend on the contrac¬ 
tion of cardiac muscle and smooth muscle, respectively. Ail these forms of muscle 
contraction dépend on the ATP-driven sliding of highly organized arrays of actin 
filaments against arrays of myosin II filaments. 

Slceletal muscle was a relatively late evolutionary development, and muscle 
cells are highly specialized for rapid and efficient contraction. The long, thin 
muscle fibers of slceletal muscle are actually huge single cells that form during 
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development by the fusion of many separate cells. The large muscle cell retains 
the many nuclei of the contributing cells. These nuclei lie just beneath the plasma 
membrane (Figure 16-31). The bulle of the cytoplasm inside is made up of myofi- 
brils, which is the name given to the basic contractile éléments of the muscle cell. 
A myofibril is a cylindrical structure 1-2 pm in diameter that is often as long as 
the muscle cell itself. It consists of a long, repeated chain of tiny contractile units— 
called sarcomeres, each about 2.2 pm long—which give the vertebrate myofibril its 
striated appearance (Figure 16-32). 

Each sarcomere is formed from a miniature, precisely ordered array of parallel 
and partly overlapping thin and thiclc filaments. The thin filaments are composed 
of actin and associated proteins, and they are attached at their plus ends to a Z 
dise at each end of the sarcomere. The capped minus ends of the actin filaments 
extend in toward the middle of the sarcomere, where they overlap with thick fil¬ 
aments, the bipolar assemblies formed from spécifie muscle isoforms of myosin 
II (see Figure 16-27). When this région of overlap is examined in cross section by 
électron microscopy, the myosin filaments are arranged in a regular hexagonal 
lattice, with the actin filaments evenly spaced between them (Figure 16-33). Car- 
diac muscle and smooth muscle also contain sarcomeres, although the organiza- 
tion is not as regular as that in slceletal muscle. 


Figure 16-30 The force of a single 
myosin molécule moving along an 
actin filament measured using an 
optical trap. (A) Schematic of the 
experiment, showing an actin filament 
with beads attached at both ends and held 
in place by focused beams of light called 
optical tweezers (Movie 16.4). 

The tweezers trap and move the bead, 
and can also be used to measure the 
force exerted on the bead through the 
filament. In this experiment, the filament 
was positioned over another bead to 
which myosin II motors were attached, 
and the optical tweezers were used to 
détermine the effects of myosin binding 
on movement of the actin filament. 

(B) These traces show filament movement 
in two separate experiments. Initially, when 
the actin filament is unattached to myosin, 
thermal motion of the filament produces 
noisy fluctuations in filament position. When 
a single myosin binds to the actin filament, 
thermal motion decreases abruptly and a 
roughly 10-nm displacement results from 
movement of the filament by the motor. The 
motor then releases the filament. Because 
the ATP concentration is very low in this 
experiment, the myosin remains attached 
to the actin filament for much longer than 
it would in a muscle cell. (Adapted from 
C. Rüegg et al., Physiology 17:213-218, 
2002. With permission from the American 
Physiological Society.) 


Figure 16-31 Skeletal muscle cells (also called muscle fibers). (A) These 
huge multinucleated cells form by the fusion of many muscle cell precursors, 
called myoblasts. Here, a single muscle cell is depicted. In an adult human, 
a muscle cell is typically 50 gm in diameter and can be up to several 
centimeters long. (B) Fluorescence micrograph of rat muscle, showing the 
peripherally located nuclei (blue) in these giant cells. Myofibrils are stained 
red. (B, courtesy of Nancy L. Kedersha.) 
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Figure 16-32 Skeletal muscle myofibrils. (A) Low-magnification électron 
micrograph of a longitudinal section through a skeletal muscle cell of a 
rabbit, showing the regular pattern of cross-striations. The cell contains 
many myofibrils aligned in parallel (see Figure 16-31). (B) Detail of the 
skeletal muscle shown in (A), showing portions of two adjacent myofibrils 
and the définition of a sarcomere (black arrow). (C) Schematic diagram of 
a single sarcomere, showing the origin of the dark and light bands seen in 
the électron micrographs. The Z dises, at each end of the sarcomere, are 
attachment sites for the plus ends of actin filaments (thin filaments); the 
M line, or midline, is the location of proteins that link adjacent myosin II 
filaments (thick filaments) to one another. (D) When the sarcomere contracts, 
the actin and myosin filaments slide past one another without shortening. 

(A and B, courtesy of Roger Craig.) 


Sarcomere shortening is caused by the myosin filaments sliding past the actin 
thin filaments, with no change in the length of either type of filament (see Figure 
16-32C and D). Bipolar thick filaments wallc toward the plus ends of two sets of 
thin filaments of opposite orientations, driven by dozens of independent myosin 
heads that are positioned to interact with each thin filament. Because there is no 
coordination among the movements of the myosin heads, it is critical that they 
remain tightly bound to the actin filament for only a small fraction of each ATPase 
cycle so that they do not hold one another baclc. Each myosin thick filament has 
about 300 heads (294 in frog muscle), and each head cycles about five times per 
second in the course of a rapid contraction—sliding the myosin and actin fila¬ 
ments past one another at rates of up to 15 pm/sec and enabling the sarcomere 
to shorten by 10% of its length in less than one-fiftieth of a second. The rapid syn- 
chronized shortening of the thousands of sarcomeres lying end-to-end in each 
myofibril enables skeletal muscle to contract rapidly enough for running and fly- 
ing, or for playing the piano. 

Accessory proteins produce the remarlcable uniformity in filament organiza- 
tion, length, and spacing in the sarcomere (Figure 16-34). The actin filament plus 
ends are anchored in the Z dise, which is built from CapZ and a-actinin; the Z dise 
caps the filaments (preventing depolymerization), while holding them together in 
a regularly spaced bundle. The précisé length of each thin filament is influenced 
by a protein of enormous size, called nebulin, which consists almost entirely of a 
repeating 35-amino-acid actin-binding motif. Nebulin stretches from the Z dise 
toward the minus end of each thin filament, which is capped and stabilized by tro- 
pomodulin. Although there is some slow exchange of actin subunits at both ends 
of the muscle thin filament, such that the components of the thin filament turn 
over with a half-life of several days, the actin filaments in sarcomeres are remarlc- 
ably stable compared with those found in most other cell types, whose dynamic 
actin filaments turn over with half-lives of a few minutes or less. 
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Figure 16-33 Electron micrographs 
of an insect flight muscle viewed in 
cross section. The myosin and actin 
filaments are packed together with almost 
crystalline regularity. Unlike their vertebrate 
counterparts, these myosin filaments 
hâve a hollow center, as seen in the 
enlargement on the right. The geometry 
of the hexagonal lattice is slightly different 
in vertebrate muscle. (From J. Auber, 

J. deMicrosc. 8:197-232, 1969. With 
permission from Société Française de 
Microscopie Électronique.) 
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Opposing pairs of an even longer template protein, called titin, position the 
thick filaments midway between the Z dises. Titin acts as a molecular spring, with 
a long sériés of immunoglobulin-like domains that can unfold one by one as stress 
is applied to the protein. A springlike unfolding and refolding of these domains 
lceeps the thick filaments poised in the middle of the sarcomere and allows the 
muscle fiber to recover after being overstretched. In C. elegans, whose sarcomeres 
are longer than those in vertebrates, titin is longer as well, suggesting that it serves 
also as a molecular ruler, determining in this case the overall length of each sar¬ 
comere. 

A Sudden Rise in Cytosolic Ca 2+ Concentration Initiâtes Muscle 
Contraction 

The force-generating molecular interaction between myosin thick filaments and 
actin thin filaments talces place only when a signal passes to the slceletal muscle 
from the nerve that stimulâtes it. Immediately upon arrivai of the signal, the mus¬ 
cle cell needs to be able to contract very rapidly, with ail the sarcomeres shorten- 
ing simultaneously. Two major features of the muscle cell malce extremely rapid 
contraction possible. First, as previously discussed, the individual myosin motor 
heads in each thick filament spend only a small fraction of the ATP cycle time 
bound to the filament and actively generating force, so many myosin heads can 
act in rapid succession on the same thin filament without interfering with one 
another. Second, a specialized membrane System relays the incoming signal rap¬ 
idly throughout the entire cell. The signal from the nerve triggers an action poten- 
tial in the muscle cell plasma membrane (discussed in Chapter 11), and this elec- 
trical excitation spreads swiftly into a sériés of membranous folds—the transverse 
tubules, or T tubules —that extend inward from the plasma membrane around 
each myofibril. The signal is then relayed across a small gap to the sarcoplasmic 
réticulum , an adjacent weblilce sheath of modified endoplasmic réticulum that 
surrounds each myofibril lilce a net stocking (Figure 16-35A and B). 

When the incoming action potential activâtes a Ca 2+ channel in the T-tubule 
membrane, Ca 2+ influx triggers the opening of Ca 2+ -release channels in the sar¬ 
coplasmic réticulum (Figure 16-35C). Ca 2+ flooding into the cytosol then initiâtes 
the contraction of each myofibril. Because the signal from the muscle cell plasma 
membrane is passed within milliseconds (via the T tubules and sarcoplasmic 
réticulum) to every sarcomere in the cell, ail of the myofibrils in the cell contract 
at once. The increase in Ca 2+ concentration is transient because the Ca 2+ is rapidly 
pumped baclc into the sarcoplasmic réticulum by an abundant, ATP-dependent 
Ca 2+ -pump (also called a Ca 2+ -ATPase) in its membrane (see Figure 11-13). Typ- 
ically, the cytoplasmic Ca 2+ concentration is restored to resting levels within 30 
msec, allowing the myofibrils to relax. Thus, muscle contraction dépends on two 
processes that consume enormous amounts of ATP: filament sliding, driven by 
the ATPase of the myosin motor domain, and Ca 2+ pumping, driven by the Ca 2+ - 
pump. 


Figure 16-34 Organization of accessory 
proteins in a sarcomere. Each giant titin 
molécule extends from the Z dise to the 
M line—a distance of over 1 gm. Part of 
each titin molécule is closely associated 
with a myosin thick filament (which 
switches polarity at the M line); the rest of 
the titin molécule is elastic and changes 
length as the sarcomere contracts and 
relaxes. Each nebulin molécule is exactly 
the length of a thin filament. The actin 
filaments are also coated with tropomyosin 
and troponin (not shown; see Figure 
16-36) and are capped at both ends. 
Tropomodulin caps the minus end of the 
actin filaments, and CapZ anchors the 
plus end at the Z dise, which also contains 
a-actinin (not shown). 
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Figure 16-35 T tubules and the 
sarcoplasmic réticulum. (A) Drawing of 
the two membrane Systems that relay the 
signal to contract from the muscle cell 
plasma membrane to ail of the myofibrils in 
the cell. (B) Electron micrograph showing 
a cross section of a T tubule. Note the 
position of the large Ca 2+ -release channels 
in the sarcoplasmic réticulum membrane 
that connect to the adjacent T-tubule 
membrane. (C) Schematic diagram 
showing how a Ca 2+ -release channel in 
the sarcoplasmic réticulum membrane is 
thought to be opened by the activation of a 
voltage-gated Ca 2+ channel (Movie 16.5). 
(B, courtesy of Clara Franzini-Armstrong.) 


The Ca 2+ -dependence of vertebrate skeletal muscle contraction, and hence 
its dependence on commands transmitted via nerves, is due entirely to a set of 
specialized accessory proteins that are closely associated with the actin thin fila¬ 
ments. One of these accessory proteins is a muscle form of tropomyosin, the elon- 
gated protein that binds along the groove of the actin filament hélix. The other is 
troponin, a complex of three polypeptides, troponins T, I, and C (named for their 
tropomyosin-binding, inhibitory, and Ca 2+ -binding activities, respectively). Tro¬ 
ponin I binds to actin as well as to troponin T. In a resting muscle, the troponin I-T 
complex pulls the tropomyosin out of its normal binding groove into a position 
along the actin filament that interfères with the binding of myosin heads, thereby 
preventing any force-generating interaction. When the level of Ca 2+ is raised, tro¬ 
ponin C—which binds up to four molécules of Ca 2+ —causes troponin I to release 
its hold on actin. This allows the tropomyosin molécules to slip back into their 
normal position so that the myosin heads can wallc along the actin filaments 
(Figure 16-36). Troponin C is closely related to the ubiquitous Ca 2+ -binding pro¬ 
tein calmodulin (see Figure 15-33); it can be thought of as a specialized form of 
calmodulin that has acquired binding sites for troponin I and troponin T, thereby 
ensuring that the myofibril responds extremely rapidly to an increase in Ca 2+ con¬ 
centration. 

In smooth muscle cells, so called because they laclc the regular striations of 
skeletal muscle, contraction is also triggered by an influx of calcium ions, but the 
regulatory mechanism is different. Smooth muscle forms the contractile portion 
of the stomach, intestine, and utérus, as well as the walls of arteries and many 
other structures requiring slow and sustained contractions. Smooth muscle is 
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Figure 16-36 The control of skeletal muscle contraction by troponin. (A) A skeletal-muscle-cell thin filament, showing the 
positions of tropomyosin and troponin along the actin filament. Each tropomyosin molécule has seven evenly spaced régions 
with similar amino acid sequences, each of which is thought to bind to an actin subunit in the filament. (B) Reconstructed 
cryoelectron microscopy image of an actin filament showing the relative position of a superimposed tropomyosin strand in the 
presence (dark purple) or absence (light purple) of calcium. (A, adapted from G.N. Phillips, J.R Fillers and C. Cohen, J. Mol. Biol. 
192:111-131, 1986. With permission from Academie Press; B, adapted from C. Xu et al., Biophys. J. 77: 985-992, 1999. With 
permission from Elsevier.) 


composed of sheets of highly elongated spindle-shaped cells, each with a sin¬ 
gle nucléus. Smooth muscle cells do not express the troponins. Instead, elevated 
intracellular Ca 2+ levels regulate contraction by a mechanism that dépends on 
calmodulin (Figure 16-37). Ca 2+ -bound calmodulin activâtes myosin light-chain 
kinase (MLCK), thereby inducing the phosphorylation of smooth muscle myosin 
on one of its two light chains. When the light chain is phosphorylated, the myosin 
head can interact with actin filaments and cause contraction; when it is dephos- 
phorylated, the myosin head tends to dissociate from actin and becomes inactive. 

The phosphorylation events that regulate contraction in smooth muscle cells 
occur relatively slowly, so that maximum contraction often requires nearly a 
second (compared with the few milliseconds required for contraction of a skel¬ 
etal muscle cell). But rapid activation of contraction is not important in smooth 
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Figure 16-37 Smooth muscle 
contraction. (A) Upon muscle stimulation 
by activation of cell-surface receptors, 

Ca 2+ released into the cytoplasm from 
the sarcoplasmic réticulum (SR) binds to 
calmodulin (see Figure 15-29). Ca 2+ -bound 
calmodulin then binds myosin light-chain 
kinase (MLCK), which phosphorylâtes 
myosin light chain, stimulating myosin 
activity. Non-muscle myosin is regulated 
by the same mechanism (see Figure 
16-39). (B) Smooth muscle cells in a cross 
section of cat intestinal wall. The outer 
layer of smooth muscle is oriented with 
the long axis of its cells extending parallel 
along the length of the intestine, and upon 
contraction will shorten the intestine. The 
inner layer is oriented circularly around 
the intestine and when contracted will 
cause the intestine to become narrower. 
Contraction of both layers squeezes 
material through the intestine, much like 
squeezing toothpaste out of a tube. 

(C) A model for the contractile apparatus 
in a smooth muscle cell, with bundles of 
contractile filaments containing actin and 
myosin (red) oriented obliquely to the long 
axis of the cell. Their contraction greatly 
shortens the cell. Only a few of the many 
bundles are shown. (B, courtesy of 
Gwen V. Childs.) 
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muscle: its myosin II hydrolyzes ATP about 10 times more slowly than slceletal 
muscle myosin, producing a slow cycle of myosin conformational changes that 
results in slow contraction. 

Heart Muscle Is a Precisely Engineered Machine 

The heart is the most heavily worked muscle in the body, contracting about 3 bil¬ 
lion (3 x 10 9 ) times during the course of a human lifetime (Movie 16.6). Heart cells 
express several spécifie isoforms of cardiac muscle myosin and cardiac muscle 
actin. Even subtle changes in these cardiac-specific contractile proteins—changes 
that would not cause any noticeable conséquences in other tissues—can cause 
serious heart disease (Figure 16-38). 

The normal cardiac contractile apparatus is such a highly tuned machine 
that a tiny abnormality anywhere in the worlcs can be enough to gradually wear 
it down over years of répétitive motion. Familial hypertrophie cardiomyopathy is 
a common cause of sudden death in young athlètes. It is a genetically dominant 
inherited condition that affects about two out of every thousand people, and it is 
associated with heart enlargement, abnormally small coronary vessels, and dis¬ 
turbances in heart rhythm (cardiac arrhythmias). The cause of this condition is 
either any one of over 40 subtle point mutations in the genes encoding cardiac (3 
myosin heavy chain (almost ail causing changes in or near the motor domain) or 
one of about a dozen mutations in other genes encoding contractile proteins— 
including myosin light chains, cardiac troponin, and tropomyosin. Minor mis- 
sense mutations in the cardiac actin gene cause another type of heart condition, 
called dilated cardiomyopathy, which can also resuit in early heart failure. 

Actin and Myosin Perform a Variety of Functions in Non-Muscle 
Cells 

Most non-muscle cells contain small amounts of contractile actin-myosin II bun- 
dles that form transiently under spécifie conditions and are much less well orga- 
nized than muscle fibers. Non-muscle contractile bundles are regulated by myo¬ 
sin phosphorylation rather than the troponin mechanism (Figure 16-39). These 
contractile bundles function to provide mechanical support to cells, for example, 
by assembling into cortical stress fibers that connect the cell to the extracellular 



Figure 16-38 Effect on the heart of a 
subtle mutation in cardiac myosin. Left, 
normal heart from a 6-day-old mouse 
pup. Right, heart from a pup with a point 
mutation in both copies of its cardiac 
myosin gene, changing Arg403 to Gin. 
The arrows indicate the atria. In the heart 
from the pup with the cardiac myosin 
mutation, both atria are greatly enlarged 
(hypertrophie), and the mice die within a 
few weeks of birth. (From D. Fatkin et al., 
J. Clin. Invest. 103:147-153, 1999. With 
permission from The American Society for 
Clinical Investigation.) 
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Figure 16-39 Light-chain phosphorylation and the régulation of the assembly of myosin II into thick filaments. (A) The controlled 
phosphorylation by the enzyme myosin light-chain kinase (MLCK) of one of the two light chains (the so-called regulatory light chain, shown in light 
blue) on non-muscle myosin II in a test tube has at least two effects: it causes a change in the conformation of the myosin head, exposing its actin- 
binding site, and it releases the myosin tail from a “sticky patch” on the myosin head, thereby allowing the myosin molécules to assemble into short, 
bipolar, thick filaments. Smooth muscle is regulated by the same mechanism (see Figure 16-37). (B) Electron micrograph of negatively stained short 
filaments of myosin II that hâve been induced to assemble in a test tube by phosphorylation of their light chains. These myosin II filaments are much 
smaller than those found in skeletal muscle cells (see Figure 16-27). (B, courtesy of John Kendrick-Jones.) 
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matrix through focal adhesions or by forming a circumferential belt in an épithé¬ 
lial cell, connecting it to adjacent cells through adherens junctions (discussed in 
Chapter 19). As described in Chapter 17, actin and myosin II in the contractile 
ring generate the force for cytokinesis, the final stage in cell division. Finally, as 
discussed later, contractile bundles also contribute to the adhesion and forward 
motion of migrating cells. 

Non-muscle cells also express a large family of other myosin proteins, which 
hâve diverse structures and functions in the cell. Following the discovery of con- 
ventional muscle myosin, a second member of the family was found in the fresh- 
water amoeba Acanthamoeba castellanii. This protein had a different tail structure 
and seemed to function as a monomer, and so it was named myosin I (for one- 
headed). Conventional muscle myosin was renamed myosin II (for two-headed). 
Subsequently, many other myosin types were discovered. The heavy chains gen- 
erally start with a recognizable myosin motor domain at the N-terminus and 
then diverge widely with a variety of C-terminal tail domains (Figure 16-40). The 
myosin family includes a number of one-headed and two-headed varieties that 
are about equally related to myosin I and myosin II, and the nomenclature now 
refiects their approximate order of discovery (myosin III through at least myosin 
XVTII). Sequence comparisons among diverse eukaryotes indicate that there are 
at least 37 distinct myosin families in the superfamily. Ail of the myosins except 
one move toward the plus end of an actin filament, although they do so at differ¬ 
ent speeds. The exception is myosin VI, which moves toward the minus end. The 
myosin tails (and the tails of motor proteins generally) hâve apparently diversified 
during évolution to permit the proteins to bind other subunits and to interact with 
different cargoes. 

Some myosins are found only in plants, and some are found only in vertebrates. 
Most, however, are found in ail eukaryotes, suggesting that myosins arose early in 
eukaryotic évolution. The human genome includes about 40 myosin genes. Nine 
of the human myosins are expressed primarily or exclusively in the hair cells of 
the inner ear, and mutations in five of them are known to cause hereditary deaf- 
ness. These extremely specialized myosins are important for the construction and 
function of the complex and beautiful bundles of actin found in stereocilia that 
project from the apical surface of these cells (see Figure 9-51); these cellular pro¬ 
trusions tilt in response to Sound and convert Sound waves into electrical signais. 

The functions of most of the myosins remain to be determined, but several are 
well characterized. The myosin I proteins often contain either a second actin-bind- 
ing site or a membrane-binding site in their tails, and they are generally involved 
in intracellular organization—including the protrusion of actin-rich structures at 
the cell surface, such as microvilli (see Panel 16-1 and Figure 16-4), and endo- 
cytosis. Myosin V is a two-headed myosin with a large step size (Figure 16-41 A) 
and is involved in organelle transport along actin filaments. In contrast to myo¬ 
sin II motors, which worlc in ensembles and are attached only transiently to actin 
filaments so as not to interfère with one another, myosin V moves continuously, 


Figure 16-40 Myosin superfamily 
members. Comparison of the domain 
structure of the heavy chains of some 
myosin types. Ail myosins share similar 
motor domains (shown in darkgreen), 
but their C-terminal tails (light green) and 
N-terminal extensions (light blue) are very 
diverse. On the right are depictions of 
the molecular structure for these family 
members. Many myosins form dimers, with 
two motor domains per molécule, but a 
few (such as I, III, and XIV) seem to function 
as monomers, with just one motor domain. 
Myosin VI, despite its overall structural 
similarity to other family members, is unique 
in moving toward the minus end (instead of 
the plus end) of an actin filament. The small 
insertion within its motor head domain, 
not found in other myosins, is probably 
responsible for this change in direction. 
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or processively, along actin filaments without letting go. Myosin V functions are 
well studied in the yeast Saccharomyces cerevisiae , which undergoes a stereotyp- 
ical pattern of growth and division called budding. Actin cables in the mother 
cell point toward the bud, where actin is found in patches that concentrate where 
cell wall growth is taking place. Myosin V motors carry a wide range of cargoes— 
including mRNA, endoplasmic réticulum, and secretory vesicles—along the actin 
cables and into the bud. In addition, myosin V médiates the correct partitioning of 
organelles such as peroxisomes and mitochondria between mother and daughter 
cells (see Figure 16-41B). 

Summary 

Using their neck domain as a lever arm, myosins couvert ATP hydrolysis into 
mechanical worlc to move along actin filaments in a stepwise fashion. Skeletal 
muscle is made up of myofibrils containing thousands of sarcomeres assembled 
from highly ordered arrays of actin and myosin II filaments, together with many 
accessory proteins. Muscle contraction is stimulated by calcium, which causes the 
actin-filament-associated protein tropomyosin to move, uncovering myosin bind- 
ing sites and allowing the filaments to slide past one another. Smooth muscle and 
non-muscle cells hâve less well-ordered contractile bundles of actin and myosin, 
which are regulated by myosin light-chain phosphorylation. Myosin V transports 
cargo by walking along actin filaments. 

MICROTUBULES 

Microtubules are structurally more complex than actin filaments, but they are 
also highly dynamic and play comparably diverse and important rôles in the cell. 
Microtubules are polymers of the protein tubulin. The tubulin subunit is itself a 
heterodimer formed from two closely related globular proteins called a-tubulin 
and P-tubulin, each comprising 445-450 amino acids, which are tightly bound 
together by noncovalent bonds (Figure 16-42A). These two tubulin proteins are 
found only in this heterodimer, and each a or P monomer has a binding site for 
one molécule of GTP. The GTP that is bound to a-tubulin is physically trapped at 
the dimer interface and is never hydrolyzed or exchanged; it can therefore be con- 
sidered to be an intégral part of the tubulin heterodimer structure. The nucléotide 
on the P-tubulin, in contrast, may be in either the GTP or the GDP form and is 
exchangeable within the soluble (unpolymerized) tubulin dimer. 

Tubulin is found in ail eukaryotic cells, and it exists in multiple isoforms. Yeast 
and human tubulins are 75% identical in amino acid sequence. In mammals, 
there are at least six forms of a-tubulin and a similar number of P-tubulins, each 
encoded by a different gene. The different forms of tubulin are very similar, and 
they generally copolymerize into mixed microtubules in the test tube. However, 
they can hâve distinct locations in cells and tissues and perform subtly different 
functions. As a striking example, mutations in a particular human P-tubulin gene 
give rise to a paralytic eye-movement disorder due to loss of ocular nerve func- 
tion. Numerous human neurological diseases hâve been linked to spécifie muta¬ 
tions in different tubulin genes. 


Figure 16-41 Myosin V carries cargo 
along actin filaments. (A) The lever arm of 
myosin V is long, allowing it to take a bigger 
step along an actin filament than myosin II 
(see Figure 16-29). (B) Myosin V transports 
cargo and organelles along actin cables, in 
this example moving a mitochondrion into 
the growing bud of a yeast cell. 
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Microtubules Are Hollow Tubes Made of Protofilaments 

A microtubule is a hollow cylindrical structure built from 13 parallel protofila¬ 
ments, each composed of aP-tubulin heterodimers staclced head to tail and then 
folded into a tube (Figure 16-42B-D). Microtubule assembly generates two new 
types of protein-protein contacts. Along the longitudinal axis of the microtubule, 
the “top” of one P-tubulin molécule forms an interface with the “bottom” of the 
a-tubulin molécule in the adjacent heterodimer. This interface is very similar to 
the interface holding the a and P monomers together in the dimer subunit, and 
the binding energy is high. Perpendicular to these interactions, neighboring pro¬ 
tofilaments form latéral contacts. In this dimension, the main latéral contacts are 
between monomers of the same type (a-a and P~P). As longitudinal and latéral 
contacts are repeated during assembly, a slight stagger in latéral contacts gives 
rise to the helical microtubule lattice. Because multiple contacts within the lattice 
hold most of the subunits in a microtubule in place, the addition and loss of sub- 
units occurs almost exclusively at the microtubule ends (see Figure 16-5). These 
multiple contacts among subunits malce microtubules stiff and difficult to bend. 
The persistence length of a microtubule is several millimeters, making microtu¬ 
bules the stiffest and straightest structural éléments found in most animal cells. 

The subunits in each protofilament in a microtubule ail point in the same 
direction, and the protofilaments themselves are aligned in parallel (see Figure 



(A) 

Figure 16-42 The structure of a microtubule and its subunit. (A) The subunit of each protofilament is a tubulin heterodimer, formed from a tightly 
linked pair of a- and p-tubulin monomers. The GTP molécule in the a-tubulin monomer is so tightly bound that it can be considered an intégral part 
of the protein. The GTP molécule in the p-tubulin monomer, however, is less tightly bound and has an important rôle in filament dynamics. Both 
nucléotides are shown in red. (B) One tubulin subunit (ap-heterodimer) and one protofilament are shown schematically. Each protofilament consists 
of many adjacent subunits with the same orientation. (C) The microtubule is a stiff hollow tube formed from 13 protofilaments aligned in parallel. 

(D) A short segment of a microtubule viewed in an électron microscope. (E) Electron micrograph of a cross section of a microtubule showing a ring 
of 13 distinct protofilaments. (D, courtesy of Richard Wade; E, courtesy of Richard Linck.) 
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Figure 16-43 The preferential growth of microtubules at the plus end. 

Microtubules grow faster at one end than at the other. A stable bundle 
of microtubules obtained from the core of a cilium (called an axoneme) 
was incubated for a short time with tubulin subunits under polymerizing 
conditions. Microtubules grew fastest from the plus end of the microtubule 
bundle, the end at the top in this micrograph. (Courtesy of Gary Borisy.) 


16-42). Therefore, the microtubule lattice itself has a distinct structural polarity, 
with a-tubulins exposed at the minus end and P-tubulins exposed at the plus 
end. As for actin filaments, the regular, parallel orientation of their subunits gives 
microtubules structural and dynamic polarity (Figure 16-43), with plus ends 
growing and shrinking more rapidly. 

Microtubules Undergo Dynamic Instability 

Microtubule dynamics, like those of actin filaments, are profoundly influenced 
by the binding and hydrolysis of nucléotide—GTP in this case. GTP hydrolysis 
occurs only within the (3-tubulin subunit of the tubulin dimer. It proceeds very 
slowly in free tubulin subunits but is accelerated when they are incorporated into 
microtubules. Following GTP hydrolysis, the free phosphate group is released and 
the GDP remains bound to P-tubulin within the microtubule lattice. Thus, as in 
the case of actin filaments, two different types of microtubule structures can exist, 
one with the "T form” of the nucléotide bound (GTP) and one with the "D form" 
bound (GDP). The energy of nucléotide hydrolysis is stored as elastic strain in 
the polymer lattice, making the free-energy change for dissociation of a subunit 
from the D-form polymer more négative than the free-energy change for dissoci¬ 
ation of a subunit from the T-form polymer. In conséquence, the ratio of k 0 ff/k 0n 
for GDP-tubulin (its critical concentration [C c (D)]) is much higher than that of 
GTP-tubulin. Thus, under physiological conditions, GTP-tubulin tends to polym- 
erize and GDP-tubulin to depolymerize. 

Whether the tubulin subunits at the very end of a microtubule are in the T or 
the D form dépends on the relative rates of GTP hydrolysis and tubulin addition. 
If the rate of subunit addition is high—and thus the filament is growing rapidly— 
then it is lilcely that a new subunit will be added to the polymer before the nuclé¬ 
otide in the previously added subunit has been hydrolyzed. In this case, the tip 
of the polymer remains in the T form, forming a GTP cap. However, if the rate of 
subunit addition is low, hydrolysis may occur before the next subunit is added, 
and the tip of the filament will then be in the D form. If GTP-tubulin subunits 
assemble at the end of the microtubule at a rate similar to the rate of GTP hydro¬ 
lysis, then hydrolysis will sometimes "catch up” with the rate of subunit addition 
and transform the end to a D form. This transformation is sudden and random, 
with a certain probability per unit time that dépends on the concentration of free 
GTP-tubulin subunits. 

Suppose that the concentration of free tubulin is intermediate between the 
critical concentration for a T-form end and the critical concentration for a D-form 
end (that is, above the concentration necessary for T-form assembly, but below 
that for the D form). Now, any end that happens to be in the T form will grow, 
whereas any end that happens to be in the D form will shrinlc. On a single microtu¬ 
bule, an end might grow for a certain length of time in a T form, but then suddenly 
change to the D form and begin to shrinlc rapidly, even while the free subunit 
concentration is held constant. At some later time, it might then regain a T-form 
end and begin to grow again. This rapid interconversion between a growing and 
shrinking State, at a uniform free subunit concentration, is called dynamic insta¬ 
bility (Figure 16-44A and Figure 16-45; see Panel 16-2). The change from growth 
to shrinlcage is called a catastrophe , while the change to growth is called a rescue. 

In a population of microtubules, at any instant some of the ends are in the T 
form and some are in the D form, with the ratio depending on the hydrolysis rate 
and the free subunit concentration. In vitro , the structural différence between a 
T-form end and a D-form end is dramatic. Tubulin subunits with GTP bound to 
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Figure 16-44 Dynamic instability due to the structural différences between a growing and a shrinking microtubule 
end. (A) If the free tubulin concentration in solution is between the critical concentrations of the GTP- and GDP-bound forms, 
a single microtubule end may undergo transitions between a growing State and a shrinking State. A growing microtubule has 
GTP-containing subunits at its end, forming a GTP cap. If nucléotide hydrolysis proceeds more rapidly than subunit addition, the 
cap is lost and the microtubule begins to shrink, an event called a “catastrophe.” But GTP-containing subunits may still add to 
the shrinking end, and if enough add to form a new cap, then microtubule growth résumés, an event called “rescue.” (B) Model 
for the structural conséquences of GTP hydrolysis in the microtubule lattice. The addition of GTP-containing tubulin subunits to 
the end of a protofilament causes the end to grow in a linear conformation that can readily pack into the cylindrical wall of the 
microtubule. Hydrolysis of GTP after assembly changes the conformation of the subunits and tends to force the protofilament 
into a curved shape that is less able to pack into the microtubule wall. (C) In an intact microtubule, protofilaments made from 
GDP-containing subunits are forced into a linear conformation by the many latéral bonds within the microtubule wall, given a 
stable cap of GTP-containing subunits. Loss of the GTP cap, however, allows the GDP-containing protofilaments to relax into 
their more curved conformation. This leads to a progressive disruption of the microtubule. Above the drawings of a growing 
and a shrinking microtubule, électron micrographs show actual microtubules in each of these two States. Note particularly the 
curling, disintegrating GDP-containing protofilaments at the end of the shrinking microtubule. (C, from E.M. Mandelkow, 

E. Mandelkow and R.A. Milligan, J. Cell Biol. 114:977-991, 1991. With permission from The Rockefeller University Press.) 
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Figure 16-45 Direct observation of the 
dynamic instability of microtubules 
in a living cell. Microtubules in a newt 
lung épithélial cell were observed after 
the cell was injected with a small amount 
of rhodamine-labeled tubulin. Notice 
the dynamic instability of microtubules 
at the edge of the cell. Four individual 
microtubules are highlighted for clarity; 
each of these shows alternating shrinkage 
and growth (Movie 16.7). (Courtesy 
of Wendy C. Salmon and Clare 
Waterman-Storer.) 


the P-monomer produce straight protofilaments that malce strong and regular lat¬ 
éral contacts with one another. But the hydrolysis of GTP to GDP is associated 
with a subtle conformational change in the protein, which malces the protofila¬ 
ments curved (Figure 16-44B). On a rapidly growing microtubule, the GTP cap 
is thought to constrain the curvature of the protofilaments, and the ends appear 
straight. But when the terminal subunits hâve hydrolyzed their nucléotides, this 
constraint is removed, and the curved protofilaments spring apart. This cooper¬ 
ative release of the energy of hydrolysis stored in the microtubule lattice causes 
the curled protofilaments to peel off rapidly, and curved oligomers of GDP-con- 
taining tubulin are seen near the ends of depolymerizing microtubules (Figure 
16-44C). 

Microtubule Functions Are Inhibited by Both Polymer-stabilizing 
and Polymer-destabilizing Drugs 

Chemical compounds that impair polymerization or depolymerization of micro¬ 
tubules are powerful tools for investigating the rôles of these polymers in cells. 
Whereas colchicine and nocodazole interact with tubulin subunits and lead to 
microtubule depolymerization, Taxol binds to and stabilizes microtubules, caus- 
ing a net increase in tubulin polymerization (see Table 16-1). Drugs lilce these 
hâve a rapid and profound effect on the organization of the microtubules in living 
cells. Both microtubule-depolymerizing drugs (such as nocodazole) and micro¬ 
tubule-polymerizing drugs (such as Taxol) preferentially kill dividing cells, since 
microtubule dynamics are crucial for correct function of the mitotic spindle (dis- 
cussed in Chapter 17). Some of these drugs efficiently kill certain types of tumor 
cells in a human patient, although not without toxicity to rapidly dividing normal 
cells, including those in the bone marrow, intestine, and hair follicles. Taxol in 
particular has been widely used to treat cancers of the breast and lung, and it is 
frequently successful in treatment of tumors that are résistant to other chemo- 
therapeutic agents. 

A Protein Complex Containing y-Tubulin Nucleates Microtubules 

Because formation of a microtubule requires the interaction of many tubulin het- 
erodimers, the concentration of tubulin subunits required for spontaneous nuclé¬ 
ation of microtubules is very high. Microtubule nucléation therefore requires 
help from other factors. While a- and p-tubulins are the regular building bloclcs of 
microtubules, another type of tubulin, called y-tubulin , is présent in much smaller 
amounts than a- and P-tubulin and is involved in the nucléation of microtubule 
growth in organisms ranging from yeasts to humans. Microtubules are generally 
nucleated from a spécifie intracellular location known as a microtubule-organiz- 
ing center (MTOC) where y-tubulin is most enriched. Nucléation in many cases 
dépends on the y-tubulin ring complex (y-TuRC). Within this complex, two 
accessory proteins bind directly to the y-tubulin, along with several other proteins 
that help create a spiral ring of y-tubulin molécules, which serves as a template 
that créâtes a microtubule with 13 protofilaments (Figure 16-46). 
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Figure 16-46 Microtubule nucléation by the y-tubulin ring complex. (A) Two copies of y-tubulin associate with a pair 
of accessory proteins to form the y-tubulin small complex (y-TuSC). This image was generated by high-resolution électron 
microscopy of individual purified complexes. (B) Seven copies of the y-TuSC associate to form a spiral structure in which the last 
y-tubulin lies beneath the first, resulting in 13 exposed y-tubulin subunits in a circular orientation that matches the orientation of 
the 13 protofilaments in a microtubule. (C) In many cell types, the y-TuSC spiral associâtes with additional accessory proteins 
to form the y-tubulin ring complex (y-TuRC), which is likely to nucleate the minus end of a microtubule as shown here. Note 
the longitudinal discontinuity between two protofilaments, which results from the spiral orientation of the y-tubulin subunits. 
Microtubules often hâve one such “seam” breaking the otherwise uniform helical packing of the protofilaments. (A and B, from 
J.M. Kollman et al., Nature 466:879-883, 2010. With permission from Macmillan Publishers Ltd.) 


Microtubules Emanate from the Centrosome in Animal Cells 

Many animal cells hâve a single, well-defined MTOC called the centrosome, 
which is located near the nucléus and from which microtubules are nucleated 
at their minus ends, so the plus ends point outward and continuously grow and 
shrink, probing the entire three-dimensional volume of the cell. A centrosome 
typically recruits more than fifty copies of y-TuRC. In addition, y-TuRC molécules 
are found in the cytoplasm, and centrosomes are not absolutely required for 
microtubule nucléation, since destroying them with a laser puise does not pre- 
vent microtubule nucléation elsewhere in the cell. A variety of proteins hâve been 
identified that anchor y-TuRC to the centrosome, but mechanisms that activate 
microtubule nucléation at MTOCs and at other sites in the cell are poorly under- 
stood. 

Embedded in the centrosome are the centrioles, a pair of cylindrical struc¬ 
tures arranged at right angles to each other in an L-shaped configuration (Figure 
16-47). A centriole consists of a cylindrical array of short, modified microtubules 
arranged into a barrel shape with striking ninefold symmetry (Figure 16-48). 
Together with a large number of accessory proteins, the centrioles organize the 
pericentriolar material, where microtubule nucléation takes place. As described 
in Chapter 17, the centrosome duplicates and splits into two parts before mitosis, 
each containing a duplicated centriole pair. The two centrosomes move to oppo¬ 
site sides of the nucléus when mitosis begins, and they form the two pôles of the 
mitotic spindle (see Panel 17-1). 

Microtubule organization varies widely among different species and cell types. 
In budding yeast, microtubules are nucleated at an MTOC that is embedded in 
the nuclear envelope as a small, multilayered structure called the spindle pôle 
body, also found in other fungi and diatoms. Higher-plant cells appear to nucleate 
microtubules at sites distributed ail around the nuclear envelope and at the cell 
cortex. Neither fungi nor most plant cells contain centrioles. Despite these différ¬ 
ences, ail these cells seem to use y-tubulin to nucleate their microtubules. 

In cultured animal cells, the aster-lilce configuration of microtubules is robust, 
with dynamic plus ends pointing outward toward the cell periphery and stable 
minus ends collected near the nucléus. The System of microtubules radiating from 
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Figure 16-47 The centrosome. (A) The centrosome is the major MTOC of animal cells. Located in the cytoplasm next to the 
nucléus, it consists of an amorphous matrix of fibrous proteins to which the y-tubulin ring complexes that nucleate microtubule 
growth are attached. This matrix is organized by a pair of centrioles, as described in the text. (B) A centrosome with attached 
microtubules. The minus end of each microtubule is embedded in the centrosome, having grown from a y-tubulin ring complex, 
whereas the plus end of each microtubule is free in the cytoplasm. (C) In a reconstructed image of the MTOC from a C. elegans 
cell, a dense thicket of microtubules can be seen emanating from the centrosome. (C, from E.T. O’Toole et al., J. Cell Biol. 
163:451-456, 2003. With permission from the authors.) 


the centrosome acts as a device to survey the outlying régions of the cell and to 
position the centrosome at its center. Even in an isolated cell fragment lacking the 
centrosome, dynamic microtubules arrange themselves into a star-shaped array 
with the microtubule minus ends clustered at the center by minus-end-binding 
proteins (Figure 16-49). This ability of the microtubule cytoslceleton to find the 
center of the cell establishes a general coordinate System, which is then used to 
position many organelles within the cell. Highly differentiated cells with complex 
morphologies such as neurons, muscles, and épithélial cells must use additional 
measuring mechanisms to establish their more elaborate internai coordinate 
Systems. Thus, for example, when an épithélial cell forms cell-cell junctions and 
becomes highly polarized, the microtubule minus ends move to a région near the 



Figure 16-48 A pair of centrioles in the 
centrosome. (A) An électron micrograph 
of a thin section of an isolated centrosome 
showing the mother centriole with its 
distal appendages and the adjacent 
daughter centriole, which formed through 
a duplication event during S phase (see 
Figure 17-26). In the centrosome, the 
centriole pair is surrounded by a dense 
matrix of pericentriolar material from which 
microtubules nucleate. Centrioles also 
function as basal bodies to nucleate the 
formation of ciliary axonemes (see Figure 
16-68). (B) Electron micrograph of a cross 
section through a centriole in the cortex of 
a protozoan. Each centriole is composed 
of nine sets of triplet microtubules 
arranged to form a cylinder. (C) Each triplet 
contains one complété microtubule (the 
A microtubule) fused to two incomplète 
microtubules (the B and C microtubules). 

(D) The centriolar protein SAS-6 forms a 
coiled-coil dimer. Nine SAS-6 dimers can 
self-associate to form a ring. Located at 
the hub of the centriole cartwheel structure, 
the SAS-6 ring is thought to generate the 
ninefold symmetry of the centriole. (A, from 
from M. Paintrand, et al. J. Struct. Biol. 
108:107, 1992. With permission from 
Elsevier; B, courtesy of Richard Linck; 

D, courtesy of Michel Steinmetz.) 
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apical plasma membrane. From this asymmetrical location, a microtubule array 
extends along the long axis of the cell, with plus ends directed toward the basal 
surface (see Figure 16-4). 


Microtubule-Binding Proteins Modulate Filament Dynamics and 
Organization 

Microtubule polymerization dynamics are very different in cells than in solutions 
of pure tubulin. Microtubules in cells exhibit a much higher polymerization rate 
(typically 10-15 pm/min, relative to about 1.5 pm/min with purified tubulin at 
similar concentrations), a greater catastrophe frequency, and extended pauses in 
microtubule growth, a dynamic behavior rarely observed in pure tubulin solu¬ 
tions. These and other différences arise because microtubule dynamics inside the 
cell are governed by a variety of proteins that bind tubulin dimers or microtu¬ 
bules, as summarized in Panel 16-4. 

Proteins that bind to microtubules are collectively called microtubule-associ- 
ated proteins, or MAPs. Some MAPs can stabilize microtubules against disassem- 
bly. A subset of MAPs can also médiate the interaction of microtubules with other 
cell components. This subset is prominent in neurons, where stabilized micro¬ 
tubule bundles form the core of the axons and dendrites that extend from the 
cell body (Figure 16-50). These MAPs hâve at least one domain that binds to the 
microtubule surface and another that projects outward. The length of the project- 
ing domain can détermine how closely MAP-coated microtubules pack together, 
as demonstrated in cells engineered to overproduce different MAPs. Cells over- 
expressing MAP2, which has a long projecting domain, form bundles of stable 
microtubules that are lcept widely spaced, while cells overexpressing tau, a MAP 
with a much shorter projecting domain, form bundles of more closely packed 
microtubules (Figure 16-51). MAPs are the targets of several protein kinases, and 
phosphorylation of a MAP can control both its activity and localization inside 
cells. 

Microtubule Plus-End-Binding Proteins Modulate Microtubule 
Dynamics and Attachments 

Cells contain numerous proteins that bind the ends of microtubules and thereby 
influence microtubule stability and dynamics. These proteins can influence the 

Figure 16-50 Localization of MAPs in the axon and dendrites of a 
neuron. This immunofluorescence micrograph shows the distribution of 
the proteins tau (green) and MAP2 (orange) in a hippocampal neuron in 
culture. Whereas tau staining is confined to the axon (long and branched in 
this neuron), MAP2 staining is confined to the cell body and its dendrites. 

The antibody used here to detect tau binds only to unphosphorylated tau; 
phosphorylated tau is also présent in dendrites. (Courtesy of James 
W. Mandell and Gary A. Banker.) 


Figure 16-49 A microtubule array can 
find the center of a cell. After the arm of a 

fish pigment cell is eut off with a needle, the 
microtubules in the detached cell fragment 
reorganize so that their minus ends end up 
near the center of the fragment, buried in a 
new microtubule-organizing center. 
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Some of the major accessory proteins of the microtubule cytoskeleton. Except for two classes of motor proteins, 
an example of each major type is shown. Each of these is discussed in the text. However, most cells contain more 
than a hundred different microtubule-binding proteins, and — as for the actin-associated proteins — it is likely 
that there are important types of microtubule-associated proteins that are not yet recognized. 
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Figure 16-51 Organization of 
microtubule bundles by MAPs. (A) MAP2 
binds along the microtubule lattice at one 
of ifs ends and extends a long projecting 
arm with a second microtubule-binding 
domain at the other end. (B) Tau possesses 
a shorter microtubule cross-linking domain. 

(C) Electron micrograph showing a cross 
section through a microtubule bundle in 
a cell overexpressing MAP2. The regular 
spacing of the microtubules (MTs) in this 
bundle results from the constant length of 
the projecting arms of the MAP2. 

(D) Similar cross section through a 
microtubule bundle in a cell overexpressing 
tau. Here the microtubules are spaced 
more closely together than they are in (C) 
because of tau’s relatively short projecting 
arm. (C and D, courtesy of J. Chen et 

al., Nature 360:674-677, 1992. With 
permission from Macmillan Publishers Ltd.) 


rate at which a microtubule switches from a growing to a shrinking State (the fre- 
quency of catastrophes) or from a shrinking to a growing State (the frequency of 
rescues). For example, members of a family of kinesin-related proteins known as 
catastrophe factors (or kinesin-13) bind to microtubule ends and appear to pry 
protofilaments apart, lowering the normal activation-energy barrier that prevents 
a microtubule from springing apart into the curved protofilaments that are char- 
acteristic of the shrinking State (Figure 16-52). Another protein, called Nezha or 
Patronin, protects microtubule minus ends from the effects of catastrophe factors. 

While very few microtubule minus-end-binding proteins hâve been charac- 
terized, a large subset of MAPs has been identified that are enriched at microtu¬ 
bule plus ends. A particularly ubiquitous example is XMAP215, which has close 
homologs in organisms that range from yeast to humans. XMAP215 binds free 
tubulin subunits and delivers them to the plus end, thereby promoting microtu¬ 
bule polymerization and simultaneously counteracting catastrophe factor activ- 
ity (see Figure 16-52). The phosphorylation of XMAP215 during mitosis inhibits 
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Figure 16-52 The effects of proteins 
that bind to microtubule ends. The 

transition between microtubule growth 
and shrinkage is controlled in cells by a 
variety of proteins. Catastrophe factors 
such as kinesin-13, a member of the 
kinesin motor protein superfamily, bind 
to microtubule ends and pry them apart, 
thereby promoting depolymerization. On 
the other hand, a MAP such as XMAP215 
stabilizes the end of a growing microtubule 
(XMAP stands fo rXenopus microtubule- 
associated protein, and the number refers 
to its molecular mass in kilodaltons). 
XMAP215 binds tubulin dimers and delivers 
them to the microtubule plus end, thereby 
increasing the microtubule growth rate and 
suppressing catastrophes. 
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its activity and shifts the balance of its compétition with catastrophe factors. This 
shift results in a tenfold increase in the dynamic instability of microtubules during 
mitosis, a transition that is critical for the efficient construction of the mitotic 
spindle (discussed in Chapter 17). 

In many cells, the minus ends of microtubules are stabilized by association 
with a capping protein or the centrosome, or else they serve as microtubule depo- 
lymerization sites. The plus ends, in contrast, efficiently explore and probe the 
entire cell space. Microtubule-associated proteins called plus-end tracking pro¬ 
teins (+TIPs) accumulate at these active ends and appear to rocket around the cell 
as passengers at the ends of rapidly growing microtubules, dissociating from the 
ends when the microtubules begin to shrinlc (Figure 16-53). 

The kinesin-related catastrophe factors and XMAP215 mentioned above 
behave as +TIPs and act to modulate the growth and shrinlcage of the microtubule 
end to which they are attached. Other +TIPs control microtubule positioning by 
helping to capture and stabilize the growing microtubule end at spécifie cellu¬ 
lar targets, such as the cell cortex or the kinetochore of a mitotic chromosome. 
EB1 and its relatives, small dimeric pro teins that are highly conserved in animais, 
plants, and fungi, are key players in this process. EB1 pro teins do not actively 
move toward plus ends, but rather recognize a structural feature of the growing 
plus end (see Figure 16-53). Several of the +TIPs dépend on EB1 pro teins for their 
plus-end accumulation and also interact with each other and with the microtu¬ 
bule lattice. By attaching to the plus end, these factors allow the cell to harness the 
energy of microtubule polymerization to generate pushing forces that can be used 
for positioning the spindle, chromosomes, or organelles. 

Tubulin-Sequestering and Microtubule-Severing Proteins 
Destabilize Microtubules 

As it does with actin monomers, the cell sequesters unpolymerized tubulin sub- 
units to maintain a pool of active subunits at a level near the critical concentra¬ 
tion. One molécule of the small protein stathmin (also called Opl8) binds to two 
tubulin heterodimers and prevents their addition to the ends of microtubules 
(Figure 16-54). Stathmin thus decreases the effective concentration of tubulin 
subunits that are available for polymerization (an action analogous to that of the 
drug colchicine), and enhances the likelihood that a growing microtubule will 
switch to the shrinking State. Phosphorylation of stathmin inhibits its binding to 
tubulin, and signais that cause stathmin phosphorylation can increase the rate 
of microtubule élongation and suppress dynamic instability. Stathmin has been 
implicated in the régulation of both cell prolifération and cell death. Interestingly, 
mice lacking stathmin develop normally but are less fearful than wild-type mice, 
reflecting a rôle for stathmin in neurons of the amygdala, where it is normally 
expressed at high levels. 

Severing is another mechanism employed by the cell to destabilize microtu¬ 
bules. To sever a microtubule, thirteen longitudinal bonds must be brolcen, one 
for each protofilament. The protein Icatanin, named after the lapanese word for 


Figure 16-53 +TIP proteins found at the 
growing plus ends of microtubules. 

(A) Frames from a fluorescence time-lapse 
movie of the edge of a cell expressing 
fluorescently labeled tubulin that incorporâtes 
into microtubules (red) as well as the +TIP 
protein EB1 tagged with a different color 
(green). The same microtubule is marked 
(asterisk) in successive movie frames. When 
the microtubule is growing (frames 1,2), 

EB1 is associated with the tip. When the 
microtubule undergoes a catastrophe and 
begins shrinking, EB1 is lost (frames 3, 4). 
The labeled EB1 is regained when growth 
of the microtubule is rescued (trame 5). 

See Movie 16.8. (B) In the fission yeast 
Schizosaccharomyces pombe, the plus ends 
of the microtubules (green) are associated 
with the homolog of EB1 (red) at the two 
pôles of the rod-shaped cells. (A, courtesy 
of Anna Akhmanova and llya Grigoriev; 

B, courtesy of Takeshi Toda.) 
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Figure 16-54 Séquestration of tubulin by 
stathmin. Structural studies with électron 
microscopy and crystallography suggest 
that the elongated stathmin protein binds 
along the side of two tubulin heterodimers. 
(Adapted from M.O. Steinmetz et al., 
EMBOJ. 19:572-580, 2000. With 
permission from John Wiley and Sons.) 
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“sword,” accomplishes this demanding taslc (Figure 16-55). Katanin is made up 
of two subunits: a smaller subunit that hydrolyzes ATP and perforais the actual 
severing, and a larger one that directs katanin to the centrosome. Katanin releases 
microtubules from their attachment to a microtubule-organizing center and is 
thought to contribute to the rapid microtubule depolymerization observed at the 
pôles of spindles during mitosis. It may also be involved in microtubule release 
and depolymerization in proliferating cells in interphase and in postmitotic cells 
such as neurons. 

Two Types of Motor Proteins Move Along Microtubules 

Lilce actin filaments, microtubules also use motor proteins to transport cargo and 
perform a variety of other functions within the cell. Ihere are two major classes 
of microtubule-based motors, kinesins and dyneins. Kinesin-1, also called “con- 
ventional kinesinwas first purified from squid neurons, where it carries mem- 
brane-enclosed organelles away from the cell body toward the axon terminal by 
walking toward the plus end of microtubules. Kinesin-1 is similar to myosin II in 
having two heavy chains per active motor; these form two globular head motor 
domains that are held together by an elongated coiled-coil tail that is responsi- 
ble for heavy-chain dimerization. One kinesin-1 light chain associâtes with each 
heavy chain through its tail domain and médiates cargo binding. Lilce myosin, 
kinesin is a member of a large protein superfamily, for which the motor domain is 
the common element (Figure 16-56). The yeast Saccharomyces cerevisiae has six 
distinct kinesins. The nematode C. elegans has 20 kinesins, and humans hâve 45. 

There are at least fourteen distinct families in the kinesin superfamily. Most of 
them hâve the motor domain at the N-terminus of the heavy chain and wallc toward 
the plus end of the microtubule. One family has the motor domain at the C-termi- 
nus and wallcs in the opposite direction, toward the minus end of the microtubule, 
while kinesin-13 has a central motor domain and does not wallc at ail, but uses the 
energy of ATP hydrolysis to depolymerize microtubule ends, as described above 
(see Figure 16-52). Some kinesin heavy chains are homodimers, and others are 
heterodimers. Most kinesins hâve a binding site in the tail for another microtu¬ 
bule; alternatively, they may linlc the motor to a membrane-enclosed organelle 
via a light chain or an adaptor protein. Many of the kinesin superfamily members 
hâve spécifie rôles in mitotic spindle formation and in chromosome ségrégation 
during cell division. 

In lcinesin-l, instead of the roclcing of a lever arm, small movements at the 
nucleotide-binding site regulate the doclcing and undoclcing of the motor head 
domain to a long linlcer région. This acts to throw the second head forward along 
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Figure 16-55 Microtubule severing by 
katanin. Taxol stabilized, rhodamine- 
labeled microtubules were adsorbed on the 
surface of a glass slide, and purified katanin 
was added along with ATR (A) There are a 
few breaks in the microtubules 30 seconds 
after the addition of katanin. (B) The same 
field 3 minutes after the addition of katanin. 
The filaments hâve been severed in many 
places, leaving a sériés of small fragments 
at the previous locations of the long 
microtubules. (From J.J. Hartman et al., 

Cell 93:277-287, 1998. With permission 
from Elsevier.) 


Figure 16-56 Kinesin and kinesin- 
related proteins. Structures of four kinesin 
superfamily members. As in the myosin 
superfamily, only the motor domains are 
conserved. Kinesin-1 has the motor domain 
at the N-terminus of the heavy chain. The 
middle domain forms a long coiled-coil, 
mediating dimerization. The C-terminal 
domain forms a tail that attaches to cargo, 
such as a membrane-enclosed organelle. 
Kinesin-5 forms tetramers where two 
dimers associate by their tails. The bipolar 
kinesin-5 tetramer is able to slide two 
microtubules past each other, analogous 
to the activity of the bipolar thick filaments 
formed by myosin II. Kinesin-13 has its 
motor domain located in the middle of the 
heavy chain. It is a member of a family of 
kinesins that hâve lost typical motor activity 
and instead bind to microtubule ends to 
promote depolymerization (see Figure 
16-52). Kinesin-14 is a C-terminal kinesin 
that includes the Drosophila protein Ncd 
and the yeast protein Kar3. These kinesins 
generally travel in the opposite direction 
from the majority of kinesins, toward the 
minus end instead of the plus end of a 
microtubule. 
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Figure 16-57 The mechanochemical cycle of kinesin. Kinesin-1 is a 
dimer of two nucleotide-binding motor domains (heads) that are connected 
through a long coiled-coil tail (see Figure 16-56). The two kinesin motor 
domains work in a coordinated manner; during a kinesin “step,” the rear head 
détachés from its tubulin binding site, passes the partner motor domain, 
and then rebinds to the next available tubulin binding site. Using this “hand- 
over-hand” motion, the kinesin dimer can move for long distances on the 
microtubule without completely letting go of its track. 

At the start of each step, one of the two kinesin motor domain heads, 
the rear or lagging head (dark green), is tightly bound to the microtubule and 
to ATP, while the front or leading head is loosely bound to the microtubule 
with ADP in its binding site. The forward displacement of the rear motor 
domain is driven by the dissociation of ADP and binding of ATP in the leading 
head (between panels 2 and 3 in this drawing). The binding of ATP to this 
motor domain causes a small peptide called the “neck linker” to shift from 
a rearward-pointing to a forward-pointing conformation (the neck linker is 
drawn here as a purple connecting line between the leading motor domain 
and the intertwined coiled-coil). This shift pulls the rear head forward, once 
it has detached from the microtubule with ADP bound [detachment requires 
ATP hydrolysis and phosphate (Pi) release]. The kinesin molécule is now 
poised for the next step, which proceeds by an exact repeat of the process 
shown (Movie 16.9). 


the protofilament to a binding site 8 nm doser to the microtubule plus end, which 
is the distance between tubulin dimers of a protofilament. The nucleotide-hydro- 
lysis cycles in the two heads are closely coordinated, so that this cycle of linker 
doclcing and undocking allows the two-headed motor to move in a hand-over- 
hand (or head-over-head) stepwise manner (Figure 16-57). 

The dyneins are a family of minus-end directed microtubule motors unrelated 
to the kinesins. They are composed of one, two, or three heavy chains (that include 
the motor domain) and a large and variable number of associated intermediate, 
light-intermediate, and light chains. The dynein family has two major branches 
(Figure 16-58). The first branch contains the cytoplasmic dyneins , which are 
homodimers of two heavy chains. Cytoplasmic dynein 1 is encoded by a single 
gene in almost ail eukaryotic cells, but is missing from flowering plants and some 
algae. It is used for organelle and mRNA trafficking, for positioning the centro¬ 
some and nucléus during cell migration, and for construction of the microtubule 
spindle in mitosis and meiosis. Cytoplasmic dynein 2 is found only in eukaryotic 
organisms that hâve cilia and is used to transport material from the tip to the base 
of the cilia, a process called intraflagellar transport. Axonemal dyneins (also called 
ciliary dyneins) comprise the second branch and include monomers, heterodi- 
mers, and heterotrimers, with one, two, or three motor-containing heavy chains, 
respectively. They are highly specialized for the rapid and efficient sliding move- 
ments of microtubules that drive the beating of cilia and flagella (discussed later). 





Figure 16-58 Dyneins. (A) Freeze-etch 
électron micrographs of a molécule of 
cytoplasmic dynein and a molécule of 
ciliary (axonemal) dynein. Like myosin II 
and kinesin-1, cytoplasmic dynein is a 
two-headed molécule. The ciliary dynein 
shown is from a protozoan and has three 
heads; ciliary dynein from animais has two 
heads. Note that the dynein head is very 
large compared with the head of either 
myosin or kinesin. (B) Schematic depiction 
of cytoplasmic dynein showing the two 
heavy chains (blue and gray) that contain 
domains for microtubule (MT) binding and 
ATP hydrolysis, connected by a long stalk. 
Bound to the heavy chain are multiple 
intermediate chains (dark green) and light 
chains (light green) that help to médiate 
many of dynein’s functions. (A, courtesy of 
John Heuser; B, adapted from R. Vale, Cell 
112:467-480, 2003. With permission from 
Cell Press.) 
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Figure 16-59 The power stroke of 
dynein. (A) The organization of the 
domains in each dynein heavy chain. This 
is a huge polypeptide, containing nearly 
4000 amino acids. The number of heavy 
chains in a dynein is equal to its number 
of motor heads. (B) Illustration of dynein 
c, a monomeric axonemal dynein found in 
the unicellular green alga Chlamydomonas 
reinhardtii. The large dynein motor head 
is a planar ring containing a C-terminal 
domain (gray) and six AAA domains, four 
of which retain ATP-binding sequences, 
but only one of which (dark red) has the 
major ATPase activity. Extending from 
the head are a long, coiled-coil stalk with 
the microtubule-binding site at the tip, 
and a tail that attaches to an adjacent 
microtubule in the axoneme. In the ATP- 
bound State, the stalk is detached from 
the microtubule, but ATP hydrolysis 
causes stalk-microtubule attachment (left). 
Subséquent release of ADP and phosphate 
(Pi) then leads to a large conformational 
“power stroke” involving rotation of the 
head and stalk relative to the tail (right). 
Each cycle generates a step of about 8 nm, 
thereby contributing to flagellar beating (see 
Figure 16-65). In the case of cytoplasmic 
dynein, the tail is attached to a cargo such 
as a vesicle, and a single power stroke 
transports the cargo about 8-nm along 
the microtubule toward its minus end (see 
Figure 16-60). (C) Electron micrographs of 
purified monomeric dyneins in two different 
conformations representing different steps 
in the mechanochemical cycle. (C, from 
S.A. Burgess et al., Nature 421:715—718, 
2003. With permission from Macmillan 
Publishers Ltd.) 


Dyneins are the largest of the known molecular motors, and they are also 
among the fastest: axonemal dyneins attached to a glass slide can move micro- 
tubules at the rate of 14 pm/sec. The dynein motor is structurally unrelated to 
myosins and kinesins, but still follows the general rule of coupling nucléotide 
hydrolysis to microtubule binding and unbinding as well as to a force-generating 
conformational change (Figure 16-59). 

Microtubules and Motors Move Organelles and Vesicles 

A major function of cytoskeletal motors in interphase cells is the transport and 
positioning of membrane-enclosed organelles (Movie 16.10). Kinesin was orig- 
inally identified as the protein responsible for fast anterograde axonal transport, 
the rapid movement of mitochondria, secretory vesicle precursors, and various 
synapse components down the microtubule highways of the axon to the distant 
nerve terminais. Cytoplasmic dynein was identified as the motor responsible for 
transport in the opposite direction, rétrogradé axonal transport. Although organ¬ 
elles in most cells need not cover such long distances, their polarized transport 
is equally necessary. A typical microtubule array in an interphase cell is oriented 
with the minus ends near the center of the cell at the centrosome and the plus 
ends extending to the cell periphery. Thus, centripetal movements of organelles 
or vesicles toward the cell center require the action of minus-end directed cyto¬ 
plasmic dynein motors, whereas centrifugal movements toward the periphery 
require plus-end directed kinesin motors. Interestingly, in animal cells, nearly ail 
minus-end directed transport is driven by the single cytoplasmic dynein 1 motor, 
whereas 15 different kinesins are used for plus-end directed transport. 
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A clear example of the effect of microtubules and microtubule motors on 
the behavior of intracellular membranes is their rôle in organizing the endo- 
plasmic réticulum (ER) and the Golgi apparatus. The networlc of ER membrane 
tubules aligns with microtubules and extends almost to the edge of the cell 
(Movie 16.11), whereas the Golgi apparatus is located near the centrosome. When 
cells are treated with a drug that depolymerizes microtubules, such as colchicine 
or nocodazole, the ER collapses to the center of the cell, while the Golgi appa¬ 
ratus fragments and disperses throughout the cytoplasm. In vitro, kinesins can 
tether ER-derived membranes to preformed microtubule traclcs and wallc toward 
the microtubule plus ends, dragging the ER membranes out into tubular protru¬ 
sions and forming a membranous web that looks very much lilce the ER in cells. 
Conversely, dyneins are required for positioning the Golgi apparatus near the cell 
center of animal cells; they do this by moving Golgi vesicles along microtubule 
traclcs toward the microtubules' minus ends at the centrosome. 

The different tails and their associated light chains on spécifie motor proteins 
allow the motors to attach to their appropriate organelle cargo. Membrane-as¬ 
sociated motor receptors that are sorted to spécifie membrane-enclosed com- 
partments interact directly or indirectly with the tails of the appropriate kinesin 
family members. Many viruses talce advantage of microtubule motor-based trans¬ 
port during infection and use kinesin to move from their site of réplication and 
assembly to the plasma membrane, from which they are poised to infect neigh- 
boring cells. An outer-membrane protein of Vaccinia virus, for example, contains 
an amino acid motif that médiates binding to kinesin-1 light chain and transport 
along microtubules to the plasma membrane. Interestingly, this motif is présent 
in over 450 human proteins, one-third of which are associated with human dis- 
eases. Thus, kinesin transports a diverse set of cargoes involved in a wide range of 
important cellular functions. 

For dynein, a large macromolecular assembly often médiates attachment to 
membranes. Cytoplasmic dynein, itself a huge protein complex, requires associa¬ 
tion with a second large protein complex called dynactin to translocate organelles 
effectively. The dynactin complex includes a short, actin-like filament that forms 
from the actin-related protein Arpl (distinct from Arp2 and Arp3, the components 
of the Arp 2/3 complex involved in the nucléation of conventional actin filaments) 
(Figure 16-60). A number of other proteins also contribute to dynein cargo bind¬ 
ing and motor régulation, and their function is especially important in neurons, 
where defects in microtubule-based transport hâve been linlced to neurological 
diseases. A striking example is smooth brain, or lissencephaly, a human disor- 
der in which cells fail to migrate to the cérébral cortex of the developing brain. 
One type of lissencephaly is caused by defects in Lisl, a dynein-binding protein 
required for nuclear migration in several species. In the normal brain, migration 
of the nucléus directs the developing neural cell body toward its correct position 
in the cortex. In the absence of Lisl, however, the nuclei of migrating neurons 
fail to attach to dynein, resulting in nuclear-migration defects. Dynein is required 
continuously for neuronal function, as mutations in a dynactin subunit or in the 
tail région of cytoplasmic dynein lead to neuronal degeneration in humans and 
mice. These effects are associated with decreased rétrogradé axonal transport and 
provide strong evidence for the importance of robust axonal transport in neuronal 
viability. 

The cell can regulate the activity of motor proteins and thereby cause either 
a change in the positioning of its membrane-enclosed organelles or whole-cell 
movements. Fish mélanocytes provide one of the most dramatic examples. These 
giant cells, which are responsible for rapid changes in skin coloration in several 
species of fish, contain large pigment granules that can alter their location in 
response to neuronal or hormonal stimulation (Figure 16-61). The pigment gran¬ 
ules aggregate or disperse by moving along an extensive networlc of microtubules 
that are anchored at the centrosome by their minus ends. The tracking of indi- 
vidual pigment granules reveals that the inward movement is rapid and smooth, 
while the outward movement is jerlcy, with frequent backward steps. Both dynein 
and kinesin microtubule motors are associated with the pigment granules. The 
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Figure 16-60 Dynactin médiates the 
attachment of dynein to a membrane- 
enclosed organelle. Dynein requires the 
presence of a large number of accessory 
proteins to associate with membrane- 
enclosed organelles. Dynactin is a large 
complex that includes components that 
bind weakly to microtubules, components 
that bind to dynein itself, and components 
that form a small, actin-like filament made 
of the actin-related protein Arpl. 
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jerky outward movements may resuit from a tug-of-war between the two opposing 
microtubule motor proteins, with the stronger kinesin winning out overall. When 
intracellular cyclic AMP levels decrease, kinesin is inactivated, leaving dynein free 
to drag the pigment granules rapidly toward the cell center, changing the fish's 
color. In a similar way, the movement of other membrane organelles coated with 
particular motor proteins is controlled by a complex balance of competing signais 
that regulate both motor protein attachment and activity. 

Construction of Complex Microtubule Assemblies Requires 
Microtubule Dynamics and Motor Proteins 

The construction of the mitotic spindle and the neuronal cytoskeleton are import¬ 
ant and fascinating examples of the power of organization by teams of motor pro¬ 
teins interacting with dynamic cytoskeletal filaments. As described in Chapter 
17, mitotic spindle assembly dépends on reorganization of the interphase array 
of microtubules to form a bipolar array of microtubules, with their minus ends 
focused at the pôles and their plus ends overlapping in the center or connecting to 
chromosomes. Spindle assembly dépends on the coordinated actions of several 
motor proteins and other factors that modulate polymerization dynamics (see 
Figures 17-23 and 17-25). 

Neurons also contain complex cytoskeletal structures. As they differentiate, 
neurons send out specialized processes that will either receive electrical signais 
(< dendrites ) or transmit electrical signais {axons) (see Figure 16-50). The beautiful 
and elaborate branching morphology of axons and dendrites enables neurons to 
form tremendously complex signaling networlcs, interacting with many other cells 
simultaneously and making possible the complicated behavior of the higher ani¬ 
mais. Both axons and dendrites (collectively called neurites) are filled with bun- 
dles of microtubules that are critical to both their structure and their function. 

In axons, ail the microtubules are oriented in the same direction, with their 
minus end pointing baclc toward the cell body and their plus end pointing toward 
the axon terminais (Figure 16-62). The microtubules do not reach from the cell 


Figure 16-61 Regulated melanosome 
movements in fish pigment cells. 

These giant cells, which are responsible 
for changes in skin coloration in several 
species of fish, contain large pigment 
granules, or melanosomes (brown). The 
melanosomes can change their location 
in the cell in response to a hormonal or 
neuronal stimulus. (A) Schematic view of 
a pigment cell, showing the dispersai and 
aggregation of melanosomes in response 
to an increase or decrease in intracellular 
cyclic AMP (cAMP), respectively. Both 
redistributions of melanosomes occur along 
microtubules. (B) Bright-field images of a 
single cell in a scale of an African cichlid 
fish, showing its melanosomes either 
dispersed throughout the cytoplasm (left) or 
aggregated in the center of the cell (right). 
(B, courtesy of Leah Haimo.) 



NEURON 

Figure 16-62 Microtubule organization 
in a neuron. In a neuron, microtubule 
organization is complex. In the axon, ail 
microtubules share the same polarity, with 
the plus ends pointing outward toward 
the axon terminus. No single microtubule 
stretches the entire length of the axon; 
instead, short overlapping segments of 
parallel microtubules make the tracks 
for fast axonal transport. In dendrites, 
the microtubules are of mixed polarity, 
with some plus ends pointing outward 
and some pointing inward. Vesicles can 
associate with both kinesin and dynein 
and move in either direction along the 
microtubules in axons and dendrites, 
depending on which motor is active. 
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body ail the way to the axon terminais; each is typically only a few micrometers 
in length, but large numbers are staggered in an overlapping array. These aligned 
microtubule traclcs act as a highway to transport spécifie proteins, protein-con- 
taining vesicles, and mRNAs to the axon terminais, where synapses are con- 
structed and maintained. The longest axon in the human body reaches from the 
base of the spinal cord to the foot and is up to a meter in length. By comparison, 
dendrites are generally much shorter than axons. The microtubules in dendrites 
lie parallel to one another but their polarities are mixed, with some pointing their 
plus ends toward the dendrite tip, while others point baclc toward the cell body, 
reminiscent of the antiparallel microtubule array of the mitotic spindle. 


Motile Cilia and Flagella Are Built from Microtubules and Dyneins 

Just as myofibrils are highly specialized and efficient motility machines built from 
actin and myosin filaments, cilia and flagella are highly specialized and efficient 
motility structures built from microtubules and dynein. Both cilia and flagella are 
hairlike cell appendages that hâve a bundle of microtubules at their core. Flagella 
are found on sperm and many protozoa. By their undulating motion, they enable 
the cells to which they are attached to swim through liquid media. Cilia are orga- 
nized in a similar fashion, but they beat with a whiplike motion that resembles the 
breaststroke in swimming. Ciliary beating can either propel single cells through a 
fluid (as in the swimming of the protozoan Paramecium ) or can move fluid over 
the surface of a group of cells in a tissue. In the human body, huge numbers of cilia 
(10 9 /cm 2 or more) line our respiratory tract, sweeping layers of mucus, trapped 
particles of dust, and bacteria up to the mouth where they are swallowed and ulti- 
mately eliminated. Tikewise, cilia along the oviduct help to sweep eggs toward the 
utérus. 

The movement of a cilium or a flagellum is produced by the bending of its 
core, which is called the axoneme. The axoneme is composed of microtubules 
and their associated proteins, arranged in a distinctive and regular pattern. Nine 
spécial doublet microtubules (comprising one complété and one partial micro¬ 
tubule fused together so that they share a common tubule wall) are arranged in 
a ring around a pair of single microtubules (Figure 16-63). Almost ail forms of 
motile eukaryotic flagella and cilia (from protozoans to humans) hâve this char- 
acteristic arrangement. The microtubules extend continuously for the length of 
the axoneme, which can be 10-200 pm. At regular positions along the length of the 
microtubules, accessory proteins cross-linlc the microtubules together. 
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Figure 16-63 The arrangement of microtubules in a flagellum or cilium. (A) Electron micrograph of the flagellum of a green-alga cell 
(Chlamydomonas) shown in cross section, illustrating the distinctive “9 + 2” arrangement of microtubules. (B) Diagram of the parts of a flagellum or 
cilium. The various projections from the microtubules link the microtubules together and occur at regular intervals along the length of the axoneme. 
(C) High-resolution électron tomography image of an outer doublet microtubule showing structural details and features inside the microtubules called 
microtubule inner proteins (MIPs). (A, courtesy of Lewis Tilney; C, courtesy of Daniela Nicastro.) 
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Molécules of axonemal dynein form bridges between the neighboring doublet 
microtubules around the circumference of the axoneme (Figure 16-64). When 
the motor domain of this dynein is activated, the dynein molécules attached to 
one microtubule doublet (see Figure 16-59) attempt to wallc along the adjacent 
microtubule doublet, tending to force the adjacent doublets to slide relative to 
one another, much as actin thin filaments slide during muscle contraction. How- 
ever, the presence of other links between the microtubule doublets prevents this 
sliding, and the dynein force is instead converted into a bending motion (Figure 
16-65). 

In humans, hereditary defects in axonemal dynein cause a condition called 
primary ciliary dyskinesia or Kartagener's syndrome. This syndrome is character- 
ized by inversion of the normal asymmetry of internai organs (sinus inversus) due 
to disruption of fluid flow in the developing embryo, male sterility due to immo- 
tile sperm, and a high susceptibility to lung infections due to paralyzed cilia being 
unable to clear the respiratory tract of débris and bacteria. 

Bacteria also swim using cell-surface structures called flagella, but these do 
not contain microtubules or dynein and do not wave or beat. Instead, bacterial 
flagella are long, rigid helical filaments, made up of repeating subunits of the pro- 
tein flagellin. The flagella rotate lilce propellers, driven by a spécial rotary motor 
embedded in the bacterial cell wall. The use of the same name to dénoté these two 
very different types of swimming apparatus is an unfortunate historical accident. 

Primary Cilia Perform Important Signaling Functions in Animal Cells 

Many cells possess a shorter, nonmotile counterpart of cilia and flagella called 
the primary cilium. Primary cilia can be viewed as specialized cellular compart- 
ments or organelles that perform a wide range of cellular functions, but share 


Figure 16-64 Ciliary dynein. Ciliary 
(axonemal) dynein is a large protein 
assembly (nearly 2 million daltons) 
composed of 9-12 polypeptide chains, the 
largest of which is the heavy chain of more 
than 500,000 daltons. (A) The heavy chains 
form the major portion of the globular 
head and stem domains, and many of the 
smaller chains are clustered around the 
base of the stem. There are two heads 
in the outer dynein in metazoans (shown 
here), but three heads in protozoa, each 
formed from their own heavy chain. The 
tail of the molécule binds tightly to an 
A microtubule, while the large globular 
heads hâve an ATP-dependent binding 
site for a B microtubule (see Figure 16-63). 
When the heads hydrolyze their bound ATP, 
they move toward the minus end of the 
B microtubule, thereby producing a sliding 
force between the adjacent microtubule 
doublets in a cilium or flagellum (see 
Figure 16-59). (B) Freeze-etch électron 
micrograph of a cilium showing the dynein 
arms projecting at regular intervals from 
the doublet microtubules. (B, courtesy of 
John Heuser.) 



Figure 16-65 The bending of an 
axoneme. (A) When axonemes are 
exposed to the proteolytic enzyme trypsin, 
the linkages holding neighboring doublet 
microtubules together are broken. In this 
case, the addition of ATP allows the motor 
action of the dynein heads to slide one 
pair of doublet microtubules against the 
other pair. (B) In an intact axoneme (such 
as in a sperm), flexible protein links prevent 
the sliding of the doublet. The motor 
action therefore causes a bending motion, 
creating waves or beating motions. 
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many structural features with motile cilia. Both motile and nonmotile cilia are 
generated during interphase at plasma-membrane-associated structures called 
basal bodies that firmly root them at the cell surface. At the core of each basal body 
is a centriole, the same structure found embedded at the center of animal cen¬ 
trosomes, with nine groups of fused triplet microtubules arranged in a cartwheel 
(see Figure 16-48). Centrioles are multifunctional, contributing to assembly of the 
mitotic spindle in dividing cells but migrating to the plasma membrane of inter¬ 
phase cells to template the nucléation of the axoneme (Figure 16-66). Because 
no protein translation occurs in cilia, construction of the axoneme requires intra- 
flagellar transport (IFT), a transport System discovered in the green algae Chlam- 
ydomonas. Analogous to the axon, motors move cargoes in both anterograde and 
rétrogradé directions, in this case driven by kinesin-2 and cytoplasmic dynein 2, 
respectively. 

Primary cilia are found on the surface of almost ail cell types, where they sense 
and respond to the exterior environment, functions best understood in the con- 
text of smell and sight. In the nasal epithelium, cilia protruding from dendrites of 
olfactory neurons are the site of both odorant réception and signal amplification. 
Similarly, the rod and cône cells of the vertebrate retina possess a primary cilium 
equipped with an expanded tip called the outer segment, which is specialized for 
converting light into a neural signal (see Figure 15-38). Maintenance of the outer 
segment requires continuous IFT-mediated transport of large quantities of lipids 
and proteins into the cilium, at rates of up to 2000 molécules per minute. The links 
between cilia function and the senses of sight and smell are underscored by Bar- 
det-Biedl syndrome, a set of disorders associated with defects in IFT, the cilium, or 
the basal body. Patients with Bardet-Biedl syndrome cannot smell and suffer from 
retinal degeneration. Other characteristics of this multifaceted disorder include 
hearing loss, polycystic kidney disease, diabètes, obesity, and polydactyly, sug- 
gesting that primary cilia hâve functions in many aspects of human physiology. 

Summary 

Microtubules are stiffpolymers oftubulin molécules. They assemble by addition of 
GTP-containing tubulin subunits to thefree end ofa microtubule, with one end (the 
plus end) growing faster than the other. Hydrolysis of the bound GTP takes place 
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Figure 16-66 Primary cilia. (A) Electron micrograph and diagram of the basal body of a mouse neuron primary cilium. The axoneme of the primary 
cilium (black arrow) is nucleated by the mother centriole at the basal body, which localizes at the plasma membrane near the cell surface. 

(B) Centrioles function alternately as basal bodies and as the core of centrosomes. Before a cell enters the cell division cycle, the primary cilium is 
shed or resorbed. The centrioles recruit pericentriolar material and duplicate during S phase, generating two centrosomes, each of which contains 
a pair of centrioles. The centrosomes nucleate microtubules and localize to the pôles of the mitotic spindle. Upon exit from mitosis, a primary cilium 
again grows from the mother centriole. (A, courtesy of Josef Spacek.) 
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after assembly and weakens the bonds that hold the microtubule together. Microtu- 
bules are dynamically unstable and liable to catastrophic disassembly, but they can 
be stabilized in cells by association with other structures. Microtubule-organizing 
centers such as centrosomes protect the minus ends of microtubules and continu- 
ally nucleate the formation ofnew microtubules. Microtubule-associated proteins 
(MAPs) stabilize microtubules, and those that localize to the plus end (+TIPs) can 
alter the dynamic properties ofthe microtubule or médiate their interaction with 
other structures. Counteracting the stabilizing activity of MAPs are catastrophe 
factors, such as kinesin-13 proteins, that act to peel apart microtubule ends. Other 
kinesin family members as well as dynein use the energy ofATP hydrolysis to move 
unidirectionally along a microtubule. The motor dynein moues toward the minus 
end of microtubules, and its sliding of axonemal microtubules underlies the beating 
ofcilia and flagella. Primary cilia are nonmotile sensory organsfound on many 
cell types. 


INTERMEDIATE FILAMENTS AND SEPTINS 

Ail eukaryotic cells contain actin and tubulin. But the third major type of cyto- 
slceletal protein, the intermediate filament, forms a cytoplasmic filament only in 
some metazoans—including vertebrates, nematodes, and mollusks. Intermediate 
filaments are particularly prominent in the cytoplasm of cells that are subject to 
mechanical stress and are generally not found in animais that hâve rigid exoslcel- 
etons, such as arthropods and echinoderms. It seems that intermediate filaments 
impart mechanical strength to tissues for the squishier animais. 

Cytoplasmic intermediate filaments are closely related to their ancestors, the 
much more prévalent nuclear lamins, which are found in many eulcaryotes but 
missing from unicellular organisms. The nuclear lamins form a meshwork lining 
the inner membrane of the nuclear envelope, where they provide anchorage sites 
for chromosomes and nuclear pores. Several times during metazoan évolution, 
lamin genes hâve apparently duplicated, and the duplicates hâve evolved to pro- 
duce ropelilce, cytoplasmic intermediate filaments. In contrast to the highly con- 
served actins and tubulin isoforms that are encoded by a handful of genes, differ¬ 
ent families of intermediate filaments are much more diverse and are encoded by 
70 different human genes with distinct, cell type-specific functions (Table 16-2). 


TABLE 16-2 

Major Types of Intermediate Filament Proteins in Vertebrate Cells 

Types of Component polypeptides Location 

intermediate 

filament 

Nuclear 

Lamins A, B, and C 

Nuclear lamina (inner lining of 
nuclear envelope) 

Vimentin-like 

Vimentin 

Many cells of mesenchymal origin 

Desmin 

Muscle 

Glial fibrillary acidic protein 

Glial cells (astrocytes and some 
Schwann cells) 

Peripherin 

Some neurons 

Epithelial 

Type 1 keratins (acidic) 

Epithelial cells and their dérivatives 
(e.g., hair and nails) 

Type II keratins (neutral/basic) 

Axonal 

Neurofilament proteins 
(NF-L, NF-M, and NF-H) 

Neurons 
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Figure 16-67 A model of intermediate filament construction. The monomer shown in (A) pairs with another monomer to 
form a dimer (B), in which the conserved central rod domains are aligned in parallel and wound together into a coiled-coil. 

(C) Two dimers then line up side by side to form an antiparallel tetramer of four polypeptide chains. Dimers and tetramers are 
the soluble subunits of intermediate filaments. (D) Within each tetramer, the two dimers are offset with respect to one another, 
thereby allowing it to associate with another tetramer. (E) In the final 10-nm ropelike filament, tetramers are packed together in a 
helical array, which has 16 dimers (32 coiled-coils) in cross section. Half of these dimers are pointing in each direction. An électron 
micrograph of intermediate filaments is shown on the upper left (Movie 16.12). (Electron micrograph courtesy of Roy Quinlan.) 


Intermediate Filament Structure Dépends on the Latéral Bundling 
and Twisting of Coiled-Coils 

Although their amino- and carboxy-terminal domains differ, ail intermediate fil¬ 
ament family members are elongated proteins with a conserved central a-helical 
domain containing 40 or so heptad repeat motifs that form an extended coiled- 
coil structure with another monomer (see Figure 3-9). A pair of parallel dimers 
then associâtes in an antiparallel fashion to form a staggered tetramer (Figure 
16-67). Unlike actin or tubulin subunits, intermediate filament subunits do not 
contain a binding site for a nucléotide. Furthermore, since the tetrameric sub¬ 
unit is made up of two dimers pointing in opposite directions, its two ends are the 
same. The assembled intermediate filament therefore laclcs the overall structural 
polarity that is critical for actin filaments and microtubules. The tetramers pack 
together laterally to form the filament, which includes eight parallel protofila¬ 
ments made up of tetramers. Each individual intermediate filament therefore has 
a cross section of 32 individual a-helical coils. This large number of polypeptides 
ail lined up together, with the strong latéral hydrophobie interactions typical of 
coiled-coil proteins, gives intermediate filaments a ropelike character. They can 
be easily bent, with a persistence length of less than one micrometer (compared 
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to several millimeters for microtubules and about ten micrometers for actin), but 
they are extremely difficult to break and can be stretched to over three times their 
length (see Figure 16-6). 

Less is understood about the mechanism of assembly and disassembly of 
intermediate filaments than of actin filaments and microtubules. In pure protein 
solutions, intermediate filaments are extremely stable due to tight association 
of subunits, but some types of intermediate filaments, including vimentin, form 
highly dynamic structures in cells such as fibroblasts. Protein phosphorylation 
probably régulâtes their disassembly, in much the same way that phosphorylation 
régulâtes the disassembly of nuclear lamins in mitosis (see Figure 12-18). As évi¬ 
dence for rapid turnover, labeled subunits microinjected into tissue-culture cells 
incorporate into intermediate filaments within a few minutes. Remodeling of the 
intermediate filament network accompanies events requiring dynamic cellular 
reorganization, such as division, migration, and différentiation. 

Intermediate Filaments Impart Mechanical Stability 
to Animal Cells 

Keratins are the most diverse intermediate filament family: there are about 20 
found in different types of human épithélial cells and about 10 more that are spé¬ 
cifie to hair and nails; analysis of the human genome sequence has revealed that 
there are 54 distinct keratins. Every lceratin filament is made up of an equal mix¬ 
ture of type I (acidic) and type II (neutral/basic) lceratin proteins; these form a 
heterodimer filament subunit (see Figure 16-67). Cross-linlced lceratin networlcs 
held together by disulfide bonds can survive even the death of their cells, forming 
tough coverings for animais, as in the outer layer of skin and in hair, nails, claws, 
and scales. The diversity in keratins is clinically useful in the diagnosis of épithé¬ 
lial cancers (carcinomas), as the particular set of keratins expressed gives an indi¬ 
cation of the épithélial tissue in which the cancer originated and thus can help to 
guide the choice of treatment. 

A single épithélial cell may produce multiple types of keratins, and these 
copolymerize into a single network (Figure 16-68). Keratin filaments impart 
mechanical strength to épithélial tissues in part by anchoring the intermediate 
filaments at sites of cell-cell contact, called desmosomes , or cell-matrix contact, 
called hemidesmosomes (see Figure 16-4). We discuss these important adhesive 
structures in Chapter 19. Accessory proteins, such as filaggrin, bundle keratin fil¬ 
aments in differentiating cells of the epidermis to give the outermost layers of the 
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Figure 16-68 Keratin filaments in 
épithélial cells. Immunofluorescence 
micrograph of the network of keratin 
filaments (blue) in a sheet of épithélial 
cells in culture. The filaments in each cell 
are indirectly connected to those of its 
neighbors by desmosomes (discussed in 
Chapter 19). A second protein (red) has 
been stained to reveal the location of the 
cell boundaries. (Courtesy of Kathleen 
Green and Evangeline Amargo.) 
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skin their spécial toughness. Individuals with mutations in the gene encoding fil- 
aggrin are strongly predisposed to dry skin diseases such as eczema. 

Mutations in lceratin genes cause several human genetic diseases. For exam¬ 
ple, when defective lceratins are expressed in the basal cell layer of the epidermis, 
they produce a disorder called epidermolysis bullosa simplex, in which the skin 
blisters in response to even very slight mechanical stress, which ruptures the basal 
cells (Figure 16-69). Other types of blistering diseases, including disorders of the 
mouth, esophageal lining, and the cornea of the eye, are caused by mutations in 
the different lceratins whose expression is spécifie to those tissues. Ail of these 
maladies are typified by cell rupture as a conséquence of mechanical trauma 
and a disorganization or dumping of the lceratin filament cytoslceleton. Many of 
the spécifie mutations that cause these diseases alter the ends of the central rod 
domain, demonstrating the importance of this particular part of the protein for 
correct filament assembly. 

Members of another family of intermediate filaments, called neurofilaments, 
are found in high concentrations along the axons of vertebrate neurons (Figure 
16-70). Three types of neurofilament proteins (NF-L, NF-M, and NF-H) coassem- 
ble in vivo, forming heteropolymers. The NF-H and NF-M proteins hâve lengthy 
C-terminal tail domains that bind to neighboring filaments, generating aligned 
arrays with a uniform interfilament spacing. During axonal growth, new neuro- 
filament subunits are incorporated ail along the axon in a dynamic process that 
involves the addition of subunits along the filament length as well as the ends. 
After an axon has grown and connected with its target cell, the diameter of the 
axon may increase as much as fivefold. The level of neurofilament gene expres¬ 
sion seems to directly control axonal diameter, which in turn influences how 
fast electrical signais travel down the axon. In addition, neurofilaments provide 
strength and stability to the long cell processes of neurons. 

The neurodegenerative disease amyotrophie latéral sclerosis (ALS, or Lou 
Gehrig's disease) is associated with an accumulation and abnormal assembly 
of neurofilaments in motor neuron cell bodies and in the axon, aberrations that 
may interfère with normal axonal transport. The degeneration of the axons leads 
to muscle wealcness and atrophy, which is usually fatal. The overexpression of 
human NF-L or NF-H in mice results in mice that hâve an ALS-lilce disease. How- 
ever, a causative linlc between neurofilament pathology and ALS has not been 
firmly established. 
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Figure 16-69 Blistering of the skin caused by a mutant keratin gene. A mutant gene encoding a truncated keratin protein (lacking both the 
N- and C-terminal domains) was expressed in a transgenic mouse. The defective protein assembles with the normal keratins and thereby disrupts 
the keratin filament network in the basal cells of the skin. Light micrographs of cross sections of (A) normal and (B) mutant skin show that the 
blistering results from the rupturing of cells in the basal layer of the mutant epidermis (short red arrows). (C) A sketch of three cells in the basal 
layer of the mutant epidermis, as observed by électron microscopy. As indicated by the red arrow, the cells rupture between the nucléus and the 
hemidesmosomes (discussed in Chapter 19), which connect the keratin filaments to the underlying basal lamina. (From P.A. Coulombe et al., J. Cell 
Biol. 115:1661-1674, 1991. With permission from The Rockefeller University Press.) 
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The vimentin-like filaments are a third family of intermediate filaments. 
Desmin, a member of this family, is expressed in slceletal, cardiac, and smooth 
muscle, where it forms a scaffold around the Z dise of the sarcomere (see Figure 
16-34). Mice lacking desmin show normal initial muscle development, but adults 
hâve various muscle-cell abnormalities, including misaligned muscle fibers. In 
humans, mutations in desmin are associated with various forms of muscular dys- 
trophy and cardiac myopathy, illustrating the important rôle of desmin in stabi- 
lizing muscle fibers. 

Besides their well-established rôle in maintaining the mechanical stability 
of the nucléus, it is becoming increasingly évident that one class of lamins, the 
A-type, together with many proteins of the nuclear envelope, are scaffolds for pro¬ 
teins that control myriad cellular processes including transcription, chromatin 
organization, and signal transduction. The majority of laminopathies are associ¬ 
ated with mutant versions of lamin A and include tissue-specific diseases. Slceletal 
and cardiac abnormalities might be explained by a weakened nuclear envelope 
leading to cell damage and death, but laminopathies are also thought to arise 
from pathogenic and tissue-specific alterations in gene expression. 

Linker Proteins Connect Cytoskeletal Filaments and Bridge the 
Nuclear Envelope 

The intermediate filament network is linlced to the rest of the cytoskeleton by 
members of a family of proteins called plalcins. Plakins are large and modular, 
containing multiple domains that connect cytoskeletal filaments to each other 
and to junctional complexes. Plectin is a particularly interesting example. In addi¬ 
tion to bundling intermediate filaments, it links the intermediate filaments to 
microtubules, actin filament bundles, and filaments of the motor protein myosin 
II; it also helps attach intermediate filament bundles to adhesive structures at the 
plasma membrane (Figure 16-71). 

Plectin and other plakins can interact with protein complexes that connect 
the cytoskeleton to the nuclear interior. These complexes consist of SUN proteins 



Figure 16-70 Two types of intermediate 
filaments in cells of the nervous System. 

(A) Freeze-etch électron microscopie 
image of neurofilaments in a nerve cell 
axon, showing the extensive cross- 
linking through protein cross-bridges—an 
arrangement believed to give this long cell 
process great tensile strength. The cross- 
bridges are formed by the long, nonhelical 
extensions at the C-terminus of the largest 
neurofilament protein (NF-H). (B) Freeze- 
etch image of glial filaments in glial cells, 
showing that these intermediate filaments 
are smooth and hâve few cross-bridges. 

(C) Conventional transmission électron 
micrograph of a cross section of an axon 
showing the regular side-to-side spacing 
of the neurofilaments, which greatly 
outnumber the microtubules. 

(A and B, courtesy of Nobutaka Hirokawa; 
C, courtesy of John Hopkins.) 


Figure 16-71 Plectin cross-linking of 
diverse cytoskeletal éléments. Plectin 
(green) is seen here making cross- 
links from intermediate filaments (blue) 
to microtubules (red). In this électron 
micrograph, the dots (yellow) are gold 
particles linked to anti-plectin antibodies. 
The entire actin filament network was 
removed to reveal these proteins. (From 
T.M. Svitkina et al., J. Cell Biol. 135:991- 
1007, 1996. With permission from The 
Rockefeller University Press.) 
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Figure 16-72 SUN-KASH protein 
complexes connect the nucléus 
and cytoplasm through the nuclear 
envelope. The cytoplasmic cytoskeleton 
is linked across the nuclear envelope 
to the nuclear lamina or chromosomes 
through SUN and KASH proteins (orange 
and purple, respectively). The SUN and 
KASH domains of these proteins bind 
within the lumen of the nuclear envelope. 
From the inner nuclear envelope, SUN 
proteins connect to the nuclear lamina 
or chromosomes. KASH proteins in the 
outer nuclear envelope connect to the 
cytoplasmic cytoskeleton by binding 
microtubule motor proteins, actin 
filaments, or plectin. 


of the inner nuclear membrane and KASH proteins (also called nesprins) of the 
outer nuclear membrane (Figure 16-72). SUN and KASH proteins bind to each 
other within the lumen of the nuclear envelope, forming a bridge that connects the 
nuclear and cytoplasmic cytoslceletons. Inside the nucléus, the SUN proteins bind 
to the nuclear lamina or chromosomes, whereas in the cytoplasm, KASH proteins 
can bind directly to actin filaments and indirectly to microtubules and intermedi- 
ate filaments through association with motor proteins and plakins, respectively. 
This linlcage serves to mechanically couple the nucléus to the cytoskeleton and is 
involved in many cellular functions, including chromosome movements inside 
the nucléus during meiosis, nuclear and centrosome positioning, nuclear migra¬ 
tion, and global cytoskeletal organization. 

Mutations in the gene for plectin cause a devastating human disease that 
combines epidermolysis bullosa (caused by disruption of slcin lceratin filaments), 
muscular dystrophy (caused by disruption of desmin filaments), and neuro- 
degeneration (caused by disruption of neurofilaments). Mice lacking a functional 
plectin gene die within a few days of birth, with blistered skin and abnormal slcel- 
etal and heart muscles. Thus, although plectin may not be necessary for the ini¬ 
tial formation and assembly of intermediate filaments, its cross-linlcing action is 
required to provide cells with the strength they need to withstand the mechanical 
stresses inhérent to vertebrate life. 

Septins Form Filaments That Regulate Cell Polarity 

GTP-binding proteins called septins serve as an additional filament System in ail 
eukaryotes except terrestrial plants. Septins assemble into nonpolar filaments 
that form rings and cagelike structures, which act as scaffolds to compartmen- 
talize membranes into distinct domains, or recruit and organize the actin and 
microtubule cytoslceletons. First identified in budding yeast, septin filaments 
localize to the neclc between a dividing yeast mother cell and its growing bud 
(Figure 16-73A). At this location, septins bloclc the movement of proteins from 
one side of the bud neclc to the other, thereby concentrating cell growth preferen- 
tially within the bud. Septins also recruit the actin-myosin machinery that forms 
the contractile ring required for cytolcinesis. In animal cells, septins function in 
cell division, migration, and vesicle trafficlcing. In primary cilia, for example, a 
ring of septin filaments assembles at the base of the cilium and serves as a dif¬ 
fusion barrier at the plasma membrane, restricting the movement of membrane 
proteins and establishing a spécifie composition in the ciliary membrane (Figure 
16-73B and C). Réduction of septin levels impairs primary cilium formation and 
signaling. 

There are 7 septin genes in yeast and 13 in human, and septin pro teins fall 
into four groups on the basis of sequence relationships. In a test tube, purified 
septins assemble into symmetrical hetero-hexamers or hetero-octamers that 
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Figure 16-73 Cell compartmentalization 
by septins. (A) Septins form filaments in 
the neck région between a mother yeast 
cell and bud. (B) In this photomicrograph 
of human cultured cells, the DNA is 
stained blue and septins are labeled in 
green. The microtubules of primary cilia are 
labeled with an antibody that recognizes a 
modified (acetylated) form of tubulin (red) 
that is enriched in the axoneme. (C) A 
magnified image reveals a collar of septin 
at the base of the cilium. (A, from B. Byers 
and L. Goetsch, J. Cell Biol. 69:717-721, 
1976. With permission from Rockefeller 
University Press. B and C, from Q. Hu et 
al., Science 329:436-439, 2010. With 
permission from AAAS.) 


form nonpolar paired filaments (Figure 16-74). GTP binding is required for the 
folding of septin polypeptides, but the rôle of GTP hydrolysis in septin function is 
not understood. Septin structures assemble and disassemble inside cells, but they 
are not as dynamic as actin filaments and microtubules. 

Summary 

Whereas tubulin and actin hâve been highly conserved in évolution, intermediate 
filament proteins are very diverse. There are many tissue-spécifiefor ms of interme¬ 
diate filaments in the cytoplasm of animal cells; including keratin filaments in épi¬ 
thélial cells, neurofilaments in nerve cells; and desmin filaments in muscle cells. The 
primary function ofthese filaments is to provide mechanical strength. Septins com¬ 
prise an additional System of filaments that organize compartments inside cells. 



Cdcll 


Cdc12 

Cdc3 

CdcIO 

CdcIO 

Cdc3 

Cdc12 

Cdcll 




Figure 16-74 Septins polymerize to form paired filaments and sheets. 

(A) Electron micrograph of a septin rod assembled by combining two copies 
each of the four yeast septins illustrated at the right. The eight-subunit rod is 
nonpolar because the central pair of subunits (Cdcl 0) créâtes a symmetrical 
dimer. (B) Electron micrograph of paired septin filaments and sheets, assembled 
from purified septins in the presence of high sait concentrations. (C) Paired septin 
filaments may assemble by latéral association between filaments, mediated by 
coiled-coils formed between the paired C-terminal extensions of Cdc3 and Cdcl 2 
that project from each filament. (Images and schematics adapted from A. Bertin 
et al., Proc. Nat! Acad. Sci. USA 105:8274-8279, 2008. With permission from the 
National Academy of Sciences.) 
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CELL POLARIZATION AND MIGRATION 

A central challenge in cell biology is to understand how multiple individual 
molecular components collaborate to produce complex cell behaviors. The pro- 
cess of cell migration, which we describe in this final section, relies on the coor- 
dinated deployment of the components and processes that we hâve explored in 
this chapter: the dynamic assembly and disassembly of cytoskeletal polymers, the 
régulation and modification of their structure by polymer-associated proteins, 
and the actions of motor proteins moving along the polymers or exerting tension 
against them. How does the cell coordinate ail these activities to define its polarity 
and enable it to crawl? 

Many Cells Can Crawl Across a Solid Substratum 

Many cells move by crawling over surfaces rather than by using cilia or flagella 
to swim. Predatory amoebae crawl continuously in search of food, and they can 
easily be observed to attaclc and devour smaller ciliates and flagellâtes in a drop 
of pond water (see Movie 1.4). In animais, almost ail cell locomotion occurs by 
crawling, with the notable exception of swimming sperm. During embryogene- 
sis, the structure of an animal is created by the migrations of individual cells to 
spécifie target locations and by the coordinated movements of whole épithélial 
sheets (discussed in Chapter 21). In vertebrates, neural crest cells are remarkable 
for their long-distance migrations from their site of origin in the neural tube to a 
variety of sites throughout the embryo (see Movie 21.5). Long-distance crawling 
is fundamental to the construction of the entire nervous System: it is in this way 
that the actin-rich growth cônes at the advancing tips of developing axons travel 
to their eventual synaptic targets, guided by combinations of soluble signais and 
signais bound to cell surfaces and extracellular matrix along the way. 

The adult animal also seethes with crawling cells. Macrophages and neutro- 
phils crawl to sites of infection and engulf foreign invaders as a critical part of 
the innate immune response. Osteoclasts tunnel into bone, forming channels that 
are filled in by the osteoblasts that follow after them, in a continuous process of 
bone remodeling and renewal. Similarly, fibroblasts migrate through connective 
tissues, remodeling them where necessary and helping to rebuild damaged struc¬ 
tures at sites of injury. In an ordered procession, the cells in the épithélial lining 
of the intestine travel up the sides of the intestinal villi, replacing absorptive cells 
lost at the tip of the villus. Unfortunately, cell crawling also has a rôle in many 
cancers, when cells in a primary tumor invade neighboring tissues and crawl into 
blood vessels or lymph vessels and then emerge at other sites in the body to form 
métastasés. 

Cell migration is a complex process that dépends on the actin-rich cortex 
beneath the plasma membrane. Three distinct activities are involved: protrusion , 
in which the plasma membrane is pushed out at the front of the cell; attachment , 
in which the actin cytoskeleton connects across the plasma membrane to the sub¬ 
stratum; and traction , in which the bulle of the trailing cytoplasm is drawn forward 
(Figure 16-75). In some crawling cells, such as lceratocytes from the fish epider- 
mis, these activities occur simultaneously, and the cells seem to glide forward 
smoothly without changing shape. In other cells, such as fibroblasts, these activi¬ 
ties are more independent, and the locomotion is jerlcy and irregular. 

Actin Polymerization Drives Plasma Membrane Protrusion 

The first step in locomotion, protrusion of a leading edge, frequently relies on 
forces generated by actin polymerization pushing the plasma membrane out- 
ward. Different cell types generate different types of protrusive structures, includ- 
ing filopodia (also known as microspikes) and lamellipodia. These are filled with 
dense cores of filamentous actin, which excludes membrane-enclosed organ- 
elles. The structures differ primarily in the way in which the actin is organized by 
actin-cross-linking proteins (see Figure 16-22). 
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Figure 16-75 A model of how forces 
generated in the actin-rich cortex move 
a cell forward. The actin-polymerization- 
dependent protrusion and firm attachment 
of a lamellipodium at the leading edge 
of the cell move the edge forward (green 
arrows at front) and stretch the actin 
cortex. Contraction at the rear of the 
cell propels the body of the cell forward 
(green arrow at back) to relax some of 
the tension (traction). New focal contacts 
are made at the front, and old ones are 
disassembled at the back as the cell crawls 
forward. The same cycle can be repeated, 
moving the cell forward in a stepwise 
fashion. Alternative^, ail steps can be 
tightly coordinated, moving the cell forward 
smoothly. The newly polymerized cortical 
actin is shown in red. 


Filopodia, formed by migrating growth cônes of neurons and some types of 
fibroblasts, are essentially one-dimensional. They contain a core of long, bun- 
dled actin filaments, which are reminiscent of those in microvilli but longer and 
thinner, as well as more dynamic. Lamellipodia, formed by épithélial cells and 
fibroblasts, as well as by some neurons, are two-dimensional, sheetlilce structures. 
They contain a cross-linked mesh of actin filaments, most of which lie in a plane 
parallel to the solid substratum. Invadopodia and related structures known as 
podosomes represent a third type of actin-rich protrusion. These extend in three 
dimensions and are important for cells to cross tissue barriers, as when a meta- 
static cancer cell invades the surrounding tissue. Invadopodia contain many of 
the same actin-regulatory components as filopodia and lamellipodia, and they 
also dégradé the extracellular matrix, which requires the delivery of vesicles con- 
taining matrix- degrading proteases. 

A distinct form of membrane protrusion called blebbing is often observed in 
vivo or when cells are cultured on a pliable extracellular matrix substratum. Blebs 
form when the plasma membrane détachés locally from the underlying actin 
cortex, thereby allowing cytoplasmic flow to push the membrane outward (Fig¬ 
ure 16-76). Bleb formation also dépends on hydrostatic pressure within the cell, 
which is generated by the contraction of actin and myosin assemblies. Once blebs 
hâve extended, actin filaments reassemble on the bleb membrane to form a new 
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Figure 16-76 Membrane bleb induced 
by disruption of the actin cortex. On 

the left is a light micrograph showing a 
spherical membrane protrusion or bleb 
induced by laser ablation of a small région 
of the actin cortex. The cortex is labeled 
green in the middle image by expression of 
GFP-actin. (Courtesy of Ewa Paluch.) 
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actin cortex. Recruitment of myosin II and contraction of actin and myosin can 
then power retraction of membrane blebs. Alternatively, extension of new blebs 
from old ones can drive cell migration. 

Lamellipodia Contain Ail of the Machinery Required for Cell Motility 

Lamellipodia hâve been particularly well studied in the épithélial cells of the epi- 
dermis of fish and frogs; these épithélial cells are known as kératocytes because 
of their abundant keratin filaments. These cells normally cover the animal by 
forming an épithélial sheet, and they are specialized to close wounds very rapidly, 
moving at rates of up to 30 pm/min. When cultured as individual cells, lceratocytes 
assume a distinctive shape with a very large lamellipodium and a small, trailing 
cell body that is not attached to the substratum (Figure 16-77). Fragments of this 
lamellipodium can be sliced offwith a micropipette. Although the fragments gen- 
erally laclc microtubules and membrane-enclosed organelles, they continue to 
crawl normally, looking lilce tiny lceratocytes. 

The dynamic behavior of actin filaments in kératocyte lamellipodia can be 
studied by labeling a small patch of actin and examining its fate. This reveals that, 
while the lamellipodia crawl forward, the actin filaments remain stationary with 
respect to the substratum. The actin filaments in the meshwork are mostly ori- 
ented with their plus ends facing forward. The minus ends are frequently attached 
to the sides of other actin filaments by Arp 2/3 complexes (see Figure 16-16), 
helping to form the two-dimensional web (Figure 16-78). The web as a whole is 
undergoing treadmilling, assembling at the front and disassembling at the baclc, 
reminiscent of the treadmilling that occurs in individual actin filaments discussed 
previously (see Figure 16-14). 


Figure 16-78 Actin filament nucléation and web formation by the 
Arp 2/3 complex in lamellipodia. (A) A kératocyte with actin filaments 
labeled in red by fluorescent phalloidin and the Arp 2/3 complex labeled in 
green with an antibody against one of its subunits. The Arp 2/3 complex 
is highly concentrated near the front of the lamellipodium, where actin 
nucléation is most active. (B) Electron micrograph of a platinum-shadowed 
replica of the leading edge of a kératocyte, showing the dense actin filament 
meshwork. The labels dénoté areas enlarged in (C). (C) Close-up views of the 
marked régions of the actin web at the leading edge shown in (B). Numerous 
branched filaments can be seen, with the characteristic 70° angle formed 
when the Arp 2/3 complex nucleates a new actin filament off the side of a 
preexisting filament (see Figure 16-16). (From T. Svitkina and G. Borisy, 

J. Cell Biol. 145:1009-1026, 1999. With permission from the authors.) 


Figure 16-77 Migratory kératocytes from 
a fish epidermis. (A) Light micrographs of 
a kératocyte in culture, taken about 
15 seconds apart. This cell is moving 
at about 15 gm/min (Movie 16.13 and 
see Movie 1.1). (B) Kératocyte seen by 
scanning électron microscopy, showing 
its broad, fiat lamellipodium and small 
cell body, including the nucléus, carried 
up above the substratum at the rear. 

(C) Distribution of cytoskeletal filaments in 
this cell. Actin filaments (red) fill the large 
lamellipodium and are responsible for 
the cell’s rapid movement. Microtubules 
(green) and intermediate filaments (blue) 
are restricted to the régions close to the 
nucléus. (A and B, courtesy of Juliet Lee.) 
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Maintenance of unidirectional motion by lamellipodia is thought to require 
the coopération and mechanical intégration of several factors. Filament nuclé¬ 
ation is localized at the leading edge, with new actin filament growth occurring 
primarily in that location to push the plasma membrane forward. Most filament 
depolymerization occurs at sites located well behind the leading edge. Because 
cofilin (see Figure 16-20) binds cooperatively and preferentially to actin filaments 
containing ADP-actin (the D form), the new T-form filaments generated at the 
leading edge should be résistant to depolymerization by cofilin (Figure 16-79). 
As the filaments âge and ATP hydrolysis proceeds, cofilin can efficiently disas¬ 
semble the older filaments. Thus, the delayed ATP hydrolysis by filamentous 
actin is thought to provide the basis for a mechanism that maintains an efficient, 
unidirectional treadmilling process in the lamellipodium (Figure 16-80); it also 
explains the intracellular movement of bacterial pathogens such as Listeria (see 
Figure 16-25). 


Figure 16-79 Cofilin in lamellipodia. 

(A) A kératocyte with actin filaments 
labeled in red by fluorescent phalloidin, and 
cofilin labeled in green with a fluorescent 
antibody. Although the dense actin 
meshwork reaches ail the way through the 
lamellipodium, cofilin is not found at the 
very leading edge. (B) Close-up view of 
the région marked with the white rectangle 
in (A). The actin filaments closest to the 
leading edge, which are also the ones that 
hâve formed most recently and that are 
most likely to contain ATP-actin (rather than 
ADP-actin), are generally not associated 
with cofilin. (From T. Svitkina and G. Borisy, 
J. Cell Biol. 145:1009-1026, 1999. With 
permission from the authors.) 


Myosin Contraction and Cell Adhesion Allow Cells to Pull 
Themselves Forward 

Forces generated by actin filament polymerization at the front of a migrating cell 
are transmitted to the underlying substratum to drive cell motion. For the leading 
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protein 
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Figure 16-80 A model for protrusion 
of the actin meshwork at the leading 
edge. Two time points during advance of 
the lamellipodium are illustrated, with newly 
assembled structures at the later time 
point shown in a lighter color. Nucléation 
is mediated by the Arp 2/3 complex at 
the front. Newly nucleated actin filaments 
are attached to the sides of preexisting 
filaments, primarily at a 70° angle. 

Filaments elongate, pushing the plasma 
membrane forward because of some 
sort of anchorage of the array behind. At 
a steady rate, actin filament plus ends 
become capped. After newly polymerized 
actin subunits hydrolyze their bound ATP in 
the filament lattice, the filaments become 
susceptible to depolymerization by cofilin. 
This cycle causes a spatial séparation 
between net filament assembly at the 
front and net filament disassembly at the 
rear, so that the actin filament network as 
a whole can move forward, even though 
the individual filaments within it remain 
stationary with respect to the substratum. 
Not ail of the actin disassembles, however, 
and actin at the rear of the lamellipodium 
contributes to subséquent steps of 
migration together with myosin. 
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Figure 16-81 Contribution of myosin II to polarized cell motility. 

(A) Myosin II bipolar filaments bind to actin filaments in the lamellipodial 
meshwork and cause network contraction. The myosin-driven reorientation 
of the actin filaments forms an actin bundle that recruits more myosin II 
and helps generate the contractile forces required for retraction of the 
trailing edge of the moving cell. (B) A fragment of the large lamellipodium 
of a kératocyte can be separated from the main cell body either by surgery 
with a micropipette or by treating the cell with certain drugs. Many of these 
fragments continue to move rapidly, with the same overall cytoskeletal 
organization as the intact kératocytes. Actin (blue) forms a protrusive 
meshwork at the front of the fragment. Myosin II (pink) is gathered into 
a band at the rear. (From A. Verkhovsky et al., Curr. Biol. 9:11-20, 1999. 

With permission from Elsevier.) 

edge of a migrating cell to advance, protrusion of the membrane must be followed 
by adhesion to the substratum at the front. Conversely, in order for the cell body 
to follow, contraction must be coupled with de-adhesion at the rear of the cell. 
The processes contributing to migration are therefore tightly regulated in space 
and time, with actin polymerization, dynamic adhesions, and myosin contraction 
being employed to coordinate movement. Myosin II opérâtes in at least two ways 
to assist cell migration. The first is by helping to connect the actin cytoskeleton to 
the substratum through integrin-mediated adhesions. Forces generated by both 
actin polymerization and myosin activity create tension at attachment sites, pro- 
moting their maturation into focal adhesions , which are dynamic assemblies of 
structural and signaling proteins that linlc the migrating cell to the extracellular 
matrix (see Figure 19-59). A second mechanism involves bipolar myosin II fila¬ 
ments, which associate with the actin filaments at the rear of the lamellipodium 
and pull them into a new orientation—from nearly perpendicular to the leading 
edge to almost parallel to the leading edge. This sarcomere-lilce contraction pre- 
vents protrusion, and it pinches in the sides of the locomoting lamellipodium, 
helping to gather in the sides of the cell as it moves forward (Figure 16-81). 

Actin-mediated protrusions can only push the leading edge of the cell forward 
if there are strong interactions between the actin network and the focal adhesions 
that linlc the cell to the substrate. When these interactions are disengaged, polym¬ 
erization pressure at the leading edge and myosin-dépendent contraction cause 
the actin network to slip baclc, resulting in a phenomenon lcnown as rétrogradé 
flow (Figure 16-82). 

The traction forces generated by locomoting cells exert a significant pull on the 
substratum. By growing cells on a surface coated with tiny flexible posts, the force 
exerted on the substratum can be calculated by measuring the deflection of each 
post from its vertical position (Figure 16-83). In a living animal, most crawling 
cells move across a semiflexible substratum made of extracellular matrix, which 
can be deformed and rearranged by these cell forces. Conversely, mechanical ten¬ 
sion or stretching applied externally to a cell will cause it to assemble stress fibers 
and focal adhesions, and become more contractile. Although poorly understood, 
this two-way mechanical interaction between cells and their physical environ¬ 
ment is thought to help vertebrate tissues organize themselves. 




(A) 



Cell Polarization Is Controlled by Members of the Rho Protein 
Family 

Cell migration requires long-distance communication and coordination between 
one end of a cell and the other. During directed migration, it is important that the 
front end of the cell remain structurally and functionally distinct from the baclc 
end. In addition to driving local mechanical processes such as protrusion at the 
front and retraction at the rear, the cytoskeleton is responsible for coordinating 
cell shape, organization, and mechanical properties from one end of the cell to 
the other, a distance that is typically tens of micrometers for animal cells. 

In many cases, including but not limited to cell migration, large-scale cyto¬ 
skeletal coordination talces the form of the establishment of cell polarity, where 
a cell builds different structures with distinct molecular components at the front 
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Figure 16-82 Control of cell- 
substratum adhesion at the leading 
edge of a migrating cell. (A) Actin 
monomers assemble on the barbed end 
of actin filaments at the leading edge. 
Transmembrane integrin proteins (blue) 
help form focal adhesions that link the 
cell membrane to the substrate. (B) If 
there is no interaction between the actin 
filaments and focal adhesions, the actin 
filament is driven rearward by newly 
assembled actin. Myosin motors (green) 
also contribute to filament movement. 

(C) Interactions between actin-binding 
adaptor proteins (brown) and integrins link 
the actin cytoskeleton to the substratum. 
Myosin-mediated contractile forces are 
then transmitted through the focal adhesion 
to generate traction on the extracellular 
matrix, and new actin polymerization drives 
the leading edge forward in a protrusion. 



versus the baclc, or at the top versus the bottom. Cell locomotion requires an ini¬ 
tial polarization of the cell to set it off in a particular direction. Carefully controlled 
cell-polarization processes are also required for oriented cell divisions in tissues 
and for formation of a cohérent, organized multicellular structure. Genetic stud- 
ies in yeast, Aies, and worms hâve provided most of our current understanding 
of the molecular basis of cell polarity. The mechanisms that generate cell polar- 
ity in vertebrates are only beginning to be explored. In ail known cases, however, 
the cytoskeleton has a central rôle, and many of the molecular components hâve 
been evolutionarily conserved. 

The establishment of many kinds of cell polarity dépends on the local régula¬ 
tion of the actin cytoskeleton by external signais. Many of these signais seem to 
converge inside the cell on a group of closely related monomeric GTPases that 
are members of the Rho protein family— Cdc42, Rac, and Rho. Lilce other mono¬ 
meric GTPases, the Rho proteins act as molecular switches that cycle between 
an active GTP-bound State and an inactive GDP-bound State (see Figure 3-66). 
Activation of Cdc42 on the inner surface of the plasma membrane triggers actin 
polymerization and bundling to form filopodia. Activation of Rac promûtes actin 
polymerization at the cell periphery, leading to the formation of sheetlike lamel- 
lipodial extensions. Activation of Rho promûtes both the bundling of actin fila¬ 
ments with myosin II filaments into stress fibers and the clustering of integrins 


(A) 20 nN ( B ) 



30 |im 


Figure 16-83 Traction forces exerted 
by a motile cell. (A) Tiny flexible pillars 
attached to the substratum bend in 
response to traction forces. (B) Scanning 
électron micrograph of a cell on a 
substratum coated with pillars that are 
6.1 jim in height. Pillar deflections are used 
to calculate force vectors corresponding 
to inward pulling forces on the underlying 
substratum. (Adapted from J. Fu et al., 

Nat. Methods 7:733-736, 2010. With 
permission from Macmillan Publishers.) 












CELL POLARIZATION AND MIGRATION 


957 


actin staining 



(A) QUIESCENT CELLS 



(C) Rac ACTIVATION 


actin staining 



(B) Cdc42 ACTIVATION 



(D) Rho ACTIVATION L 


20 |um 


Figure 16-84 The dramatic effects 
of Cdc42, Rac, and Rho on actin 
organization in fibroblasts. In each case, 
the actin filaments hâve been labeled with 
fluorescent phalloidin. (A) Serum-starved 
fibroblasts hâve actin filaments primarily 
in the cortex, and relatively few stress 
fibers. (B) Microinjection of a constitutively 
activated form of Cdc42 causes the 
protrusion of many long filopodia at 
the cell periphery. (C) Microinjection of 
a constitutively activated form of Rac, 
a closely related monomeric GTPase, 
causes the formation of an enormous 
lamellipodium that extends from the 
entire circumference of the cell. 

(D) Microinjection of a constitutively 
activated form of Rho causes the rapid 
assembly of many prominent stress fibers. 
(From A. Hall, Science 279:509-514, 1998. 
With permission from AAAS.) 


and associated proteins to form focal adhesions (Figure 16-84). These dramatic 
and complex structural changes occur because each of these three molecular 
switches has numerous downstream target proteins that affect actin organization 
and dynamics. 

Some key targets of activated Cdc42 are members of the WASp protein fam- 
ily. Human patients déficient in WASp suffer from Wislcott-Aldrich Syndrome, a 
severe form of immunodeficiency in which immune System cells hâve abnormal 
actin-based motility and platelets do not form normally. Although WASp itself 
is expressed only in blood cells and immune System cells, other more ubiqui- 
tous versions enable activated Cdc42 to enhance actin polymerization in many 
cell types. WASp proteins can exist in an inactive folded conformation and an 
activated open conformation. Association with Cdc42-GTP stabilizes the open 
form of WASp, enabling it to bind to the Arp 2/3 complex and strongly enhance 
its actin-nucleating activity (see Figure 16-16). In this way, activation of Cdc42 
increases actin nucléation. 

Rac-GTP also activâtes WASp family members. Additionally, it activâtes the 
cross-linking activity of the gel-forming protein filamin and inhibits the contrac¬ 
tile activity of the motor protein myosin II. It thereby stabilizes lamellipodia and 
inhibits the formation of contractile stress fibers (Figure 16-85A). 

Rho-GTP has a very different set of targets. Instead of activating the Arp 2/3 
complex to build actin networks, Rho-GTP turns on formin proteins to construct 
parallel actin bundles. At the same time, Rho-GTP activâtes a protein kinase that 
indirectly inhibits the activity of cofilin, leading to actin filament stabilization. The 
same protein kinase inhibits a phosphatase acting on myosin light chains (see 
Figure 16-39). The conséquent increase in the net amount of myosin light chain 
phosphorylation increases the amount of contractile myosin motor protein activ¬ 
ity in the cell, enhancing the formation of tension-dependent structures such as 
stress fibers (Figure 16-85B). 

In some cell types, Rac-GTP activâtes Rho, usually at a rate that is slow com- 
pared to Rac's activation of the Arp 2/3 complex. This enables cells to use the Rac 
pathway to build a new actin structure while subsequently activating the Rho 
pathway to generate a contractility that builds up tension in this structure. This 
occurs, for example, during the formation and maturation of cell-cell contacts. 
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As we will explore in more detail below, the communication between the Rac and 
Rho pathways also facilitâtes maintenance of the large-scale différences between 
the cell front and the cell rear during migration. 


Extracellular Signais Can Activate the Three Rho Protein Family 
Members 

The activation of the monomeric GTPases Rho, Rac, and Cdc42 occurs through an 
exchange of GTP for a tightly bound GDP molécule, catalyzed by guanine nuclé¬ 
otide exchange factors (GEFs). Of the many GEFs that hâve been identified in the 
human genome, some are spécifie for an individual Rho family GTPase, whereas 
others seem to act on multiple family members. Different GEFs are restricted to 
spécifie tissues and even spécifie subcellular locations, and they are sensitive to 
distinct lcinds of regulatory inputs. GEFs can be activated by extracellular eues 
through cell-surface receptors, or in response to intracellular signais. GEFs may 
also act as scaffolds that direct GTPases to downstream effectors. Interestingly, 
several of the Rho family GEFs associate with the growing ends of microtubules 
by binding to one of the +TIPs. This provides a connection between the dynamics 
of the microtubule cytoskeleton and the large-scale organization of the actin cyto¬ 
skeleton; such a connection is important for the overall intégration of cell shape 
and movement. 


Figure 16-85 The contrasting effects 
of Rac and Rho activation on actin 
organization. (A) Activation of the small 
GTPase Rac leads to alterations in actin 
accessory proteins that tend to favor 
the formation of actin networks, as in 
lamellipodia. Several different pathways 
contribute independently. Rac-GTP 
activâtes members of the WASp protein 
family, which in turn activate actin 
nucléation and branched web formation by 
the Arp 2/3 complex. In a parailel pathway, 
Rac-GTP activâtes a protein kinase, PAK, 
which has several targets including the 
web-forming cross-linker filamin, which 
is activated by phosphorylation, and the 
myosin light chain kinase (MLCK), which is 
inhibited by phosphorylation. Inhibition of 
MLCK results in decreased phosphorylation 
of the myosin regulatory light chain and 
leads to myosin II filament disassembly and 
a decrease in contractile activity. In some 
cells, PAK also directly inhibits myosin II 
activity by phosphorylation of the myosin 
heavy chain (MHC). (B) Activation of the 
related GTPase Rho leads to nucléation of 
actin filaments by formins and increases 
contraction by myosin II, promoting the 
formation of contractile actin bundles 
such as stress fibers. Activation of myosin 
Il by Rho requires a Rho-dependent 
protein kinase called Rock. This kinase 
inhibits the phosphatase that removes the 
activating phosphate groups from myosin 
Il light chains (MLC); it may also directly 
phosphorylate the myosin light chains in 
some cell types. Rock also activâtes other 
protein kinases, such as LIM kinase, which 
in turn contributes to the formation of 
stable contractile actin filament bundles by 
inhibiting the actin depolymerizing factor 
cofilin. A similar signaling pathway 
is important for forming the contractile 
ring necessary for cytokinesis (see 
Figure 17-44). 


External Signais Can Dictate the Direction of Cell Migration 

Chemotaxis is the movement of a cell toward or away from a source of some dif¬ 
fusible Chemical. These external signais act through Rho family proteins to set up 
large-scale cell polarity by influencing the organization of the cell motility appa- 
ratus. One well-studied example is the chemotactic movement of a class of white 
blood cells, called neutrophils, toward a source of bacterial infection. Receptor 
proteins on the surface of neutrophils enable them to detect very low concen¬ 
trations of A/-formylated peptides that are derived from bacterial proteins (only 
prokaryotes begin protein synthesis with A/-formylmethionine). Using these 
receptors, neutrophils are guided to bacterial targets by their ability to detect a 
différence of only 1% in the concentration of these diffusible peptides on one side 
of the cell versus the other (Figure 16-86A). 

In this case, and in the chemotaxis of Dictyostelium amoebae toward a source 
of cyclic AMP, binding of the chemoattractant to its G-protein-coupled receptor 
activâtes phosphoinositide 3-kinases (PI3Ks) (see Figure 15-52), which gener- 
ate a signaling molécule [PI(3,4,5)Ps] that in turn activâtes the Rac GTPase. Rac 
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then activâtes the Arp 2/3 complex leading to lamellipodial protrusion. Through 
an unknown mechanism, accumulation of the polarized actin web at the lead¬ 
ing edge causes further local enhancement of PI3K activity in a positive feedback 
loop, strengthening the induction of protrusion. The PI(3,4,5)P 3 that activâtes Rac 
cannot diffuse far from its site of synthesis, since it is rapidly converted baclc into 
PI(4,5)P2 by a constitutively active lipid phosphatase. At the same time, binding 
of the chemoattractant ligand to its receptor activâtes another signaling pathway 
that turns on Rho and enhances myosin-based contractility. The two processes 
directly inhibit each other, such that Rac activation dominâtes in the front of the 
cell and Rho activation dominâtes in the rear (Figure 16-86B). This enables the 
cell to maintain its functional polarity with protrusion at the leading edge and 
contraction at the baclc. 

Nondiffusible Chemical eues attached to the extracellular matrix or to the sur¬ 
face of cells can also influence the direction of cell migration. When these sig¬ 
nais activate receptors, they can cause increased cell adhesion and directed actin 
polymerization. Most long-distance cell migrations in animais, including neural- 
crest-cell migration and the travels of neuronal growth cônes, dépend on a com¬ 
bination of diffusible and nondiffusible signais to steer the locomoting cells or 
growth cônes to their proper destinations. 

Communication Among Cytoskeletal Eléments Coordinates 
Whole-Cell Polarization and Locomotion 

The interconnected cytoslceleton is crucial for cell migration. Although movement 
is driven primarily by actin polymerization and myosin contractility, septins and 
intermediate filaments also participate. For example, vimentin intermediate fil¬ 
ament networks associate with integrins at focal adhesions, and vimentin-défi¬ 
cient fibroblasts display impaired mechanical stability, migration, and contractile 
capacity. Furthermore, disruption of linlcer proteins that connect different cyto¬ 
skeletal éléments, including several plakins and KASH proteins, leads to defects 
in cell polarization and migration. Thus, interactions among cytoplasmic filament 
Systems, as well as mechanical linkage to the nucléus, are required for complex, 
whole-cell behaviors such as migration. 

Cells also use microtubules to help organize persistent movement in a spé¬ 
cifie direction. In many locomoting cells, the position of the centrosome is influ- 
enced by the location of protrusive actin polymerization. Activation of receptors 
on the protruding front edge of a cell might locally activate dynein motor proteins 
that move the centrosome by pulling on its microtubules. Several effector pro¬ 
teins downstream of Rac and Rho modulate microtubule dynamics directly: for 
example, a protein kinase activated by Rac can phosphorylate (and thereby 
inhibit) the tubulin-binding protein stathmin (see Panel 16-4), thereby stabilizing 
microtubules. 


Figure 16-86 Neutrophil polarization 
and chemotaxis. (A) The pipette tip at 
the right is leaking a small amount of 
the bacterial peptide formyl-Met-Leu- 
Phe, which is recognized by the human 
neutrophil as the product of a foreign 
invader. The neutrophil quickly extends a 
new lamellipodium toward the source of 
the chemoattractant peptide (top). It then 
extends this lamellipodium and polarizes 
its cytoskeleton so that contractile myosin 
Il is located primarily at the rear, opposite 
the position of the lamellipodium (middle). 
Finally, the cell crawls toward the source 
of the peptide (bottom). If a real bacterium 
were the source of the peptide, rather than 
an investigator’s pipette, the neutrophil 
would engulf the bacterium and destroy 
it (see also Figure 16-3 and Movie 
16.14). (B) Binding of bacterial molécules 
to G-protein-coupled receptors on the 
neutrophil stimulâtes directed motility. 
These receptors are found ail over the 
surface of the cell, but are more likely to be 
bound to the bacterial ligand at the front. 
Two distinct signaling pathways contribute 
to the cell’s polarization. At the front of the 
cell, stimulation of the Rac pathway leads, 
via the trimeric G protein Gj, to growth 
of protrusive actin networks. Second 
messengers within this pathway are short- 
lived, so protrusion is limited to the région 
of the cell closest to the stimulant. The 
same receptor also stimulâtes a second 
signaling pathway, via the trimeric G 
proteins G 12 and G 13 , that triggers the 
activation of Rho. The two pathways are 
mutually antagonistic. Since Rac-based 
protrusion is active at the front of the cell, 
Rho is activated only at the rear of the cell, 
stimulating contraction of the cell rear and 
assisting directed movement. (A, from 
O.D. Weiner et al., Nat. Cell Biol. 1:75-81, 
1999. With permission from Macmillan 
Publishers Ltd.) 
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In turn, microtubules influence actin rearrangements and cell adhesion. The 
centrosome nucleates a large number of dynamic microtubules, and its reposi- 
tioning means that the plus ends of many of these microtubules extend into the 
protrusive région of the cell. Direct interactions with microtubules help guide focal 
adhesion dynamics in migrating cells. Microtubules might also influence actin fil¬ 
ament formation by delivering Rac-GEFs that bind to the +TIPs traveling on grow- 
ing microtubule ends. Microtubules also transport cargoes to and from the focal 
adhesions, thereby affecting their signaling and disassembly. Thus, microtubules 
reinforce the polarity information that the actin cytoskeleton receives from the 
outside world, allowing a sensitive response to wealc signais and enabling motility 
to persist in the same direction for a prolonged period. 


WHATWE DONT KNOW 


• How is the cell cortex regulated 
locally and globally to coordinate its 
activities at different places on the 
cell surface? What détermines, for 
example, where filopodia form? 

• How are actin-regulatory proteins 
controlled spatially in the cytoplasm to 
generate multiple distinct types of actin 
arrays in the same cell? 


Summary 

Whole-cell movements and the large-scale shaping and structuring of cells require 
the coordinated activities ofall three basic filament Systems along with a large vari- 
ety of cytoskeletal accessory proteins , including motor proteins. Cell crawling—a 
widespread behavior important in embryonic development and also in wound 
healing, tissue maintenance, and immune System function in the adult animal — 
is a prime example ofsuch complex, coordinated cytoskeletal action. For a cell to 
crawl, it must generate and maintain an overall structural polarity, which is influ- 
enced by external eues. In addition, the cell must coordinate protrusion at the lead- 
ing edge (by assembly ofnew actin filaments), adhesion ofthe newly protruded part 
ofthe cell to the substratum, and forces generated by molecular motors to bring the 
cell bodyforward. 


• Are there biologically important 
processes occurring inside a 
microtubule? 

• How can we account for the fact that 
there are many different kinesins and 
myosins in the cytoplasm but only one 
dynein? 

• Mutations in the nuclear lamin 
proteins cause a large number of 
diseases called laminopathies. What do 
we not understand about the nuclear 
lamina that could account for this fact? 


PROBLEMS 

Which statements are true? Explain why or why not. 

16-1 The rôle of ATP hydrolysis in actin polymerization 
is similar to the rôle of GTP hydrolysis in tubulin polym¬ 
erization: both serve to wealcen the bonds in the polymer 
and thereby promote depolymerization. 

16-2 Motor neurons trigger action potentials in muscle 
cell membranes that open voltage-sensitive Ca 2+ channels 
in T tubules, allowing extracellular Ca 2+ to enter the cyto¬ 
sol, bind to troponin C, and initiate rapid muscle contrac¬ 
tion. 

16-3 In most animal cells, minus-end directed microtu¬ 
bule motors deliver their cargo to the periphery of the cell, 
whereas plus-end directed microtubule motors deliver 
their cargo to the interior of the cell. 

Discuss the following problems. 

16-4 The concentration of actin in cells is 50-100 times 
greater than the critical concentration observed for pure 
actin in a test tube. How is this possible? What prevents the 
actin subunits in cells from polymerizing into filaments? 
Why is it advantageous to the cell to maintain such a large 
pool of actin subunits? 

16-5 Detailed measurements of sarcomere length and 
tension during isométrie contraction in striated muscle 
provided crucial early support for the sliding-filament 



Figure Q16-1 Tension as a function of sarcomere length during 
isométrie contraction (Problem 16-5). 

model of muscle contraction. Based on your understand- 
ing of the sliding-filament model and the structure of a 
sarcomere, propose a molecular explanation for the rela- 
tionship of tension to sarcomere length in the portions of 
Figure Q16-1 marlced I, II, III, and IV. (In this muscle, the 
length of the myosin filament is 1.6 pm, and the lengths of 
the actin thin filaments that project from the Z dises are 1.0 
(im.) 

16-6 At 1.4 mg/mL pure tubulin, microtubules grow at 
a rate of about 2 pm/min. At this growth rate, how many 
aP-tubulin dimers (8 nm in length) are added to the ends 
of a microtubule each second? 
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LINEAR GROWTH 


8 
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LATERAL ASSOCIATION 



Figure Q16-2 Model for microtubule nucléation by pure ap-tubulin 
dimers (Problem 16-7). 


16-7 A solution of pure aP-tubulin dimers is thought to 
nucleate microtubules by forming a linear protofilament 
about seven dimers in length. At that point, the probabili¬ 
tés that the next aP-dimer will bind laterally or to the end 
of the protofilament are about equal. The critical event 
for microtubule formation is thought to be the first latéral 
association (Figure Q16-2). How does latéral association 
promote the subséquent rapid formation of a microtu¬ 
bule? 

16-8 How does a centrosome “know" when it has found 
the center of the cell? 

16-9 The movements of single motor-protein molécules 
can be analyzed directly. Using polarized laser light, it is 
possible to create interférence patterns that exert a cen- 
trally directed force, ranging from zéro at the center to a 
few piconewtons at the periphery (about 200 nm from the 
center). Individual molécules that enter the interférence 
pattern are rapidly pushed to the center, allowing them to 
be captured and moved at the expérimenter's discrétion. 

Using such “optical tweezers,” single kinesin mol¬ 
écules can be positioned on a microtubule that is fixed to 
a coverslip. Although a single kinesin molécule cannot 
be seen optically, it can be tagged with a silica bead and 
traclced indirectly by following the bead (Figure Q16-3A). 
In the absence of ATP, the kinesin molécule remains at the 
center of the interférence pattern, but with ATP it moves 
toward the plus end of the microtubule. As kinesin moves 
along the microtubule, it encounters the force of the inter¬ 
férence pattern, which simulâtes the load kinesin carries 
during its actual function in the cell. Moreover, the pres¬ 
sure against the silica bead counters the effects of Brown - 
ian (thermal) motion, so that the position of the bead more 
accurately reflects the position of the kinesin molécule on 
the microtubule. 

A trace of the movements of a kinesin molécule 
along a microtubule is shown in Figure Q16-3B. 

A. As shown in Figure Q16-3B, ail movement of kine¬ 

sin is in one direction (toward the plus end of the micro¬ 
tubule). What supplies the free energy needed to ensure a 
unidirectional movement along the microtubule? 


B. What is the average rate of movement of kinesin 
along the microtubule? 

C. What is the length of each step that a kinesin takes 
as it moves along a microtubule? 

D. From other studies it is lcnown that kinesin has two 
globular domains that can each bind to P-tubulin, and that 
kinesin moves along a single protofilament in a microtu¬ 
bule. In each protofilament, the P-tubulin subunit repeats 
at 8-nm intervals. Given the step length and the interval 
between P-tubulin subunits, how do you suppose a kine¬ 
sin molécule moves along a microtubule? 

E. Is there anything in the data in Figure Q16-3B that 
tells you how many ATP molécules are hydrolyzed per 
step? 

16-10 A mitochondrion 1 pm long can travel the 1 meter 
length of the axon from the spinal cord to the big toe in a 
day. The Olympic men's freestyle swimming record for 200 
meters is 1.75 minutes. In terms of body lengths per day, 
who is moving faster: the mitochondrion or the Olympic 
record holder? (Assume that the swimmer is 2 meters tall.) 

16-11 Cofilin preferentially binds to older actin filaments 
and promûtes their disassembly. How does cofilin distin- 
guish old filaments from new ones? 

16-12 Why is it that intermediate filaments hâve iden- 
tical ends and laclc polarity, whereas actin filaments and 
microtubules hâve two distinct ends with a defined polar¬ 
ity? 


16-13 How is the unidirectional motion of a lamellipo- 
dium maintained? 


(A) EXPERIMENTAL-SETUP 


(B) POSITION OF KINESIN 



microtubule 



time (seconds) 


Figure Q16-3 Movement of kinesin along a microtubule (Problem 
16-9). (A) Experimental set-up, with kinesin linked to a silica bead, 
moving along a microtubule. (B) Position of kinesin (as visualized by 
the position of the silica bead) relative to the center of the interférence 
pattern, as a function of time of movement along the microtubule. 

The jagged nature of the trace results from Brownian motion of 
the bead. 
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The only way to malce a new cell is to duplicate a cell that already exists. This sim¬ 
ple fact, first established in the middle of the nineteenth century, carries with it a 
profound message for the continuity of life. Ail living organisms, from the unicel- 
lular bacterium to the multicellular mammal, are products of repeated rounds of 
cell growth and division extending baclc in time to the beginnings of life on Earth 
over three billion years ago. 

A cell reproduces by performing an orderly sequence of events in which it 
duplicates its contents and then divides in two. This cycle of duplication and divi¬ 
sion, lcnown as the cell cycle, is the essential mechanism by which ail living things 
reproduce. In unicellular species, such as bacteria and yeasts, each cell division 
produces a complété new organism. In multicellular species, long and complex 
sequences of cell divisions are required to produce a functioning organism. Even 
in the adult body, cell division is usually needed to replace cells that die. In fact, 
each of us must manufacture many millions of cells every second simply to sur¬ 
vive: if ail cell division were stopped—by exposure to a very large dose of x-rays, 
for example—we would die within a few days. 

The details of the cell cycle vary from organism to organism and at different 
times in an organisnTs life. Certain characteristics, however, are universal. At a 
minimum, the cell must accomplish its most fundamental taslc: the passing on 
of its genetic information to the next génération of cells. To produce two geneti- 
cally identical daughter cells, the DNA in each chromosome must first be faith- 
fully replicated to produce two complété copies. The replicated chromosomes 
must then be accurately distributed ( segregated ) to the two daughter cells, so that 
each receives a copy of the entire genome (Figure 17-1). In addition to duplicat- 
ing their genome, most cells also duplicate their other organelles and macromole- 
cules; otherwise, daughter cells would get smaller with each division. To maintain 
their size, dividing cells must coordinate their growth (that is, their increase in cell 
mass) with their division. 

This chapter describes the events of the cell cycle and how they are controlled 
and coordinated. We begin with a brief overview of the cell cycle. We then describe 
the cell-cycle control System , a complex network of regulatory proteins that triggers 
the different events of the cycle. We next consider in detail the major stages of the 
cell cycle, in which the chromosomes are duplicated and then segregated into the 
two daughter cells. Finally, we consider how extracellular signais govern the rates 
of cell growth and division and how these two processes are coordinated. 


IN THIS CHAPTER 

OVERVIEW OF THE CELL CYCLE 

THE CELL-CYCLE CONTROL 
SYSTEM 

S PHASE 

MITOSIS 

CYTOKINESIS 

MEIOSIS 

CONTROL OF CELL DIVISION 
AND CELL GROWTH 


OVERVIEW OF THE CELL CYCLE 

The most basic function of the cell cycle is to duplicate the vast amount of DNA 
in the chromosomes and then segregate the copies into two genetically identical 
daughter cells. These processes define the two major phases of the cell cycle. Chro¬ 
mosome duplication occurs during S phase (S for DNA synthesis), which requires 
10-12 hours and occupies about half of the cell-cycle time in a typical mammalian 
cell. After S phase, chromosome ségrégation and cell division occur in Mphase (M 
for mitosis), which requires much less time (less than an hour in a mammalian 
cell). M phase comprises two major events: nuclear division, or mitosis , during 


964 Chapter17: The Cell Cycle 



Figure 17-1 The cell cycle. The division 
of a hypothetical eukaryotic cell with two 
chromosomes (one red, and one black) 
is shown to illustrate how two genetically 
identical daughter cells are produced in 
each cycle. Each of the daughter cells will 
often continue to divide by going through 
additional cell cycles. 


which the copied chromosomes are distributed into a pair of daughter nuclei; 
and cytoplasmic division, or cytolcinesis, when the cell itself divides in two (Figure 
17-2). 

At the end of S phase, the DNA molécules in each pair of duplicated chromo¬ 
somes are intertwined and held tightly together by specialized protein linlcages. 
Early in mitosis at a stage called prophase, the two DNA molécules are gradu- 
ally disentangled and condensed into pairs of rigid, compact rods called sister 
chromatids, which remain linlced by sister-chromatid cohésion. When the nuclear 
envelope disassembles later in mitosis, the sister-chromatid pairs become 
attached to the mitotic spindle , a giant bipolar array of microtubules (discussed 
in Chapter 16). Sister chromatids are attached to opposite pôles of the spindle 
and, eventually, align at the spindle equator in a stage called metaphase. The 
destruction of sister-chromatid cohésion at the start of anaphase séparâtes the 
sister chromatids, which are pulled to opposite pôles of the spindle. The spindle is 
then disassembled, and the segregated chromosomes are packaged into separate 
nuclei at telophase. Cytokinesis then cleaves the cell in two, so that each daughter 
cell inherits one of the two nuclei (Figure 17-3). 

The Eukaryotic Cell Cycle Usually Consists of Four Phases 

Most cells require much more time to grow and double their mass of proteins and 
organelles than they require to duplicate their chromosomes and divide. Partly to 
allow time for growth, most cell cycles hâve gap phases—a Gi phase between M 
phase and S phase and a G 2 phase between S phase and mitosis. Thus, the eukary¬ 
otic cell cycle is traditionally divided into four sequential phases: Gi, S, G 2 , and M. 
Gi, S, and G 2 together are called interphase (Figure 17-4, and see Figure 17-3). In 
a typical human cell proliferating in culture, interphase might occupy 23 hours of 
a 24-hour cycle, with 1 hour for M phase. Cell growth occurs throughout the cell 
cycle, except during mitosis. 

The two gap phases are more than simple time delays to allow cell growth. They 
also provide time for the cell to monitor the internai and external environment 


Figure 17-2 The major events of the cell cycle. The major chromosomal 
events of the cell cycle occur in S phase, when the chromosomes are 
duplicated, and M phase, when the duplicated chromosomes are segregated 
into a pair of daughter nuclei (in mitosis), after which the cell itself divides into 
two (cytokinesis). 
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Figure 17-3 The events of eukaryotic cell division as seen under a microscope. The easily visible processes of nuclear 
division (mitosis) and cell division (cytokinesis), collectively called M phase, typically occupy only a small fraction of the cell cycle. 
The other, much longer, part of the cycle is known as interphase, which includes S phase and the gap phases (discussed in 
text). The five stages of mitosis are shown: an abrupt change in the biochemical State of the cell occurs at the transition from 
metaphase to anaphase. A cell can pause in metaphase before this transition point, but once it passes this point, the cell carries 
on to the end of mitosis and through cytokinesis into interphase. 


to ensure that conditions are suitable and préparations are complété before the 
cell commits itself to the major upheavals of S phase and mitosis. The Gi phase 
is especially important in this respect. Its length can vary greatly depending on 
external conditions and extracellular signais from other cells. If extracellular con¬ 
ditions are unfavorable, for example, cells delay progress through Gi and may even 
enter a specialized resting State known as Go (G zéro), in which they can remain 
for days, weelcs, or even years before resuming prolifération. Indeed, many cells 
remain permanently in Go until they or the organism dies. If extracellular condi¬ 
tions are favorable and signais to grow and divide are présent, cells in early Gi or 
Go progress through a commitment point near the end of Gi known as Start (in 
yeasts) or the restriction point (in mammalian cells). We will use the term Start 
for both yeast and animal cells. After passing this point, cells are committed to 
DNA réplication, even if the extracellular signais that stimulate cell growth and 
division are removed. 

Cell-Cycle Control Is Similar in Ail Eukaryotes 

Some features of the cell cycle, including the time required to complété certain 
events, vary greatly from one cell type to another, even in the same organism. The 
basic organization of the cycle, however, is essentially the same in ail eukaryotic 



Figure 17-4 The four phases of the cell 
cycle. In most cells, gap phases separate 
the major events of S phase and M phase. 
Gi is the gap between M phase and 
S phase, while G 2 is the gap between 
S phase and M phase. 
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Figure 17-5 Mammalian cells proliferating in culture. The cells in this 
scanning électron micrograph are rat fibroblasts. Cells at the lower left hâve 
rounded up and are in mitosis. (Courtesy of Guenter Albrecht-Buehler.) 


cells, and ail eukaryotes appear to use similar machinery and control mecha- 
nisms to drive and regulate cell-cycle events. The proteins of the cell-cycle control 
System, for example, first appeared over a billion years ago. Remarlcably, they hâve 
been so well conserved over the course of évolution that many of them function 
perfectly when transferred from a human cell to a yeast cell. We can therefore 
study the cell cycle and its régulation in a variety of organisms and use the find- 
ings from ail of them to assemble a unified picture of how eukaryotic cells divide. 

Several model organisms are used in the analysis of the eukaryotic cell cycle. 
The budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccha- 
romyces pombe are simple eukaryotes in which powerful molecular and genetic 
approaches can be used to identify and characterize the genes and proteins that 
govern the fundamental features of cell division. The early embryos of certain 
animais, particularly those of the frog Xenopus laevis, are excellent tools for bio- 
chemical dissection of cell-cycle control mechanisms, while the fruit fly Drosoph- 
ila melanogaster is useful for the genetic analysis of mechanisms underlying the 
control and coordination of cell growth and division in multicellular organisms. 
Cultured human cells provide an excellent System for the molecular and micro¬ 
scopie exploration of the complex processes by which our own cells divide. 
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Cell-Cycle Progression Can Be Studied in Various Ways 

How can we tell what stage a cell has reached in the cell cycle? One way is simply 
to look at living cells with a microscope. A glance at a population of mammalian 
cells proliferating in culture reveals that a fraction of the cells hâve rounded up 
and are in mitosis (Figure 17-5). Others can be observed in the process of cyto- 
kinesis. Similarly, looking at budding yeast cells under a microscope is very use¬ 
ful, because the size of the bud provides an indication of cell-cycle stage (Figure 
17-6). We can gain additional dues about cell-cycle position by staining cells with 
DNA-binding fluorescent dyes (which reveal the condensation of chromosomes 
in mitosis) or with antibodies that recognize spécifie cell components such as the 
microtubules (revealing the mitotic spindle). S-phase cells can be identified in 
the microscope by supplying them with visualizable molécules that are incorpo- 
rated into newly synthesized DNA, such as the artificial thymidine analog bromo- 
deoxyuridine (BrdU); cell nuclei that hâve incorporated BrdU are then revealed 
by staining with anti-BrdU antibodies (Figure 17-7). 

Typically, in a population of cultured mammalian cells that are ail proliferat¬ 
ing rapidly but asynchronously, about 30-40% will be in S phase at any instant and 
become labeled by a brief puise of BrdU. From the proportion of cells in such a 
population that are labeled, we can estimate the duration of S phase as a fraction 
of the whole cell-cycle duration. Similarly, from the proportion of cells in mitosis 
(the mitotic index), we can estimate the duration of M phase. 

Another way to assess the stage that a cell has reached in the cell cycle is by 
measuring its DNA content, which doubles during S phase. This approach is 
greatly facilitated by the use of fluorescent DNA-binding dyes and a flou) cytom- 
eter , which allows the rapid and automatic analysis of large numbers of cells (Fig¬ 
ure 17-8). We can use flow cytometry to détermine the lengths of Gi, S, and G 2 
+ M phases, by measuring DNA content in a synchronized cell population as it 
progresses through the cell cycle. 


Figure 17-6 The morphology of budding yeast cells. In a normal 
population of proliferating yeast cells, buds vary in size according to the 
cell-cycle stage. Unbudded cells are in Gi. Progression through the Start 
transition triggers formation of a tiny bud, which grows in size during the 
S and M phases until it is almost the size of the mother cell. (Courtesy of 
Jeff Ubersax.) 
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Figure 17-7 Labeling S-phase cells. An immunofluorescence micrograph 
of BrdU-labeled épithélial cells of the zebrafish gut. The fish was exposed 
to BrdU, after which the tissue was fixed and prepared for labeling with 
fluorescent anti-BrdU antibodies (green). Ail the cells are stained with a red 
fluorescent dye. (Courtesy of Cécile Crosnier.) 


Summary 

Cell division usually begins with duplication ofthe celVs contents, followed by dis¬ 
tribution ofthose contents into two daughter cells. Chromosome duplication occurs 
during S phase ofthe cell cycle, whereas most other cell components are duplicated 
continuously throughout the cycle. During M phase, the replicated chromosomes 
are segregated into individual nuclei (mitosis), and the cell then splits in two (cyto- 
kinesis). S phase and M phase are usually separated by gap phases called Gi and 
G 2 , when various intracellular and extracellular signais regulate cell-cycle progres¬ 
sion. Cell-cycle organization and control hâve been highly conserved during évolu¬ 
tion, and studies in a wide range of Systems hâve led to a unified view ofeulcaryotic 
cell-cycle control. 

THE CELL-CYCLE CONTROL SYSTEM 

For many years, cell biologists watched the puppet show of DNA synthesis, mito¬ 
sis, and cytokinesis but had no idea of what lay behind the curtain controlling 
these events. It was not even clear whether there was a separate control System, or 
whether the processes of DNA synthesis, mitosis, and cytokinesis somehow con- 
trolled themselves. A major breakthrough came in the late 1980s with the identi¬ 
fication of the lcey proteins of the control System, along with the realization that 
they are distinct from the proteins that perform the processes of DNA réplication, 
chromosome ségrégation, and so on. 

In this section, we first consider the basic principles upon which the cell-cycle 
control System opérâtes. We then discuss the protein components of the System 
and how they worlc together to time and coordinate the events of the cell cycle. 

The Cell-Cycle Control System Triggers the Major Events of the 
Cell Cycle 

The cell-cycle control System opérâtes much like a timer that triggers the events 
of the cell cycle in a set sequence (Figure 17-9). In its simplest form—as seen in 
the stripped-down cell cycles of early animal embryos, for example—the control 
System is rigidly programmed to provide a fixed amount of time for the comple- 
tion of each cell-cycle event. The control System in these early embryonic divi¬ 
sions is independent of the events it Controls, so that its timing mechanisms con¬ 
tinue to operate even if those events fail. In most cells, however, the control System 
does respond to information received baclc from the processes it Controls. If some 
malfunction prevents the successful completion of DNA synthesis, for example, 
signais are sent to the control System to delay progression to M phase. Such delays 
provide time for the machinery to be repaired and also prevent the disaster that 
might resuit if the cycle progressed prematurely to the next stage—and segregated 
incompletely replicated chromosomes, for example. 

The cell-cycle control System is based on a connected sériés of biochemi- 
cal switches, each of which initiâtes a spécifie cell-cycle event. This System of 
switches possesses many important features that increase the accuracy and reli- 
ability of cell-cycle progression. First, the switches are generally binary (on/off) 
and launch events in a complété, irréversible fashion. It would clearly be disas- 
trous, for example, if events like chromosome condensation or nuclear-envelope 
breakdown were only partially initiated or started but not completed. Second, the 
cell-cycle control System is remarkably robust and reliable, partly because baclcup 
mechanisms and other features allow the System to operate effectively under a 
variety of conditions and even if some components fail. Finally, the control System 
is highly adaptable and can be modified to suit spécifie cell types or to respond to 
spécifie intracellular or extracellular signais. 
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Figure 17-8 Analysis of DNA content 
with a flow cytometer. This graph shows 
typical results obtained for a proliferating 
cell population when the DNA content of 
its individual cells is determined in a flow 
cytometer. (A flow cytometer, also called a 
fluorescence-activated cell sorter, or FACS, 
can also be used to sort cells according to 
their fluorescence—see Figure 8 - 2 ). The 
cells analyzed here were stained with a dye 
that becomes fluorescent when it binds to 
DNA, so that the amount of fluorescence 
is directly proportional to the amount of 
DNA in each cell. The cells fail into three 
categories: those that hâve an unreplicated 
complément of DNA and are therefore 
in Gi, those that hâve a fully replicated 
complément of DNA (twice the Gi DNA 
content) and are in G2 or M phase, and 
those that hâve an intermediate amount of 
DNA and are in S phase. The distribution 
of cells indicates that there are greater 
numbers of cells in Gi than in G2 + M 
phase, showing that Gi is longer than 
G2 + M in this population. 
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Is ail DNA replicated? 


Are ail chromosomes 
attached to the spindle? 



Is environment favorable? 


In most eulcaryotic cells, the cell-cycle control System governs cell-cycle pro¬ 
gression at three major regulatory transitions (see Figure 17-9). The first is Start 
(or the restriction point) in late Gi, where the cell commits to cell-cycle entry and 
chromosome duplication. The second is the G 2 /M transition, where the control 
System triggers the early mitotic events that lead to chromosome alignment on 
the mitotic spindle in metaphase. The third is the metaphase-to-anaphase tran¬ 
sition, where the control System stimulâtes sister-chromatid séparation, leading 
to the completion of mitosis and cytokinesis. The control System blocks progres¬ 
sion through each of these transitions if it detects problems inside or outside the 
cell. If the control System senses problems in the completion of DNA réplication, 
for example, it will hold the cell at the G 2 /M transition until those problems are 
solved. Similarly, if extracellular conditions are not appropriate for cell proliféra¬ 
tion, the control System blocks progression through Start, thereby preventing cell 
division until conditions become favorable. 

The Cell-Cycle Control System Dépends on Cyclically Activated 
Cyclin-Dependent Protein Kinases (Cdks) 

Central components of the cell-cycle control System are members of a family of 
protein kinases known as cyclin-dépendent kinases (Cdks). The activities of these 
kinases rise and fall as the cell progresses through the cycle, leading to cyclical 
changes in the phosphorylation of intracellular proteins that initiate or regulate 
the major events of the cell cycle. An increase in Cdk activity at the G 2 /M transi¬ 
tion, for example, increases the phosphorylation of proteins that control chromo¬ 
some condensation, nuclear-envelope breakdown, spindle assembly, and other 
events that occur in early mitosis. 

Cyclical changes in Cdk activity are controlled by a complex array of enzymes 
and other proteins. The most important of these Cdk regulators are proteins 
known as cyclins. Cdks, as their name implies, are dépendent on cyclins for their 
activity: unless they are bound tightly to a cyclin, they hâve no protein kinase 
activity (Figure 17-10). Cyclins were originally named because they undergo a 
cycle of synthesis and dégradation in each cell cycle. The levels of the Cdk pro¬ 
teins, by contrast, are constant. Cyclical changes in cyclin protein levels resuit in 
the cyclic assembly and activation of cyclin-Cdk complexes at spécifie stages of 
the cell cycle. 


Figure 17-9 The control of the cell cycle. 

A cell-cycle control System triggers the 
essential processes of the cycle—such as 
DNA réplication, mitosis, and cytokinesis. 
The control System is represented here as 
a central arm—the controller—that rotâtes 
clockwise, triggering essential processes 
when it reaches spécifie transitions on the 
outer dial (yellow boxes). Information about 
the completion of cell-cycle events, as well 
as signais from the environment, can cause 
the control System to arrest the cycle at 
these transitions. 


cyclin 



cyclin-dependent 
kinase (Cdk) 


Figure 17-10 Two key components of 
the cell-cycle control System. When 
cyclin forms a complex with Cdk, the 
protein kinase is activated to trigger spécifie 
cell-cycle events. Without cyclin, Cdk is 
inactive. 
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There are four classes of cyclins, each defined by the stage of the cell cycle at 
which they bind Cdlcs and function. Ail eukaryotic cells require three of these 
classes (Figure 17-11): 

1. Gi/S-cyclins activate Cdlcs in late Gi and thereby help trigger progression 
through Start, resulting in a commitment to cell-cycle entry. Iheir levels 
fall in S phase. 

2. S-cyclins bind Cdlcs soon after progression through Start and help stimu- 
late chromosome duplication. S-cyclin levels remain elevated until mito- 
sis, and these cyclins also contribute to the control of some early mitotic 
events. 

3. M-cyclins activate Cdlcs that stimulate entry into mitosis at the G 2 /M tran¬ 
sition. M-cyclin levels fall in mid-mitosis. 

In most cells, a fourth class of cyclins, the Gi-cyclins, helps govern the activi- 
ties of the Gi/S-cyclins, which control progression through Start in late Gi. 

In yeast cells, a single Cdlc protein binds ail classes of cyclins and triggers dif¬ 
ferent cell-cycle events by changing cyclin partners at different stages of the cycle. 
In vertebrate cells, by contrast, there are four Cdlcs. Two interact with Gi-cyclins, 
one with Gi/S- and S-cyclins, and one with S- and M-cyclins. In this chapter, we 
simply refer to the different cyclin-Cdlc complexes as Gi-Cdlc, Gi/S-Cdlc, S-Cdk, 
and M-Cdk. Table 17-1 lists the names of the individual Cdlcs and cyclins. 

How do different cyclin-Cdlc complexes trigger different cell-cycle events? 
The answer, at least in part, seems to be that the cyclin protein does not simply 
activate its Cdlc partner but also directs it to spécifie target proteins. As a resuit, 
each cyclin-Cdlc complex phosphorylâtes a different set of substrate proteins. 
The same cyclin-Cdlc complex can also induce different effects at different times 
in the cycle, probably because the accessibility of some Cdlc substrates changes 
during the cell cycle. Certain proteins that function in mitosis, for example, may 
become available for phosphorylation only in G 2 . 


Figure 17-11 Cyclin-Cdk complexes 
of the cell-cycle control System. The 

concentrations of the three major cyclin 
types oscillate during the cell cycle, while 
the concentrations of Cdks (not shown) do 
not change and exceed cyclin amounts. 

In late Gi, rising Gi/S-cyclin levels lead 
to the formation of Gi/S-Cdk complexes 
that trigger progression through the 
Start transition. S-Cdk complexes form 
at the start of S phase and trigger DNA 
réplication, as well as some early mitotic 
events. M-Cdk complexes form during 
G 2 but are held in an inactive State; they 
are activated at the end of G 2 and trigger 
entry into mitosis at the G 2 /M transition. 

A separate regulatory protein complex, 
the APC/C, initiâtes the metaphase-to- 
anaphase transition, as we discuss later. 


The Major Cyclins and Cdks of Vertebrates and Budding Yeast 



Vertebrates 

Budding yeast 

Cyclin-Cdk 

complex 

Cyclin 

Cdk partner 

Cyclin 

Cdk partner 

Gi-Cdk 

Cyclin D* 

Cdk4, Cdk6 

Cln3 

Cdkl** 

Gi/S-Cdk 

Cyclin E 

Cdk2 

Clnl, 2 

Cdkl 

S-Cdk 

Cyclin A 

Cdk2, Cdkl** 

Clb5, 6 

Cdkl 

M-Cdk 

Cyclin B 

Cdkl 

Clbl, 2, 3, 4 

Cdkl 

* There are three D cyclins in mammals (cyclins DI, D2, and D3). 

** The original name of Cdkl was Cdc2 in both vertebrates and fission yeast, and Cdc28 in 
budding yeast. 
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(A) INACTIVE (B) PARTLY ACTIVE 


(C) FULLY ACTIVE 


Studies of the three-dimensional structures of Cdk and cyclin proteins hâve 
revealed that, in the absence of cyclin, the active site in the Cdk protein is partly 
obscured by a protein loop, lilce a stone blocking the entrance to a cave (Figure 
17-12A). Cyclin binding causes the loop to move away from the active site, result- 
ing in partial activation of the Cdk enzyme (Figure 17-12B). Full activation of 
the cyclin-Cdk complex then occurs when a separate kinase, the Cdk-activating 
kinase (CAK), phosphorylâtes an amino acid near the entrance of the Cdk active 
site. This causes a small conformational change that further increases the activity 
of the Cdk, allowing the kinase to phosphorylate its target proteins effectively and 
thereby induce spécifie cell-cycle events (Figure 17-12C). 

Cdk Activity Can Be Suppressed By Inhibitory Phosphorylation 
and Cdk Inhibitor Proteins (CKIs) 

The rise and fall of cyclin levels is the primary déterminant of Cdk activity during 
the cell cycle. Several additional mechanisms, however, help control Cdk activity 
at spécifie stages of the cycle. 

Phosphorylation at a pair of amino acids in the roof of the kinase active site 
inhibits the activity of a cyclin-Cdk complex. Phosphorylation of these sites by 
a protein kinase known as Weel inhibits Cdk activity, while déphosphorylation 
of these sites by a phosphatase known as Cdc25 increases Cdk activity (Figure 
17-13). We will see later that this regulatory mechanism is particularly important 
in the control of M-Cdlc activity at the onset of mitosis. 

Binding of Cdk inhibitor proteins (CKIs) inactivâtes cyclin-Cdk complexes. 
The three-dimensional structure of a cyclin-Cdk-CKI complex reveals that CKI 
binding stimulâtes a large rearrangement in the structure of the Cdk active site, 
rendering it inactive (Figure 17-14). Cells use CKIs primarily to help govern the 
activities of G\/S- and S-Cdlcs early in the cell cycle. 

Regulated Proteolysis Triggers the Metaphase-to-Anaphase 
Transition 

Whereas activation of spécifie cyclin-Cdk complexes drives progression through 
the Start and G 2 /M transitions (see Figure 17-11), progression through the meta- 
phase-to-anaphase transition is triggered not by protein phosphorylation but by 
protein destruction, leading to the final stages of cell division. 

The lcey regulator of the metaphase-to-anaphase transition is the anaphase- 
promoting complex, or cyclosome (APC/C), a member of the ubiquitin ligase 
family of enzymes. As discussed in Chapter 3, these enzymes are used in numer- 
ous cell processes to stimulate the proteolytic destruction of spécifie regulatory 
proteins. They polyubiquitylate spécifie target proteins, resulting in their destruc¬ 
tion in protéasomes. Other ubiquitin ligases mark proteins for purposes other 
than destruction (discussed in Chapter 3). 


Figure 17-12 The structural basis of 
Cdk activation. These drawings are 
based on three-dimensional structures of 
human Cdk2 and cyclin A, as determined 
by x-ray crystallography. The location of 
the bound ATP is indicated. The enzyme 
is shown in three States. (A) In the inactive 
State, without cyclin bound, the active 
site is blocked by a région of the protein 
called the T-loop (red). (B) The binding of 
cyclin causes the T-loop to move out of the 
active site, resulting in partial activation of 
the Cdk2. (C) Phosphorylation of Cdk2 (by 
CAK) at a threonine residue in the T-loop 
further activâtes the enzyme by changing 
the shape of the T-loop, improving the 
ability of the enzyme to bind its protein 
substrates (Movie 17.1). 


cyclin inhibitory phosphate 



Cdk activating 
phosphate 


ACTIVE INACTIVE 


Figure 17-13 The régulation of Cdk 
activity by phosphorylation. The active 
cyclin-Cdk complex is turned off when 
the kinase Weel phosphorylâtes two 
closely spaced sites above the active 
site. Removal of these phosphates by the 
phosphatase Cdc25 activâtes the cyclin- 
Cdk complex. For simplicity, only one 
inhibitory phosphate is shown. CAK adds 
the activating phosphate, as shown 
in Figure 17-12. 
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Figure 17-14 The inhibition of a cyclin-Cdk complex by a CKI. This drawing is based on 
the three-dimensional structure of the human cyclin A-Cdk2 complex bound to the CKI p27, as 
determined by x-ray crystallography. The p27 binds to both the cyclin and Cdk in the complex, 
distorting the active site of the Cdk. It also inserts into the ATP-binding site, further inhibiting the 
enzyme activity. 


The APC/C catalyzes the ubiquitylation and destruction of two major types 
of proteins. The first is securin, which protects the protein linkages that hold 
sister-chromatid pairs together in early mitosis. Destruction of securin in meta- 
phase activâtes a protease that séparâtes the sisters and unleashes anaphase, as 
described later. The S- and M-cyclins are the second major targets of the APC/C. 
Destroying these cyclins inactivâtes most Cdlcs in the cell (see Figure 17-11). As 
a resuit, the many proteins phosphorylated by Cdlcs from S phase to early mitosis 
are dephosphorylated by various phosphatases in the anaphase cell. This déphos¬ 
phorylation of Cdk targets is required for the completion of M phase, including the 
final steps in mitosis and then cytolcinesis. Following its activation in mid-mitosis, 
the APC/C remains active in Gi to provide a stable period of Cdk inactivity. When 
Gi/S-Cdk is activated in late Gi, the APC/C is turned off, thereby allowing cyclin 
accumulation in the next cell cycle. 

The cell-cycle control System also uses another ubiquitin ligase called SCF (see 
Figure 3-71). It has many functions in the cell, but its major rôle in the cell cycle is 
to ubiquitylate certain CKI proteins in late G i; thereby helping to control the acti¬ 
vation of S-Cdlcs and DNA réplication. SCF is also responsible for the destruction 
of Gi/S-cyclins in early S phase. 

The APC/C and SCF are both large, multisubunit complexes with some related 
components (see Figure 3-71), but they are regulated differently. APC/C activity 
changes during the cell cycle, primarily as a resuit of changes in its association 
with an activating subunit—either Cdc20 in mid-mitosis or Cdhl from late mito¬ 
sis through early Gi. These subunits help the APC/C recognize its target proteins 
(Figure 17-15A). SCF activity dépends on substrate-binding subunits called F-box 
proteins. Unlilce APC/C activity, however, SCF activity is constant during the cell 
cycle. Ubiquitylation by SCF is controlled instead by changes in the phosphor¬ 
ylation State of its target proteins, as F-box subunits recognize only specifically 
phosphorylated proteins (Figure 17-15B). 

Cell-Cycle Control Also Dépends on Transcriptional Régulation 

In the simple cell cycles of early animal embryos, gene transcription does not 
occur. Cell-cycle control dépends exclusively on post-transcriptional mechanisms 
that involve the régulation of Cdlcs and ubiquitin ligases and their target proteins. 
In the more complex cell cycles of most cell types, however, transcriptional con¬ 
trol provides an important additional level of régulation. Changes in cyclin gene 
transcription, for example, help control cyclin levels in most cells. 

A variety of methods discussed in Chapter 8 hâve been used to analyze changes 
in the expression of ail genes in the genome as the cell progresses through the cell 
cycle. The results of these studies are surprising. In budding yeast, for example, 
about 10% of the genes encode mRNAs whose levels oscillate during the cell cycle. 
Some of these genes encode proteins with lcnown cell-cycle functions, but the 
functions of many others are unknown. 
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The Cell-Cycle Control System Functions as a Network of 
Biochemical Switches 

Table 17-2 summarizes some of the major components of the cell-cycle control 
System. These proteins are functionally linlced to form a robust network, which 
opérâtes essentially autonomously to activate a sériés of biochemical switches, 
each of which triggers a spécifie cell-cycle event. 

When conditions for cell prolifération are right, various external and internai 
signais stimulate the activation of Gi-Cdk, which in turn stimulâtes the expression 
of genes encoding G\/S- and S-cyclins (Figure 17-16). The resulting activation of 
Gi/S-Cdlc then drives progression through the Start transition. By mechanisms 
we discuss later, Gi/S-Cdlcs unleash a wave of S-Cdk activity, which initiâtes 


(A) control of proteolysis by APC/C 


activating 
subunit (Cdc20) 



Cdk ubiquitin (J) 



ubiquitylation 

enzymes 


(B) control of proteolysis by SCF 


active 

SCF 

complex 



Cdk inhibitor protein 
(CKI) 




ubiquitylation 

enzymes 


Figure 17-1 5 The control of proteolysis by APC/C and SCF during the cell cycle. (A) The 

APC/C is activated in mitosis by association with Cdc20, which recognizes spécifie amino acid 
sequences on M-cyclin and other target proteins. With the help of two additional proteins called 
El and E2, the APC/C assembles polyubiquitin chains on the target protein. The polyubiquitylated 
target is then recognized and degraded in a protéasome. (B) The activity of the ubiquitin ligase SCF 
dépends on substrate-binding subunits called F-box proteins, of which there are many different 
types. The phosphorylation of a target protein, such as the CKI shown, allows the target to be 
recognized by a spécifie F-box subunit. 
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Summary of the Major Cell Cycle Regulatory Proteins 

General name 

Functions and comments 

Protein kinases and protein phosphatases that modify Cdks 

Cdk-activating kinase (CAK) 

Phosphorylâtes an activating site in Cdks 

Weel kinase 

Phosphorylâtes inhibitory sites in Cdks; primarily involved in suppressing Cdkl activity before 
mitosis 

Cdc25 phosphatase 

Removes inhibitory phosphates from Cdks; three family members (Cdc25A, B, C) in mammals; 
primarily involved in controlling Cdkl activation at the onset of mitosis 

Cdk inhibitor proteins (CKIs) 

Sicl (budding yeast) 

Suppresses Cdkl activity in Gi ; phosphorylation by Cdkl at the end of Gi triggers its destruction 

p27 (mammals) 

Suppresses G-|/S-Cdk and S-Cdk activities in Gi; helps cells withdraw from cell cycle when they 
terminally differentiate; phosphorylation by Cdk2 triggers its ubiquitylation by SCF 

p21 (mammals) 

Suppresses Gi/S-Cdk and S-Cdk activities following DNA damage 

pi 6 (mammals) 

Suppresses Gi-Cdk activity in Gi ; frequently inactivated in cancer 

Ubiquitin ligases and their activators 

APC/C 

Catalyzes ubiquitylation of regulatory proteins involved primarily in exit from mitosis, including 
securin and S- and M-cyclins; regulated by association with activating subunits Cdc20 or Cdhl 

Cdc20 

APC/C-activating subunit in ail cells; triggers initial activation of APC/C at metaphase-to-anaphase 
transition; stimulated by M-Cdk activity 

Cdhl 

APC/C-activating subunit that maintains APC/C activity after anaphase and throughout Gi ; 
inhibited by Cdk activity 

SCF 

Catalyzes ubiquitylation of regulatory proteins involved in Gi control, including some CKIs (Sicl in 
budding yeast, p27 in mammals); phosphorylation of target protein usually required for this activity 


chromosome duplication in S phase and also contributes to some early events of 
mitosis. M-Cdlc activation then triggers progression through the G 2 /M transition 
and the events of early mitosis, leading to the alignment of sister-chromatid pairs 
at the equator of the mitotic spindle. Finally, the APC/C, together with its acti- 
vator Cdc20, triggers the destruction of securin and cyclins, thereby unleashing 
sister-chromatid séparation and ségrégation and the completion of mitosis. When 
mitosis is complété, multiple mechanisms collaborate to suppress Cdk activity, 
resulting in a stable Gi period. We are now ready to discuss these cell-cycle stages 
in more detail, starting with S phase. 


favorable 

extracellular DNA 

environment damage 


unreplicated DNA 
DNA damage 


chromosome 
unattached to 
spindle 



Figure 17-16 An overview of the cell- 
cycle control System. The core of the cell- 
cycle control System consists of a sériés 
of cyclin-Cdk complexes (yellow). The 
activity of each complex is also influenced 
by various inhibitory mechanisms, which 
provide information about the extracellular 
environment, cell damage, and incomplète 
cell-cycle events (top). These inhibitory 
mechanisms are not présent in ail cell 
types; many are missing in early embryonic 
cell cycles, for example. 
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Summary 

The cell-cycle control System triggers the events ofthe cell cycle and ensures that they 
are properly timed and coordinated with each other. The control System responds 
to various intracellular and extracellular signais and arrests the cycle when the cell 
either fails to complété an essential cell-cycle process or encounters unfavorable 
environmental or intracellular conditions. 

Central components of the control System are the cyclin-dependent protein 
kinases (Cdlcs), which dépend on cyclin subunitsfor their activity. Oscillations in the 
activities of different cyclin-Cdk complexes control various cell-cycle events. Thus , 
activation of S-phase cyclin-Cdk complexes (S-Cdk) initiâtes S phase , whereas acti¬ 
vation ofM-phase cyclin-Cdk complexes (M-Cdk) triggers mitosis. The mechanisms 
that control the activities of cyclin-Cdk complexes include phosphorylation ofthe 
Cdk subunit , binding ofCdk inhibitor proteins (CKIs), proteolysis of cyclins, and 
changes in the transcription ofgenes encoding Cdk regulators. The cell-cycle control 
System also dépends crucially on two additional enzyme complexes; the APC/C and 
SCF ubiquitin ligases , which catalyze the ubiquitylation and conséquent destruc¬ 
tion of spécifie regulatory proteins that control critical events in the cycle. 

S PHASE 

The linear chromosomes of eukaryotic cells are vast and dynamic assemblies of 
DNA and protein, and their duplication is a complex process that takes up a major 
fraction of the cell cycle. Not only must the long DNA molécule of each chromo¬ 
some be duplicated accurately—a remarlcable feat in itself—but the protein pack¬ 
aging surrounding each région of that DNA must also be reproduced, ensuring 
that the daughter cells inherit ail features of chromosome structure. 

The central event of chromosome duplication—DNA réplication—poses two 
problems for the cell. First, réplication must occur with extreme accuracy to mini- 
mize the rislc of mutations in the next cell génération. Second, every nucléotide in 
the genome must be copied once, and only once, to prevent the damaging effects 
of gene amplification. In Chapter 5, we discuss the sophisticated protein machin- 
ery that performs DNA réplication with astonishing speed and accuracy. In this 
section, we consider the élégant mechanisms by which the cell-cycle control Sys¬ 
tem initiâtes the réplication process and, at the same time, prevents it from hap¬ 
pening more than once per cycle. 

S-Cdk Initiâtes DNA Réplication Once Per Cycle 

DNA réplication begins at origins of réplication, which are scattered at numerous 
locations in every chromosome. During S phase, DNA réplication is initiated at 
these origins when a DNA helicase unwinds the double hélix and DNA réplica¬ 
tion enzymes are loaded onto the two single-stranded templates. This leads to the 
élongation phase of réplication, when the réplication machinery moves outward 
from the origin at two réplication forks (discussed in Chapter 5). 

To ensure that chromosome duplication occurs only once per cell cycle, the 
initiation phase of DNA réplication is divided into two distinct steps that occur at 
different times in the cell cycle (Figure 17-17). The first step occurs in late mitosis 
and early Gi, when a pair of inactive DNA helicases is loaded onto the réplication 
origin, forming a large complex called the prereplicative complex or preRC. This 
step is sometimes called licensing of réplication origins because initiation of DNA 
synthesis is permitted only at origins containing a preRC. The second step occurs 
in S phase, when the DNA helicases are activated, resulting in DNA unwinding 
and the initiation of DNA synthesis. Once a réplication origin has been fired in 
this way, the two helicases move out from the origin with the réplication forks, and 
that origin cannot be reused until a new preRC is assembled there at the end of 
mitosis. As a resuit, origins can be activated only once per cell cycle. 

Figure 17-18 illustrâtes some of the molecular details underlying the control 
of the two steps in the initiation of DNA réplication. A lcey player is a large mul- 
tiprotein complex called the origin récognition complex (ORC), which binds to 
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Figure 17-17 Control of chromosome 
duplication. Préparations for DNA 
réplication begin in late mitosis and Gi, 
when the DNA helicases are loaded by 
multiple proteins at the réplication origin, 
forming the prereplicative complex (preRC). 
S-Cdk activation leads to activation of the 
DNA helicases, which unwind the DNA 
at origins to initiate DNA réplication. Two 
réplication forks move out from each origin 
until the entire chromosome is duplicated. 
Duplicated chromosomes are then 
segregated in M phase. S-Cdk activation 
in S phase also prevents assembly of new 
preRCs at any origin until the following 
Gi —thereby ensuring that each origin is 
activated only once in each cell cycle. 



M-Cdk activation CHROMOSOME SEGREGATION 


g i 


APC/C activation 
Cdk inactivation 


ASSEMBLY OF NEW 
PREREPLICATIVE COMPLEXES 
AT ORIGINS 



réplication origins throughout the cell cycle. In late mitosis and early Gi, the pro¬ 
teins Cdc6 and Cdtl collaborate with the ORC to load the inactive DNA helicases 
around the DNA next to the origin. The resulting large complex is the preRC, and 
the origin is now licensed for réplication. 

At the onset of S phase, S-Cdk triggers origin activation by phosphorylating 
spécifie initiator proteins, which then nucleate the assembly of a large protein 
complex that activâtes the DNA helicase and recruits the DNA synthesis machin- 
ery. Another protein kinase called DDK is also activated in S phase and helps 
drive origin activation by phosphorylating spécifie subunits of the DNA helicase. 

At the same time as S-Cdk initiâtes DNA réplication, several mechanisms 
prevent assembly of new preRCs. S-Cdk phosphorylâtes and thereby inhibits the 
ORC and Cdc6 proteins. Inactivation of the APC/C in late Gi also helps turn off 
preRC assembly. In late mitosis and early Gi, the APC/C triggers the destruction 
of a Cdtl inhibitor called geminin, thereby allowing Cdtl to be active. When the 
APC/C is turned off in late Gi, geminin accumulâtes and inhibits the Cdtl that 
is not associated with DNA. Also, the association of Cdtl with a protein at active 
réplication forks stimulâtes Cdtl destruction. In these various ways, preRC for¬ 
mation is prevented from S phase to mitosis, thereby ensuring that each origin 
is fired only once per cell cycle. How, then, is the cell-cycle control System reset 
to allow réplication in the next cell cycle? At the end of mitosis, APC/C activation 
leads to the inactivation of Cdlcs and the destruction of geminin. ORC and Cdc6 
are dephosphorylated and Cdtl is activated, allowing preRC assembly to préparé 
the cell for the next S phase. 

Chromosome Duplication Requires Duplication of Chromatin 
Structure 

The DNA of the chromosomes is extensively paclcaged in a variety of protein com- 
ponents, includinghistones andvariousregulatoryproteinsinvolvedin the control 
of gene expression (discussed in Chapter 4). Thus, duplication of a chromosome is 
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not simply a matter of replicating the DNA at its core but also requires the duplica¬ 
tion of these chromatin proteins and their proper assembly on the DNA. 

The production of chromatin proteins increases during S phase to provide the 
raw materials needed to package the newly synthesized DNA. Most importantly, 
S-Cdlcs stimulate a large increase in the synthesis of the four histone subunits that 
form the histone octamers at the core of each nucléosome. These subunits are 
assembled into nucléosomes on the DNA by nucléosome assembly factors, which 
typically associate with the réplication forlc and distribute nucléosomes on both 
strands of the DNA as they emerge from the DNA synthesis machinery. 

Chromatin packaging helps to control gene expression. In some parts of the 
chromosome, the chromatin is highly condensed and is called heterochromatin, 
whereas in other régions it has a more open structure and is called euchromatin 
(discussed in Chapter 4). These différences in chromatin structure dépend on a 
variety of mechanisms, including modification of histone tails and the presence of 
non-histone proteins. Because these différences are important in gene régulation, 
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Figure 17-18 Control of the initiation 
of DNA réplication. The réplication origin 
is bound by the ORC throughout the cell 
cycle. In early Gi, Cdc6 associâtes with 
the ORC, and these proteins bind the 
DNA helicase, which contains six closely 
related subunits called Mcm proteins. The 
helicase also associâtes with a protein 
called Cdtl. Using energy provided by ATP 
hydrolysis, the ORC and Cdc6 proteins 
load two copies of the DNA helicase, in an 
inactive form, around the DNA next to the 
origin, thereby forming the prereplicative 
complex (preRC). At the onset of S 
phase, S-Cdk stimulâtes the assembly 
of several initiator proteins on each DNA 
helicase, while another protein kinase, 

DDK, phosphorylâtes subunits of the DNA 
helicase. As a resuit, the DNA helicases 
are activated and unwind the DNA. DNA 
polymerase and other réplication proteins 
are recruited to the origin, and DNA 
réplication begins. The ORC is displaced 
by the réplication machinery and then 
rebinds. S-Cdk and other mechanisms 
also inactivate the preRC components 
ORC, Cdc6, and Cdtl, thereby preventing 
formation of new preRCs at the origins until 
the end of mitosis (see text). 
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(C) sister chromatids 


Figure 17-19 Cohesin. Cohesin is a 
protein complex with four subunits. (A) Two 
subunits, Smcl and Smc3, are coiled-coil 
proteins with an ATPase domain at one 
end; (B) two additional subunits, Sccl and 
Scc3, connect the ATPase head domains, 
forming a ring structure that may encircle 
the sister chromatids as shown in (C). The 
ATPase domains are required for cohesin 
loading on the DNA. 


it is crucial that chromatin structure, lilce the DNA within, is reproduced accu- 
rately during S phase. How chromatin structure is reproduced is not well under- 
stood, however. During DNA synthesis, histone-modifying enzymes and various 
non-histone proteins are probably deposited onto the two new DNA strands as 
they emerge from the réplication forlc, and these proteins are thought to repro- 
duce the local chromatin structure of the parent chromosome (see Figure 4-45). 

Cohesins Hold Sister Chromatids Together 

At the end of S phase, each replicated chromosome consists of a pair of identical 
sister chromatids glued together along their length. This sister-chromatid cohé¬ 
sion sets the stage for a successful mitosis because it greatly facilitâtes the attach- 
ment of the two sister chromatids to opposite pôles of the mitotic spindle. Imagine 
how difficult it would be to achieve this bipolar attachment if sister chromatids 
were allowed to drift apart after S phase. Indeed, defects in sister-chromatid cohé¬ 
sion—in yeast mutants, for example—lead inevitably to major errors in chromo¬ 
some ségrégation. 

Sister-chromatid cohésion dépends on a large protein complex called cohesin, 
which is deposited at many locations along the length of each sister chromatid as 
the DNA is replicated in S phase. Two of the subunits of cohesin are members of a 
large family of pro teins called SMC proteins (for Structural Maintenance of Chro¬ 
mosomes). Cohesin forms giant ringlilce structures, and it has been proposed that 
these surround the two sister chromatids (Figure 17-19). 

Sister-chromatid cohésion also results, at least in part, from DNA catenation , 
the intertwining of sister DNA molécules that occurs when two réplication forlcs 
meet during DNA synthesis. The enzyme topoisomerase II gradually disentangles 
the catenated sister DNAs between S phase and early mitosis by cutting one DNA 
molécule, passing the other through the break, and then resealing the eut DNA 
(see Figure 5-22). Once the catenation has been removed, sister-chromatid cohé¬ 
sion dépends primarily on cohesin complexes. The sudden and synchronous loss 
of sister cohésion at the metaphase-to-anaphase transition therefore dépends 
primarily on disruption of these complexes, as we describe later. 

Summary 

Duplication ofthe chromosomes in S phase involves the accurate réplication ofthe 
entire DNA molécule in each chromosome , as well as the duplication ofthe chro¬ 
matin proteins that associate with the DNA and govern various aspects of chromo¬ 
some function. Chromosome duplication is triggered by the activation ofS-Cdk , 
which activâtes proteins that unwind the DNA and initiate its réplication at repli- 
cation origins. Once a réplication origin is activated, S-Cdlc also inhibits proteins 
that are required to allow that origin to initiate DNA réplication again. Thus, each 
origin is fired once and only once in each S phase and cannot be reused until the 
next cell cycle. 
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MITOSIS 

Following the completion of S phase and transition through G 2 , the cell under- 
goes the dramatic upheaval of M phase. This begins with mitosis, during which 
the sister chromatids are separated and distributed ( segregated ) to a pair of iden- 
tical daughter nuclei, each with its own copy of the genome. Mitosis is tradition- 
ally divided into five stages —prophase, prometaphase, metaphase, anaphase, and 
telophase— defined primarily on the basis of chromosome behavior as seen in a 
microscope. As mitosis is completed, the second major event of M phase—cytolci- 
nesis—divides the cell into two halves, each with an identical nucléus. Panel 17-1 
summarizes the major events of M phase (Movie 17.2, Movie 17.3, Movie 17.4, 
and Movie 17.5). 

From a regulatory point of view, mitosis can be divided into two major parts, 
each governed by distinct components of the cell-cycle control System. First, 
an abrupt increase in M-Cdlc activity at the G 2 /M transition triggers the events 
of early mitosis (prophase, prometaphase, and metaphase). During this period, 
M-Cdlc and several other mitotic protein kinases phosphorylate a variety of pro¬ 
teins, leading to the assembly of the mitotic spindle and its attachment to the 
sister-chromatid pairs. The second major part of mitosis begins at the metaphase- 
to-anaphase transition, when the APC/C triggers the destruction of securin, liber- 
ating a protease that cleaves cohesin and thereby initiâtes séparation of the sister 
chromatids. The APC/C also promûtes the destruction of cyclins, which leads to 
Cdk inactivation and the déphosphorylation of Cdlc targets, which is required for 
ail events of late M phase, including the completion of anaphase, the disassembly 
of the mitotic spindle, and the division of the cell by cytokinesis. 

In this section, we describe the lcey mechanical events of mitosis and how 
M-Cdlc and the APC/C orchestrate them. 

M-Cdk Drives Entry Into Mitosis 

One of the most remarlcable features of cell-cycle control is that a single protein 
kinase, M-Cdk, brings about ail of the diverse and complex cell rearrangements 
that occur in the early stages of mitosis. At a minimum, M-Cdk must induce the 
assembly of the mitotic spindle and ensure that each sister chromatid in a pair 
is attached to the opposite pôle of the spindle. It also triggers chromosome con¬ 
densation, the large-scale reorganization of the intertwined sister chromatids into 
compact, rodlilce structures. In animal cells, M-Cdk also promûtes the brealcdown 
of the nuclear envelope and rearrangements of the actin cytoslceleton and the 
Golgi apparatus. Each of these processes is thought to be initiated when M-Cdk 
phosphorylâtes spécifie proteins involved in the process, although most of these 
proteins hâve not yet been identified. 

M-Cdk does not act alone to phosphorylate lcey proteins involved in early 
mitosis. Two additional families of protein kinases, the Polo-like kinases and the 
Aurora kinases , also malce important contributions to the control of early mitotic 
events. The Polo-like kinase Plie, for example, is required for the normal assembly 
of a bipolar mitotic spindle, in part because it phosphorylâtes proteins involved 
in séparation of the spindle pôles early in mitosis. The Aurora kinase Aurora-A 
also helps control proteins that govern the assembly and stability of the spindle, 
whereas Aurora-B Controls attachment of sister chromatids to the spindle, as we 
discuss later. 

Déphosphorylation Activâtes M-Cdk at the Onset of Mitosis 

M-Cdlc activation begins with the accumulation of M-cyclin (cyclin B in vertebrate 
cells; see Table 17-1). In embryonic cell cycles, the synthesis of M-cyclin is con¬ 
stant throughout the cell cycle, and M-cyclin accumulation results from the high 
stability of the protein in interphase. In most cell types, however, M-cyclin synthe¬ 
sis increases during G 2 and M, owing primarily to an increase in M-cyclin gene 
transcription. The increase in M-cyclin protein leads to a corresponding accu¬ 
mulation of M-Cdk (the complex of Cdlcl and M-cyclin) as the cell approaches 
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mitosis. Although the Cdk in these complexes is phosphorylated at an activating 
site by the Cdk-activating kinase (CAK), as discussed earlier, the protein kinase 
Weel holds it in an inactive State by inhibitory phosphorylation at two neighbor- 
ing sites (see Figure 17-13). Thus, by the time the cell reaches the end of G 2 , it 
contains an abundant stockpile of M-Cdk that is primed and ready to act but is 
suppressed by phosphates that bloclc the active site of the kinase. 

What, then, triggers the activation of the M-Cdk stockpile? The crucial event 
is the activation of the protein phosphatase Cdc25, which removes the inhibitory 
phosphates that restrain M-Cdk (Figure 17-20). At the same time, the inhibitory 
activity of the kinase Weel is suppressed, further ensuring that M-Cdk activity 
increases. The mechanisms that unleash Cdc25 activity in early mitosis are not 
well understood. One possibility is that the S-Cdlcs that are active in G 2 and early 
prophase stimulate Cdc25. 

Interestingly, Cdc25 can also be activated, at least in part, by its target, M-Cdk. 
M-Cdk may also inhibit the inhibitory kinase Weel. The ability of M-Cdk to acti- 
vate its own activator (Cdc25) and inhibit its own inhibitor (Weel) suggests that 
M-Cdk activation in mitosis involves positive feedback loops (see Figure 17-20). 
According to this attractive model, the partial activation of Cdc25 (perhaps by 
S-Cdlc) leads to the partial activation of a subpopulation of M-Cdk complexes, 
which then phosphorylate Cdc25 and Weel molécules. This leads to more M-Cdk 
activation, and so on. Such a mechanism would quicldy promote the activation of 
ail M-Cdk complexes in the cell. As mentioned earlier, similar molecular switches 
operate at various points in the cell cycle to promote the abrupt and complété 
transition from one cell-cycle State to the next. 

Condensin Helps Configure Duplicated Chromosomes for 
Séparation 

At the end of S phase, the immensely long DNA molécules of the sister chroma- 
tids are tangled in a mass of partially catenated DNA and proteins. Any attempt to 
pull the sisters apart in this State would undoubtedly lead to breaks in the chro¬ 
mosomes. To avoid this disaster, the cell devotes a great deal of energy in early 
mitosis to gradually reorganizing the sister chromatids into relatively short, dis¬ 
tinct structures that can be pulled apart more easily in anaphase. These chromo¬ 
somal changes involve two processes: chromosome condensation , in which the 
chromatids are dramatically compacted; and sister-chromatid resolution, whereby 
the two sisters are resolved into distinct, separable units (Figure 17-21). Resolu¬ 
tion results from the decatenation of the sister DNAs, accompanied by the partial 
removal of cohesin molécules along the chromosome arms. As a resuit, when the 
cell reaches metaphase, the sister chromatids appear in the microscope as com¬ 
pact, rodlilce structures that are joined tightly at their centromeric régions and 
only loosely along their arms. 


Figure 17-20 The activation of M-Cdk. 

Cdkl associâtes with M-cyclin as the levels 
of M-cyclin gradually rise. The resulting 
M-Cdk complex is phosphorylated on an 
activating site by the Cdk-activating kinase 
(CAK) and on a pair of inhibitory sites by 
the Weel kinase. The resulting inactive 
M-Cdk complex is then activated at the 
end of G 2 by the phosphatase Cdc25. 
Cdc25 is further stimulated by active 
M-Cdk, resulting in positive feedback. 

This feedback is enhanced by the ability of 
M-Cdk to inhibit Weel. 
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Figure 17-21 The mitotic chromosome. 

Scanning électron micrograph of a 
human mitotic chromosome, consisting 
of two sister chromatids joined along their 
length. The constricted régions are the 
centromeres. (Courtesy of Terry D. Allen.) 
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PANEL 17-1: The Principle Stages of M Phase (Mitosis and Cytokinesis) in an Animal Cell 



PROPHASE 


centrosome 


intact 

nuclear 

envelope 


forming 
. mitotic 
spindle 


kinetochore 

condensing replicated chromosome, consisting of 
two sister chromatids held together along their length 


At prophase, the replicated 
chromosomes, each 
consisting of two closely 
associated sister chromatids, 
condense. Outside the 
nucléus, the mitotic spindle 
assembles between the two 
centrosomes, which hâve 
replicated and moved apart. 
For simplicity, only three 
chromosomes are shown. In 
diploid cells, there would be 
two copies of each chromo¬ 
some présent. In the 
fluorescence micrograph, 
chromosomes are stained 
orange and microtubules are 
green . 


2 PROMETAPHASE 


centrosome 
at spindle 
pôle 


fragments of 
nuclear envelope 


V 




kinetochore 

microtubule 




Prometaphase starts 
abruptly with the 
breakdown of the nuclear 
envelope. Chromosomes 
can now attach to spindle 
microtubules via their 
kinetochores and undergo 
active movement. 


chromosome in active motion 


3 METAPHASE 

centrosome at 
spindle pôle 


kinetochore 

microtubule 


\ 


At metaphase, the 

chromosomes are aligned 
at the equator of the 
spindle, midway between 
the spindle pôles. The 
kinetochore microtubules 
attach sister chromatids to 
opposite pôles of the 
spindle. 

















ANAPHASE 


daughter chromosomes 


\ 




shortening 

kinetochore 

microtubule 


At anaphase, the sister 
chromatids synchronously 
separate to form two 
daughter chromosomes, 
and each is pulled slowly 
toward the spindle pôle it 
faces. The kinetochore 
microtubules get shorter, 
and the spindle pôles also 
move apart; both 
processes contribute to 
chromosome ségrégation. 


spindle pôle 
moving outward 



TELOPHASE 


set of daughter chromosomes 
at spindle pôle 

, contractile ring 
starting to 
contract 


interpolar 

microtubules 


During telophase, the two 
sets of daughter chromo¬ 
somes arrive at the pôles of 
the spindle and décondense. 
A new nuclear envelope 
reassembles around each 
set, completing the formation 
of two nuclei and marking 
the end of mitosis. The 
division of the cytoplasm 
begins with contraction of 
the contractile ring. 


centrosome 


' nuclear envelope reassembling 
around individual chromosomes 


CYTOKINESIS 



During cytokinesis, the 

cytoplasm is divided in two 
by a contractile ring of 
actin and myosin 
filaments, which pinches 
the cell in two to create 
two daughters, each with 
one nucléus. 


re-formation of interphase 
array of microtubules nucleated 
by the centrosome 


(Micrographs courtesy of Julie Canman and Ted Salmon.) 
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Figure 17-22 Condensin. (A) Condensin is a 
five-subunit protein complex that resembles 
cohesin (see Figure 17-19). The ATPase head 
domains of its two major subunits, Smc2 and 
Smc4, are held together by three additional 
subunits. (B) It is not clear how condensin 
catalyzes the restructuring and compaction 
of chromosome DNA, but it may form a ring 
structure that encircles loops of DNA within each 
sister chromatid. 


ATPase domain 



The condensation and resolution of sister chromatids dépend, at least in 
part, on a five-subunit protein complex called condensin. Condensin structure 
is related to that of the cohesin complex that holds sister chromatids together 
(see Figure 17-19). It contains two SMC subunits lilce those of cohesin, plus three 
non-SMC subunits (Figure 17-22). Condensin may form a ringlike structure that 
somehow uses the energy provided by ATP hydrolysis to promote the compaction 
and resolution of sister chromatids. Condensin is able to change the coiling of 
DNA molécules in a test tube, and this coiling activity is thought to be important 
for chromosome condensation during mitosis. Interestingly, phosphorylation 
of condensin subunits by M-Cdlc stimulâtes this coiling activity, providing one 
mechanism by which M-Cdlc may promote chromosome restructuring in early 
mitosis. 

The Mitotic Spindle Is a Microtubule-Based Machine 

The central event of mitosis—chromosome ségrégation—dépends in ail eulcary- 
otes on a complex and beautiful machine called the mitotic spindle (see Panel 
17-1). The spindle is a bipolar array of microtubules, which pulls sister chroma¬ 
tids apart in anaphase, thereby segregating the two sets of chromosomes to oppo¬ 
site ends of the cell, where they are paclcaged into daughter nuclei (Movie 17.6). 
M-Cdlc triggers the assembly of the spindle early in mitosis, in parallel with the 
chromosome restructuring just described. Before we consider how the spindle 
assembles and how its microtubules attach to sister chromatids, we briefly review 
the basic features of spindle structure. 

The core of the mitotic spindle is a bipolar array of microtubules, the minus 
ends of which are focused at the two spindle pôles, and the plus ends of which 
radiate outward from the pôles (Figure 17-23). The plus ends of some microtu¬ 
bules—called the interpolar microtubules—overlap with the plus ends of micro¬ 
tubules from the other pôle, resulting in an antiparallel array in the spindle mid- 
zone. The plus ends of other microtubules—the lcinetochore microtubules—are 
attached to sister-chromatid pairs at large protein structures called kinetochores, 
which are located at the centromere of each sister chromatid. Finally, many spin- 
dles also contain astral microtubules that radiate outward from the pôles and 
contact the cell cortex, helping to position the spindle in the cell. 

In most somatic animal cells, each spindle pôle is focused at a protein organ- 
elle called the centrosome (see Figures 16-47 and 16-48). Each centrosome 
consists of a cloud of amorphous material (called the pericentriolar matrix) that 
surrounds a pair of centrioles (Figure 17-24). The pericentriolar matrix nucleates 
a radial array of microtubules, with their fast-growing plus ends projecting out¬ 
ward and their minus ends associated with the centrosome. The matrix contains a 
variety of proteins, including microtubule-dépendent motor proteins, coiled-coil 
proteins that linlc the motors to the centrosome, structural proteins, and compo- 
nents of the cell-cycle control System. Most important, it contains y-tubulin ring 
complexes, which are the components mainly responsible for nucleating microtu¬ 
bules (see Figure 16-46). 

Some cells—notably the cells of higher plants and the oocytes of many ver- 
tebrates—do not hâve centrosomes, and micro tubule-dépendent motor proteins 
and other proteins associate with microtubule minus ends to organize and focus 
the spindle pôles. 
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Figure 17-23 The metaphase mitotic 
spindle in an animal cell. The plus ends 
of the microtubules project away from the 
spindle pôle, while the minus ends are 
anchored at the spindle pôles, which in this 
example are organized by centrosomes. 
Kinetochore microtubules connect the 
spindle pôles with the kinetochores of sister 
chromatids, while interpolar microtubules 
from the two pôles interdigitate at the 
spindle equator. Astral microtubules radiate 
out from the pôles into the cytoplasm. 


Microtubule-Dependent Motor Proteins Govern Spindle Assembly 
and Function 

The function of the mitotic spindle dépends on numerous microtubule-depen- 
dent motor proteins. As discussed in Chapter 16, these proteins belong to two 
families—the kinesin-related proteins, which usually move toward the plus end 
of microtubules, and dyneins, which move toward the minus end. In the mitotic 
spindle, these motor proteins generally operate at or near the ends of the micro¬ 
tubules. Four major types of motor pro teins— Jcinesin-5, kinesin-14, kinesins-4/10, 
and dynein —are particularly important in spindle assembly and function (Figure 
17-25). 

Kinesin-5 pro teins contain two motor domains that interact with the plus 
ends of antiparallel microtubules in the spindle midzone. Because the two motor 
domains move toward the plus ends of the microtubules, they slide the two anti¬ 
parallel microtubules past each other toward the spindle pôles, pushing the 
pôles apart. Kinesin-14 proteins, by contrast, are minus-end directed motors 
with a single motor domain and other domains that can interact with a neighbor- 
ing microtubule. They can cross-linlc antiparallel interpolar microtubules at the 
spindle midzone and tend to pull the pôles together. Kinesin-4 and kinesin-10 



Figure 17-24 The centrosome. (A) Electron micrograph of an S-phase mammalian cell in culture, showing a duplicated centrosome. Each 
centrosome contains a pair of centrioles; although the centrioles hâve duplicated, they remain together in a single complex, as shown in the drawing 
of the micrograph in (B). One centriole of each centriole pair has been eut in cross section, while the other is eut in longitudinal section, indicating 
that the two members of each pair are aligned at right angles to each other. The two halves of the replicated centrosome, each consisting of a 
centriole pair surrounded by pericentriolar matrix, will split and migrate apart to initiate the formation of the two pôles of the mitotic spindle when the 
cell enters M phase. (A, from M. McGill, D.P. Highfield, T.M. Monahan, and B.R. Brinkley, J. Ultrastruct. Res. 57:43-53, 1976. With permission from 
Academie Press.) 
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Figure 17-25 Major motor proteins of the 
spindle. Four major classes of microtubule- 
dependent motor proteins (yellow boxes) 
contribute to spindle assembly and function 
(see text). The colored arrows indicate the 
direction of motor protein movement along 
a microtubul e—blue toward the minus end 
and red toward the plus end. 


proteins, also called chromolcinesins, are plus-end directed motors that associate 
with chromosome arms and push the attached chromosome away from the pôle 
(or the pôle away from the chromosome). Finally, dyneins are minus-end directed 
motors that, together with associated proteins, organize microtubules at various 
locations in the cell. They linlc the plus ends of astral microtubules to components 
of the actin cytoskeleton at the cell cortex, for example; by moving toward the 
minus end of the microtubules, the dynein motors pull the spindle pôles toward 
the cell cortex and away from each other. 

Multiple Mechanisms Collaborate in the Assembly of a Bipolar 
Mitotic Spindle 

The mitotic spindle must hâve two pôles if it is to pull the two sets of sister chroma- 
tids to opposite ends of the cell in anaphase. In most animal cells, several mecha¬ 
nisms ensure the bipolarity of the spindle. One dépends on centrosomes. A typi- 
cal animal cell enters mitosis with a pair of centrosomes, each of which nucleates 
a radial array of microtubules. The two centrosomes provide prefabricated spin¬ 
dle pôles that greatly facilitate bipolar spindle assembly. The other mechanisms 
dépend on the ability of mitotic chromosomes to nucleate and stabilize microtu¬ 
bules and on the ability of motor proteins to organize microtubules into a bipolar 
array. These "self-organization” mechanisms can produce a bipolar spindle even 
in cells lacking centrosomes. 

We now describe the steps of spindle assembly, beginning with centrosome- 
dependent assembly in early mitosis. We then consider the self-organization 
mechanisms that do not require centrosomes and become particularly important 
after nuclear-envelope brealcdown. 

Centrosome Duplication Occurs Early in the Cell Cycle 

Most animal cells contain a single centrosome that nucleates most of the cell's 
cytoplasmic microtubules. The centrosome duplicates when the cell enters the 
cell cycle, so that by the time the cell reaches mitosis there are two centrosomes. 
Centrosome duplication begins at about the same time as the cell enters S phase. 
The Gi/S-Cdk (a complex of cyclin E and Cdk2 in animal cells; see Table 17-1) that 
triggers cell-cycle entry also helps initiate centrosome duplication. The two cen¬ 
trions in the centrosome separate, and each nucleates the formation of a single 
new centriole, resulting in two centriole pairs within an enlarged pericentriolar 
matrix (Figure 17-26). This centrosome pair remains together on one side of the 
nucléus until the cell enters mitosis. 

There are interesting parallels between centrosome duplication and chromo¬ 
some duplication. Both use a semiconservative mechanism of duplication, in 
which the two halves separate and serve as templates for construction of a new 
half. Centrosomes, like chromosomes, must replicate once and only once per 
cell cycle, to ensure that the cell enters mitosis with only two copies: an incorrect 
number of centrosomes could lead to defects in spindle assembly and thus errors 
in chromosome ségrégation. 
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The mechanisms that lirait centrosome duplication to once per cell cycle are 
uncertain. In many cell types, experimental inhibition of DNA synthesis bloclcs 
centrosome duplication, providing one mechanism by which centrosome number 
is lcept in checlc. Other cell types, however, including those in the early embryos 
of Aies, sea urchins, and frogs, do not hâve such a mechanism and centrosome 
duplication continues if chromosome duplication is bloclced. It is not known how 
such cells limit centrosome duplication to once per cell cycle. 

M-Cdk Initiâtes Spindle Assembly in Prophase 

Spindle assembly begins in early mitosis, when the two centrosomes move apart 
along the nuclear envelope, pulled by dynein motor proteins that linlc astral micro- 
tubules to the cell cortex (see Figure 17-25). The plus ends of the microtubules 
between the centrosomes interdigitate to form the interpolar microtubules, and 
kinesin-5 motor proteins associate with these microtubules and push the centro¬ 
somes apart (see Figure 17-25). Also in early mitosis, the number of y-tubulin ring 
complexes in each centrosome increases greatly, increasing the ability of the cen¬ 
trosomes to nucleate new microtubules, a process called centrosome maturation. 

The balance of opposing forces generated by different types of motor proteins 
détermines the final length of the spindle. Dynein and kinesin-5 motors generally 
promote centrosome séparation and increase spindle length. Kinesin-14 proteins 
do the opposite: they tend to pull the pôles together (see Figure 17-25). It is not 
clear how the cell régulâtes the balance of opposing forces to generate the appro- 
priate spindle length. 

M-Cdk and other mitotic protein kinases are required for centrosome sépara¬ 
tion and maturation. M-Cdk and Aurora-A phosphorylate kinesin-5 motors and 
stimulate them to drive centrosome séparation. Aurora-A and Plie also phosphor¬ 
ylate components of the centrosome and thereby promote its maturation. 

The Completion of Spindle Assembly in Animal Cells Requires 
Nuclear-Envelope Breakdown 

The centrosomes and microtubules of animal cells are located in the cytoplasm, 
separated from the chromosomes by the double-membrane barrier of the nuclear 
envelope (discussed in Chapter 12). Clearly, the attachment of sister-chromatid 
pairs to the spindle requires the removal of this barrier. In addition, many of the 
motor proteins and microtubule regulators that promote spindle assembly are 
associated with the chromosomes inside the nucléus, and they require nuclear- 
envelope breakdown to carry out their functions. 

Nuclear-envelope breakdown is a complex, multistep process, which is 
thought to begin when M-Cdk phosphorylâtes several subunits of the nuclear pore 
complexes in the nuclear envelope. This phosphorylation initiâtes the disassem- 
bly of nuclear pore complexes and their dissociation from the envelope. M-Cdk 


Figure 17-26 Centriole réplication. The 

centrosome consists of a centriole pair and 
associated pericentriolar matrix (green). At 
a certain point in Gi, the two centrioles of 
the pair separate by a few micrometers. 
During S phase, a daughter centriole 
begins to grow near the base of each 
mother centriole and at a right angle to it. 
The élongation of the daughter centriole is 
usually completed in G 2 . The two centriole 
pairs remain close together in a single 
centrosomal complex until the beginning 
of M phase, when the complex splits in 
two and the two daughter centrosomes 
begin to separate. Each centrosome 
now nucleates its own radial array of 
microtubules (called an aster), mainly from 
the mother centriole. 
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also phosphorylâtes components of the nuclear lamina, the structural framework 
beneath the envelope. The phosphorylation of these lamina components and of 
several inner-nuclear-envelope proteins leads to disassembly of the nuclear lam¬ 
ina and the breakdown of the envelope membranes into small vesicles. 

Microtubule Instability Increases Greatly in Mitosis 

Most animal cells in interphase contain a cytoplasmic array of microtubules radi- 
ating out from the single centrosome. As discussed in Chapter 16, the microtubules 
of this interphase array are in a State of dynamic instability , in which individual 
microtubules are either growing or shrinking and stochastically switch between 
the two States. The switch from growth to shrinkage is called a catastrophe , and 
the switch from shrinkage to growth is called a rescue. New microtubules are con- 
tinually being created to balance the loss of those that disappear completely by 
depolymerization. 

Entry into mitosis signais an abrupt change in the cell's microtubules. The 
interphase array of few, long microtubules radiating from the single centrosome is 
converted to a larger number of shorter and more dynamic microtubules emanat- 
ing from both centrosomes. During prophase, and particularly in prometaphase 
and metaphase (see Panel 17-1), the half-life of microtubules decreases dramati- 
cally. This increase in microtubule instability, coupled with the increased ability 
of centrosomes to nucleate microtubules as mentioned earlier, results in remarlc- 
ably dense and dynamic arrays of spindle microtubules that are ideally suited for 
capturing sister chromatids. 

Microtubule dynamics are controlled in the cell by a variety of regulatory pro¬ 
teins, including microtubule-associated proteins (MAPs) that promote stability 
and catastrophe factors that destabilize microtubule plus ends. Changes in the 
activities of these regulatory proteins are responsible for the changes in micro¬ 
tubule dynamics that occur during mitosis. Many of these changes resuit from 
phosphorylation of spécifie proteins by M-Cdk and other mitotic protein kinases. 

Mitotic Chromosomes Promote Bipolar Spindle Assembly 

Chromosomes are not just passive passengers in the process of spindle assem¬ 
bly. By creating a local environment that favors both microtubule nucléation 
and microtubule stabilization, they play an active part in spindle formation. The 
influence of the chromosomes can be demonstrated by using a fine glass needle 
to reposition them after the spindle has formed. For some cells in metaphase, if 
a single chromosome is tugged out of alignment, a mass of new spindle micro¬ 
tubules rapidly appears around the newly positioned chromosome, while the 
spindle microtubules at the chromosome's former position depolymerize. This 
property of the chromosomes seems to dépend, at least in part, on a guanine 
nucléotide exchange factor (GEF) that is bound to chromatin; the GEF stimulâtes 
a small GTPase in the cytosol called Ran to bind GTP in place of GDP. The activated 
Ran-GTP, which is also involved in nuclear transport (discussed in Chapter 12), 
releases microtubule-stabilizing proteins from protein complexes in the cytosol, 
thereby stimulating the local nucléation and stabilization of microtubules around 
chromosomes (Figure 17-27). Local microtubule stabilization is also promoted 
by the protein kinase Aurora-B, which associâtes with mitotic chromosomes. 


Figure 17-27 Activation of the GTPase Ran around mitotic 
chromosomes. The Ran protein, like other members of the small 
GTPase family (discussed in Chapter 15), can exist in two conformations 
depending on whether it is bound to GDP (inactive State) or GTP (active 
State). The localization of active Ran in mitosis was determined using 
a protein that emits fluorescence at a spécifie wavelength when it is 
activated by Ran-GTP. In the metaphase human cell shown here, Ran 
activity (yellow and red) is highest around the chromosomes, between 
the pôles of the mitotic spindle (indicated by asterisks). (From P. Kalâb 
et al., Nature 440:697-701,2006. With permission from Macmillan 
Publishers Ltd.) 
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Figure 17-28 Spindle self-organization by motor proteins. Mitotic chromosomes stimulate the local activation of proteins 
that nucleate and promote the formation of microtubules in the vicinity of the chromosomes. Kinesin-5 motor proteins (see 
Figure 17-25) organize these microtubules into antiparallel bundles, while plus-end directed kinesins-4 and 10 link the 
microtubules to chromosome arms and push minus ends away from the chromosomes. Dynein and kinesin-14 motors, 
together with numerous other proteins, focus these minus ends into a pair of spindle pôles. 


The ability of chromosomes to stabilize and organize microtubules enables 
cells to form bipolar spindles in the absence of centrosomes. Acentrosomal spin¬ 
dle assembly is thought to begin with the formation of microtubules around the 
chromosomes. Various motor proteins then organize the microtubules into a 
bipolar spindle, as illustrated in Figure 17-28. 

Cells that normally laclc centrosomes, such as those of higher plants and 
many animal oocytes, use this chromosome-based self-organization process to 
form spindles. It is also the process used to assemble spindles in certain animal 
embryos that hâve been induced to develop from eggs without fertilization (that 
is, parthenogenetically ); as the sperm normally provides the centrosome when it 
fertilizes an egg, the mitotic spindles in these parthenogenetic embryos develop 
without centrosomes (Figure 17-29). Even in cells that normally contain centro¬ 
somes, the chromosomes help organize the spindle microtubules and, with the 
help of various motor proteins, can promote the assembly of a bipolar mitotic 
spindle if the centrosomes are removed. Although the resulting acentrosomal 
spindle can segregate chromosomes normally, it laclcs astral microtubules, which 
are responsible for positioning the spindle in animal cells; as a resuit, the spindle 
is often mispositioned in the cell. 

Kinetochores Attach Sister Chromatids to the Spindle 

Following the assembly of a bipolar microtubule array, the second major step 
in spindle formation is the attachment of the array to the sister-chromatid pairs. 
Spindle microtubules become attached to each chromatid at its kinetochore, 
a giant, multilayered protein structure that is built at the centromeric région of 
the chromatid (Figure 17-30; also see Chapter 4). In metaphase, the plus ends 
of kinetochore microtubules are embedded head-on in specialized microtubule- 
attachment sites within the outer région of the kinetochore, furthest from the 
DNA. The kinetochore of an animal cell can bind 10-40 microtubules, whereas 
a budding yeast kinetochore can bind only one. Attachment of each microtubule 
dépends on multiple copies of a rod-shaped protein complex called the Ndc80 
complex, which is anchored in the kinetochore at one end and interacts with the 
sides of the microtubule at the other, thereby linking the microtubule to the kinet¬ 
ochore while still allowing the addition or removal of tubulin subunits at this end 
(Figure 17-31). Régulation of plus-end polymerization and depolymerization at 
the kinetochore is critical for the control of chromosome movement on the spin¬ 
dle, as we discuss later. 

Kinetochore attachment to the spindle occurs by a complex sequence of 
events. At the end of prophase in animal cells, the centrosomes of the growing 
spindle generally lie on opposite sides of the nuclear envelope. Thus, when the 
envelope breaks down, the sister-chromatid pairs are bombarded by microtubule 
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Figure 17-29 Bipolar spindle assembly 
without centrosomes in parthenogenetic 
embryos of the insect Sciara (or fungus 
gnat). The microtubules are stained 
green, the chromosomes red. The top 
fluorescence micrograph shows a normal 
spindle formed with centrosomes in a 
normally fertilized Sciara embryo. The 
bottom micrograph shows a spindle 
formed without centrosomes in an 
embryo that initiated development without 
fertilization. Note that the spindle with 
centrosomes has an aster at each pôle of 
the spindle, whereas the spindle formed 
without centrosomes does not. Both 
types of spindles are able to segregate 
the replicated chromosomes. (From B. de 
Saint Phalle and W. Sullivan, J. Cell Biol. 
141:1383-1391,1998. With permission 
from The Rockefeller University Press.) 
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plus ends coming from two directions. However, the kinetochores do not instantly 
achieve the correct 'end-on' microtubule attachment to both spindle pôles. 
Instead, detailed studies with light and électron microscopy show that most initial 
attachments are unstable latéral attachments, in which a kinetochore attaches to 
the side of a passing microtubule, with assistance from kinesin motor proteins in 
the outer kinetochore. Soon, however, the dynamic microtubule plus ends cap¬ 
ture the kinetochores in the correct end-on orientation (Figure 17-32). 

Another attachment mechanism also plays a part, particularly in the absence 
of centrosomes. Careful microscopie analysis suggests that short microtubules 
in the vicinity of the chromosomes become embedded in the plus-end-binding 
sites of the kinetochore. Polymerization at these plus ends then results in growth 
of the microtubules away from the kinetochore. The minus ends of these kineto¬ 
chore microtubules are eventually cross-linked to other minus ends and focused 
by motor proteins at the spindle pôle (see Figure 17-28). 

Bi-orientation Is Achieved by Trial and Error 

The success of mitosis demands that sister chromatids in a pair attach to opposite 
pôles of the mitotic spindle, so that they move to opposite ends of the cell when 


Figure 17-30 The kinetochore. 

(A) A fluorescence micrograph of a 
metaphase chromosome stained with a 
DNA-binding fluorescent dye and with 
human autoantibodies that react with 
spécifie kinetochore proteins. The two 
kinetochores, one associated with each 
sister chromatid, are stained red. 

(B) A drawing of a metaphase chromosome 
showing its two sister chromatids 
attached to the plus ends of kinetochore 
microtubules. Each kinetochore forms a 
plaque on the surface of the centromere. 

(C) Electron micrograph of an anaphase 
chromatid with microtubules attached to 
its kinetochore. While most kinetochores 
hâve a trilaminar structure, the one shown 
here (from a green alga) has an unusually 
complex structure with additional layers. 

(A, courtesy of B.R. Brinkley; C, from 
J.D. Pickett-Heaps and L.C. Fowke, 

Aust. J. Biol. Soi. 23:71-92, 1970. With 
permission from CSIRO.) 



100 nm 

Figure 17-31 Microtubule attachment sites in the kinetochore. (A) In this électron micrograph of a mammalian kinetochore, the chromosome 
is on the right, and the plus ends of multiple microtubules are embedded in the outer kinetochore on the left. (B) Electron tomography (discussed 
in Chapter 9) was used to construct a low-resolution three-dimensional image of the outer kinetochore in (A). Several microtubules (in multiple 
colors) are embedded in fibrous material of the kinetochore, which is thought to be composed of the Ndc80 complex and other proteins. (C) Each 
microtubule is attached to the kinetochore by interactions with multiple copies of the Ndc80 complex (M/e). This complex binds to the sides of the 
microtubule near its plus end, allowing polymerization and depolymerization to occur while the microtubule remains attached to the kinetochore. 

(A and B, from Y. Dong et al., Nature Cell Biol. 9:516-522, 2007. With permission from Macmillan Publishers Ltd.) 
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Figure 17-32 Chromosome attachment to the mitotic spindle in animal cells. (A) In late prophase of most animal cells, the 
mitotic spindle pôles hâve moved to opposite sides of the nuclear envelope, with an array of overlapping microtubules between 
them. (B) Following nuclear envelope breakdown, the sister-chromatid pairs are exposed to the large number of dynamic plus 
ends of microtubules radiating from the spindle pôles. In most cases, the kinetochores are first attached to the sides of these 
microtubules, while at the same time the arms of the chromosomes are pushed outward from the spindle interior, preventing 
the arms from blocking microtubule access to the kinetochores. (C) Eventually, the latérally-attached sister chromatids are 
arranged in a ring around the outside of the spindle. Most of the microtubules are concentrated in this ring, so that the spindle 
is relatively hollow inside. (D) Dynamic microtubule plus ends eventually encounter the kinetochores in an end-on orientation and 
are captured and stabilized. (E) Stable end-on attachment to both pôles results in bi-orientation. Additional microtubules are 
attached to the kinetochore, resulting in a kinetochore fiber containing 10-40 microtubules. 


they separate in anaphase. How is this mode of attachment, called bi-orienta- 
tion, achieved? What prevents the attachment of both kinetochores to the same 
spindle pôle or the attachment of one kinetochore to both spindle pôles? Part of 
the answer is that sister kinetochores are constructed in a baclc-to-baclc orienta¬ 
tion that reduces the likelihood that both kinetochores can face the same spindle 
pôle. Nevertheless, incorrect attachments do occur, and élégant regulatory mech- 
anisms hâve evolved to correct them. 

Incorrect attachments are corrected by a System of trial and error that is based 
on a simple principle: incorrect attachments are highly unstable and do not last, 
whereas correct attachments become loclced in place. How does the kinetochore 
sense a correct attachment? The answer appears to be tension (Figure 17-33). 
When a sister-chromatid pair is properly bi-oriented on the spindle, the two kine¬ 
tochores are pulled in opposite directions by strong poleward forces. Sister-chro¬ 
matid cohésion resists these poleward forces, creating high levels of tension within 
the kinetochores. When chromosomes are incorrectly attached—when both sis¬ 
ter chromatids are attached to the same spindle pôle, for example—tension is low 


Figure 17-33 Alternative forms of 
kinetochore attachment to the spindle 
pôles. (A) Initially, a single microtubule 
from a spindle pôle binds to one 
kinetochore in a sister-chromatid pair. 
Additional microtubules can then bind to 
the chromosome in various ways. 

(B) A microtubule from the same spindle 
pôle can attach to the other sister 
kinetochore, or (C) microtubules from 
both spindle pôles can attach to the same 
kinetochore. These incorrect attachments 
are unstable, however, so that one of the 
two microtubules tends to dissociate. 

(D) When a microtubule from the opposite 
pôle binds to the second kinetochore, 
the sister kinetochores are thought to 
sense tension across their microtubule- 
binding sites. This triggers an increase in 
microtubule binding affinity, thereby locking 
(D) stable the correct attachment in place. 




(B) UNSTABLE 


(C) UNSTABLE 
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and the kinetochore sends an inhibitory signal that loosens the grip of its microtu- 
bule attachment site, allowing detachment to occur. When bi-orientation occurs, 
the high tension at the kinetochore shuts off the inhibitory signal, strengthening 
microtubule attachment. In animal cells, tension not only increases the affïnity of 
the attachment site but also leads to the attachment of additional microtubules 
to the kinetochore. This results in the formation of a thiclc kinetochore fiber com- 
posed of multiple microtubules. 

The tension-sensing mechanism dépends on the protein kinase Aurora-B, 
which is associated with the kinetochore and is thought to generate the inhibi¬ 
tory signal that reduces the strength of microtubule attachment in the absence 
of tension. It phosphorylâtes several components of the microtubule attachment 
site, including the Ndc80 complex, decreasing the site's affinity for a microtubule 
plus end. When bi-orientation occurs, the resulting tension somehow reduces 
phosphorylation by Aurora-B, thereby increasing the affinity of the attachment 
site (Figure 17-34). 

Following their attachment to the two spindle pôles, the chromosomes are 
tugged baclc and forth, eventually assuming a position équidistant between the 
two pôles, a position called the metaphase plate. In vertebrate cells, the chro¬ 
mosomes then oscillate gently at the metaphase plate, awaiting the signal for the 
sister chromatids to separate. The signal is produced, with a predictable lag time, 
after the bi-oriented attachment of the last of the chromosomes. 


Figure 17-34 How tension might 
increase microtubule attachment to the 
kinetochore. These diagrams illustrate 
one spéculative mechanism by which 
bi-orientation might increase microtubule 
attachment to the kinetochore. A single 
kinetochore is shown for clarity; the spindle 
pôle is on the right. (A) When a sister- 
chromatid pair is unattached to the spindle 
or attached to just one spindle pôle, there 
is little tension between the outer and inner 
kinetochores. The protein kinase Aurora-B 
is tethered to the inner kinetochore and 
phosphorylâtes the microtubule attachment 
sites, including the Ndc80 complex 
(blue), in the outer kinetochore as shown, 
thereby reducing the affinity of microtubule 
binding. Microtubules therefore associate 
and dissociate rapidly, and attachment 
is unstable. (B) When bi-orientation is 
achieved, the forces pulling the kinetochore 
toward the spindle pôle are resisted by 
forces pulling the other sister kinetochore 
toward the opposite pôle, and the resulting 
tension pulls the outer kinetochore away 
from the inner kinetochore. As a resuit, 
Aurora-B is unable to reach the outer 
kinetochore, and microtubule attachment 
sites are not phosphorylated. Microtubule 
binding affinity is therefore increased, 
resulting in the stable attachment of 
multiple microtubules to both kinetochores. 
The déphosphorylation of outer kinetochore 
proteins dépends on a phosphatase that is 
not shown here. 


Multiple Forces Act on Chromosomes in the Spindle 

Multiple mechanisms generate the forces that move chromosomes baclc and forth 
after they are attached to the spindle, and produce the tension that is so important 
for the stabilization of correct attachments. In anaphase, similar forces pull the 
separated chromatids to opposite ends of the spindle. Three major spindle forces 
are particularly critical, although their strength and importance vary at different 
stages of mitosis. 

The first major force pulls the kinetochore and its associated chromatid along 
the kinetochore microtubule toward the spindle pôle. It is produced by proteins at 
the kinetochore itself. By an uncertain mechanism, depolymerization at the plus 
end of the microtubule generates a force that pulls the kinetochore poleward. This 
force pulls on chromosomes during prometaphase and metaphase but is particu¬ 
larly important for moving sister chromatids toward the pôles after they separate 
in anaphase. Interestingly, this kinetochore-generated poleward force does not 
require ATP or motor proteins. This might seem implausible at first, but it has 
been shown that purified kinetochores in a test tube, with no ATP présent, can 
remain attached to depolymerizing microtubules and thereby move. The energy 
that drives the movement is stored in the microtubule and is released when the 
microtubule depolymerizes; it ultimately cornes from the hydrolysis of GTP that 
occurs after a tubulin subunit adds to the end of a microtubule (discussed in 
Chapter 16). 
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How does plus-end depolymerization drive the lcinetochore toward the pôle? 
As we discussed earlier (see Figure 17-3IC), Ndc80 complexes in the kineto- 
chore malce multiple low-affinity attachments along the side of the microtubule. 
Because the attachments are constantly breaking and re-forming at new sites, the 
kinetochore remains attached to a microtubule even as the microtubule depoly- 
merizes. In principle, this could move the kinetochore toward the spindle pôle. 

A second poleward force is provided in some cell types by microtubule flux, 
whereby the microtubules themselves are pulled toward the spindle pôles and 
dismantled at their minus ends. The mechanism underlying this poleward move- 
ment is not clear, although it might dépend on forces generated by motor proteins 
and minus-end depolymerization at the spindle pôle. In metaphase, the addition 
of new tubulin at the plus end of a microtubule compensâtes for the loss of tubulin 
at the minus end, so that microtubule length remains constant despite the move- 
ment of microtubules toward the spindle pôle (Figure 17-35). Any kinetochore 
that is attached to a microtubule undergoing such flux expériences a poleward 
force, which contributes to the génération of tension at the kinetochore in meta¬ 
phase. Together with the kinetochore-based forces discussed above, flux also con¬ 
tributes to the poleward forces that move sister chromatids after they separate in 
anaphase. 

A third force acting on chromosomes is the polar éjection force, or polar wind. 
Plus-end directed kinesin-4 and 10 motors on chromosome arms interact with 
interpolar microtubules and transport the chromosomes away from the spindle 
pôles (see Figure 17-25). This force is particularly important in prometaphase and 
metaphase, when it helps push chromosome arms out from the spindle. This force 
might also help align the sister-chromatid pairs at the metaphase plate (Figure 
17-36). 

One of the most striking aspects of mitosis in vertebrate cells is the continu- 
ous oscillatory movement of the chromosomes in prometaphase and metaphase. 
When studied by video microscopy in newt lung cells, the movements are seen to 
switch between two States—a poleward State, when the chromosomes are pulled 
toward the pôle, and an away-from-the-pole, or neutral, State, when the poleward 
forces are turned off and the polar éjection force pushes the chromosomes away 
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Figure 17-35 Microtubule flux in the metaphase spindle. (A) To observe microtubule flux, a very small amount of fluorescent tubulin is injected 
into living cells so that individual microtubules form with a very small proportion of fluorescent tubulin. Such microtubules hâve a speckled 
appearance when viewed by fluorescence microscopy. (B) Fluorescence micrograph of a mitotic spindle in a living newt lung épithélial cell. The 
chromosomes are colored brown, and the tubulin speckles are red. (C) The movement of individual speckles can be followed by time-lapse video 
microscopy. Images of the thin vertical boxed région (arrow) in (B), taken at sequential times, show that individual speckles move toward the pôles 
at a rate of about 0.75 gm/min, indicating that the microtubules are moving poleward. (D) Microtubule length in the metaphase spindle does not 
change significantly because new tubulin subunits are added at the microtubule plus end at the same rate as tubulin subunits are removed from the 
minus end. (B and C, from T.J. Mitchison and E.D. Salmon, Nat. Cell Biol. 3:E17-21,2001. With permission from Macmillan Publishers Ltd.) 
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from the pôle. The switch between the two States may dépend on the degree of 
tension in the kinetochore. It has been proposed, for example, that, as chromo¬ 
somes move toward a spindle pôle, an increasing polar éjection force generates 
tension in the kinetochore nearest the pôle, triggering a switch to the away-from- 
the-pole State and gradually resulting in the accumulation of chromosomes at the 
equator of the spindle. 


Figure 17-36 How opposing forces may 
drive chromosomes to the metaphase 
plate. (A) Evidence for a polar éjection 
force that pushes chromosomes away 
from the spindle pôles toward the spindle 
equator. In this experiment, a laser beam 
severs a prometaphase chromosome 
that is attached to a single pôle by a 
kinetochore microtubule. The part of 
the severed chromosome without a 
kinetochore is pushed rapidly away 
from the pôle, whereas the part with 
the kinetochore moves toward the pôle, 
reflecting a decreased repulsion. (B) A 
model of how two opposing forces may 
cooperate to move chromosomes to the 
metaphase plate. Plus-end-directed motor 
proteins (kinesin-4 and kinesin-10) on the 
chromosome arms are thought to interact 
with microtubules to generate the polar 
éjection force, which pushes chromosomes 
toward the spindle equator (see Figure 
17-25). Poleward forces generated by 
depolymerization at the kinetochore, 
together with microtubule flux, are thought 
to pull chromosomes toward the pôle. 


The APC/C Triggers Sister-Chromatid Séparation and the 
Completion of Mitosis 

After M-Cdlc has triggered the complex processes leading up to metaphase, the 
cell cycle reaches its climax with the séparation of the sister chromatids at the 
metaphase-to-anaphase transition (Figure 17-37). Although M-Cdlc activity sets 
the stage for this event, the anaphase-promoting complex (APC/C) discussed ear- 
lier throws the switch that initiâtes sister-chromatid séparation by ubiquitylating 
several mitotic regulatory proteins and thereby triggering their destruction (see 
Figure 17-15A). 

During metaphase, cohesins holding the sister chromatids together resist the 
poleward forces that pull the sister chromatids apart. Anaphase begins with the 
sudden loss of sister-chromatid cohésion, which allows the sisters to separate 
and move to opposite pôles of the spindle. The APC/C initiâtes the process by 
targeting the inhibitory protein securin for destruction. Before anaphase, securin 
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Figure 17-37 Sister-chromatid séparation at anaphase. In the transition from metaphase (A) to anaphase (B), sister chromatids suddenly 
and synchronously separate and move toward opposite pôles of the mitotic spindle—as shown in these light micrographs of Haemanthus (lily) 
endosperm cells that were stained with gold-labeled antibodies against tubulin. (Courtesy of Andrew Bajer.) 
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Figure 17-38 The initiation of sister- 
chromatid séparation by the APC/C. 

The activation of APC/C by Cdc20 leads 
to the ubiquitylation and destruction of 
securin, which normally holds separase 
in an inactive State. The destruction of 
securin allows separase to cleave Sccl, 
a subunit of the cohesin complex holding 
the sister chromatids together (see Figure 
17-19). The pulling forces of the mitotic 
spindle then pull the sister chromatids 
apart. In animal cells, phosphorylation by 
Cdks also inhibits separase (not shown). 
Thus, Cdk inactivation in anaphase 
(resulting from cyclin destruction) also 
promûtes separase activation by allowing 
its déphosphorylation. 


binds to and inhibits the activity of a protease called separase. The destruction 
of securin at the end of metaphase releases separase, which is then free to cleave 
one of the subunits of cohesin. The cohesins fall away, and the sister chromatids 
separate (Figure 17-38). 

In addition to securin, the APC/C also targets the S- and M-cyclins for destruc¬ 
tion, leading to the loss of most Cdk activity in anaphase. Cdk inactivation allows 
phosphatases to dephosphorylate the many Cdk target substrates in the cell, as 
required for the completion of mitosis and cytokinesis. 

If the APC/C triggers anaphase, what activâtes the APC/C? The answer is 
only partly lcnown. As mentioned earlier, APC/C activation requires binding to 
the protein Cdc20 (see Figure 17-15A). At least two processes regulate Cdc20 
and its association with the APC/C. First, Cdc20 synthesis increases as the cell 
approaches mitosis, owing to an increase in the transcription of its gene. Second, 
phosphorylation of the APC/C helps Cdc20 bind to the APC/C, thereby helping to 
create an active complex. Among the kinases that phosphorylate and thus activate 
the APC/C is M-Cdlc. Thus, M-Cdlc not only triggers the early mitotic events lead¬ 
ing up to metaphase, but it also sets the stage for progression into anaphase. The 
ability of M-Cdlc to promote Cdc20-APC/C activity créâtes a négative feedback 
loop: M-Cdlc sets in motion a regulatory process that leads to cyclin destruction 
and thus its own inactivation. 

Unattached Chromosomes Block Sister-Chromatid Séparation: 

The Spindle Assembly Checkpoint 

Drugs that destabilize microtubules, such as colchicine or Vinblastine (discussed 
in Chapter 16), arrest cells in mitosis for hours or even days. This observation led 
to the identification of a spindle assembly checkpoint mechanism that is acti- 
vated by the drug treatment and bloclcs progression through the metaphase-to- 
anaphase transition. The checkpoint mechanism ensures that cells do not enter 
anaphase until ail chromosomes are correctly bi-oriented on the mitotic spindle. 

The spindle assembly checkpoint dépends on a sensor mechanism that 
monitors the strength of microtubule attachment at the kinetochore, possi- 
bly by sensing tension as described earlier (see Figure 17-34). Any kinetochore 
that is not properly attached to the spindle sends out a diffusible négative sig¬ 
nal that bloclcs Cdc20-APC/C activation throughout the cell and thus bloclcs the 
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metaphase-to-anaphase transition. When the last sister-chromatid pair is prop- 
erly bi-oriented, this bloclc is removed, allowing sister-chromatid séparation to 
occur. 

The négative checlcpoint signal dépends on several proteins, including 
Mad2, which are recruited to unattached kinetochores (Figure 17-39). Detailed 
structural analyses of Mad2 suggest that the unattached kinetochore acts lilce 
an enzyme that catalyzes a change in the conformation of Mad2, so that Mad2, 
together with other proteins, can bind and inhibit Cdc20-APC/C. 

In mammalian somatic cells, the spindle assembly checlcpoint détermines 
the normal timing of anaphase. The destruction of securin in these cells begins 
moments after the last sister-chromatid pair becomes bi-oriented on the spin¬ 
dle, and anaphase begins about 20 minutes later. Experimental inhibition of the 
checlcpoint mechanism causes prématuré sister-chromatid séparation and ana¬ 
phase. Surprisingly, the normal timing of anaphase does not dépend on the spin¬ 
dle assembly checlcpoint in some cells, such as yeasts and the cells of early frog 
and fly embryos. Other mechanisms, as yet unlcnown, must détermine the timing 
of anaphase in these cells. 

Chromosomes Segregate in Anaphase A and B 

The sudden loss of sister-chromatid cohésion at the onset of anaphase leads to sis¬ 
ter-chromatid séparation, which allows the forces of the mitotic spindle to pull the 
sisters to opposite pôles of the cell—called chromosome ségrégation. The chromo¬ 
somes move by two independent and overlapping processes. The first, anaphase 
A, is the initial poleward movement of the chromosomes, which is accompanied 
by shortening of the kinetochore microtubules. The second, anaphase B, is the 
séparation of the spindle pôles themselves, which begins after the sister chroma- 
tids hâve separated and the daughter chromosomes hâve moved some distance 
apart (Figure 17-40). 

Chromosome movement in anaphase A dépends on a combination of the two 
major poleward forces described earlier. The first is the force generated by micro- 
tubule depolymerization at the kinetochore, which results in the loss of tubulin 
subunits at the plus end as the kinetochore moves toward the pôle. The second 
is provided by microtubule flux, which is the poleward movement of the micro¬ 
tubules toward the spindle pôle, where minus-end depolymerization occurs. The 
relative importance of these two forces during anaphase varies in different cell 
types: in embryonic cells, chromosome movement dépends mainly on microtu- 
bule flux, for example, whereas movement in yeast and vertebrate somatic cells 
results primarily from forces generated at the kinetochore. 

Spindle-pole séparation during anaphase B dépends on motor-driven mecha¬ 
nisms similar to those that separate the two centrosomes in early mitosis. Plus- 
end directed kinesin-5 motor proteins, which cross-linlc the overlapping plus 
ends of the interpolar microtubules, push the pôles apart. In addition, dynein 
motors that anchor astral microtubule plus ends to the cell cortex pull the pôles 
apart (see Figure 17-25). 

Although sister-chromatid séparation initiâtes the chromosome movements 
of anaphase A, other mechanisms also ensure correct chromosome move¬ 
ments in anaphase A and spindle élongation in anaphase B. Most importantly, 
the completion of a normal anaphase dépends on the déphosphorylation of Cdk 
substrates, which in most cells results from the APC/C-dependent destruction of 
cyclins. If M-cyclin destruction is prevented—by the production of a mutant form 
that is not recognized by the APC/C, for example—sister-chromatid séparation 
generally occurs, but the chromosome movements and microtubule behavior of 
anaphase are abnormal. 

The relative contributions of anaphase A and anaphase B to chromosome ség¬ 
régation vary greatly, depending on the cell type. In mammalian cells, anaphase 
B begins shortly after anaphase A and stops when the spindle is about twice its 
metaphase length; in contrast, the spindles of yeasts and certain protozoa primar¬ 
ily use anaphase B to separate the chromosomes at anaphase, and their spindles 
elongate to up to 15 times their metaphase length. 



Figure 17-39 Mad2 protein on 
unattached kinetochores. This 
fluorescence micrograph shows a 
mammalian cell in prometaphase, with 
the mitotic spindle in green and the sister 
chromatids in blue. One sister-chromatid 
pair is attached to only one pôle of the 
spindle. Staining with anti-Mad2 antibodies 
indicates that Mad2 is bound to the 
kinetochore of the unattached sister 
chromatid (red dot, indicated by red 
arrow). A small amount of Mad2 is 
associated with the kinetochore of the 
sister chromatid that is attached to the 
spindle pôle (pale dot, indicated by white 
arrow). (From J.C. Waters et al., J. Cell Biol. 
141:1181-1191, 1998. With permission 
from the authors.) 
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Segregated Chromosomes Are Packaged in Daughter Nuclei at 
Telophase 

By the end of anaphase, the daughter chromosomes hâve segregated into two 
equal groups at opposite ends of the cell. In telophase, the final stage of mitosis, 
the two sets of chromosomes are packaged into a pair of daughter nuclei. The first 
major event of telophase is the disassembly of the mitotic spindle, followed by 
the re-formation of the nuclear envelope. Initially, nuclear membrane fragments 
associate with the surface of individual chromosomes. These membrane frag¬ 
ments fuse to partly enclose clusters of chromosomes and then coalesce to re- 
form the complété nuclear envelope. Nuclear pore complexes are incorporated 
into the envelope, the nuclear lamina re-forms, and the envelope once again 
becomes continuous with the endoplasmic réticulum. Once the nuclear enve¬ 
lope has re-formed, the pore complexes pump in nuclear proteins, the nucléus 
expands, and the mitotic chromosomes are reorganized into their interphase 
State, allowing gene transcription to résumé. A new nucléus has been created, and 
mitosis is complété. Ail that remains is for the cell to complété its division into two. 

We saw earlier that phosphorylation of various proteins by M-Cdlc promûtes 
spindle assembly, chromosome condensation, and nuclear-envelope breakdown 
in early mitosis. It is thus not surprising that the déphosphorylation of these same 
proteins is required for spindle disassembly and the re-formation of daughter 
nuclei in telophase. In principle, these déphosphorylations and the completion 
of mitosis could be triggered by the inactivation of Cdlcs, the activation of phos¬ 
phatases, or both. Although Cdk inactivation—resulting primarily from cyclin 
destruction—is mainly responsible in most cells, some cells also rely on activa¬ 
tion of phosphatases. In budding yeast, for example, the completion of mitosis 
dépends on the activation of a phosphatase called Cdcl4, which dephosphory- 
lates a subset of Cdk substrates involved in anaphase and telophase. 


Figure 17-40 The two processes of 
anaphase in mammalian cells. Separated 
sister chromatids move toward the pôles 
in anaphase A. In anaphase B, the two 
spindle pôles move apart. 


Summary 

M-Cdk triggers the events of early mitosis , including chromosome condensation, 
assembly of the mitotic spindle, and bipolar attachment of the sister-chromatid 
pairs to microtubules of the spindle. Spindle formation in animal cells dépends 
largely on the ability of mitotic chromosomes to stimulate local microtubule nuclé¬ 
ation and stability, as well as on the ability ofmotor proteins to organize micro¬ 
tubules into a bipolar array. Many cells also use centrosomes tofacilitate spindle 
assembly. Anaphase is triggered by the APC/C, which stimulâtes the destruction of 
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the proteins that hold the sister chromatids together. APC/C also promûtes cyclin 
destruction and thus the inactivation ofM-Cdk. The resulting déphosphorylation of 
Cdk targets is requiredfor the events that complété mitosis, including the disassem- 
bly ofthe spindle and the re-formation ofthe nuclear envelope. 

CYTOKINESIS 

The final step in the cell cycle is cytokinesis, the division of the cytoplasm in two. 
In most cells, cytokinesis follows every mitosis, although some cells, such as early 
Drosophila embryos and some mammalian hépatocytes and heart muscle cells, 
undergo mitosis without cytokinesis and thereby acquire multiple nuclei. In most 
animal cells, cytokinesis begins in anaphase and ends shortly after the comple- 
tion of mitosis in telophase. 

The first visible change of cytokinesis in an animal cell is the sudden appear- 
ance of a puclcer, or cleavage Jurrow, on the cell surface. The furrow rapidly deep- 
ens and spreads around the cell until it completely divides the cell in two. The 
structure underlying this process is the contractile ring—a dynamic assembly 
composed of actin filaments, myosin II filaments, and many structural and regula- 
tory proteins. During anaphase, the ring assembles just beneath the plasma mem¬ 
brane (Figure 17-41; see also Panel 17-1). The ring gradually contracts, and, at 
the same time, fusion of intracellular vesicles with the plasma membrane inserts 
new membrane adjacent to the ring. This addition of membrane compensâtes for 
the increase in surface area that accompanies cytoplasmic division. When ring 
contraction is completed, membrane insertion and fusion seal the gap between 
the daughter cells. 

Actin and Myosin II in the Contractile Ring Generate the Force for 
Cytokinesis 

In interphase cells, actin and myosin II filaments form a cortical network underly¬ 
ing the plasma membrane. In some cells, they also form large cytoplasmic bundles 
called stress fibers (discussed in Chapter 16). As cells enter mitosis, these arrays 
of actin and myosin disassemble; much of the actin reorganizes, and myosin II 
filaments are released. As the sister chromatids separate in anaphase, actin and 
myosin II begin to accumulate in the rapidly assembling contractile ring (Figure 
17-42), which also contains numerous other proteins that provide structural sup¬ 
port or assist in ring assembly. Assembly of the contractile ring results in part from 
the local formation of new actin filaments, which dépends on formin proteins that 



(B) 


200 pm 



Figure 17-41 Cytokinesis. (A) The actin-myosin bundles of the contractile ring are oriented as shown, so that their contraction pulls the membrane 
inward. (B) In this low-magnification scanning électron micrograph of a cleaving frog egg, the cleavage furrow is especially prominent, as the cell 
is unusually large. The furrowing of the cell membrane is caused by the activity of the contractile ring underneath it. (C) The surface of a furrow at 
higher magnification. (B and C, from H.W. Beams and R.G. Kessel, Am. Sci. 64:279-290, 1976. With permission from Sigma Xi.) 
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nucleate the assembly of parallel arrays of linear, unbranched actin filaments (dis- 
cussed in Chapter 16). After anaphase, the overlapping arrays of actin and myosin 
II filaments contract to generate the force that divides the cytoplasm in two. Once 
contraction begins, the ring exerts a force large enough to bend a fine glass needle 
that is inserted in its path. As the ring constricts, it maintains the same thickness, 
suggesting that its total volume and the number of filaments it contains decrease 
steadily. Moreover, unlilce actin in muscle, the actin filaments in the ring are 
highly dynamic, and their arrangement changes continually during cytokinesis. 

The contractile ring is finally dispensed with altogether when cleavage ends, 
as the plasma membrane of the cleavage furrow narrows to form the midbody. 
The midbody persists as a tether between the two daughter cells and contains 
the remains of the central spindle, a large protein structure derived from the anti- 
parallel interpolar microtubules of the spindle midzone, paclced tightly together 
within a dense matrix material (Figure 17-43). After the daughter cells separate 
completely, some of the components of the residual midbody often remain on the 
inside of the plasma membrane of each cell, where they may serve as a mark on 
the cortex that helps to orient the spindle in the subséquent cell division. 

Local Activation of RhoA Triggers Assembly and Contraction 
of the Contractile Ring 

RhoA, a small GTPase of the Ras superfamily (see Table 15-5), Controls the assem¬ 
bly and function of the contractile ring at the site of cleavage. RhoA is activated at 
the cell cortex at the future division site, where it promûtes actin filament forma¬ 
tion, myosin II assembly, and ring contraction. It stimulâtes actin filament forma¬ 
tion by activating formins, and it promûtes myosin II assembly and contractions 
by activating multiple protein kinases, including the Rho-activated kinase Rock 
(Figure 17-44). These kinases phosphorylate the regulatory myosin light chain, 
a subunit of myosin II, thereby stimulating bipolar myosin II filament formation 
and motor activity. 

RhoA is thought to be activated by a guanine nucléotide exchange factor (Rho- 
GEF), which is found at the cell cortex at the future division site and stimulâtes 
the release of GDP and binding of GTP to RhoA (see Figure 17-44). We know little 
about how the RhoGEF is localized or activated at the division site, although the 
microtubules of the anaphase spindle seem to be involved, as we discuss next. 

The Microtubules of the Mitotic Spindle Détermine the Plane of 
Animal Cell Division 

The central problem in cytokinesis is how to ensure that division occurs at the 
right time and in the right place. Cytokinesis must occur only after the two sets of 
chromosomes are fully segregated from each other, and the site of division must 


Figure 17-42 The contractile ring. 

(A) A drawing of the cleavage furrow in a 
dividing cell. (B) An électron micrograph of 
the ingrowing edge of a cleavage furrow 
of a dividing animal cell. (C) Fluorescence 
micrographs of a dividing slime mold 
amoeba stained for actin (red) and myosin 
Il (green). Whereas ail of the visible myosin 
Il has redistributed to the contractile ring, 
only some of the actin has done so; the 
rest remains in the cortex of the nascent 
daughter cells. (B, from H.W. Beams and 
R.G. Kessel, Am. Sci. 64:279-290, 1976. 
With permission from Sigma Xi; C, courtesy 
of Yoshio Fukui.) 
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be placed between the two sets of daughter chromosomes, thereby ensuring that 
each daughter cell receives a complété set. The correct timing and positioning 
of cytokinesis in animal cells are achieved by mechanisms that dépend on the 
mitotic spindle. During anaphase, the spindle generates signais that initiate fur- 
row formation at a position midway between the spindle pôles, thereby ensuring 
that division occurs between the two sets of separated chromosomes. Because 
these signais originate in the anaphase spindle, this mechanism also contrib- 
utes to the correct timing of cytokinesis in late mitosis. Cytokinesis also occurs 
at the correct time because déphosphorylation of Cdk substrates, which dépends 
on cyclin destruction in metaphase and anaphase, initiâtes cytokinesis. We now 
describe these regulatory mechanisms in more detail, with an emphasis on cyto¬ 
kinesis in animal cells. 

Studies of the fertilized eggs of marine invertebrates first revealed the impor¬ 
tance of spindle microtubules in determining the placement of the contractile 
ring. After fertilization, these embryos cleave rapidly without intervening periods 
of growth. In this way, the original egg is progressively divided into smaller and 
smaller cells. Because the cytoplasm is clear, the spindle can be observed in real 
time with a microscope. If the spindle is tugged into a new position with a fine 
glass needle in early anaphase, the incipient cleavage furrow disappears, and a 
new one develops in accord with the new spindle site—supporting the idea that 
signais generated by the spindle induce local furrow formation. 

How does the mitotic spindle specify the site of division? Three general mech¬ 
anisms hâve been proposed, and most cells appear to employ a combination of 
these (Figure 17-45). The first is termed the astral stimulation model , in which the 


Figure 17-43 The midbody. (A) A 

scanning électron micrograph of a cultured 
animal cell dividing; the midbody still joins 
the two daughter cells. (B) A conventional 
électron micrograph of the midbody of a 
dividing animal cell. Cleavage is almost 
complété, but the daughter cells remain 
attached by this thin strand of cytoplasm 
containing the remains of the central 
spindle. (A, courtesy of Guenter Albrecht- 
Buehler; B, courtesy of J.M. Mullins.) 
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Figure 17-44 Régulation of the 
contractile ring by the GTPase RhoA. 

Like other Rho family GTPases, RhoA 
is activated by a RhoGEF protein and 
inactivated by a Rho GTPase-activating 
protein (RhoGAP). The active GTP- 
bound form of RhoA is focused at the 
future cleavage site. By binding formins, 
activated RhoA promûtes the assembly of 
actin filaments in the contractile ring. By 
activating Rho-activated protein kinases, 
such as Rock, it stimulâtes myosin II 
filament formation and activity, thereby 
promoting contraction of the ring. 
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Figure 17-45 Three current models of 
how the microtubules of the anaphase 
spindle generate signais that influence 
the positioning of the contractile 
ring. No single model explains ail the 
observations, and furrow positioning is 
probably determined by a combination of 
these mechanisms, with the importance 
of the different mechanisms varying in 
different organisms. See text for details. 


astral microtubules carry furrow-inducing signais, which are somehow focused 
in a ring on the cell cortex, halfway between the spindle pôles. Evidence for this 
model cornes from ingenious experiments in large embryonic cells, which dem- 
onstrate that a cleavage furrow forms midway between two asters, even when 
the two centrosomes nucleating the asters are not connected to each other by a 
mitotic spindle (Figure 17-46). 

A second possibilité called the central spindle stimulation model, is that the 
spindle midzone, or central spindle, generates a furrow-inducing signal that spéc¬ 
ifiés the site of furrow formation at the cell cortex (see Figure 17-45). The over- 
lapping interpolar microtubules of the central spindle associate with numerous 
signaling proteins, including proteins that may stimulate RhoA (Figure 17-47). 
Defects in the functions of these proteins (in Drosophila mutants, for example) 
resuit in failure of cytokinesis. 

A third model proposes that, in some cell types, the astral microtubules pro- 
mote the local relaxation of actin-myosin bundles at the cell cortex. According to 
this astral relaxation model, the cortical relaxation is minimal at the spindle equa- 
tor, thus promoting cortical contraction at that site (see Figure 17-45). In the early 
embryos of Caenorhabditis elegans, for example, treatments that resuit in the loss 
of astral microtubules lead to increased contractile activity throughout the cell 
cortex, consistent with this model. 

In some cell types, the site of ring assembly is chosen before mitosis. In bud- 
ding yeasts, for example, a ring of proteins called septins assembles in late Gi at 
the future division site. The septins are thought to form a scaffold onto which 
other components of the contractile ring, including myosin II, assemble. In plant 
cells, an organized band of microtubules and actin filaments, called the prepro- 
phase band, assembles just before mitosis and marks the site where the cell wall 
will assemble and divide the cell in two, as we now discuss. 
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Figure 17-46 An experiment 
demonstrating the influence of the 
position of microtubule asters on the 
subséquent plane of cleavage in a 
large egg cell. If the mitotic spindle is 
mechanically pushed to one side of the 
cell with a glass bead, the membrane 
furrowing is incomplète, failing to occur on 
the opposite side of the cell. Subséquent 
cleavages occur not only at the midzone 
of each of the two subséquent mitotic 
spindles (yellow arrowheads), but also 
between the two adjacent asters that are 
not linked by a mitotic spindle—but in this 
abnormal cell share the same cytoplasm 
(red arrowhead). Apparently, the contractile 
ring that produces the cleavage furrow 
in these cells always forms in the région 
midway between two asters, suggesting 
that the asters somehow alter the adjacent 
région of cell cortex to induce furrow 
formation between them. 
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The Phragmoplast Guides Cytokinesis in Higher Plants 

In most animal cells, the inward movement of the cleavage furrow dépends on an 
increase in the surface area of the plasma membrane. New membrane is added 
at the inner edge of the cleavage furrow and is generally provided by small mem¬ 
brane vesicles that are transported on microtubules from the Golgi apparatus to 
the furrow. 

Membrane déposition is particularly important for cytokinesis in higher-plant 
cells. These cells are enclosed by a semirigid cell wall. Rather than a contractile 
ring dividing the cytoplasm from the outside in, the cytoplasm of the plant cell is 
partitioned from the inside out by the construction of a new cell wall, called the 
cell plate, between the two daughter nuclei (Figure 17-48). The assembly of the 
cell plate begins in late anaphase and is guided by a structure called the phrag¬ 
moplast, which contains microtubules derived from the mitotic spindle. Motor 
proteins transport small vesicles along these microtubules from the Golgi appara¬ 
tus to the cell center. These vesicles, filled with polysaccharide and glycoproteins 
required for the synthesis of the new cell wall, fuse to form a disc-lilce, membrane- 
enclosed structure called the early cell plate. The plate expands outward by further 
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Figure 17-48 Cytokinesis in a plant cell in telophase. In this light micrograph, the early cell plate 
(between the two arrowheads) has formed in a plane perpendicular to the plane of the page. The 
microtubules of the spindle are stained with gold-labeled antibodies against tubulin, and the DNA 
in the two sets of daughter chromosomes is stained with a fluorescent dye. Note that there are no 
astral microtubules, because there are no centrosomes in higher-plant cells. (Courtesy of Andrew 
Bajer.) 


Figure 17-47 Localization of cytokinesis 
regulators at the central spindle of the 
human cell. (A) At center is a cultured 
human cell at the beginning of cytokinesis, 
showing the locations of the GTPase RhoA 
(red) and a protein called Cyk4 (green), 
which is one of several regulatory proteins 
that form complexes at the overlapping 
plus ends of interpolar microtubules. 

These proteins are thought to generate 
signais that help control RhoA activity at 
the cell cortex. (B) When the same three- 
dimensional image is viewed in the plane 
of the contractile ring, as shown here, 

RhoA (red) is seen as a ring beneath the 
cell surface, while the central spindle 
protein Cyk4 (green) is associated with 
microtubule bundles scattered throughout 
the équatorial plane of the cell. (Courtesy of 
Alisa Piekny and Michael Glotzer.) 
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vesicle fusion until it reaches the plasma membrane and the original cell wall and 
divides the cell in two. Later, cellulose microfibrils are laid down within the matrix 
of the cell plate to complété the construction of the new cell wall (Figure 17-49). 

Membrane-Enclosed Organelles Must Be Distributed to Daughter 
Cells During Cytokinesis 

The process of mitosis ensures that each daughter cell receives a full complé¬ 
ment of chromosomes. When a eulcaryotic cell divides, however, each daughter 
cell must also inherit ail of the other essential cell components, including the 
membrane-enclosed organelles. As discussed in Chapter 12, organelles such as 
mitochondria and chloroplasts cannot be assembled de novo from their individ- 
ual components; they can arise only by the growth and division of the preexisting 
organelles. Similarly, cells cannot malce a new endoplasmic réticulum (ER) unless 
some part of it is already présent. 

How, then, do the various membrane-enclosed organelles segregate when a 
cell divides? Organelles such as mitochondria and chloroplasts are usually prés¬ 
ent in large enough numbers to be safely inherited if, on average, their numbers 
roughly double once each cycle. The ER in interphase cells is continuous with 
the nuclear membrane and is organized by the microtubule cytoslceleton. Upon 
entry into M phase, the reorganization of the microtubules and brealcdown of 
the nuclear envelope releases the ER. In most cells, the ER remains largely intact 
and is eut in two during cytokinesis. The Golgi apparatus is reorganized and frag- 
mented during mitosis. Golgi fragments associate with the spindle pôles and are 
thereby distributed to opposite ends of the spindle, ensuring that each daughter 
cell inherits the materials needed to reconstruct the Golgi in telophase. 

Some Cells Reposition Their Spindle to Divide Asymmetrically 

Most animal cells divide symmetrically: the contractile ring forms around the 
equator of the parent cell, producing two daughter cells of equal size and with 
the same components. This symmetry results from the placement of the mitotic 
spindle, which in most cases tends to center itself in the cytoplasm. Astral micro¬ 
tubules and motor proteins that either push or pull on these microtubules con- 
tribute to the centering process. 

There are many instances in development, however, when cells divide asym¬ 
metrically to produce two cells that differ in size, in the cytoplasmic contents they 
inherit, or in both. Usually, the two different daughter cells are destined to develop 


Figure 17-49 The spécial features of 
cytokinesis in a higher-plant cell. The 

division plane is established before 
M phase by a band of microtubules and 
actin filaments (the preprophase band) 
at the cell cortex. At the beginning of 
telophase, after the chromosomes hâve 
segregated, a new cell wall starts to 
assemble inside the cell at the equator of 
the old spindle. The interpolar microtubules 
of the mitotic spindle remaining at 
telophase form the phragmoplast. The 
plus ends of these microtubules no 
longer overlap but end at the cell equator. 
Golgi-derived vesicles, filled with cell-wall 
material, are transported along these 
microtubules and fuse to form the new cell 
wall, which grows outward to reach the 
plasma membrane and original cell wall. 
The plasma membrane and the membrane 
surrounding the new cell wall fuse, 
separating the two daughter cells. 
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along different pathways. To create daughter cells with different fates in this way, 
the mother cell must first segregate certain components (called cellfate détermi¬ 
nants) to one side of the cell and then position the plane of division so that the 
appropriate daughter cell inherits these components (Figure 17-50). To position 
the plane of division asymmetrically, the spindle has to be moved in a controlled 
manner within the dividing cell. It seems lilcely that changes in local régions of 
the cell cortex direct such spindle movements and that motor proteins localized 
there pull one of the spindle pôles, via its astral microtubules, to the appropriate 
région. Genetic analyses in C. elegans and Drosophila hâve identified some of the 
proteins required for such asymmetric divisions, and some of these proteins seem 
to hâve a similar rôle in vertebrates. 


Mitosis Can Occur Without Cytokinesis 

Although nuclear division is usually followed by cytoplasmic division, there are 
exceptions. Some cells undergo multiple rounds of nuclear division without inter- 
vening cytoplasmic division. In the early Drosophila embryo, for example, the 
first 13 rounds of nuclear division occur without cytoplasmic division, resulting in 
the formation of a single large cell containing several thousand nuclei, arranged 
in a monolayer near the surface. A cell in which multiple nuclei share the same 
cytoplasm is called a syncytium. This arrangement greatly speeds up early devel¬ 
opment, as the cells do not hâve to talce the time to go through ail the steps of 
cytokinesis for each division. After these rapid nuclear divisions, membranes are 
created around each nucléus in one round of coordinated cytokinesis called cellu- 
larization. The plasma membrane extends inward and, with the help of an actin- 
myosin ring, pinches off to enclose each nucléus (Figure 17-51). 

Nuclear division without cytokinesis also occurs in some types of mammalian 
cells. Megakaryocytes, which produce blood platelets, and some hépatocytes and 
heart muscle cells, for example, become multinucleated in this way. 

After cytokinesis, most cells enter Gi, in which Cdlcs are mostly inactive. We 
end this section by discussing how this State is achieved at the end of M phase. 


The Gi Phase Is a Stable State of Cdk Inactivity 

A lcey regulatory event in late M phase is the inactivation of Cdlcs, which is driven 
primarily by APC/C-dependent cyclin destruction. As described earlier, the inac¬ 
tivation of Cdlcs in late M phase has many functions: it triggers the events of late 
mitosis, promûtes cytokinesis, and enables the synthesis of prereplicative com¬ 
plexes at DNA réplication origins. It also provides a mechanism for resetting the 



Figure 17-50 An asymmetric cell 
division segregating cytoplasmic 
components to only one daughter cell. 

These light micrographs illustrate the 
controlled asymmetric ségrégation 
of spécifie cytoplasmic components to 
one daughter cell during the first division 
of a fertilized egg of the nematode 
C. elegans. The fertilized egg is shown in 
the left micrographs and the two daughter 
cells in the right micrographs. The cells 
above hâve been stained with a blue, 
DNA-binding, fluorescent dye to show the 
nucléus (and polar bodies); they are viewed 
by both differential-interference-contrast 
and fluorescence microscopy. The cells 
below are the same cells stained with an 
antibody against P-granules and viewed 
by fluorescence microscopy. These small 
granules are made of RNA and proteins 
and détermine which cells become germ 
cells. They are distributed randomly 
throughout the cytoplasm of the unfertilized 
egg (not shown) but become segregated to 
the posterior pôle of the fertilized egg. The 
cleavage plane is oriented to ensure that 
only the posterior daughter cell receives 
the P-granules when the egg divides. 

The same ségrégation process is 
repeated in several subséquent cell 
divisions, so that the P-granules end up 
only in cells that give rise to eggs and 
sperm. (Courtesy of Susan Strome.) 
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Figure 17-51 Mitosis without cytokinesis in the early Drosophila 
embryo. (A) The first 13 nuclear divisions occur synchronously and without 
cytoplasmic division to create a large syncytium. Most of the nuclei migrate 
to the cortex, and the plasma membrane extends inward and pinches 
off to surround each nucléus to form individual cells in a process called 
cellularization. (B) Fluorescence micrograph of multiple mitotic spindles in 
a Drosophila embryo before cellularization. The microtubules are stained 
green and the centrosomes red. Note that ail the nuclei go through the 
cycle synchronously; here, they are ail in metaphase, with the unlabeled 
chromosomes seen as a dark band at the spindle equator. (B, courtesy of 
Kristina Yu and William Sullivan.) 


cell-cycle control System to a State of Cdk inactivity as the cell préparés to enter a 
new cell cycle. In most cells, this State of Cdk inactivity generates a Gi gap phase, 
during which the cell grows and monitors its environment before committing to 
a new cell cycle. 

In early animal embryo s, the inactivation of M-Cdlc in late mitosis is due 
almost entirely to the action of Cdc20-APC/C, discussed earlier. Recall, however, 
that M-Cdlc stimulâtes Cdc20-APC/C activity. Thus, the destruction of M-cyclin 
in late mitosis soon leads to the inactivation of ail APC/C activity in an embry- 
onic cell. This APC/C inactivation immediately after mitosis is especially useful 
in rapid embryonic cell cycles, as it allows the cell to quickly begin accumulating 
new M-cyclin for the next cycle (Figure 17-52A). 

Rapid cyclin accumulation immediately after mitosis is not useful, however, for 
cells in which a Gi phase is needed to allow control of entry into the next cell cycle. 
These cells employ several mechanisms to prevent Cdk réactivation after mito¬ 
sis. One mechanism uses another APC/C-activating protein called Cdhl, men- 
tioned earlier as a close relative of Cdc20 (see Table 17-2). Although both Cdhl 
and Cdc20 bind to and activate the APC/C, they differ in one important respect. 
Whereas M-Cdlc activâtes the Cdc20-APC/C complex, it inhibits the Cdhl-APC/C 
complex by directly phosphorylating Cdhl. As a resuit of this relationship, Cdhl- 
APC/C activity increases in late mitosis after the Cdc20-APC/C complex has initi- 
ated the destruction of M-cyclin. M-cyclin destruction therefore continues after 
mitosis: although Cdc20-APC/C activity has declined, Cdhl-APC/C activity is 
high (Figure 17-52B). 

A second mechanism that suppresses Cdk activity in Gi dépends on the 
increased production of CKIs, the Cdk inhibitor proteins discussed earlier. Bud- 
ding yeast cells, in which this mechanism is best understood, contain a CKI pro¬ 
tein called Sicl, which binds to and inactivâtes M-Cdlc in late mitosis and Gi (see 
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Figure 17-52 The création of a Gi phase by stable Cdk inhibition after 
mitosis. (A) In early embryonic cell cycles, Cdc20-APC/C activity rises at 
the end of metaphase, triggering M-cyclin destruction. Because M-Cdk 
activity stimulâtes Cdc20-APC/C activity, the loss of M-cyclin leads to APC/C 
inactivation after mitosis, which allows M-cyclins to begin accumulating again. 
(B) In cells that hâve a Gi phase, the drop in M-Cdk activity in late mitosis 
leads to the activation of Cdhl-APC/C (as well as to the accumulation of Cdk 
inhibitor proteins; not shown). This ensures a continued suppression of Cdk 
activity after mitosis, as required for a Gi phase. 
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Table 17-2). Lilce Cdhl, Sicl is inhibited by M-Cdlc, which phosphorylâtes Sicl 
during mitosis and thereby promûtes its ubiquitylation by SCF. Thus, Sicl and 
M-Cdlc, lilce Cdhl and M-Cdlc, inhibit each other. As a resuit, the décliné in M-Cdlc 
activity that occurs in late mitosis causes the Sicl protein to accumulate, and this 
CKI helps lceep M-Cdlc activity low after mitosis. A CKI protein called p27 (see 
Figure 17-14) may serve similar functions in animal cells. 

In most cells, decreased transcription of M-cyclin genes also inactivâtes 
M-Cdlcs in late mitosis. In budding yeast, for example, M-Cdlc promûtes the 
expression of these genes, resulting in a positive feedback loop. This loop is turned 
off as cells exit from mitosis: the inactivation of M-Cdlc by Cdhl and Sicl leads 
to decreased M-cyclin gene transcription and thus decreased M-cyclin synthesis. 
Gene regulatory proteins that promote the expression of G\/S- and S-cyclins are 
also inhibited during Gi. 

Thus, Cdhl-APC/C activation, CKI accumulation, and decreased cyclin gene 
expression act together to ensure that the early Gi phase is a time when essen- 
tially ail Cdlc activity is suppressed. As in many other aspects of cell-cycle control, 
the use of multiple regulatory mechanisms allows the System to operate with rea- 
sonable efficiency even if one mechanism fails. So how does the cell escape from 
this stable Gi State to initiate a new cell cycle? The answer is that Gi/S-Cdlc activ¬ 
ity, which rises in late Gi, releases ail the braking mechanisms that suppress Cdlc 
activity, as we describe later, in the last section of this chapter. 

Summary 

After mitosis complétés the formation ofa pair ofdaughter nuclei, cytokinesis fin- 
ishes the cell cycle by dividing the cell itself Cytokinesis dépends on a ring ofactin 
and myosin filaments that contracts in late mitosis at a site midway between the 
segregated chromosomes. In animal cells, the positioning ofthe contractile ring is 
determined by signais emanating from the microtubules ofthe anaphase spindle. 
Déphosphorylation of Cdlc targets; which results from Cdk inactivation in ana¬ 
phase, triggers cytokinesis at the correct time after anaphase. After cytokinesis, the 
cell enters a stable Gi State oflow Cdk activity, where it awaits signais to enter a new 
cell cycle. 

MEIOSIS 

Most eukaryotic organisms reproduce sexually: the genomes of two parents mix 
to generate offspring that are genetically distinct from either parent. The cells of 
these organisms are generally diploid: that is, they contain two slightly different 
copies, or homologs, of each chromosome, one from each parent. Sexual repro¬ 
duction dépends on a specialized nuclear division process called meiosis , which 
produces haploid cells carrying only a single copy of each chromosome. In many 
organisms, the haploid cells differentiate into specialized reproductive cells called 
gametes—e ggs and sperm in most species. In these species, the reproductive cycle 
ends when a sperm and egg fuse to form a diploid zygote, which has the poten- 
tial to form a new individual. In this section, we consider the basic mechanisms 
and régulation of meiosis, with an emphasis on how they compare with those of 
mitosis. 

Meiosis Includes Two Rounds of Chromosome Ségrégation 

Meiosis reduces the chromosome number by half using many of the same molec- 
ular machines and control Systems that operate in mitosis. As in the mitotic cell 
cycle, the cell begins the meiotic program by duplicating its chromosomes in mei- 
otic S phase, resulting in pairs of sister chromatids that are tightly linlced along 
their entire lengths by cohesin complexes. Unlike mitosis, however, two succes¬ 
sive rounds of chromosome ségrégation then occur (Figure 17-53). The first of 
these divisions (meiosis I) solves the problem, unique to meiosis, of segregating 
the homologs. The duplicated paternal and maternai homologs pair up alongside 
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each other and become physically linlced by the process of genetic recombination. 
These pairs of homologs, each containing a pair of sister chromatids, then line 
up on the first meiotic spindle. In the first meiotic anaphase, duplicated homo¬ 
logs rather than sister chromatids are pulled apart and segregated into the two 
daughter nuclei. Only in the second division (meiosis II), which occurs without 
further DNA réplication, are the sister chromatids pulled apart and segregated (as 
in mitosis) to produce haploid daughter nuclei. In this way, each diploid nucléus 
that enters meiosis produces four haploid nuclei, each of which contains either 
the maternai or paternal copy of each chromosome, but not both (Movie 17.7). 
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Figure 17-53 Comparison of 
meiosis and mitosis. For clarity, 
only one pair of homologous 
chromosomes (homologs) is shown. 
(A) Meiosis is a form of nuclear 
division in which a single round of 
chromosome duplication (meiotic 
S phase) is followed by two rounds 
of chromosome ségrégation. The 
duplicated homologs, each consisting 
of tightly bound sister chromatids, 
pair up and are segregated into 
different daughter nuclei in meiosis I; 
the sister chromatids are segregated 
in meiosis II. As indicated by the 
formation of chromosomes that are 
partly red and partly gray, homolog 
pairing in meiosis leads to genetic 
recombination (crossing-over) during 
meiosis I. Each diploid cell that enters 
meiosis therefore produces four 
genetically different haploid nuclei, 
which are distributed by cytokinesis 
into haploid cells that differentiate into 
gametes. (B) In mitosis, by contrast, 
homologs do not pair up, and the 
sister chromatids are segregated 
during the single division. Thus, each 
diploid cell that divides by mitosis 
produces two genetically identical 
diploid daughter nuclei, which are 
distributed by cytokinesis into a pair 
of daughter cells. 
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Duplicated Homologs Pair During Meiotic Prophase 

During mitosis in most organisms, homologous chromosomes behave indepen- 
dently of each other. During meiosis I, however, it is crucial that homologs rec- 
ognize each other and associate physically in order for the maternai and pater- 
nal homologs to be bi-oriented on the first meiotic spindle. Spécial mechanisms 
médiate these interactions. 

The graduai juxtaposition of homologs occurs during a prolonged period 
called meiotic prophase (or prophase I), which can talce hours in yeasts, days in 
mice, and weelcs in higher plants. Like their mitotic counterparts, duplicated mei¬ 
otic prophase chromosomes first appear as long threadlike structures, in which 
the sister chromatids are so tightly glued together that they appear as one. It is 
during early prophase I that the homologs begin to associate along their length in 
a process called pairing, which, in some organisms at least, begins with interac¬ 
tions between complementary DNA sequences (called pairing sites ) in the two 
homologs. As prophase progresses, the homologs become more closely juxta- 
posed, forming a four-chromatid structure called a bivalent (Figure 17-54A). In 
most species, homolog pairs are then loclced together by homologous recombi¬ 
nation: DNA double-strand breaks are formed at several locations in each sister 
chromatid, resulting in large numbers of DNA recombination events between the 
homologs (as described in Chapter 5). Some of these events lead to reciprocal 
DNA exchanges called crossovers, where the DNA of a chromatid crosses over to 
become continuous with the DNA of a homologous chromatid (Figure 17-54B; 
also see Figure 5-54). 


Homolog Pairing Culminâtes in the Formation of a Synaptonemal 
Complex 

The paired homologs are brought into close juxtaposition, with their structural 
axes ( axial cores ) about 400 nm apart, by a mechanism that dépends in most 
species on the double-strand DNA breaks that occur in sister chromatids. What 
pulls the axes together? One possibility is that the large protein machine, called 
a recombination complex, which assembles on a double-strand break in a chro¬ 
matid, binds the matching DNA sequence in the nearby homolog and helps reel 
in this partner. This so-called presynaptic alignment of the homologs is followed 
by synapsis, in which the axial core of a homolog becomes tightly linlced to the 
axial core of its partner by a closely paclced array of transverse filaments to cre- 
ate a synaptonemal complex, which bridges the gap, now only 100 nm, between 
the homologs (Figure 17-55). Although crossing-over begins before the synap¬ 
tonemal complex assembles, the final steps occur while the DNA is held in the 
complex. 

The morphological changes that occur during homolog pairing are the basis 
for dividing meiotic prophase into five sequential stages—leptotene, zygotene, 
pachytene, diplotene, and diakinesis (Figure 17-56). Prophase starts with lep¬ 
totene, when homologs condense and pair and genetic recombination begins. 
At zygotene, the synaptonemal complex begins to assemble at sites where the 
homologs are closely associated and recombination events are occurring. At 
pachytene, the assembly process is complété, and the homologs are synapsed 


Figure 17-54 Homolog pairing and crossing-over. (A) The structure 
formed by two closely aligned duplicated homologs is called a bivalent. 

As in mitosis, the sister chromatids in each homolog are tightly connected 
along their entire lengths, as well as at their centromeres. At this stage, the 
homologs are usually joined by a protein complex called the synaptonemal 
complex (not shown; see Figure 17-55). (B) A later-stage bivalent in which 
a single crossover has occurred between nonsister chromatids. It is only 
when the synaptonemal complex disassembles and the paired homologs 
separate a little at the end of prophase I, as shown, that the crossover is 
seen microscopically as a thin connection between the homologs called a 
chiasma. 
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Figure 17-55 Simplifiée! schematic 
drawing of a synaptonemal complex. 

Each homolog is organized around a 
protein axial core, and the synaptonemal 
complex forms when these homolog 
axes are linked by rod-shaped transverse 
filaments. The axial core of each homolog 
also interacts with the cohesin complexes 
that hold the sister chromatids together 
(see Figure 9-35). (Modified from 
K. Nasmyth, Annu. Rev. Genet. 35:673- 
745, 2001.) 


along their entire lengths (see Figure 9-35). The pachytene stage can persist for 
days or longer, until desynapsis begins at diplotene with the disassembly of the 
synaptonemal complexes and the concomitant condensation and shortening of 
the chromosomes. It is only at this stage, after the complexes hâve disassembled, 
that the individual crossover events between nonsister chromatids can be seen 
as inter-homolog connections called chiasmata (singular chiasma), which now 
play a crucial part in holding the compact homologs together (Figure 17-57). The 
homologs are now ready to begin the process of ségrégation. 
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Figure 17-56 Homolog synapsis and desynapsis during the different stages of prophase I. (A) A single bivalent is shown schematically. At 
leptotene, the two sister chromatids coalesce, and their chromatid loops extend out from a common axial core. Assembly of the synaptonemal 
complex begins in early zygotene and is complété in pachytene. The complex disassembles in diplotene. (B) An électron micrograph of a 
synaptonemal complex from a meiotic cell at pachytene in a lily flower. (C and D) Immunofluorescence micrographs of prophase I cells of the fungus 
Sordaria. Partially synapsed bivalents at zygotene are shown in (C) and fully synapsed bivalents are shown in (D). Red arrowheads in (C) point to 
régions where synapsis is still incomplète. (B, courtesy of Brian Wells; C and D, from A. Storlazzi et al., Genes Dev. 17:2675-2687, 2003. With 
permission from Cold Spring Harbor Laboratory Press.) 
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Figure 17-57 A bivalent with three chiasmata resulting from three 
crossover events. (A) Light micrograph of a grasshopper bivalent. 

(B) Drawing showing the arrangement of the crossovers in (A). Note that 
chromatid 1 has undergone an exchange with chromatid 3, and chromatid 
2 has undergone exchanges with chromatids 3 and 4. Note also how 
the combination of the chiasmata and the tight attachment of the sister 
chromatid arms to each other (mediated by cohesin complexes) holds the 
two homologs together after the synaptonemal complex has disassembled; 
if either the chiasmata or the sister-chromatid cohésion failed to form, the 
homologs would corne apart at this stage and not be segregated properly in 
meiosis I. (A, courtesy of Bernard John.) 

Homolog Ségrégation Dépends on Several Unique Features of 
Meiosis I 

A fundamental différence between meiosis I and mitosis (and meiosis II) is that 
in meiosis I homologs rather than sister chromatids separate and then segregate 
(see Figure 17-53). This différence dépends on three features of meiosis I that dis- 
tinguish it from mitosis (Figure 17-58). 

First, both sister kinetochores in a homolog must attach stably to the same 
spindle pôle. This type of attachment is normally avoided during mitosis (see Fig¬ 
ure 17-33). In meiosis I, however, the two sister kinetochores are fused into a sin¬ 
gle microtubule-binding unit that attaches to just one pôle (see Figure 17-58A). 
The fusion of sister kinetochores is achieved by a complex of proteins that is 
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Figure 17-58 Comparison of chromosome behavior in meiosis I, meiosis II, and mitosis. Chromosomes behave similarly in mitosis and meiosis 
II, but they behave very differently in meiosis I. (A) In meiosis I, the two sister kinetochores are located side-by-side on each homolog and attach to 
microtubules from the same spindle pôle. The proteolytic cleavage of cohesin along the sister-chromatid arms unglues the arms and résolves the 
crossovers, allowing the duplicated homologs to separate at anaphase I, while the residual cohesin at the centromeres keeps the sisters together. 
Cleavage of centromeric cohesin allows the sister chromatids to separate at anaphase II. (B) In mitosis, by contrast, the two sister kinetochores 
attach to microtubules from different spindle pôles, and the two sister chromatids corne apart at the start of anaphase and segregate into separate 
daughter nuclei. 
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localized at the kinetochores in meiosis I, but we do not know in any detail how 
these proteins worlc. They are removed from kinetochores after meiosis I, so that 
in meiosis II the sister-chromatid pairs can be bi-oriented on the spindle as they 
are in mitosis. 

Second, crossovers generate a strong physical linkage between homologs, 
allowing their bi-orientation at the equator of the spindle—much lilce cohésion 
between sister chromatids is important for their bi-orientation in mitosis (and 
meiosis II). Crossovers hold homolog pairs together only because the arms of 
the sister chromatids are connected by sister-chromatid cohésion (see Figure 
17-58A). 

Third, cohésion is removed in anaphase I only from chromosome arms and 
not from the régions near the centromeres, where the kinetochores are located. 
The loss of arm cohésion triggers homolog séparation at the onset of anaphase I. 
This process dépends on APC/C activation, which leads to securin destruction, 
separase activation, and cohesin cleavage along the arms (see Figure 17-38). 

Cohesins near the centromeres are protected from separase in meiosis I by 
a kinetochore-associated protein called shugoshin (from the Japanese word 
for “guardian spirit”). Shugoshin acts by recruiting a protein phosphatase that 
removes phosphates from centromeric cohesins. Cohesin phosphorylation is 
normally required for separase to cleave cohesin; thus, removal of this phosphor¬ 
ylation near the centromere prevents cohesin cleavage. Sister-chromatid pairs 
therefore remain linked through meiosis I, allowing their correct bi-orientation 
on the spindle in meiosis IL Shugoshin is inactivated after meiosis I. At the onset 
of anaphase II, APC/C activation triggers centromeric cohesin cleavage and sister- 
chromatid séparation—much as it does in mitosis. Following anaphase II, nuclear 
envelopes form around the chromosomes to produce four haploid nuclei, after 
which cytokinesis and other différentiation processes lead to the production of 
haploid gametes. 

Crossing-Over Is Highly Regulated 

Crossing-over has two distinct functions in meiosis: it helps hold homologs 
together so that they are properly segregated to the two daughter nuclei produced 
by meiosis I, and it contributes to the genetic diversification of the gametes that 
are eventually produced. As might be expected, therefore, crossing-over is highly 
regulated: the number and location of double-strand breaks along each chromo¬ 
some is controlled, as is the likelihood that a break will be converted into a cross- 
over. On average, the resuit of this régulation is that each pair of human homologs 
is linked by about two or three crossovers (Figure 17-59). 

Although the double-strand breaks that occur in meiosis I can be located 
almost anywhere along the chromosome, they are not distributed uniformly: they 
cluster at “hot spots," where the DNA is accessible, and occur only rarely in “cold 
spots," such as the heterochromatin régions around centromeres and telomeres. 



Figure 17-59 Crossovers between 
homologs in the human testis. In 

these immunofluorescence micrographs, 
antibodies hâve been used to stain the 
synaptonemal complexes (red), the 
centromeres (blue), and the sites of 
crossing-over (green). Note that ail of the 
bivalents hâve at least one crossover and 
none hâve more than four. (Modified from 
A. Lynn et al., Science 296:2222-2225, 
2002. With permission from AAAS.) 
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At least two kinds of régulation influence the location and number of cross- 
overs that form, neither of which is well understood. Both operate before the 
synaptonemal complex assembles. One ensures that at least one crossover forms 
between the members of each homolog pair, as is necessary for normal homolog 
ségrégation in meiosis I. In the other, called crossover interférence, the presence 
of one crossover event inhibits another from forming close by, perhaps by locally 
depleting proteins required for converting a double-strand DNA break into a sta¬ 
ble crossover. 

Meiosis Frequently Goes Wrong 

The sorting of chromosomes that talces place during meiosis is a remarlcable feat 
of intracellular booldceeping. In humans, each meiosis requires that the starting 
cell lceep traclc of 92 chromatids (46 chromosomes, each of which has duplicated), 
distributing one complété set of each type of autosome to each of the four hap- 
loid progeny. Not surprisingly, mistakes can occur in allocating the chromosomes 
during this elaborate process. Mistakes are especially common in human female 
meiosis, which arrests for years after diplotene: meiosis I is completed only at ovu¬ 
lation, and meiosis II only after the egg is fertilized. Indeed, such chromosome 
ségrégation errors during egg development are the most common cause of both 
spontaneous abortion (miscarriage) and mental retardation in humans. 

When homologs fail to separate properly—a phenomenon called nondisjunc- 
tion—the resuit is that some of the resulting haploid gametes laclc a particular 
chromosome, while others hâve more than one copy of it. Upon fertilization, 
these gametes form abnormal embryos, most of which die. Some survive, how- 
ever. Down syndrome in humans, for example, which is the leading cause of men¬ 
tal retardation, is caused by an extra copy of chromosome 21, usually resulting 
from nondisjunction during meiosis I in the female ovary. Ségrégation errors dur¬ 
ing meiosis I increase greatly with advancing maternai âge. 

Summary 

Haploid gametes are produced by meiosis, in which a diploid nucléus undergoes 
two successive cell divisions after one round ofDNA réplication. Meiosis is domi- 
nated by a prolonged prophase. At the start of prophase, the chromosomes hâve 
replicated and consist oftwo tightlyjoined sister chromatids. Homologous chromo¬ 
somes then pair up and become progressively more closely juxtaposed as prophase 
proceeds. The tightly aligned homologs undergo genetic recombination, forming 
crossovers that help hold each pair of homologs together during metaphase I. Meio- 
sis-specific, Icinetochore-associated proteins help ensure that both sister chromatids 
in a homolog attach to the same spindle pôle; other Icinetochore-associated proteins 
ensure that the homologs remain connected at their centromeres during anaphase 
I, so that homologs rather than sister chromatids are segregated in meiosis I. After 
meiosis I, meiosis II foliows rapidly, without DNA réplication, in a process that 
resembles mitosis, in that sister chromatids are pulled apart at anaphase. 

CONTROL OF CELL DIVISION AND CELL GROWTH 

A fertilized mouse egg and a fertilized human egg are similar in size, yet they pro¬ 
duce animais of very different sizes. What factors in the control of cell behavior in 
humans and mice are responsible for these size différences? The same fundamen- 
tal question can be aslced for each organ and tissue in an animaFs body. What fac¬ 
tors détermine the length of an elephant's trunlc or the size of its brain or its liver? 
These questions are largely unanswered, but it is nevertheless possible to say what 
the ingrédients of an answer must be. 

The size of an organ or organism dépends on its total cell mass, which dépends 
on both the total number of cells and their size. Cell number, in turn, dépends 
on the amounts of cell division and cell death. Organ and body size are therefore 
determined by three fundamental processes: cell growth, cell division, and cell 
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survival. Each is tightly regulated—both by intracellular programs and by extra¬ 
cellular signal molécules that control these programs. 

The extracellular signal molécules that regulate cell growth, division, and 
survival are generally soluble secreted proteins, proteins bound to the surface of 
cells, or components of the extracellular matrix. They can be divided operation- 
ally into three major classes: 

1. Mitogens, which stimulate cell division, primarily by triggering a wave of 
Gi/S-Cdk activity that relieves intracellular négative Controls that other- 
wise bloclc progress through the cell cycle. 

2. Growth factors, which stimulate cell growth (an increase in cell mass) by 
promoting the synthesis of proteins and other macromolecules and by 
inhibiting their dégradation. 

3. Survival factors, which promote cell survival by suppressing the form of 
programmed cell death known as apoptosis. 

Many extracellular signal molécules promote ail of these processes, while oth- 
ers promote one or two of them. Indeed, the term growth factor is often used inap- 
propriately to describe a factor that has any of these activities. Even worse, the 
term cell growth is often used to mean an increase in cell number, or cell proliféra¬ 
tion. 

In addition to these three classes of stimulating signais, there are extracellular 
signal molécules that suppress cell prolifération, cell growth, or both; in general, 
less is known about them. There are also extracellular signal molécules that acti- 
vate apoptosis. 

In this section, we focus primarily on how mitogens and other factors, such as 
DNA damage, control the rate of cell division. We then turn to the important but 
poorly understood problem of how a proliferating cell coordinates its growth with 
cell division so as to maintain its appropriate size. We discuss the control of cell 
survival and cell death by apoptosis in Chapter 18. 

Mitogens Stimulate Cell Division 

Unicellular organisms tend to grow and divide as fast as they can, and their rate of 
prolifération dépends largely on the availability of nutrients in the environment. 
The cells of a multicellular organism, however, divide only when the organism 
needs more cells. Thus, for an animal cell to proliferate, it must receive stimu- 
latory extracellular signais, in the form of mitogens, from other cells, usually its 
neighbors. Mitogens overcome intracellular braking mechanisms that bloclc prog¬ 
ress through the cell cycle. 

One of the first mitogens to be identified was platelet-derived growth factor 
{PDGF), and it is typical of many others discovered since. The path to its isola¬ 
tion began with the observation that fibroblasts in a culture dish proliferate when 
provided with sérum but not when provided with plasma. Plasma is prepared by 
removing the cells from blood without allowing clotting to occur; sérum is pre¬ 
pared by allowing blood to clôt and talcing the cell-free liquid that remains. When 
blood clots, platelets incorporated in the clôt are stimulated to release the con¬ 
tents of their secretory vesicles (Figure 17-60). The superior ability of sérum to 
support cell prolifération suggested that platelets contain one or more mitogens. 
This hypothesis was confirmed by showing that extracts of platelets could serve 
instead of sérum to stimulate fibroblast prolifération. The crucial factor in the 
extracts was shown to be a protein, which was subsequently purified and named 
PDGF. In the body, PDGF liberated from blood clots helps stimulate cell division 
during wound healing. 

PDGF is only one of over 50 animal proteins that are known to act as mitogens. 
Most of these proteins hâve a broad specificity. PDGF, for example, can stimulate 
many types of cells to divide, including fibroblasts, smooth muscle cells, and neu¬ 
roglial cells. Similarly, epidermal growth factor {EGF) acts not only on epidermal 
cells but also on many other cell types, including both épithélial and nonepithe- 
lial cells. Some mitogens, however, hâve a narrow specificity; erythropoietin, for 
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Figure 17-60 A platelet. Platelets are 
miniature cells without a nucléus. They 
circulate in the blood and help stimulate 
blood clotting at sites of tissue damage, 
thereby preventing excessive bleeding. 

They also release various factors that 
stimulate wound healing. The platelet 
shown here has been eut in hait to show its 
secretory vesicles, some of which contain 
platelet-derived growth factor (PDGF). 
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example, only induces the prolifération of red blood cell precursors. Many mito- 
gens, including PDGF, also hâve actions other than the stimulation of cell division: 
they can stimulate cell growth, survival, différentiation, or migration, depending 
on the circumstances and the cell type. 

In some tissues, inhibitory extracellular signal proteins oppose the positive 
regulators and thereby inhibit organ growth. The best-understood inhibitory 
signal proteins are transforming growth factor-P (TGFP) and its relatives. TGFp 
inhibits the prolifération of several cell types, mainly by blocking cell-cycle pro¬ 
gression in Gi. 

Cells Can Enter a Specialized Nondividing State 

In the absence of a mitogenic signal to proliferate, Cdk inhibition in Gi is main- 
tained by the multiple mechanisms discussed earlier, and progression into a new 
cell cycle is blocked. In some cases, cells partly disassemble their cell-cycle con- 
trol System and withdraw from the cycle to a specialized nondividing State called 
Go. 

Most cells in our body are in Go, but the molecular basis and reversibility of 
this State vary in different cell types. Most of our neurons and skeletal muscle cells, 
for example, are in a terminally differentiated Go State, in which their cell-cycle 
control System is completely dismantled: the expression of the genes encoding 
various Cdlcs and cyclins is permanently turned off, and cell division rarely occurs. 
Some cell types withdraw from the cell cycle only transiently and retain the ability 
to reassemble the cell-cycle control System quickly and re-enter the cycle. Most 
liver cells, for example, are in Go, but they can be stimulated to divide if the liver 
is damaged. Still other types of cells, including fibroblasts and some lymphocytes, 
withdraw from and re-enter the cell cycle repeatedly throughout their lifetime. 

Almost ail the variation in cell-cycle length in the adult body occurs during the 
time the cell spends in Gi or Go. By contrast, the time a cell talces to progress from 
the beginning of S phase through mitosis is usually brief (typically 12-24 hours in 
mammals) and relatively constant, regardless of the interval from one division to 
the next. 

Mitogens Stimulate Gi-Cdk and Gi/S-Cdk Activities 

For the vast majority of animal cells, mitogens control the rate of cell division by 
acting in the Gi phase of the cell cycle. As discussed earlier, multiple mechanisms 
act during Gi to suppress Cdk activity. Mitogens release these bralces on Cdk 
activity, thereby allowing entry into a new cell cycle. 

As we discuss in Chapter 15, mitogens interact with cell-surface receptors to 
trigger multiple intracellular signaling pathways. One major pathway acts through 
the monomeric GTPase Ras, which leads to the activation of a mitogen-activated 
protein kinase (MAP kinase) cascade (see Figure 15-49). This leads to an increase 
in the production of transcription regulatory proteins, including Myc. Myc is 
thought to promote cell-cycle entry by several mechanisms, one of which is to 
increase the expression of genes encoding Gi cyclins (D cyclins), thereby increas- 
ing Gi-Cdlc (cyclin D-Cdk4) activity. Myc also has a major rôle in stimulating the 
transcription of genes that increase cell growth. 

The lcey function of Gi-Cdlc complexes in animal cells is to activate a group 
of gene regulatory factors called the E2F proteins, which bind to spécifie DNA 
sequences in the promoters of a wide variety of genes that encode pro teins required 
for S-phase entry, including Gi/S-cyclins, S-cyclins, and proteins involved in DNA 
synthesis and chromosome duplication. In the absence of mitogenic stimulation, 
E2F-dependent gene expression is inhibited by an interaction between E2F and 
members of the retinoblastoma protein (Rb) family. When cells are stimulated 
to divide by mitogens, active Gi-Cdlc accumulâtes and phosphorylâtes Rb family 
members, reducing their binding to E2F. The liberated E2F proteins then activate 
expression of their target genes (Figure 17-61). 

This transcriptional control System, lilce so many other control Systems that 
regulate the cell cycle, includes feedback loops that ensure that entry into the 
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Figure 17-61 Mitogen stimulation of cell- 
cycle entry. As discussed in Chapter 15, 
mitogens bind to cell-surface receptors to 
initiate intracellular signaling pathways. One 
of the major pathways involves activation 
of the small GTPase Ras, which activâtes a 
MAP kinase cascade, leading to increased 
expression of numerous immédiate early 
genes, including the gene encoding the 
transcription regulatory protein Myc. Myc 
increases the expression of many delayed- 
response genes, including some that lead 
to increased Gi-Cdk activity (cyclin D- 
Cdk4), which triggers the phosphorylation 
of members of the Rb family of proteins. 
This inactivâtes the Rb proteins, freeing 
the gene regulatory protein E2F to activate 
the transcription of Gi/S genes, including 
the genes for a Gi/S-cyclin (cyclin E) and 
S-cyclin (cyclin A). The resulting Gi/S-Cdk 
and S-Cdk activities further enhance Rb 
protein phosphorylation, forming a positive 
feedback loop. E2F proteins also stimulate 
the transcription of their own genes, 
forming another positive feedback loop. 
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cell cycle is complété and irréversible. The liberated E2F proteins, for example, 
increase the transcription of their own genes. In addition, E2F-dépendent tran¬ 
scription of Gi/S-cyclin (cyclin E) and S-cyclin (cyclin A) genes leads to increased 
Gi/S-Cdk and S-Cdk activities, which in turn increase Rb protein phosphorylation 
and promote further E2F release (see Figure 17-61). 

The central member of the Rb family, the Rb protein itself, was identified orig- 
inally through studies of an inherited form of eye cancer in children, known as 
retinoblastoma (discussed in Chapter 20). The loss of both copies of the Rb gene 
leads to excessive prolifération of some cells in the developing retina, suggesting 
that the Rb protein is particularly important for restraining cell division in this tis- 
sue. The complété loss of Rb does not immediately cause increased prolifération 
of retinal or other types of cells, in part because Cdhl and CKIs also help inhibit 
progression through Gi and in part because other cell types contain Rb-related 
proteins that provide baclcup support in the absence of Rb. It is also lilcely that 
other proteins, unrelated to Rb, help to regulate the activity of E2F. 
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Additional layers of control promote an overwhelming increase in S-Cdlc 
activity at the beginning of S phase. We mentioned earlier that the APC/C acti- 
vator Cdhl suppresses cyclin levels after mitosis. In animal cells, however, Gl¬ 
and Gi/S-cyclins are résistant to Cdhl-APC/C and can therefore act unopposed 
by the APC/C to promote Rb protein phosphorylation and E2F-dependent gene 
expression. S-cyclin, by contrast, is not résistant, and its level is initially restrained 
by Cdhl-APC/C activity. However, Gi/S-Cdlc also phosphorylâtes and inactivâtes 
Cdhl-APC/C, thereby allowing the accumulation of S-cyclin, further promot- 
ing S-Cdlc activation. Gi/S-Cdk also inactivâtes CKI proteins that suppress S-Cdlc 
activity. The overall effect of ail these interactions is the rapid and complété acti¬ 
vation of the S-Cdlc complexes required for S-phase initiation. 

DNA Damage Blocks Cell Division: The DNA Damage Response 

Progression through the cell cycle, and thus the rate of cell prolifération, is con- 
trolled not only by extracellular mitogens but also by other extracellular and intra- 
cellular signais. One of the most important influences is DNA damage, which can 
occur as a resuit of spontaneous Chemical reactions in DNA, errors in DNA repli- 
cation, or exposure to radiation or certain Chemicals (discussed in Chapter 5). It is 
essential that damaged chromosomes are repaired before attempting to duplicate 
or segregate them. The cell-cycle control System can readily detect DNA damage 
and arrest the cycle at either of two transitions—one at Start, which prevents entry 
into the cell cycle and into S phase, and one at the G 2 /M transition, which pre¬ 
vents entry into mitosis (see Figure 17-16). 

DNA damage initiâtes a signaling pathway by activating one of a pair of related 
protein kinases called ATM and ATR, which associate with the site of damage and 
phosphorylate various target proteins, including two other protein kinases called 
Chkl and Chk2. These various kinases phosphorylate other target proteins that 
lead to cell-cycle arrest. A major target is the gene regulatory protein p53, which 
stimulâtes transcription of the gene encoding p21, a CKI protein; p21 binds to 
Gi/S-Cdlc and S-Cdlc complexes and inhibits their activities, thereby helping to 
bloclc entry into the cell cycle (Figure 17-62 and Movie 17.8). 

DNA damage activâtes p53 by an indirect mechanism. In undamaged cells, p53 
is highly unstable and is présent at very low concentrations. This is largely because 
it interacts with another protein, Mdm2, which acts as a ubiquitin ligase that tar- 
gets p53 for destruction by protéasomes. Phosphorylation of p53 after DNA dam¬ 
age reduces its binding to Mdm2. This decreases p53 dégradation, which results 
in a marlced increase in p53 concentration in the cell. In addition, the decreased 
binding to Mdm2 enhances the ability of p53 to stimulate gene transcription (see 
Figure 17-62). 

The protein kinases Chkl and Chlc2 also bloclc cell-cycle progression by 
phosphorylating members of the Cdc25 family of protein phosphatases, thereby 
inhibiting their function. As described earlier, these phosphatases are particu- 
larly important in the activation of M-Cdlc at the beginning of mitosis (see Figure 
17-20). Chkl and Chlc2 phosphorylate Cdc25 at inhibitory sites that are distinct 
from the phosphorylation sites that stimulate Cdc25 activity. The inhibition of 
Cdc25 activity by DNA damage helps bloclc entry into mitosis (see Figure 17-16). 

The DNA damage response can also be activated by problems that arise when 
a réplication forlc fails during DNA réplication. When nucléotides are depleted, for 
example, réplication forlcs stall during the élongation phase of DNA synthesis. To 
prevent the cell from attempting to segregate partially replicated chromosomes, 
the same mechanisms that respond to DNA damage detect the stalled réplication 
forlcs and bloclc entry into mitosis until the problems are resolved. 

A low level of DNA damage occurs in the normal life of any cell, and this dam¬ 
age accumulâtes in the cell's progeny if the DNA damage response is not func- 
tioning. Over the long term, the accumulation of genetic damage in cells laclcing 
the DNA damage response leads to an increased frequency of cancer-promoting 
mutations. Indeed, mutations in the p53 gene occur in at least half of ail human 
cancers (discussed in Chapter 20). This loss of p53 function allows the cancer cell 
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Figure 17-62 How DNA damage 
arrests the cell cycle in Gi . When DNA 
is damaged, various protein kinases 
are recruited to the site of damage and 
initiate a signaling pathway that causes 
cell-cycle arrest. The first kinase at 
the damage site is either ATM or ATR, 
depending on the type of damage. 
Additional protein kinases, called Chkl 
and Chk2, are then recruited and 
activated, resulting in the phosphorylation 
of the transcription regulatory protein 
p53. Mdm2 normally binds to p53 and 
promûtes its ubiquitylation and destruction 
in protéasomes. Phosphorylation of p53 
blocks its binding to Mdm2; as a resuit, 
p53 accumulâtes to high levels and 
stimulâtes transcription of numerous genes, 
including the gene that encodes the CKI 
protein p21. The p21 binds and inactivâtes 
Gi/S-Cdk and S-Cdk complexes, arresting 
the cell in G-|. In some cases, DNA damage 
also induces either the phosphorylation of 
Mdm2 or a decrease in Mdm2 production, 
which causes a further increase in p53 (not 
shown). 
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to accumulate mutations more readily. Similarly, a rare genetic disease known as 
ataxia telangiectasia is caused by a defect in ATM, one of the protein kinases that 
are activated in response to x-ray-induced DNA damage; patients with this dis¬ 
ease are very sensitive to x-rays and suffer from increased rates of cancer. 

What happens if DNA damage is so severe that repair is not possible? The 
answer differs in different organisms. Unicellular organisms such as budding 
yeast arrest their cell cycle to try to repair the damage, but the cycle résumés even 
if the repair cannot be completed. For a single-celled organism, life with muta¬ 
tions is apparently better than no life at ail. In multicellular organisms, however, 
the health of the organism talces precedence over the life of an individual cell. 
Cells that divide with severe DNA damage threaten the life of the organism, since 
genetic damage can often lead to cancer and other diseases. Thus, animal cells 
with severe DNA damage do not attempt to continue division, but instead com¬ 
mit suicide by undergoing apoptosis. Thus, unless the DNA damage is repaired, 
the DNA damage response can lead to either cell-cycle arrest or cell death. DNA 
damage-induced apoptosis often dépends on the activation of p53. Indeed, it is 
this apoptosis-promoting function of p53 that is apparently most important in 
protecting us against cancer. 
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Many Human Cells Hâve a Built-ln Limitation on the Number of 
Times They Can Divide 

Many human cells divide a limited number of times before they stop and undergo 
a permanent cell-cycle arrest. Fibroblasts talcen from normal human tissue, for 
example, go through only about 25-50 population doublings when cultured in 
a standard mitogenic medium. Toward the end of this time, prolifération slows 
down and finally halts, and the cells enter a nondividing State from which they 
never recover. This phenomenon is called réplicative cell senescence. 

Réplicative cell senescence in human fibroblasts seems to be caused by 
changes in the structure of the telomeres, the répétitive DNA sequences and asso- 
ciated proteins at the ends of chromosomes. As discussed in Chapter 5, when a 
cell divides, telomeric DNA sequences are not replicated in the same manner as 
the rest of the genome but instead are synthesized by the enzyme telomerase. 
Telomerase also promûtes the formation of protein cap structures that protect the 
chromosome ends. Because human fibroblasts, and many other human somatic 
cells, do not produce telomerase, their telomeres become shorter with every cell 
division, and their protective protein caps progressively deteriorate. Eventu- 
ally, the exposed chromosome ends are sensed as DNA damage, which activâtes 
a p53-dependent cell-cycle arrest (see Figure 17-62). Rodent cells, by contrast, 
maintain telomerase activity when they proliferate in culture and therefore do not 
hâve such a telomere-dépendent mechanism for limiting prolifération. The forced 
expression of telomerase in normal human fibroblasts, using genetic engineer¬ 
ing techniques, bloclcs this form of senescence. Unfortunately, most cancer cells 
hâve regained the ability to produce telomerase and therefore maintain telomere 
function as they proliferate; as a resuit, they do not undergo réplicative cell senes¬ 
cence. 

Abnormal Prolifération Signais Cause Cell-Cycle Arrest or 
Apoptosis, Except in Cancer Cells 

Many of the components of mitogenic signaling pathways are encoded by genes 
that were originally identified as cancer-promoting genes, because mutations in 
them contribute to the development of cancer. The mutation of a single amino 
acid in the small GTPase Ras, for example, causes the protein to become perma- 
nently overactive, leading to constant stimulation of Ras-dependent signaling 
pathways, even in the absence of mitogenic stimulation. Similarly, mutations that 
cause an overexpression of Myc stimulate excessive cell growth and prolifération 
and thereby promote the development of cancer (discussed in Chapter 20). 

Surprisingly, however, when a hyperactivated form of Ras or Myc is experi- 
mentally overproduced in most normal cells, the resuit is not excessive prolif¬ 
ération but the opposite: the cells undergo either permanent cell-cycle arrest or 
apoptosis. The normal cell seems able to detect abnormal mitogenic stimulation, 
and it responds by preventing further division. Such responses help prevent the 
survival and prolifération of cells with various cancer-promoting mutations. 

Although it is not lcnown how a cell detects excessive mitogenic stimulation, 
such stimulation often leads to the production of a cell-cycle inhibitor protein 
called Arf, which binds and inhibits Mdm2. As discussed earlier, Mdm2 nor- 
mally promûtes p53 dégradation. Activation of Arf therefore causes p53 levels to 
increase, inducing either cell-cycle arrest or apoptosis (Figure 17-63). 

How do cancer cells ever arise if these mechanisms bloclc the division or sur¬ 
vival of mutant cells with overactive prolifération signais? The answer is that the 
protective System is often inactivated in cancer cells by mutations in the genes 
that encode essential components of the blocking mechanisms, such as Arf or p53 
or the proteins that help activate them. 

Cell Prolifération is Accompanied by Cell Growth 

If cells proliferated without growing, they would get progressively smaller and 
there would be no net increase in total cell mass. In most proliferating cell popula¬ 
tions, therefore, cell growth accompanies cell division. In single-celled organisms 
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Figure 17-63 Cell-cycle arrest or 
apoptosis induced by excessive 
stimulation of mitogenic pathways. 

Abnormally high levels of Myc cause the 
activation of Arf, which binds and inhibits 
Mdm2 and thereby increases p53 levels 
(see Figure 17-62). Depending on the cell 
type and extracellular conditions, p53 then 
causes either cell-cycle arrest or apoptosis. 


such as yeasts, both cell growth and cell division require only nutrients. In ani¬ 
mais, by contrast, both cell growth and cell prolifération dépend on extracellular 
signal molécules, produced by other cells, which we call growth factors and mito- 
gens, respectively. 

Lilce mitogens, the extracellular growth factors that stimulate animal cell 
growth bind to receptors on the cell surface and activate intracellular signaling 
pathways. These pathways stimulate the accumulation of proteins and other mac- 
romolecules, and they do so by both increasing their rate of synthesis and decreas- 
ing their rate of dégradation. They also trigger increased uptalce of nutrients and 
production of the ATP required to fuel the increased protein synthesis. One of the 
most important intracellular signaling pathways activated by growth factor recep¬ 
tors involves the enzyme phosphoinositide 3-kinase (PI 3-kinase), which adds 
a phosphate from ATP to the 3' position of inositol phospholipids in the plasma 
membrane (discussed in Chapter 15). The activation of PI 3-kinase leads to the 
activation of a kinase called TOR, which lies at the heart of cell growth regulatory 
pathways in ail eulcaryotes. TOR activâtes many targets in the cell that stimulate 
metabolic processes , including protein synthesis. One target is a protein kinase 
called S6 kinase ( S6K ), which phosphorylâtes ribosomal protein S6, increasing the 
ability of ribosomes to translate a subset of mRNAs that mostly encode ribosomal 
components. TOR also indirectly activâtes a translation initiation factor called 
eIF4E and directly activâtes transcription regulators that promote the increased 
expression of genes encoding ribosomal subunits (Figure 17-64). 
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Figure 17-64 Stimulation of cell growth 
by extracellular growth factors and 
nutrients. The occupation of cell-surface 
receptors by growth factors leads to 
the activation of PI 3-kinase, which 
promûtes protein synthesis through a 
complex signaling pathway that leads 
to the activation of the protein kinase 
TOR; extracellular nutrients such as 
amino acids also help activate TOR. 

TOR phosphorylâtes multiple proteins to 
stimulate protein synthesis, as shown; 
it also inhibits protein dégradation (not 
shown). Growth factors also stimulate 
increased production of the transcription 
regulatory protein Myc (not shown), 
which activâtes the transcription of 
various genes that promote cell metabolism 
and growth. 4E-BP is an inhibitor 
of the translation initiation factor elF4E. 
PI(4,5)P2, phosphatidylinositol 
4,5-bisphosphate; PI(3,4,5)P 3 , 
phosphatidylinositol 3,4,5-trisphosphate. 
























1018 Chapter 17: The Cell Cycle 



(A) (B) 



Figure 17-65 Potential mechanisms for coordinating cell growth and division. In proliferating cells, cell size is maintained 
by mechanisms that coordinate rates of cell division and cell growth. Numerous alternative coupling mechanisms are thought to 
exist, and different cell types appear to employ different combinations of these mechanisms. (A) In many cell types—particularly 
yeast—the rate of cell division is governed by the rate of cell growth, so that division occurs only when growth rate achieves 
some minimal threshold; in yeasts, it is mainly the levels of extracellular nutrients that regulate the rate of cell growth and thereby 
the rate of cell division. (B) In some animal cell types, growth and division can each be controlled by separate extracellular 
factors (growth factors and mitogens, respectively), and cell size dépends on the relative levels of the two types of factors. 

(C) Some extracellular factors can stimulate both cell growth and cell division by simultaneously activating signaling pathways 
that promote growth and other pathways that promote cell-cycle progression. 


Proliferating Cells Usually Coordinate Their Growth and Division 

For proliferating cells to maintain a constant size, they must coordinate their 
growth with cell division to ensure that cell size doubles with each division: if cells 
grow too slowly, they will get smaller with each division, and if they grow too fast, 
they will get larger with each division. It is not clear how cells achieve this coordi¬ 
nation, but it is lilcely to involve multiple mechanisms that vary in different organ- 
isms and even in different cell types of the same organism (Figure 17-65). 

Animal cell growth and division are not always coordinated, however. In many 
cases, they are completely uncoupled to allow growth without division or division 
without growth. Muscle cells and nerve cells, for example, can grow dramatically 
after they hâve permanently withdrawn from the cell cycle. Similarly, the eggs of 
many animais grow to an extremely large size without dividing; after fertilization, 
however, this relationship is reversed, and many rounds of division occur without 
growth. 

Compared to cell division, there has been surprisingly little study of how cell 
size is controlled in animais. As a resuit, it remains a mystery how cell size is deter- 
mined and why different cell types in the same animal grow to be so different in 
size. One of the best-understood cases in mammals is the adult sympathetic neu- 
ron, which has permanently withdrawn from the cell cycle. Its size dépends on the 
amount of nerve growth factor (NGF) secreted by the target cells it innervâtes; the 
greater the amount of NGF the neuron has access to, the larger it becomes. It seems 
lilcely that the genes a cell expresses set limits on the size it can be, while extracel¬ 
lular signal molécules and nutrients regulate the size within these limits. The chal¬ 
lenge is to identify the relevant genes and signal molécules for each cell type. 


WHATWE DONT KNOW 


• Progression through the cell cycle 
dépends on the phosphorylation 

of hundreds of different proteins by 
cyclin-Cdk complexes. What are the 
molecular mechanisms ensuring that 
these proteins are phosphorylated at 
precisely the right time and place? 

• During S phase, how are histones 
and their modifying enzymes 
controlled to replicate chromatin 
structure on the duplicated DNA? 

• What is the structural basis of 
chromosome condensation, and 
how is the process stimulated during 
mitosis? 

• What are the mechanisms by which 
microtubule attachment and tension 
are sensed at the kinetochore by the 
components of the spindle assembly 
checkpoint? 


Summary 

In multicellular animais , cell size, cell division, and cell survival are carefully con¬ 
trolled to ensure that the organism and its organs achieve and maintain an appro- 
priate size. Mitogens stimulate the rate of cell division by removing intracellular 
molecular bralces that restrain cell-cycle progression in Gj. Growth factors promote 
cell growth (an increase in cell mass) by stimulating the synthesis and inhibiting 
the dégradation of macromolecules. To maintain a constant cell size, proliferating 
cells employ multiple mechanisms to ensure that cell growth is coordinated with 
cell division. 


• How is cell growth coordinated with 
cell division to ensure that cell size 
remains constant? 
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PROBLEMS 

Which statements are true? Explain why or why not. 

17-1 Since there are about 10 13 cells in an adult human, 
and about 10 10 cells die and are replaced each day, we 
become new people every three years. 

17-2 In order for proliferating cells to maintain a rela- 
tively constant size, the length of the cell cycle must match 
the time it talces for the cell to double in size. 

17-3 While other proteins corne and go during the 
cell cycle, the proteins of the origin récognition complex 
remain bound to the DNA throughout. 

17-4 Chromosomes are positioned on the metaphase 
plate by equal and opposite forces that pull them toward 
the two pôles of the spindle. 

17-5 Meiosis segregates the paternal homologs into 
sperm and the maternai homologs into eggs. 

17-6 If we could turn on telomerase activity in ail our 
cells, we could prevent aging. 

Discuss the following problems. 

17-7 Many cell-cycle genes from human cells function 
perfectly well when expressed in yeast cells. Why do you 
suppose that is considered remarkable? After ail, many 
human genes encoding enzymes for metabolic reactions 
also function in yeast, and no one thinlcs that is remark¬ 
able. 

17-8 Hoechst 33342 is a membrane-permeant dye that 
fluoresces when it binds to DNA. When a population of 
cells is incubated briefly with Hoechst dye and then sorted 
in a flow cytometer, which measures the fluorescence of 
each cell, the cells display various levels of fluorescence as 
shown in Figure Q17-1. 

A. Which cells in Figure Q17-1 are in the Gi, S, G 2 , 
and M phases of the cell cycle? Explain the basis for your 
answer. 

B. Sketch the sorting distributions you would expect 
for cells that were treated with inhibitors that bloclc the cell 
cycle in the Gi, S, or M phase. Explain your reasoning. 

Figure Q17-1 Analysis of Hoechst 33342 
fluorescence in a population of cells sorted 
in a flow cytometer (Problem 17-8). 


relative fluorescence per cell 



17-9 The yeast cohesin subunit Sccl, which is essen- 
tial for sister-chromatid cohésion, can be artificially 
regulated for expression at any point in the cell cycle. If 
expression is turned on at the beginning of S phase, ail the 
cells divide satisfactorily and survive. By contrast, if Sccl 
expression is turned on only after S phase is completed, 


the cells fail to divide and they die, even though Sccl 
accumulâtes in the nucléus and interacts efficiently with 
chromosomes. Why do you suppose that cohesin must 
be présent during S phase for cells to divide normally? 

17-10 High doses of caffeine interfère with the DNA 
damage response in mammalian cells. Why then do you 
suppose the Surgeon General has not yet issued an appro- 
priate warning to heavy coffee and cola drinkers? A typical 
cup of coffee (150 mL) contains 100 mg of caffeine (196 g/ 
mole). How many cups of coffee would you hâve to drinlc 
to reach the dose (10 mM) required to interfère with the 
DNA damage response? (A typical adult contains about 40 
liters ofwater.) 

17-11 How many kinetochores are there in a human cell 
at mitosis? 

17-12 A living cell from the lung epithelium of a newt 
is shown at different stages in M phase in Figure Q17-2. 
Order these light micrographs into the correct sequence 
and identify the stage in M phase that each represents. 


(A) (B) (C) 




Figure Q17-2 Light micrographs of a single cell at different stages of 
M phase (Problem 17-12). (Courtesy of Conly L. Rieder.) 


17-13 Down syndrome (trisomy 21) and Edwards syn¬ 
drome (trisomy 18) are the most common autosomal triso¬ 
mies seen in human infants. Does this fact mean that these 
chromosomes are the most difficult to segregate properly 
during meiosis? 

17-14 The human genome consists of 23 pairs of chro¬ 
mosomes (22 pairs of autosomes and one pair of sex chro¬ 
mosomes). During meiosis, the maternai and paternal sets 
of homologs pair, and then are separated into gametes, so 
that each contains 23 chromosomes. If you assume that 
the chromosomes in the paired homologs are randomly 
assorted to daughter cells, how many potential combina¬ 
tions of paternal and maternai homologs can be gener- 
ated during meiosis? (For the purposes of this calculation, 
assume that no recombination occurs.) 
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The growth, development, and maintenance of multicellular organisms dépend 
not only on the production of cells but also on mechanisms to destroy them. The 
maintenance of tissue size, for example, requires that cells die at the same rate as 
they are produced. During development, carefully orchestrated patterns of cell 
death help détermine the size and shape of limbs and other tissues. Cells also die 
when they become damaged or infected, ensuring that they are removed before 
they threaten the health of the organism. In these and most other cases, cell death 
is not a random process but occurs by a programmed sequence of molecular 
events, in which the cell systematically destroys itself from within and is then 
eaten by other cells, leaving no trace. In most cases, this programmed cell death 
occurs by a process called apoptosis—from the Greelc word meaning “falling off,” 
as leaves from a tree. 

Cells dying by apoptosis undergo characteristic morphological changes. They 
shrinlc and condense, the cytoskeleton collapses, the nuclear envelope disassem¬ 
bles, and the nuclear chromatin condenses and breaks up into fragments (Figure 
18-1A). The cell surface often bulges outward and, if the cell is large, it breaks 
up into membrane-enclosed fragments called apoptotic bodies. The surface of the 
cell or apoptotic bodies becomes chemically altered, so that a neighboring cell 
or a macrophage (a specialized phagocytic cell, discussed in Chapter 22) rapidly 
engulfs them, before they can spill their contents (Figure 18-1B). In this way, the 
cell dies neatly and is rapidly cleared away, without causing a damaging inflam- 
matory response. Because the cells are eaten and digested so quicldy, there are 
usually few dead cells to be seen, even when large numbers of cells hâve died by 
apoptosis. This is probably why biologists overloolced apoptosis for many years 
and still might underestimate its extent. 

In contrast to apoptosis, animal cells that die in response to an acute insult, 
such as trauma or a laclc of blood supply, usually do so by a process called cell 
necrosis. Necrotic cells swell and burst, spilling their contents over their neighbors 
and eliciting an inflammatory response (Figure 18-1C). In most cases, necrosis is 
lilcely to be caused by energy déplétion, which leads to metabolic defects and loss 
of the ionic gradients that normally exist across the cell membrane. One form of 
necrosis, called necroptosis, is a form of programmed cell death that is triggered by 
a spécifie regulatory signal from other cells, although we are only just beginning to 
understand the underlying mechanisms. 

Some form of programmed cell death occurs in many organisms, but apopto¬ 
sis is found primarily in animais. This chapter focuses on the major functions of 
apoptosis, its mechanism and régulation, and how excessive or insufficient apop¬ 
tosis can contribute to human disease. 

Apoptosis Eliminâtes Unwanted Cells 

The amount of apoptotic cell death that occurs in developing and adult animal 
tissues is astonishing. In the developing vertebrate nervous System, for exam¬ 
ple, more than half of many types of nerve cells normally die soon after they are 
formed. It seems remarkably wasteful for so many cells to die, especially as the 
vast majority are perfectly healthy at the time they kill themselves. What purposes 
does this massive cell death serve? 
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In some cases, the answer is clear. Cell death helps sculpt hands and feet dur- 
ing embryonic development: they start out as spade-like structures, and the indi- 
vidual digits separate only as the cells between them die, as illustrated for a mouse 
paw in Figure 18-2. In other cases, cells die when the structure they form is no 
longer needed. When a tadpole changes into a frog at metamorphosis, the cells 
in the tail die, and the tail, which is not needed in the frog, disappears. Apopto- 
sis also functions as a quality-control process in development, eliminating cells 
that are abnormal, misplaced, nonfunctional, or potentially dangerous to the ani¬ 
mal. Striking examples occur in the vertebrate adaptive immune System, where 
apoptosis éliminâtes developing T and B lymphocytes that either fail to produce 
potentially useful antigen-specific receptors or produce self-reactive receptors 
that malce the cells potentially dangerous (discussed in Chapter 24); it also élimi¬ 
nâtes most of the lymphocytes activated by an infection, after they hâve helped 
destroy the responsible microbes. 

In adult tissues that are neither growing nor shrinlcing, cell death and cell divi¬ 
sion must be tightly regulated to ensure that they are exactly in balance. If part of 
the liver is removed in an adult rat, for example, liver cell prolifération increases 
to malce up the loss. Conversely, if a rat is treated with the drug phénobarbital— 
which stimulâtes liver cell division (and thereby liver enlargement)—and then the 
phénobarbital treatment is stopped, apoptosis in the liver greatly increases until 
the liver has returned to its original size, usually within a week or so. Thus, the liver 
is lcept at a constant size through the régulation of both the cell death rate and the 
cell birth rate. The control mechanisms responsible for such régulation are largely 
unknown. 

Animal cells can recognize damage in their various organelles and, if the dam¬ 
age is great enough, they can lcill themselves by undergoing apoptosis. An impor¬ 
tant example is DNA damage, which can produce cancer-promoting mutations 
if not repaired. Cells hâve various ways of detecting DNA damage, and undergo 
apoptosis if they cannot repair it. 

Apoptosis Dépends on an Intracellular Proteolytic Cascade That Is 
Mediated by Caspases 

Apoptosis is triggered by members of a family of specialized intracellular pro¬ 
teases, which cleave spécifie sequences in numerous proteins inside the cell, 
thereby bringing about the dramatic changes that lead to cell death and engulf- 
ment. These proteases hâve a cysteine at their active site and cleave their target 
proteins at spécifie aspartic acids; they are therefore called caspases (c for cys¬ 
teine and asp for aspartic acid). Caspases are synthesized in the cell as inactive 
precursors and are activated only during apoptosis. There are two major classes of 
apoptotic caspases: initiator caspases and executioner caspases. 


Figure 18-1 Two distinct forms of cell 
death. These électron micrographs show 
cells that hâve died by apoptosis (A and 
B) or by necrosis (C). The cells in (A) and 
(C) died in a culture dish, whereas the cell 
in (B) died in a developing tissue and has 
been engulfed by a phagocytic cell. Note 
that the cells in (A) and (B) hâve condensed 
but seem relatively intact, whereas the cell 
in (C) seems to hâve exploded. The large 
vacuoles visible in the cytoplasm of the cell 
in (A) are a variable feature of apoptosis. 
(Courtesy of Julia Burne.) 



(B) I_ 

1 mm 


Figure 18-2 Sculpting the digits in the 
developing mouse paw by apoptosis. 

(A) The paw in this mouse fétus has been 
stained with a dye that specifically labels 
cells that hâve undergone apoptosis. 

The apoptotic cells appear as bright 
green dots between the developing digits. 

(B) The interdigital cell death has eliminated 
the tissue between the developing digits, 
as seen one day later, when there are very 
few apoptotic cells. (From W. Wood et 

al., Development 127:5245-5252, 2000. 
With permission from The Company of 
Biologists.) 
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Figure 18-3 Caspase activation during 
apoptosis. An initiator caspase contains 
a protease domain in its carboxy-terminal 
région and a small protein interaction 
domain near its amino terminus. It is initially 
made in an inactive, monomeric form, 
sometimes called procaspase. Apoptotic 
signais trigger the assembly of adaptor 
proteins carrying multiple binding sites 
for the caspase amino-terminal domain. 
Upon binding to the adaptor proteins, the 
initiator caspases dimerize and are thereby 
activated, leading to cleavage of a spécifie 
site in their protease domains. Each 
protease domain is then rearranged into 
a large and small subunit. In some cases 
(not shown), the adaptor-binding domain 
of the initiator caspase is also cleaved (see 
Figure 18-5). Executioner caspases are 
initially formed as inactive dimers. Upon 
cleavage at a site in the protease domain 
by an initiator caspase, the executioner 
caspase dimer undergoes an activating 
conformational change. The executioner 
caspases then cleave a variety of key 
proteins, leading to the controlled death of 
the cell. 


Initiator caspases, as their name implies, begin the apoptotic process. They 
normally exist as inactive, soluble monomers in the cytosol. An apoptotic signal 
triggers the assembly of large protein platforms that bring multiple initiator cas¬ 
pases together into large complexes. Within these complexes, pairs of caspases 
associate to form dimers, resulting in protease activation (Figure 18-3). Each cas¬ 
pase in the dimer then cleaves its partner at a spécifie site in the protease domain, 
which stabilizes the active complex and is required for the proper function of the 
enzyme in the cell. 

The major function of the initiator caspases is to activate the executioner cas¬ 
pases. These normally exist as inactive dimers. When they are cleaved by an ini¬ 
tiator caspase at a site in the protease domain, the active site is rearranged from 
an inactive to an active conformation. One initiator caspase complex can activate 
many executioner caspases, resulting in an amplifying proteolytic cascade. Once 
activated, executioner caspases catalyze the widespread protein cleavage events 
that lcill the cell. 

Various experimental approaches hâve led to the identification of over a thou- 
sand proteins that are cleaved by caspases during apoptosis. Only a few of these 
proteins hâve been studied in any detail. These include the nuclear lamins, the 
cleavage of which causes the irréversible breakdown of the nuclear lamina (dis- 
cussed in Chapter 12). Another target is a protein that normally holds a DNA- 
degrading endonuclease in an inactive form; its cleavage frees the endonuclease 
to eut up the DNA in the cell nucléus (Figure 18-4). Other target proteins include 
components of the cytoslceleton and cell-cell adhesion proteins that attach cells 
to their neighbors; the cleavage of these proteins helps the apoptotic cell to round 
up and detach from its neighbors, malcing it easier for a neighboring cell to engulf 
it, or, in the case of an épithélial cell, for the neighbors to extrude the apoptotic cell 
from the cell sheet. The caspase cascade is not only destructive and self-amplify- 
ing but also irréversible, so that once a cell starts out along the path to destruction, 
it cannot turn baclc. 

How is the initiator caspase first activated in response to an apoptotic signal? 
The two best-understood activation mechanisms in mammalian cells are called 
the extrinsic pathway and the intrinsic, or mitochondrial , pathway. Each uses its 
own initiator caspase and activation System, as we now discuss. 
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Figure 18-4 DNA fragmentation during apoptosis. (A) In healthy cells, the endonuclease CAD associâtes with its inhibitor, 
iCAD. Activation of executioner caspases in the cell leads to cleavage of iCAD, which unleashes the nuclease. Activated CAD 
cuts the chromosomal DNA between nucléosomes, resulting in the production of DNA fragments that form a ladder pattern 
(see B) upon gel electrophoresis. (B) Mouse thymus lymphocytes were treated with an antibody against the cell-surface death 
receptor Fas (discussed in the text), inducing the cells to undergo apoptosis. DNA was extracted at the times indicated above 
the figure, and the fragments were separated by size by electrophoresis in an agarose gel and stained with ethidium bromide. 
Because the cleavages occur in the linker régions between nucléosomes, the fragments separate into a characteristic ladder 
pattern on these gels. Note that in gel electrophoresis, smaller molécules are more widely separated in the lower part of the gel, 
so that removal of a single nucléosome has a greater apparent effect on their gel mobility. (C) Apoptotic nuclei can be detected 
using a technique that adds a fluorescent label to DNA ends. In the image shown here, this technique was used in a tissue 
section of a developing chick leg bud; this cross section through the skin and underlying tissue is from a région between two 
developing digits, as indicated in the underlying drawing. The procedure is called the TUNEL (IdT-mediated dUTP nick end 
jabeling) technique because the enzyme terminal deoxynucléotidyl transferase (TdT) adds chains of labeled deoxynucleotide 
(dUTP) to the 3'-OH ends of DNA fragments. The presence of large numbers of DNA fragments therefore results in bright 
fluorescent dots in apoptotic cells. (B, from D. Mcllroy et al., Genes Dev. 14:549-558, 2000. With permission from Cold Spring 
Harbor Laboratory Press; C, from V. Zuzarte-Luis and J.M. Hurlé, Int. J. Dev. Biol. 46:871-876, 2002. With permission from 
UBC Press.) 

Cell-Surface Death Receptors Activate the Extrinsic Pathway of 
Apoptosis 

Extracellular signal proteins binding to cell-surface death receptors trigger the 
extrinsic pathway of apoptosis. Death receptors are transmembrane proteins 
containing an extracellular ligand-binding domain, a single transmembrane 
domain, and an intracellular death domain , which is required for the receptors 
to activate the apoptotic program. The receptors are homotrimers and belong to 
the tumor necrosis factor (TNF) receptor family, which includes a receptor for TNF 
itself and the Fas death receptor. The ligands that activate the death receptors are 
also homotrimers; they are structurally related to one another and belong to the 
TNF family of signal proteins. 

A well-understood example of how death receptors trigger the extrinsic path¬ 
way of apoptosis is the activation of Fas on the surface of a target cell by Fas ligand 
on the surface of a lciller (cytotoxic) lymphocyte. When activated by the binding 
of Fas ligand, the death domains on the cytosolic tails of the Fas death receptors 
bind intracellular adaptor proteins, which in turn bind initiator caspases (pri- 
marily caspase-8), forming a death-inducing signaling complex (DISC). Once 
dimerized and activated in the DISC, the initiator caspases cleave their partners 
and then activate downstream executioner caspases to induce apoptosis (Figure 
18-5). In some cells, the extrinsic pathway recruits the intrinsic apoptotic path¬ 
way to amplify the caspase cascade and kill the cell. 
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Many cells produce inhibitory proteins that act to restrain the extrinsic path- 
way. For example, some cells produce the protein FLIP, which resembles an ini- 
tiator caspase but has no protease activity because it laclcs the lcey cysteine in its 
active site. FLIP dimerizes with caspase-8 in the DISC; although caspase-8 appears 
to be active in these heterodimers, it is not cleaved at the site required for its stable 
activation, and the apoptotic signal is bloclced. Such inhibitory mechanisms help 
prevent the inappropriate activation of the extrinsic pathway of apoptosis. 

The Intrinsic Pathway of Apoptosis Dépends on Mitochondria 

Cells can also activate their apoptosis program from inside the cell, often in 
response to stresses, such as DNA damage, or in response to developmen- 
tal signais. In vertebrate cells, these responses are governed by the intrinsic, or 
mitochondrial, pathway of apoptosis, which dépends on the release into the cyto¬ 
sol of mitochondrial proteins that normally résidé in the intermembrane space 
of these organelles (see Figure 12-19). Some of the released proteins activate a 
caspase proteolytic cascade in the cytoplasm, leading to apoptosis. 

A lcey protein in the intrinsic pathway is cytochrome c, a water-soluble com- 
ponent of the mitochondrial electron-transport chain. When released into the 
cytosol (Figure 18-6), it talces on a new function: it binds to an adaptor protein 
called Apafl (apoptotic protease activating factor- J), causing the Apafl to oligo- 
merize into a wheel-lilce heptamer called an apoptosome. The Apafl proteins in 
the apoptosome then recruit initiator caspase-9 proteins, which are thought to 
be activated by proximity in the apoptosome, just as caspase-8 is activated in the 
DISC. The activated caspase-9 molécules then activate downstream executioner 
caspases to induce apoptosis (Figure 18-7). 

Bcl2 Proteins Regulate the Intrinsic Pathway of Apoptosis 

The intrinsic pathway of apoptosis is tightly regulated to ensure that cells kill 
themselves only when it is appropriate. A major class of intracellular regulators of 
the intrinsic pathway is the Bcl2 family of proteins, which, lilce the caspase family, 
has been conserved in évolution from worms to humans; a human Bcl2 protein, 
for example, can suppress apoptosis when expressed in the worm Caenorhabditis 
elegans. 


Figure 18-5 The extrinsic pathway of 
apoptosis activated through Fas death 
receptors. Trimeric Fas ligands on the 
surface of a killer lymphocyte interact with 
trimeric Fas receptors on the surface of the 
target cell, leading to clustering of several 
ligand-bound receptor trimers (only one 
trimer is shown here for clarity). Receptor 
clustering activâtes death domains on the 
receptor tails, which interact with similar 
domains on the adaptor protein FADD 
(FADD stands for Fas-associated death 
domain). Each FADD protein then recruits 
an initiator caspase (caspase-8) via a 
death effector domain on both FADD and 
the caspase, forming a death-inducing 
signaling complex (DISC). Within the DISC, 
two adjacent initiator caspases interact and 
cleave one another to form an activated 
protease dimer, which then cleaves itself 
in the région linking the protease to the 
death effector domain. This stabilizes and 
releases the active caspase dimer into 
the cytosol, where it activâtes executioner 
caspases by cleaving them. 
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Mammalian Bcl2 family proteins regulate the intrinsic pathway of apopto- 
sis mainly by controlling the release of cytochrome c and other intermembrane 
mitochondrial proteins into the cytosol. Some Bcl2 family proteins are pro-apop- 
totic and promote apoptosis by enhancing the release, whereas others are anti- 
apoptotic and inhibit apoptosis by blocking the release. The pro-apoptotic and 
anti-apoptotic proteins can bind to each other in various combinations to form 
heterodimers in which the two proteins inhibit each other's function. The bal¬ 
ance between the activities of these two functional classes of Bcl2 family proteins 
largely détermines whether a mammalian cell lives or dies by the intrinsic path¬ 
way of apoptosis. 

As illustrated in Figure 18-8, the anti-apoptotic Bcl2 family proteins, including 
Bcl2 itself (the founding member of the Bcl2 family) and BclX l, share four distinc¬ 
tive Bcl2 homology (BH) domains (BH1-4). The pro-apoptotic Bcl2 family proteins 


Figure 18-6 Release of cytochrome 
c from mitochondria in the intrinsic 
pathway of apoptosis. Fluorescence 
micrographs of human cancer cells 
in culture. (A) The control cells were 
transfected with a gene encoding a fusion 
protein consisting of cytochrome c linked to 
green fluorescent protein (cytochrome- 
c-GFP); they were also treated with a red 
dye that accumulâtes in mitochondria. 

The overlapping distribution of the green 
and red indicates that the cytochrome- 
c-GFP is located in mitochondria. 

(B) Cells expressing cytochrome-c-GFP 
were irradiated with ultraviolet (UV) light to 
induce the intrinsic pathway of apoptosis 
and were photographed 5 hours later. 

The six cells in the bottom half of this 
micrograph hâve released their cytochrome 
c from mitochondria into the cytosol, 
whereas the cells in the upper half of the 
micrograph hâve not yet done so 
(Movie 18.1). (From J.C. Goldstein et al., 
Nat. Cell Biol. 2:156-162, 2000. With 
permission from Macmillan Publishers Ltd.) 
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Figure 18-7 The intrinsic pathway of apoptosis. Intracellular apoptotic stimuli cause mitochondria to release cytochrome c, which interacts 
with Apafl. The binding of cytochrome c causes Apafl to unfold partly, exposing a domain that interacts with the same domain in other activated 
Apafl molécules. Seven activated Apafl proteins form a large ring complex called the apoptosome. Each Apafl protein contains a caspase 
recruitment domain (CARD), and these are clustered above the central hub of the apoptosome. The CARDs bind similar domains in multiple 
caspase-9 molécules, which are thereby recruited into the apoptosome and activated. The mechanism of caspase-9 activation is not clear: it 
probably results from dimerization and cleavage of adjacent caspase-9 proteins, but it might also dépend on interactions between caspase-9 and 
Apafl. Once activated, caspase-9 cleaves and thereby activâtes downstream executioner caspases. Note that the CARD is related in structure 
and function to the death effector domain of caspase-8 (see Figure 18-5). Some scientists use the term “apoptosome” to refer to the complex 
containing caspase-9. 
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Figure 18-8 The three classes of Bcl2 
family proteins. Note that the BH3 domain 
is the only BH domain shared by ail Bcl2 
family members; it médiates the direct 
interactions between pro-apoptotic and 
anti-apoptotic family members. 


consist of two subfamilies—the effector Bcl2 family proteins and the BH3-only pro¬ 
teins. The main effector proteins are Bax and Bak, which are structurally similar to 
Bcl2 but laclc the BH4 domain. The BH3-only proteins share sequence homology 
with Bcl2 in only the BH3 domain. 

When an apoptotic stimulus triggers the intrinsic pathway, the pro-apoptotic 
effector Bcl2 family proteins become activated and aggregate to form oligomers 
in the mitochondrial outer membrane, inducing the release of cytochrome c and 
other intermembrane proteins by an unknown mechanism (Figure 18-9). In 
mammalian cells, Bax and Bak are the main effector Bcl2 family proteins, and 
at least one of them is required for the intrinsic pathway of apoptosis to operate: 
mutant mouse cells that laclc both proteins are résistant to ail pro-apoptotic sig¬ 
nais that normally activate this pathway. Whereas Bak is bound to the mitochon¬ 
drial outer membrane even in the absence of an apoptotic signal, Bax is mainly 
located in the cytosol and translocates to the mitochondria only after an apop¬ 
totic signal activâtes it. As we discuss below, the activation of Bax and Bak usually 
dépends on activated pro-apoptotic BH3-only proteins. 

The anti-apoptotic Bcl2 family proteins such as Bcl2 itself and BclX L are also 
located on the cytosolic surface of the outer mitochondrial membrane, where 
they help prevent inappropriate release of intermembrane proteins. The anti- 
apoptotic Bcl2 family proteins inhibit apoptosis mainly by binding to and inhibit- 
ing pro-apoptotic Bcl2 family proteins—either on the mitochondrial membrane 
or in the cytosol. On the outer mitochondrial membrane, for example, they bind 
to Bak and prevent it from oligomerizing, thereby inhibiting the release of cyto¬ 
chrome c and other intermembrane proteins. There are at least five mammalian 
anti-apoptotic Bcl2 family proteins, and every mammalian cell requires at least 
one to survive. Moreover, a number of these proteins must be inhibited for the 
intrinsic pathway to induce apoptosis; the BH3-only proteins médiate the inhibi¬ 
tion. 

The BH3-only proteins are the largest subclass of Bcl2 family proteins. The cell 
either produces or activâtes them in response to an apoptotic stimulus, and they 
are thought to promote apoptosis mainly by inhibiting anti-apoptotic Bcl2 family 



Figure 18-9 The rôle of pro-apoptotic 
effector Bcl2 family proteins (mainly Bax 
and Bak) in the release of mitochondrial 
intermembrane proteins in the intrinsic 
pathway of apoptosis. When activated 
by an apoptotic stimulus, the effector Bcl2 
family proteins aggregate on the outer 
mitochondrial membrane and release 
cytochrome c and other proteins from the 
intermembrane space into the cytosol by 
an unknown mechanism. 
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proteins. Their BH3 domain binds to a long hydrophobie groove on anti-apop- 
totic Bcl2 family proteins, neutralizing their activity. This binding and inhibition 
enables the aggregation of Bax and Bak on the surface of mitochondria, which 
triggers the release of the intermembrane mitochondrial proteins that induce 
apoptosis (Figure 18-10). Some BH3-only proteins may bind directly to Bax and 
Bak to help stimulate their aggregation. 

BH3-only proteins provide the crucial linlc between apoptotic stimuli and the 
intrinsic pathway of apoptosis, with different stimuli activating different BH3- 
only proteins. Some extracellular survival signais, for example, bloclc apopto¬ 
sis by inhibiting the synthesis or activity of certain BH3-only proteins (see Fig¬ 
ure 18-12B). Similarly, in response to DNA damage that cannot be repaired, the 
tumor suppressor protein p53 accumulâtes (discussed in Chapters 17 and 20) and 
activâtes the transcription of genes that encode the BH3-only proteins Puma and 
Noxa. These BH3-only proteins then trigger the intrinsic pathway, thereby elimi- 
nating a potentially dangerous cell that could otherwise become cancerous. 

As mentioned earlier, in some cells the extrinsic apoptotic pathway recruits 
the intrinsic pathway to amplify the caspase cascade to kill the cell. The BH3-only 
protein Bid is the linlc between the two pathways. Bid is normally inactive. How- 
ever, when death receptors activate the extrinsic pathway in some cells, the initia- 
tor caspase, caspase-8, cleaves Bid, producing an active form of Bid that trans- 
locates to the outer mitochondrial membrane and inhibits anti-apoptotic Bcl2 
family proteins, thereby amplifying the death signal. 
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Figure 18-10 How pro-apoptotic BH3- 
only and anti-apoptotic Bcl2 family 
proteins regulate the intrinsic pathway 
of apoptosis. (A) In the absence of 
an apoptotic stimulus, anti-apoptotic 
Bcl2 family proteins bind to and inhibit 
the effector Bcl2 family proteins on the 
mitochondrial outer membrane (and in the 
cytosol—not shown). (B) In the presence of 
an apoptotic stimulus, BH3-only proteins 
are activated and bind to the anti-apoptotic 
Bcl2 family proteins so that they can 
no longer inhibit the effector Bcl2 family 
proteins; the latter then become activated, 
aggregate in the outer mitochondrial 
membrane, and promote the release of 
intermembrane mitochondrial proteins 
into the cytosol. Some activated BH3- 
only proteins may stimulate mitochondrial 
protein release more directly by binding 
to and activating the effector Bcl2 family 
proteins. Although not shown, the anti- 
apoptotic Bcl2 family proteins are bound to 
the mitochondrial surface. 
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lAPs Help Control Caspases 

Because activation of a caspase cascade leads to certain death, the cell employs 
multiple robust mechanisms to ensure that these proteases are activated only 
when appropriate. One line of defense is provided by a family of proteins called 
inhibitors of apoptosis (IAPs). These proteins were first identified in certain 
insect viruses (baculoviruses), which encode IAP proteins to prevent a host cell 
that is infected by the virus from killing itself by apoptosis. It is now known that 
most animal cells also malce IAP proteins. 

Ail IAPs hâve one or more BIR (baculovirus IAP repeat) domains, which enable 
them to bind to and inhibit activated caspases. Some IAPs also polyubiquitylate 
caspases, marking the caspases for destruction by protéasomes. In this way, the 
IAPs set an inhibitory threshold that caspases must overcome to trigger apoptosis. 

In Drosophila at least, the inhibitory barrier provided by IAPs can be neutral- 
ized by anti-IAP proteins, which are produced in response to various apoptotic 
stimuli. There are numerous anti-IAPs in Aies, including Reaper, Grim, and Hid, 
and their only structural similarity is their short, N-terminal, IAP-binding motif, 
which binds to the BIR domain of IAPs, preventing the domain from binding to 
a caspase. Délétion of the three genes encoding Reaper, Grim, and Hid bloclcs 
apoptosis in Aies. Conversely, inactivation of one of the two genes that encode 
IAPs in Drosophila causes ail of the cells in the developing Ay embryo to undergo 
apoptosis. Clearly, the balance between IAPs and anti-IAPs is tightly regulated 
and is crucial for controlling apoptosis in the Ay. 

The rôle of mammalian IAP and anti-IAP proteins in apoptosis is less clear. 
Anti-IAPs are released from the mitochondrial intermembrane space when the 
intrinsic pathway of apoptosis is activated, blocking IAPs in the cytosol and thereby 
promoting apoptosis. However, mice appear to develop normally if they are miss- 
ing either the major mammalian IAP (called XIAP) or the two known mammalian 
anti-IAPs (called Smac/Diablo and Omi). Worms do not even contain a caspase- 
inhibiting IAP protein. Apparently, the tight control of caspase activity is achieved 
by diAerent mechanisms in diAerent animais. 

Extracellular Survival Factors Inhibit Apoptosis in Various Ways 

Intercellular signais regulate most activities of animal cells, including apoptosis. 
These extracellular signais are part of the normal "social" Controls that ensure that 
individual cells behave for the good of the organism as a whole—in this case, by 
surviving when they are needed and killing themselves when they are not. Some 
extracellular signal molécules stimulate apoptosis, whereas others inhibit it. We 
hâve discussed signal proteins such as Fas ligand that activate death receptors 
and thereby trigger the extrinsic pathway of apoptosis. Other extracellular signal 
molécules that stimulate apoptosis are especially important during vertebrate 
development: a surge of thyroid hormone in the bloodstream, for example, signais 
cells in the tadpole tail to undergo apoptosis at metamorphosis. In mice, locally 
produced signal proteins stimulate cells between developing Angers and toes to 
kill themselves (see Figure 18-2). Here, however, we focus on extracellular signal 
molécules that inhibit apoptosis, which are collectively called survival factors. 

Most animal cells require continuous signaling from other cells to avoid apop¬ 
tosis. This surprising arrangement apparently helps ensure that cells survive only 
when and where they are needed. Nerve cells, for example, are produced in excess 
in the developing nervous System and then compete for limited amounts of sur¬ 
vival factors that are secreted by the target cells that they normally connect to (see 
Figure 21-81). Nerve cells that receive enough survival signais live, while the oth¬ 
ers die. In this way, the number of surviving neurons is automatically adjusted 
so that it is appropriate for the number of target cells they connect with (Figure 
18-11). A similar compétition for limited amounts of survival factors produced by 
neighboring cells is thought to control cell numbers in other tissues, both during 
development and in adulthood. 
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Figure 18-11 The rôle of survival factors 
and cell death in adjusting the number 
of developing nerve cells to the amount 
of target tissue. More nerve cells are 
produced than can be supported by the 
limited amount of survival factors released 
by the target cells. Therefore, some nerve 
cells receive an insufficient amount of 
survival factors to avoid apoptosis. This 
strategy of overproduction followed by 
culling helps ensure that ail target cells are 
contacted by nerve cells and that the extra 
nerve cells are automatically eliminated. 


Survival factors usually bind to cell-surface receptors, which activate intracel- 
lular signaling pathways that suppress the apoptotic program, often by regulating 
members of the Bcl2 family of proteins. Some survival factors, for example, stimu- 
late the synthesis of anti-apoptotic Bcl2 family proteins such as Bcl2 itself or BclX L 
(Figure 18-12A). Others act by inhibiting the function of pro-apoptotic BH3-only 
proteins such as Bad (Figure M18-12B). In Drosophila, some survival factors 
act by phosphorylating and inactivating anti-IAP proteins such as Hid, thereby 
enabling IAP proteins to suppress apoptosis (Figure 18-12C). Some develop¬ 
ing neurons, lilce those illustrated in Figure 18-11, use an ingenious alternative 
approach: survival-factor receptors stimulate apoptosis—by an unknown mecha- 
nism—when they are not occupied, and then stop promoting death when survival 
factor binds. The end resuit in ail these cases is the same: cell survival dépends on 
survival factor binding. 


Phagocytes Remove the Apoptotic Cell 

Apoptotic cell death is a remarkably tidy process: the apoptotic cell and its frag¬ 
ments do not break open and release their contents, but instead remain intact 
as they are efficiently eaten—or phagocytosed —by neighboring cells, leaving no 
trace and therefore triggering no inflammatory response (see Figure 18-1B and 
Movie 13.5). This engulfment process dépends on Chemical changes on the sur¬ 
face of the apoptotic cell, which displays signais that recruit phagocytic cells. An 
especially important change occurs in the distribution of the negatively charged 
phospholipid phosphatidylserine on the cell surface. This phospholipid is nor- 
mally located exclusively in the inner leaflet of the lipid bilayer of the plasma 
membrane (see Figure 10-15), but it flips to the outer leaflet in apoptotic cells. The 


(A) increased production of 
anti-apoptotic Bcl2 family 
prote in 


survival factor 



(B) inactivation of pro- 
apoptotic BH3-only 
protein 


survival factor 



(C) inactivation of anti-IAPs 


survival factor 



active Hid inactive Hid 
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Figure 18-12 Three ways that 
extracellular survival factors can inhibit 
apoptosis. (A) Some survival factors 
suppress apoptosis by stimulating the 
transcription of genes that encode anti- 
apoptotic Bcl2 family proteins such as Bcl2 
itself or BcIXl. (B) Many others activate the 
serine/threonine protein kinase Akt, which, 
among many other targets, phosphorylâtes 
and inactivâtes the pro-apoptotic BH3-only 
protein Bad (see Figure 15-53). When not 
phosphorylated, Bad promûtes apoptosis 
by binding to and inhibiting Bcl2; once 
phosphorylated, Bad dissociâtes, freeing 
Bcl2 to suppress apoptosis. Akt also 
suppresses apoptosis by phosphorylating 
and inactivating transcription regulatory 
proteins that stimulate the transcription 
of genes encoding proteins that promote 
apoptosis (not shown). (C) In Drosophila, 
some survival factors inhibit apoptosis by 
stimulating the phosphorylation of the anti- 
IAP protein Hid. When not phosphorylated, 
Hid promûtes cell death by inhibiting 
lAPs. Once phosphorylated, Hid no longer 
inhibits lAPs, which become active and 
block apoptosis. MAP kinase, mitogen- 
activated protein kinase. 
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underlying mechanism is poorly understood, but the external exposure of phos- 
phatidylserine is lilcely to dépend on caspase cleavage of some protein involved in 
phospholipid distribution in the membrane. A variety of soluble “bridging" pro¬ 
teins interact with the exposed phosphatidylserine on the apoptotic cell. These 
bridging proteins also interact with spécifie receptors on the surface of a neigh- 
boring cell or macrophage, triggering cytoslceletal and other changes that initiate 
the engulfment process. 

Macrophages do not phagocytose healthy cells in the animal—despite the fact 
that healthy cells normally expose some phosphatidylserine on their surfaces. 
Healthy cells express signal proteins on their surface that interact with inhibitory 
receptors on macrophages that bloclc phagocytosis. Thus, in addition to express¬ 
ing cell-surface signais such as phosphatidylserine that stimulate phagocytosis, 
apoptotic cells must lose or inactivate these “don't eat me" signais that bloclc 
phagocytosis. 

Either Excessive or Insufficient Apoptosis Can Contribute to 
Disease 

There are many human disorders in which excessive numbers of cells undergo 
apoptosis and thereby contribute to tissue damage. Among the most dramatic 
examples are heart attaclcs and strolces. In these acute conditions, many cells die 
by necrosis as a resuit of ischemia (inadéquate blood supply), but some of the 
less affected cells die by apoptosis. It is hoped that, in the future, drugs that bloclc 
apoptosis—such as spécifie caspase inhibitors—will prove useful in saving such 
cells. 

There are other conditions where too few cells die by apoptosis. Mutations 
in mice and humans, for example, that inactivate the genes that encode the Fas 
death receptor or the Fas ligand prevent the normal death of some lymphocytes, 
causing these cells to accumulate in excessive numbers in the spleen and lymph 
glands. In many cases, this leads to autoimmune disease, in which the lympho¬ 
cytes react against the individuaFs own tissues. 

Decreased apoptosis also malces an important contribution to many tumors, as 
cancer cells often regulate their apoptotic program abnormally. The Bcl2 gene, for 
example, was first identified in a common form of lymphocyte cancer in humans, 
where a chromosome translocation causes excessive production of the Bcl2 pro¬ 
tein; indeed, Bcl2 gets its name from this B cell lymphoma. The high level of Bcl2 
protein in the lymphocytes that carry the translocation promûtes the develop¬ 
ment of cancer by inhibiting apoptosis, thereby prolonging lymphocyte survival 
and increasing their number; it also decreases the cells' sensitivity to anticancer 
drugs, which commonly worlc by causing cancer cells to undergo apoptosis. 

Similarly, the gene encoding the tumor suppressor protein p53 is mutated in 
about 50% of human cancers so that it no longer promûtes apoptosis or cell-cycle 
arrest in response to DNA damage. The laclc of p53 function therefore enables the 
cancer cells to survive and proliferate even when their DNA is damaged; in this 
way, the cells accumulate more mutations, some of which malce the cancer more 
malignant (discussed in Chapter 20). As many anticancer drugs induce apoptosis 
(and cell-cycle arrest) by a p53-dependent mechanism (discussed in Chapters 17 
and 20), the loss of p53 function also malces cancer cells less sensitive to these 
drugs. 

If decreased apoptosis contributes to many cancers, then we might be able to 
treat those cancers with drugs that stimulate apoptosis. This line of thinking has 
recently led to the development of small Chemicals that interfère with the function 
of anti-apoptotic Bcl2 family proteins such as Bcl2 and BcIXl. These Chemicals 
bind with high affinity to the hydrophobie groove on anti-apoptotic Bcl2 family 
proteins, blocking their function in essentially the same way that BH3-only pro¬ 
teins do (Figure 18-13). The intrinsic pathway of apoptosis is thereby stimulated, 
which in certain tumors increases the amount of cell death. 

Most human cancers arise in épithélial tissues such as those in the lung, intes¬ 
tinal tract, breast, and prostate. Such cancer cells display many abnormalities in 
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their behavior, including a decreased ability to adhéré to the extracellular matrix 
and to one another at specialized cell-cell junctions. In the next chapter, we dis- 
cuss the remarkable structures and functions of the extracellular matrix and cell 
junctions. 

Summary 

Animal cells can activate an intracellular death program and kill themselves in 
a controlled way when they are irreversibly damaged , no longer needed, or are 
a threat to the organism. In most cases, these deaths occur by apoptosis: the cells 
shrink, condense , and frequently fragment , and neighboring cells or macrophages 
rapidly phagocytose the cells or fragments before there is any lealcage ofcytoplas- 
mic contents. Apoptosis is mediated by proteolytic enzymes called caspases, which 
cleave spécifie intracellular proteins to help kill the cell. Caspases are présent in 
ail nucleated animal cells as inactive precursors. Initiator caspases are activated 
when brought into proximity in activation complexes: once activated, they cleave 
and thereby activate downstream executioner caspases, which then cleave various 
target proteins in the cell, producingan amplifying, irréversible proteolytic cascade. 

Cells use at least two distinct pathways to activate initiator caspases and trig- 
ger a caspase cascade leading to apoptosis: the extrinsic pathway is activated by 
extracellular ligands binding to cell-surface death receptors; the intrinsic pathway 
is activated by intracellular signais generated when cells are stressed. Each path¬ 
way uses its own initiator caspases, which are activated in distinct activation com¬ 
plexes: in the extrinsic pathway, the death receptors recruit caspase-8 via adaptor 
proteins toform the DISC; in the intrinsic pathway, cytochrome c released from the 
intermembrane space of mitochondria activâtes Apafl, which assembles into an 
apoptosome and recruits and activâtes caspase-9. 

Intracellular Bcl2family proteins and IAP proteins tightly regulate the apoptotic 
program to ensure that cells kill themselves only when it benefits the animal. Both 
anti-apoptotic and pro-apoptotic Bcl2family proteins regulate the intrinsic path¬ 
way by controlling the release of mitochondrial intermembrane proteins; while IAP 
proteins inhibit activated caspases and promote their dégradation. 


Figure 18-13 How the Chemical ABT- 
737 inhibits anti-apoptotic Bcl2 family 
proteins. As shown in Figure 18-10B, 
an apoptotic signal results in activation of 
BH3-only proteins, which interact with a 
long hydrophobie groove in anti-apoptotic 
Bcl2 family proteins, thereby preventing 
them from blocking apoptosis. Using 
the crystal structure of the groove, the 
drug shown in (A), called ABT-737, was 
designed and synthesized to bind tightly in 
the groove, as shown for the anti-apoptotic 
Bcl2 family protein, BcIXl, in (B). By 
inhibiting the activity of these proteins, the 
drug promûtes apoptosis in any cell that 
dépends on them for survival. (PDB code: 
2YXJ.) 


WHATWE DONT KNOW 


• How many forms of programmée! 
cell death exist? What are the 
underlying mechanisms and benefits 
of each? 

• Thousands of caspase substrates 
hâve been identified. Which ones 
are the critical proteins that must 
be cleaved to trigger the major 

cell remodeling events underlying 
apoptosis? 

• How did the intrinsic pathway of 
apoptosis evolve, and what is the 
advantage of having mitochondria 
play such a central rôle in regulating 
apoptosis? 

• How are “don’t eat me” signais 
eliminated or inactivated during 
apoptosis to allow the cells to be 
phagocytosed? 
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PROBLEMS 

Which statements are true? Explain why or why not. 

18-1 In normal adult tissues, cell death usually balances 

cell division. 

18-2 Mammalian cells that do not hâve cytochrome c 
should be résistant to apoptosis induced by DNA damage. 

Discuss the following problems. 

18-3 One important rôle of Fas and Fas ligand is to médi¬ 
ate the élimination of tumor cells by killer lymphocytes. 
In a study of 35 primary lung and colon tumors, half the 
tumors were found to hâve amplified and overexpressed a 
gene for a secreted protein that binds to Fas ligand. How do 
you suppose that overexpression of this protein might con- 
tribute to the survival of these tumor cells? Explain your 
reasoning. 

18-4 Development of the nematode Caenorhabditis ele¬ 
gans generates exactly 959 somatic cells; it also produces 
an additional 131 cells that are later eliminated by apop¬ 
tosis. Classical genetic experiments in C. elegans isolated 
mutants that led to the identification of the first genes 
involved in apoptosis. Of the many mutations affecting 
apoptosis in the nematode, none hâve ever been found in 
the gene for cytochrome c. Why do you suppose that such 
a central effector molécule in apoptosis was not found in 
the many genetic screens for "death" genes that hâve been 
carried out in C. elegans ? 

18-5 Imagine that you could microinject cytochrome c 
into the cytosol of wild-type mammalian cells and of cells 
that were doubly defective for Bax and Baie. Would you 
expect one, both, or neither type of cell to undergo apopto¬ 
sis? Explain your reasoning. 

18-6 In contrast to their similar brain abnormalities, 
newborn mice déficient in Apafl or caspase-9 hâve dis¬ 
tinctive abnormalities in their paws. Apafl-déficient mice 
fail to eliminate the webs between their developing digits, 
whereas caspase-9-deficient mice hâve normally formed 
digits (Figure Q18-1). If Apafl and caspase-9 function in 
the same apoptotic pathway, how is it possible for these 
déficient mice to differ in web-cell apoptosis? 


i dh i 

+/- -/- 

Apafl 

Figure Q18-1 Appearance of paws in 
Apaf1~ f ~ and Casp9~ f ~ newborn mice 
relative to normal newborn mice (Problem 
18-6). (From H. Yoshida et al., Cell 94:739- 
750, 1998. With permission from Elsevier.) 


18-7 When human cancer cells are exposed to ultra¬ 
violet (UV) light at 90 ml/cm 2 , most of the cells undergo 
apoptosis within 24 hours. Release of cytochrome c from 
mitochondria can be detected as early as 6 hours after 
exposure of a population of cells to UV light, and it contin¬ 
ues to increase for more than 10 hours thereafter. Does this 
mean that individual cells slowly release their cytochrome 
c over this time period? Or, alternatively, do individual 
cells release their cytochrome c rapidly but with different 
cells being triggered over the longer time period? 

To answer this fundamental question, you hâve 
fused the gene for green fluorescent protein (GFP) to the 
gene for cytochrome c, so that you can observe the behav- 
ior of individual cells by confocal fluorescence microscopy. 
In cells that are expressing the cytochrome c-GFP fusion, 
fluorescence shows the punctate pattern typical of mito¬ 
chondrial proteins. You then irradiate these cells with UV 
light and observe individual cells for changes in the punc¬ 
tate pattern. Two such cells (outlined in white) are shown 
in Figure Q18-2A and B. Release of cytochrome c-GFP is 
detected as a change from a punctate to a diffuse pattern 
of fluorescence. Times after UV exposure are indicated as 
hours minutes below the individual panels. 

Which model for cytochrome c release do these observa¬ 
tions support? Explain your reasoning. 

(A) 


10:09 10:15 


17:10 17:18 

Figure Q18-2 Time-lapse video fluorescence microscopie analysis 
of cytochrome c-GFP release from mitochondria of individual cells 
(Problem 18-7). (A) Cells observed for 6 minutes, 10 hours after 
UV irradiation. (B) Cells observed for 8 minutes, 17 hours after UV 
irradiation. One cell in (A) and one in (B), each outlined in white, 
hâve released their cytochrome c-GFP during the time frame of the 
observation, which is shown as hours:minutes below each panel. (From 
J.C. Goldstein et al., Nat. Cell Biol. 2:156-162, 2000. With permission 
from Macmillan Publishers Ltd.) 

18-8 Fas ligand is a trimeric, extracellular protein that 
binds to its receptor, Fas, which is composed of three iden- 
tical transmembrane subunits (Figure Q18-3). The bind- 
ing of Fas ligand alters the conformation of Fas so that it 
binds an adaptor protein, which then recruits and acti¬ 
vâtes caspase-8, triggering a caspase cascade that leads to 
cell death. In humans, the autoimmune lymphoproliféra¬ 
tive syndrome (ALPS) is associated with dominant muta¬ 
tions in Fas that include point mutations and C-terminal 


+/+ -/- 

Casp9 
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truncations. In individuals that are heterozygous for such 
mutations, lymphocytes do not die at their normal rate and 
accumulate in abnormally large numbers, causing a vari- 
ety of clinical problems. In contrast to these patients, indi¬ 
viduals that are heterozygous for mutations that eliminate 
Fas expression entirely hâve no clinical symptoms. 

A. Assuming that the normal and dominant forms 
of Fas are expressed to the same level and bind Fas ligand 
equally, what fraction of Fas-Fas ligand complexes on a 
lymphocyte from a heterozygous ALPS patient would be 
expected to be composed entirely of normal Fas subunits? 

B. In an individual heterozygous for a mutation that 
éliminâtes Fas expression, what fraction of Fas-Fas ligand 
complexes would be expected to be composed entirely of 
normal Fas subunits? 

C. Why are the Fas mutations that are associated with 
ALPS dominant, while those that eliminate expression of 
Fas are récessive? 



Figure Q18-3 The binding of trimeric Fas ligand to Fas (Problem 18-8). 
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CELLS IN THEIR SOCIAL CONTEXT 


Cell Junctions and the 
Extracellular Matrix 


CHAPTER 

19 


Of ail the social interactions between cells in a multicellular organism, the most 
fundamental are those that hold the cells together. Cells may be linlced by direct 
interactions, or they may be held together within the extracellular matrix , a com- 
plex network of proteins and polysaccharide chains that the cells secrete. By one 
means or another, cells must cohere if they are to form an organized multicellular 
structure that can withstand and respond to the various external forces that try to 
pull it apart. 

The mechanisms of cohésion govern the architecture of the body—its shape, 
its strength, and the arrangement of its different cell types. The malcing and break- 
ing of the attachments between cells and the modeling of the extracellular matrix 
govern the way cells move within the organism, guiding them as the body grows, 
develops, and repairs itself. Attachments to other cells and to extracellular matrix 
control the orientation and behavior of the cell's cytoskeleton, thereby allowing 
cells to sense and respond to changes in the mechanical features of their environ¬ 
ment. Thus, the apparatus of cell junctions and the extracellular matrix is criti- 
cal for every aspect of the organization, function, and dynamics of multicellular 
structures. Defects in this apparatus underlie an enormous variety of diseases. 

The lcey features of cell junctions and the extracellular matrix are best illus- 
trated by considering two broad categories of tissues that are found in ail animais 
(Figure 19-1). Connective tissues, such as bone or tendon, are formed from an 
extracellular matrix produced by cells that are distributed sparsely in the matrix. 
It is the matrix—rather than the cells—that bears most of the mechanical stress to 
which the tissue is subjected. Direct attachments between one cell and another 
are relatively rare, but the cells hâve important attachments to the matrix. These 
cell-matrixjunctions linlc the cytoskeleton to the matrix, allowing the cells to move 
through the matrix and monitor changes in its mechanical properties. 

In épithélial tissues, such as the lining of the gut or the epidermal covering of 
the skin, cells are tightly bound together into sheets called epithelia. The extracel¬ 
lular matrix is less pronounced, consisting mainly of a thin mat called the basal 
lamina (or basement membrane) underlying the sheet. Within the epithelium, 
cells are attached to each other directly by cell-cell junctions , where cytoskeletal 
filaments are anchored, transmitting stresses across the interiors of the cells, from 
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collagen 

fibers 


mechanical stresses 
are transmitted from 
cell to cell by cytoskeletal 
filaments anchored to 
cell-matrix and cell—cell 
adhesion sites 


extracellular matrix 
directly bears mechanical 
stresses of tension and 
compression 


adhesion site to adhesion site. The cytoskeleton of épithélial cells is also linlced to 
the basal lamina through cell-matrix junctions. 

Figure 19-2 provides a doser view of épithélial cells to illustrate the major 
types of cell-cell and cell-matrix junctions that we will discuss in this chapter. The 
diagram shows the typical arrangement of junctions in a simple columnar epithe¬ 
lium such as the lining of the small intestine of a vertebrate. Here, a single layer 
of tall cells stands on a basal lamina, with the cells' uppermost surface, or apex, 
free and exposed to the extracellular medium. On their sides, or latéral surfaces, 
the cells malce junctions with one another. Two types of anchoring junctions linlc 
the cytoslceletons of adjacent cells: adherens junctions are anchorage sites for 
actin filaments; desmosomes are anchorage sites for intermediate filaments. Two 
additional types of anchoring junctions link the cytoskeleton of the épithélial cells 
to the basal lamina: actin-linked cell-matrix junctions anchor actin filaments to 
the matrix, while hemidesmosomes anchor intermediate filaments to it. 


Figure 19-1 Two main ways in which 
animal cells are bound together. In 

connective tissue, the main stress-bearing 
component is the extracellular matrix. In 
épithélial tissue, it is the cytoskeletons 
of the cells themselves, linked from cell 
to cell by adhesive junctions. Cell-matrix 
attachments bond épithélial tissue to the 
connective tissue beneath it. 



ruAMMn cnDMiMr: 



tight junction seals gap between 
épithélial cells 


adherens junction connects actin 
filament bundle in one cell with 
that in the next cell 


desmosome connects intermediate 
filaments in one cell to those in 
the next cell 


gap junction allows the passage 
of small water-soluble molécules 
from cell to cell 


actin-linked cell-matrix junction hemidesmosome anchors intermediate 

anchors actin filaments in cell filaments in a cell to extracellular matrix 

to extracellular matrix 


Figure 19-2 A summary of the various cell junctions found in a vertebrate épithélial cell, classified according to their primary functions. 

In the most apical portion of the cell, the relative positions of the junctions are the same in nearly ail vertebrate epithelia. The tight junction occupies 
the most apical position, followed by the adherens junction (adhesion belt) and then by a spécial parallel row of desmosomes; together these form 
a structure called a junctional complex. Gap junctions and additional desmosomes are less regularly organized. Two types of cell-matrix anchoring 
junctions tether the basal surface of the cell to the basal lamina. The drawing is based on épithélial cells of the small intestine. 
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cytoske létal 



Figure 19-3 Transmembrane adhesion 
proteins link the cytoskeleton to 
extracellular structures. The external 
linkage may be either to other cells (cell-cell 
junctions, mediated typically by cadherins) 
or to extracellular matrix (cell-matrix 
junctions, mediated typically by integrins). 
The internai linkage to the cytoskeleton is 
generally indirect, via intracellular adaptor 
proteins, to be discussed later. 


Two other types of cell-cell junction are shown in Figure 19-2. Tight junctions 
hold the cells closely together near the apex, sealing the gap between the cells and 
thereby preventing molécules from leaking across the epithelium. Near the basal 
end of the cells are channel-forming junctions, called gap junctions , that create 
passageways linking the cytoplasms of adjacent cells. 

Each of the four major anchoring junction types dépends on transmembrane 
adhesion proteins that span the plasma membrane, with one end linking to the 
cytoskeleton inside the cell and the other end linking to other structures outside it 
(Figure 19-3). These cytoslceleton-linlced transmembrane proteins fall neatly into 
two superfamilies, corresponding to the two basic kinds of external attachment. 
Pro teins of the cadherin superfamily chiefly médiate attachment of cell to cell 
(Movie 19.1). Proteins of the integrin superfamily chiefly médiate attachment of 
cells to matrix. There is specialization within each family: some cadherins link to 
actin and form adherens junctions, while others link to intermediate filaments 
and form desmosomes; likewise, some integrins link to actin and form actin- 
linlced cell-matrix junctions, while others link to intermediate filaments and form 
hemidesmosomes (Table 19-1). 


Anchoring Junctions 

Junction 

Transmembrane 
adhesion protein 

Extracellular ligand 

Intracellular 

cytoskeletal 

attachment 

Intracellular adaptor 
proteins 

Cell—Cell 

Adherens junction 

Classical cadherins 

Classical cadherin on 
neighboring cell 

Actin filaments 

a-Catenin, p-catenin, 
plakoglobin (y-catenin), 
p120-catenin, vinculin 

Desmosome 

Nonclassical 

cadherins 

(desmoglein, 

desmocollin) 

Desmoglein and 
desmocollin on 
neighboring cell 

Intermediate filaments 

Plakoglobin (y-catenin), 
plakophilin, desmoplakin 

Cell-Matrix 

Actin-linked cell- 
matrix junction 

Integrin 

Extracellular matrix 
proteins 

Actin filaments 

Talin, kindlin, vinculin, 
paxillin, focal adhesion 
kinase (FAK), numerous 
others 

Hemidesmosome 

a 6 p 4 Integrin, type 
XVII collagen 
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There are some exceptions to these rules. Some integrins, for example, médi¬ 
ate cell-cell rather than cell-matrix attachment. Moreover, there are other types 
of cell adhesion molécules that can provide transient cell-cell attachments more 
flimsy than anchoring junctions, but suffïcient to stick cells together in spécial 
circumstances. 

We begin the chapter with a discussion of the major forms of cell-cell junc¬ 
tions. We then consider in turn the extracellular matrix of animais, the structure 
and function of integrin-mediated cell-matrix junctions, and, finally, the plant 
cell wall, a spécial form of extracellular matrix. 

CELL-CELL JUNCTIONS 

Cell-cell junctions corne in many forms and can be regulated by a variety of 
mechanisms. The best understood and most common are the two types of cell- 
cell anchoring junctions, which employ cadherins to linlc the cytoskeleton of one 
cell with that of its neighbor. Their primary function is to resist the external forces 
that pull cells apart. The épithélial cells of your skin, for example, must remain 
tightly linlced when they are stretched, pinched, or polced. Cell-cell anchoring 
junctions must also be dynamic and adaptable, so that they can be altered or rear- 
ranged when tissues are remodeled or repaired, or when there are changes in the 
forces acting on them. 

In this section, we focus primarily on the cadherin-based anchoring junctions. 
We then briefly describe tight junctions and gap junctions. Finally, we consider 
the more transient cell-cell adhesion mechanisms employed by some cells in the 
bloodstream. 

Cadherins Form a Diverse Family of Adhesion Molécules 

Cadherins are présent in ail multicellular animais whose genomes hâve been 
analyzed. They are also présent in the choanoflagellates, which can exist either as 
free-living unicellular organisms or as multicellular colonies and are thought to 
be représentatives of the group of protists from which ail animais evolved. Other 
eukaryotes, including fungi and plants, laclc cadherins, and they are also absent 
from bacteria and archaea. Cadherins therefore seem to be part of the essence of 
what it is to be an animal. 

The cadherins talce their name from their dependence on Ca 2+ ions: removing 
Ca 2+ from the extracellular medium causes adhesions mediated by cadherins to 
corne apart. The first three cadherins to be discovered were named according to 
the main tissues in which they were found: E-cadherin is présent on many types 
of épithélial cells; N-cadherin on nerve, muscle, and lens cells; and P-cadherin on 
cells in the placenta and epidermis. Ail are also found in other tissues. These and 
other classical cadherins are closely related in sequence throughout their extra¬ 
cellular and intracellular domains. 

There are also a large number of nonclassical cadherins that are more dis- 
tantly related in sequence, with more than 50 expressed in the brain alone. The 
nonclassical cadherins include proteins with known adhesive function, such as 
the diverse protocadherins found in the brain, and the desmocollins and desmo- 
gleins that form desmosomes (see Table 19-1). Other family members are involved 
primarily in signaling. Together, the classical and nonclassical cadherin proteins 
constitute the cadherin superfamily (Figure 19-4), with more than 180 members 
in humans. 

Cadherins Médiate Homophilic Adhesion 

Anchoring junctions between cells are usually symmetrical: if the linlcage is to 
actin in the cell on one side of the junction, it will be to actin in the cell on the 
other side. In fact, the binding between cadherins is generally homophilic (lilce- 
to-like, Figure 19-5): cadherin molécules of a spécifie subtype on one cell bind to 
cadherin molécules of the same or closely related subtype on adjacent cells. 
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The spacing between the cell membranes at an anchoring junction is precisely 
defined and dépends on the structure of the participating cadherin molécules. Ail 
the members of the superfamily, by définition, hâve an extracellular portion con- 
sisting of several copies of the extracellular cadherin (EC) domain. Homophilic 
binding occurs at the N-terminal tips of the cadherin molécules—the cadherin 
domains that lie furthest from the membrane. These terminal domains each form 
a lcnob and a nearby pocket, and the cadherin molécules protruding from oppo¬ 
site cell membranes bind by insertion of the lcnob of one domain into the pocket 
of the other (Figure 19-6A). 

Each cadherin domain forms a more-or-less rigid unit, joined to the next cad¬ 
herin domain by a hinge. Ca 2+ ions bind to sites near each hinge and prevent it 
from flexing, so that the whole string of cadherin domains behaves as a rigid and 
slightly curved rod. When Ca 2+ is removed, the hinges can flex, and the structure 
becomes floppy (Figure 19-6B). At the same time, the conformation at the N-ter- 
minus is thought to change slightly, wealcening the binding affinity for the match- 
ing cadherin molécule on the opposite cell. 

Unlilce receptors for soluble signal molécules, which bind their spécifie ligand 
with high affinity, cadherins (and most other cell-cell adhesion proteins) typi- 
cally bind to their partners with relatively low affinity. Strong attachments resuit 
from the formation of many such wealc bonds in parallel. When binding to oppo- 
sitely oriented partners on another cell, cadherin molécules are often clustered 
side-to-side with many other cadherin molécules on the same cell (Figure 19-6C). 
The strength of this junction is far greater than that of any individual intermolecu- 
lar bond, and yet regulatory mechanisms can easily disassemble the junction by 
separating the molécules sequentially, just as two pièces of fabric can be joined 
strongly by Velcro and yet easily peeled apart from the sides. A similar “Velcro 
principle” also opérâtes at cell-cell and cell-matrix adhesions formed by other 
types of transmembrane adhesion proteins. 


Figure 19-4 The cadherin superfamily. 

The diagram shows some of the diversity 
among cadherin superfamily members. 
These proteins ail hâve extracellular 
portions containing multiple copies of the 
extracellular cadherin domain (green ovals). 
In the classical cadherins of vertebrates 
there are 5 of these domains, and in 
desmogleins and desmocollins there are 
4 or 5, but some nonclassical cadherins 
hâve more than 30. The intracellular 
portions are more varied, reflecting 
interactions with a wide variety of 
intracellular ligands, including signaling 
molécules and adaptor proteins 
that connect the cadherin to the 
cytoskeleton. In some cases, such as 
T-cadherin, a transmembrane domain 
is not présent and the protein is 
attached to the plasma membrane by a 
glycosylphosphatidylinositol (GPI) anchor. 
The differently colored motifs in Fat, 
Flamingo, and Ret represent conserved 
domains that are also found in other protein 
families. 



HETEROPHILIC BINDING 


Figure 19-5 Homophilic versus 
heterophilic binding. Cadherins in general 
bind homophilically; some other cell 
adhesion molécules, discussed later, bind 
heterophilically. 
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Figure 19-6 Cadherin structure and function. (A) The extracellular région of 
a classical cadherin contains five copies of the extracellular cadherin domain 
(see Figure 19-4) separated by flexible hinge régions. Ca 2+ ions (red dots) bind 
in the neighborhood of each hinge, preventing it from flexing. As a resuit, the 
extracellular région forms a rigid, curved structure as shown here. To generate 
cell—cell adhesion, the cadherin domain at the N-terminal tip of one cadherin 
molécule binds the N-terminal domain from a cadherin molécule on another cell. 
The structure was determined by x-ray diffraction of the crystallized C-cadherin 
extracellular région. (B) In the absence of Ca 2+ , increased flexibility in the hinge 
régions results in a floppier molécule that is no longer oriented correctly to 
interact with a cadherin on another cell—and adhesion fails. (C) At a typical 
cell—cell junction, an organized array of cadherin molécules functions like Velcro 
to hold cells together. Cadherins on the same cell are thought to be coupled by 
side-to-side interactions between their N-terminal head régions, resulting in a 
linear array like the alternating green and light green cadherins on the lower cell 
shown here. These arrays are thought to interact with similar linear arrays on an 
adjacent cell (blue cadherin molécules, top cell). The linear arrays on one cell are 
perpendicular to those on the other cell, as indicated by the red arrows. Multiple 
perpendicular arrays on both cells interact to form a tight-knit mat of cadherin 
proteins. (A, based on T.J. Boggon et al., Science 296:1308-1313, 2002; 

C, based on O.J. Harrison et al. Structure 19:244-256, 2011.) 


Cadherin-Dependent Cell—Cell Adhesion Guides the Organization 
of Developing Tissues 


Cadherins form spécifie homophilic attachments, explaining why there are so 
many different family members. Cadherins are not like glue, malcing cell surfaces 
generally sticlcy. Rather, they médiate highly sélective récognition, enabling cells 
of a similar type to stick together and to stay segregated from other types of cells. 

Selectivity in the way that animal cells consort with one another was first dem- 
onstrated in the 1950s, long before the discovery of cadherins, in experiments in 
which amphibian embryos were dissociated into single cells. These cells were then 
mixed up and allowed to reassociate. Remarkably, the dissociated cells often reas- 
sembled into structures resembling those of the original embryo (Figure 19-7). 
These experiments, together with numerous more recent experiments, reveal that 
sélective cell-cell récognition Systems malce cells of the same differentiated tissue 
preferentially adhéré to one another. 



Figure 19-7 Sorting out. Cells from 
different layers of an early amphibian 
embryo will sort out according to their 
origins. In the classical experiment shown 
here, mesoderm cells (green), neural plate 
cells (blue), and epidermal cells (red) hâve 
been disaggregated and then reaggregated 
in a random mixture. They sort out into 
an arrangement reminiscent of a normal 
embryo, with a “neural tube” internally, 
epidermis externally, and mesoderm in 
between. (Modified from P.L. Townes and 
J. Holtfreter, J. Exp. Zoo!. 128:53-120, 
1955. With permission from Wiley-Liss.) 
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Figure 19-8 Changing patterns of 
cadherin expression during construction 
of the vertebrate nervous System. The 

figure shows cross sections of the early 
chick embryo, as the neural tube détachés 
from the ectoderm and then as neural 
crest cells detach from the neural tube. 

(A, B) Immunofluorescence micrographs 
showing the developing neural tube labeled 
with antibodies against (A) E-cadherin (blue) 
and (B) N-cadherin (yellow). (C) As the 
patterns of gene expression change, the 
different groups of cells segregate from one 
another according to the cadherins they 
express. (Micrographs courtesy of Miwako 
Nomura and Masatoshi Takeichi.) 
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Cadherins play a crucial part in these cell-sorting processes during develop¬ 
ment. The appearance and disappearance of spécifie cadherins correlate with 
steps in embryonic development where cells regroup and change their contacts 
to create new tissue structures. In the vertebrate embryo, for example, changes in 
cadherin expression are seen when the neural tube forms and pinches off from 
the overlying ectoderm: neural tube cells lose E-cadherin and acquire other cad¬ 
herins, including N-cadherin, while the cells in the overlying ectoderm continue 
to express E-cadherin (Figure 19-8A and B). Then, when the neural crest cells 
migrate away from the neural tube, these cadherins become scarcely détectable, 
and another cadherin (cadherin 7) appears that helps hold the migrating cells 
together as loosely associated cell groups (Figure 19-8C). Finally, when the cells 
aggregate to form a ganglion, they switch on expression of N-cadherin again. If 
N-cadherin is artificially overexpressed in the emerging neural crest cells, the cells 
fail to escape from the neural tube. 

Studies with cultured cells further support the idea that the homophilic bind- 
ing of cadherins Controls these processes of tissue ségrégation. In a line of cultured 
fibroblasts called L cells, for example, cadherins are not expressed and the cells 
do not adhéré to one another. When these cells are transfected with DNA encod- 
ing E-cadherin, E-cadherins on one cell bind to E-cadherins on another, resulting 
in cell-cell adhesion. If L cells expressing different cadherins are mixed together, 
they sort out and aggregate separately, indicating that different cadherins pref- 
erentially bind to their own type (Figure 19-9A), mimicking what happens when 
cells derived from tissues that express different cadherins are mixed together. A 
similar ségrégation of cells occurs if L cells expressing different amounts of the 
same cadherin are mixed together (Figure 19-9B). It therefore seems lilcely that 
both qualitative and quantitative différences in the expression of cadherins hâve 
a rôle in organizing tissues. 
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Figure 19-9 Cadherin-dependent cell 
sorting. Cells in culture can sort themselves 
out according to the type and level of 
cadherins they express. This can be 
visualized by labeling different populations of 
cells with dyes of different colors. (A) Cells 
expressing N-cadherin sort out from cells 
expressing E-cadherin. (B) Cells expressing 
high levels of E-cadherin sort out from cells 
expressing low levels of E-cadherin. The 
cells expressing high levels adhéré more 
strongly and end up internally. 
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Epithelial-Mesenchymal Transitions Dépend on Control of 
Cadherins 

The assembly of cells into an epithelium is a réversible process. By switching on 
expression of adhesion molécules, dispersed unattached mesenchymal cells , such 
as fibroblasts, can corne together to form an epithelium. Conversely, épithélial 
cells can change their character, disassemble, and migrate away from their par¬ 
ent epithelium as separate cells. Such epithelial-mesenchymal transitions play an 
important part in normal embryonic development; the origin of the neural crest 
is one example. These transitions dépend in part on transcription regulatory pro¬ 
teins called Slug, Snail, and Twist. Increased expression of Twist, for example, 
converts épithélial cells to a mesenchymal character, and switching it off does the 
opposite. Twist exerts its effects, in part, by inhibiting expression of cadherins, 
including E-cadherin, that hold épithélial cells together. 

Epithelial-mesenchymal transitions also occur as pathological events during 
adult life, in cancer. Most cancers originate in epithelia, but become dangerously 
prone to spread—that is, malignant —only when the cancer cells escape from the 
epithelium of origin and invade other tissues. Experiments with malignant breast 
cancer cells in culture show that blocking expression of Twist can convert the cells 
baclc toward a nonmalignant character. Conversely, by forcing Twist expression, 
one can make normal épithélial cells undergo an epithelial-mesenchymal tran¬ 
sition and behave lilce malignant cells. Mutations that disrupt the production or 
function of E-cadherin are often found in cancer cells and are thought to help 
make them malignant. 

Catenins Link Classical Cadherins to the Actin Cytoskeleton 

The extracellular domains of cadherins médiate homophilic binding at adherens 
junctions. The intracellular domains of typical cadherins, including ail classical 
and some nonclassical ones, interact with filaments of the cytoskeleton: actin at 
adherens junctions and intermediate filaments at desmosomes (see Table 19-1). 
These cytoskeletal linkages are essential for efficient cell-cell adhesion, as cad¬ 
herins that laclc their cytoplasmic domains cannot stably hold cells together. 

The linkage of cadherins to the cytoskeleton is indirect and dépends on adap- 
tor proteins that assemble on the cytoplasmic tail of the cadherin. At adherens 
junctions, the cadherin tail binds two such proteins: fi-catenin and a distant rela¬ 
tive called pl20-catenin; a third protein called a-catenin interacts with p-catenin 
and recruits a variety of other proteins to provide a dynamic linkage to actin 
filaments (Figure 19-10). At desmosomes, cadherins are linked to intermediate 
filaments through other adaptor proteins, including a p-catenin-related protein 
called plakoglobin , as we discuss later. 

In their mature form, adherens junctions are enormous protein complexes 
containing hundreds to thousands of cadherin molécules, packed into dense, reg- 
ular arrays that are linked on the extracellular side by latéral interactions between 
cadherin domains, as we discussed earlier (see Figure 19-6C). On the cytoplasmic 
side, a complex networlc of catenins, actin regulators, and contractile actin bun- 
dles holds the cluster of cadherins together and links it to the actin cytoskeleton. 
Assembling a structure of this complexity is not a simple taslc, and it involves a 
complex sequence of events controlled by the actin-regulatory proteins discussed 
in Chapter 16. The general features of the assembly process are summarized in 
Figure 19-11. 

Adherens Junctions Respond to Forces Generated by the Actin 
Cytoskeleton 

Most adherens junctions are linked to contractile bundles of actin filaments and 
non-muscle myosin IL These junctions are therefore subjected to pulling forces 
generated by the attached actin. The pulling forces are important for junction 
assembly and maintenance: disruption of myosin activity, for example, results in 


cadherin 


plasma membrane 

\ 




Figure 19-10 The linkage of classical 
cadherins to actin filaments. The 

cadherins are coupled indirectly to actin 
filaments through an adaptor protein 
complex containing p120-catenin, 
P-catenin, and a-catenin. Other proteins, 
including vinculin, associate with a-catenin 
and help provide the linkage to actin. 
P-Catenin has a second, and very 
important, function in intracellular signaling, 
as we discuss in Chapter 15 (see Figure 
15-60). For clarity, this diagram does not 
show the cadherin of the adjacent cell in 
the junction. 
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the disassembly of many adherens junctions. Furthermore, the contractile forces 
acting on a junction in one cell are balanced by contractile forces at the junction 
of the opposite cell, so that no cell pulls others toward it and thereby disrupts the 
uniform distribution of cells in the tissue. 

We do not understand the mechanisms responsible for maintaining this bal¬ 
ance. Adherens junctions seem to sense the forces acting on them and modify 
local actin and myosin behavior to balance the forces on both sides of the junc¬ 
tion. Evidence for these mechanisms cornes from studies of pairs of cultured 
mammalian cells connected by adherens junctions. If contractile activity in one 
cell is increased experimentally, the adherens junctions linking the two cells 
increase in size, and the contractile activity of the second cell increases to match 
that of the first—resulting in a balance of forces across the junction. These and 
other experiments suggest that adherens junctions are not simply passive sites 
of protein-protein binding but are dynamic tension sensors that regulate their 
behavior in response to changing mechanical conditions. This ability to transduce 
a mechanical signal into a change in junctional behavior is an example of mecha- 
notransduction. We will see later that it is also important at cell-matrix junctions. 

The mechanotransduction at cell-cell junctions is thought to dépend, at least 
in part, on proteins in the cadherin complex that alter their shape when stretched 
by tension. The protein a-catenin, for example, is stretched from a folded to an 
extended conformation when contractile activity increases at the junction. The 
unfolding exposes a cryptic binding site for another protein, vinculin, which pro¬ 
mûtes the recruitment of more actin to the junction (Figure 19-12). By mecha¬ 
nisms such as this, pulling on a junction malces it stronger. Furthermore, as noted 
above, pulling on a junction in one cell will increase the contractile force gener- 
ated in the attached cell. 

In some cell types, actin contractility reduces cell-cell adhesion, particularly if 
large forces are involved. Large actin-based contractile forces might, in some tis- 
sues, pull sufficiently hard on the edges of cell-cell adhesions to peel them apart, 
particularly if contraction is coupled to additional regulatory mechanisms that 
wealcen the adhesion. This mechanism might be important in certain forms of 
tissue remodeling during development, as we describe next. 

Tissue Remodeling Dépends on the Coordination of Actin- 
Mediated Contraction With Cell—Cell Adhesion 

Adherens junctions are an essential part of the machinery for modeling the shapes 
of multicellular structures in the animal body. By indirectly linking the actin fila¬ 
ments in one cell to those in its neighbors, they enable the cells in the tissue to use 
their actin cytoskeletons in a coordinated way. 

Adherens junctions occur in various forms. In many nonepithelial tissues, 
they appear as small punctate or linear attachments that connect the cortical 


Figure 19-11 Assembly of an adherens 
junction. (A) Assembly begins when 
two unattached épithélial cell precursors 
explore their surroundings with membrane 
protrusions, generated by local nucléation 
of actin networks. When the cells make 
contact, small cadherin and catenin 
clusters take shape at the contact sites and 
associate with actin, leading to activation 
of the small monomeric GTPase Rac (not 
shown), an important actin regulator (see 
Figure 16-85). (B) Rac promotes additional 
actin protrusions in the vicinity, expanding 
the size of the contact zone and thereby 
promoting further recruitment of cadherins 
and their associated catenin proteins. 

(C) Eventually, Rac is inactivated and 
replaced by the related GTPase Rho (not 
shown), which shifts actin remodeling 
toward the assembly of linear, contractile 
filament bundles. Rho also promotes 
the assembly of myosin II filaments that 
associate with bundles of actin filaments to 
generate contractile activity. This contractile 
activity generates tension that stimulâtes 
further actin recruitment and expansion 
of the junction, in part through the 
mechanisms illustrated in Figure 19-12. 
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(A) NO TENSION (B) TENSION 



Figure 19-12 Mechanotransduction 
in an adherens junction. (A) Cell—cell 
junctions are able to sense increased 
tension and respond by strengthening 
their actin linkages. Tension sensing is 
thought to dépend in part on a-catenin 
(see Figure 19-10). (B) When actin 
filaments are pulled from within the cell by 
non-muscle myosin II, the resulting force 
unfolds a domain in a-catenin, thereby 
exposing an otherwise hidden binding site 
for the adaptor protein vinculin. Vinculin 
then promûtes additional actin recruitment, 
strengthening the linkages between the 
junction and the cytoskeleton. 


actin filaments beneath the plasma membranes of two interacting cells. In heart 
muscle, they anchor the actin bundles of the contractile apparatus and act in par- 
allel with desmosomes to linlc the contractile cells end-to-end. But the prototypi- 
cal examples of adherens junctions occur in epithelia, where they often form a 
continuous adhesion belt (or zonula adherens ) just beneath the apical face of the 
epithelium, encircling each of the interacting cells in the sheet (Figure 19-13). 
Within each cell, a contractile bundle of actin filaments and myosin II lies adjacent 
to the adhesion belt, oriented parallel to the plasma membrane and tethered to 
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Figure 19-13 Adherens junctions 
between épithélial cells in the small 
intestine. These cells are specialized 
for absorption of nutrients; at their apex, 
facing the lumen of the gut, they hâve 
many microvilli (protrusions that increase 
the absorptive surface area). The adherens 
junction takes the form of an adhesion 
belt, encircling each of the interacting cells. 
Its most obvious feature is a contractile 
bundle of actin filaments running along 
the cytoplasmic surface of the junctional 
plasma membrane. The actin filament 
bundles are tethered by intracellular 
proteins to cadherins, which bind to 
cadherins on the adjacent cell. In this way, 
the actin filament bundles in adjacent cells 
are tied together. For clarity, this drawing 
does not show most of the other cell—cell 
and cell-matrix junctions of épithélial cells 
(see Figure 19-2). 
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INVAGINATION OF EPITHELIAL SHEET CAUSED 
BY AN ORGANIZED TIGHTENING OF 
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CELL SHEET 
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Figure 19-14 The folding of an épithélial 
sheet to form an épithélial tube. The 

oriented contraction of the bundles of 
actin and myosin filaments running along 
adhesion belts causes the épithélial cells to 
narrow at their apex and helps the épithélial 
sheet to roll up into a tube. An example 
is the formation of the neural tube in early 
vertebrate development (see Figure 19-8). 




épithélial 

tube 


it by the cadherins and their associated intracellular adaptor proteins. The actin- 
myosin bundles are thus linked, via the cadherins, into an extensive transcellular 
networlc. Coordinated contraction of this network provides the motile force for 
a fundamental process in animal morphogenesis—the folding of épithélial cell 
sheets into tubes, vesicles, and other related structures (Figure 19-14). 

The coordination of cell-cell adhesion and actin contractility is beautifully 
illustrated by cellular rearrangements that occur early in the development of the 
fruit fly Drosophila melanogaster. Soon after gastrulation, the outer epithelium 
of the embryo is elongated by a process called germ-band extension , in which the 
cells converge inward toward the dorsal-ventral axis and extend along the ante- 
rior-posterior axis (Figure 19-15). Actin-dépendent contraction along spécifie 
cell boundaries is coordinated with a loss of spécifie adherens junctions to allow 
cells to insert themselves between other cells (a process called intercalation ), 
resulting in a longer and narrower epithelium. The mechanisms underlying the 
loss of adhesion along spécifie cell boundaries are not clear, but they dépend in 
part on increased dégradation of (3-catenin, due to its phosphorylation by a pro- 
tein kinase that is localized specifically at those boundaries. 

Desmosomes Give Epithelia Mechanical Strength 

Desmosomes are structurally similar to adherens junctions but contain special- 
ized cadherins that linlc to intermediate filaments instead of actin filaments. Their 
main function is to provide mechanical strength. Desmosomes are important 


dorsal 




Figure 19-15 Remodeling of cell-cell 
adhesions in embryonic Drosophila 
epithelium. Depicted at left is a group of 
cells in the outer epithelium of a Drosophila 
embryo. During germ-band extension, cells 
converge toward each other (middle) on 
the dorsal-ventral axis and then extend 
(right) along the anterior-posterior axis. 

The resuit is intercalation: cells that were 
originally far apart along the dorsal-ventral 
axis (darkgreen) are inserted between 
the cells (light green) that separated 
them. These rearrangements dépend 
on the spatial régulation of actin-myosin 
contractile bundles, which are localized 
primarily at the vertical cell boundaries 
(red, left). Contraction of these bundles is 
accompanied by removal of E-cadherin 
(not shown) at the same cell boundaries, 
resulting in shrinkage and loss of adhesion 
along the vertical axis (middle). New 
cadherin-based adhesions (blue, right) 
then form and expand along horizontal 
boundaries, resulting in extension of the 
cells in the anterior-posterior dimension. 
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Figure 19-16 Desmosomes. (A) The structural components of a desmosome. On the cytoplasmic surface of each interacting 
plasma membrane is a dense plaque composed of a mixture of intracellular adaptor proteins. A bundle of keratin intermediate 
filaments is attached to the surface of each plaque. Transmembrane nonclassical cadherins bind to the plaques and interact 
through their extracellular domains to hold the adjacent membranes together. (B) Some of the molecular components of a 
desmosome. Desmoglein and desmocollin are nonclassical cadherins. Their cytoplasmic tails bind plakoglobin (y-catenin) 
and plakophilin (a distant relative of p120-catenin), which in turn bind to desmoplakin. Desmoplakin binds to the sides of 
intermediate filaments, thereby tying the desmosome to these filaments. (C) An électron micrograph of desmosome junctions 
between three epidermal cells in the skin of a baby mouse. (D) Part of the same tissue at higher magnification, showing a single 
desmosome, with intermediate filaments attached to it. (C and D, from W. He, P. Cowin and D.L. Stokes, Science 302:109-113, 
2003. With permission from AAAS.) 


in vertebrates but are not found, for example, in Drosophila. They are présent in 
most mature vertebrate epithelia and are particularly plentiful in tissues that are 
subject to high levels of mechanical stress, such as heart muscle and the epider- 
mis, the epithelium that forms the outer layer of the skin. 

Figure 19-16A shows the general structure of a desmosome, and Figure 
19-16B shows some of the proteins that form it. Desmosomes typically appear as 
buttonlilce spots of adhesion, riveting the cells together (Figure 19-16C). Inside 
the cell, the bundles of ropelike intermediate filaments that are anchored to 
the desmosomes form a structural frameworlc of great tensile strength (Figure 
19-16D), with linlcage to similar bundles in adjacent cells, creating a network that 
extends throughout the tissue (Figure 19-17). The particular type of intermediate 
filaments attached to the desmosomes dépends on the cell type: they are kera¬ 
tin filaments in most épithélial cells, for example, and desmin filaments in heart 
muscle cells. 

The importance of desmosomes is demonstrated by some forms of the poten- 
tially fatal skin disease pemphigus. Affected individuals malce antibodies against 
one of their own desmosomal cadherin proteins. These antibodies bind to and 
disrupt the desmosomes that hold their epidermal cells (kératinocytes) together. 
This results in a severe blistering of the skin, with leakage of body fluids into the 
loosened epithelium. 
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Tight Junctions Form a Seal Between Cells and a Fence Between 
Plasma Membrane Domains 

Sheets of épithélial cells enclose and partition the animal body, lining ail its sur¬ 
faces and cavities, and creating internai compartments where specialized pro¬ 
cesses occur. The épithélial sheet seems to be one of the inventions that lie at the 
origin of animal évolution, diversifying in a huge variety of ways but retaining an 
organization based on a set of conserved molecular mechanisms. 

Essentially ail epithelia are anchored to other tissue on one side—the basal 
side—and free of such attachment on their opposite side—the apical side. A basal 
lamina lies at the interface with the underlying tissue, mediating the attachment, 
while the apical surface of the epithelium is generally bathed by extracellular fiuid. 
Thus, ail epithelia are structurally polarized, and so are their individual cells: the 
basal end of a cell, adhèrent to the basal lamina below, differs from the apical end, 
exposed to the medium above. 

Correspondingly, ail epithelia hâve at least one function in common: they 
serve as sélective permeability barriers, separating the fiuid that permeates the 
tissue on their basal side from fiuid with a different Chemical composition on their 
apical side. This barrier function requires that the adjacent cells be sealed together 
by tight junctions, so that molécules cannot lealc freely across the cell sheet. 

The epithelium of the small intestine provides a good illustration of tight-junc- 
tion structure and function (see Figure 19-2). This epithelium has a simple colum- 
nar structure; that is, it consists of a single layer of tall (columnar) cells. These are 
of several differentiated types, but the majority are absorptive cells, specialized for 
uptalce of nutrients from the internai cavity, or lumen, of the gut. The absorptive 
cells hâve to transport selected nutrients across the epithelium from the lumen 
into the extracellular fiuid on the other side. From there, these nutrients diffuse 
into small blood vessels to provide nourishment to the organism. This transcellu¬ 
lar transport dépends on two sets of transport proteins in the plasma membrane 
of the absorptive cell. One set is confined to the apical surface of the cell (facing 
the lumen) and actively transports selected molécules into the cell from the gut. 
The other set is confined to the basolateral (basal and latéral) surfaces of the cell, 
and it allows the same molécules to leave the cell by passive transport into the 
extracellular fiuid on the other side of the epithelium. For this transport activity 
to be effective, the spaces between the épithélial cells must be tightly sealed, so 
that the transported molécules cannot leak baclc into the gut lumen through these 
spaces (Figure 19-18). Moreover, the transport proteins must be correctly distrib- 
uted in the plasma membranes: the apical transporters must be delivered to the 
cell apex and must not be allowed to drift to the basolateral membrane, and the 
basolateral transporters must be delivered to and remain in the basolateral mem¬ 
brane. Tight junctions, besides sealing the gaps between the cells, also function 
as “fences" that help prevent apical or basolateral proteins from diffusing into the 
wrong région. 

The sealing function of tight junctions is easy to demonstrate experimentally: a 
low-molecular-weight tracer added to one side of an epithelium will generally not 
pass beyond the tight junction (Figure 19-19). This seal is not absolute, however. 
Although ail tight junctions are imperméable to macromolecules, their permea¬ 
bility to ions and other small molécules varies. Tight junctions in the epithelium 
lining the small intestine, for example, are 10,000 times more permeable to inor- 
ganic ions, such as Na + , than the tight junctions in the epithelium lining the uri- 
nary bladder. The movement of ions and other molécules between épithélial cells 
is called paracellular transport, and tissue-specific différences in transport rates 
generally resuit from différences in the proteins that form tight junctions. 

Tight Junctions Contain Strands of Transmembrane Adhesion 
Proteins 

When tight junctions are visualized by freeze-fracture électron microscopy, they 
are seen as a branching network of sealing strands that completely encircles the 


keratin filaments 



basal lamina hemidesmosome 


Figure 19-17 Desmosomes, 
hemidesmosomes, and the intermediate 
filament network. The keratin intermediate 
filament networks of adjacent cells—in 
this example, epithelia! cells of the small 
intestine—are indirectly connected to one 
another through desmosomes, and to the 
basal lamina through hemidesmosomes. 
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Figure 19-18 The rôle of tight junctions 
in transcellular transport. For clarity, only 
the tight junctions are shown. Transport 
proteins are confined to different régions of 
the plasma membrane in épithélial cells of 
the small intestine. This ségrégation permits 
a vectorial transfer of nutrients across the 
epithelium from the gut lumen to the blood. 
In the example shown, glucose is actively 
transported into the cell by Na + -driven 
glucose transporters at its apical surface, 
and it leaves the cell through passive 
glucose transporters in its basolateral 
membrane. Tight junctions are thought 
to confine the transport proteins to their 
appropriate membrane domains by 
acting as diffusion barriers, or “fences,” 
within the lipid bilayer of the plasma 
membrane; these junctions also block 
the backflow of glucose from the basal 
side of the epithelium into the gut lumen 
(see Movie 11.2). 


apical end of each cell in the épithélial sheet (Figure 19-20A and B). In conven- 
tional électron micrographs, the outer leaflets of the two interacting plasma mem¬ 
branes are tightly apposed where sealing strands are présent (Figure 19-20C). 
Each sealing strand is composed of a long row of transmembrane homophilic 
adhesion proteins embedded in each of the two interacting plasma membranes. 
The extracellular domains of these proteins adhéré directly to one another to 
occlude the intercellular space (Figure 19-21). 

The main transmembrane proteins forming these strands are the claudins, 
which are essential for tight-junction formation and function. Mice that laclc the 
claudin-1 gene, for example, fail to malce tight junctions between the cells in the 
epidermal layer of the slcin; as a resuit, the baby mice lose water rapidly by évapo¬ 
ration through the skin and die within a day after birth. Conversely, if nonepithe- 
lial cells such as fibroblasts are artificially caused to express claudin genes, they 



Figure 19-19 The rôle of tight junctions 
in allowing epithelia to serve as barriers 
to soluté diffusion. (A) The drawing shows 
how a small extracellular tracer molécule 
added on one side of an epithelium is 
prevented from Crossing the epithelium by 
the tight junctions that seal adjacent cells 
together. Adherens junctions and other 
cell junctions are not shown for clarity. 

(B) Electron micrographs of cells in an 
epithelium in which a small, extracellular, 
electron-dense tracer molécule has been 
added to either the apical side (on the left) 
or the basolateral side (on the right). The 
tight junction blocks passage of the tracer 
in both directions. (B, courtesy of Daniel 
Friend.) 
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Figure 19-20 The structure of a tight junction between épithélial cells of the small intestine. The junctions are shown 
(A) schematically, (B) in a freeze-fracture électron micrograph, and (C) in a conventional électron micrograph. In (B), the plane of 
the micrograph is parallel to the plane of the membrane, and the tight junction appears as a band of branching sealing strands 
that encircle each cell in the epithelium (see Figure 19-21 A). In (C), the junction is seen in cross section as a sériés of focal 
connections between the outer leaflets of the two interacting plasma membranes, each connection corresponding to a sealing 
strand in cross section. (B and C, from N.B. Gilula, in Cell Communication [R.P. Cox, ed.j, pp. 1-29. New York: Wiley, 1974.) 


will form tight-junctional connections with one another. Normal tight junctions 
also contain a second major transmembrane protein called occludin, which is not 
essential for the assembly or structure of the tight junction but is important for 
limiting junctional permeability. A third transmembrane protein, tricellulin, is 
required to seal cell membranes together and prevent transepithelial leakage at 
the points where three cells meet. 

The claudin protein family has many members (24 in humans), and these are 
expressed in different combinations in different epithelia to confer particular per¬ 
meability properties on the épithélial sheet. They are thought to form paracellular 
pores —sélective channels allowing spécifie ions to cross the tight-junctional bar- 
rier, from one extracellular space to another. A spécifie claudin found in kidney 
épithélial cells, for example, is needed to let Mg 2+ pass between the cells of the 
kidney tubules so that this ion can be resorbed from the urine into the blood. A 
mutation in the gene encoding this claudin results in excessive loss of Mg 2+ in the 
urine. 

Scaffold Proteins Organize Junctional Protein Complexes 

Like the cadherin molécules of an adherens junction, the claudins and occludins 
of a tight junction interact with each other on their extracellular sides to promote 
junction assembly. Also as in adherens junctions, the organization of adhesion 
proteins in a tight junction dépends on additional proteins that bind the cytoplas- 
mic side of the adhesion proteins. The key organizational proteins at tight junc¬ 
tions are the zonula occludens (ZO) proteins. The three major members of the ZO 
family—ZO-1, ZO-2, and ZO-3—are large scaffold proteins that provide a struc¬ 
tural support on which the tight junction is built. These intracellular molécules 
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Figure 19-21 A model of a tight junction. 

(A) The sealing strands hold adjacent 
plasma membranes together. The strands 
are composed of transmembrane proteins 
that make contact across the intercellular 
space and create a seal. (B) The molecular 
composition of a sealing strand. The 
major extracellular components of the 
tight junction are members of a family of 
proteins with four transmembrane domains. 
One of these proteins, claudin, is the most 
important for the assembly and structure 
of the sealing strands, whereas the related 
protein occludin has the less critical rôle of 
determining junction permeability. The two 
termini of these proteins are both on the 
cytoplasmic side of the membrane, where 
they interact with large scaffolding proteins 
that organize the sealing strands and link 
the tight junction to the actin cytoskeleton 
(not shown here, but see Figure 19-22). 


consist of strings of protein-binding domains, typically including several PDZ 
domains—segments about 80 amino acids long that can recognize and bind the 
C-terminal tails of spécifie partner proteins (Figure 19-22). One domain of these 
scaffold proteins can attach to a claudin protein, while others can attach to occlu¬ 
din or the actin cytoskeleton. Moreover, one molécule of scaffold protein can bind 
to another. In this way, the cell can assemble a mat of intracellular proteins that 
organizes and positions the sealing strands of the tight junction. 

The tight-junctional network of sealing strands usually lies just apical to adhe- 
rens and desmosome junctions that bond the cells together mechanically; the 
whole assembly is called a junctional complex (see Figure 19-2). The parts of this 
junctional complex dépend on each other for their formation. For example, anti- 
cadherin antibodies that bloclc the formation of adherens junctions also bloclc the 
formation of tight junctions. 


Gap Junctions Couple Cells Both Electrically and Metabolically 

Tight junctions bloclc the passageways through the gaps between épithélial cells, 
preventing extracellular molécules from lealcing from one side of an epithelium to 
the other. Another type of junctional structure has a radically different function: 
it bridges gaps between adjacent cells so as to create direct channels from the 
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Figure 19-22 Scaffold proteins at the 
tight junction. The scaffold proteins ZO-1, 
ZO-2, and ZO-3 are concentrated beneath 
the plasma membrane at tight junctions. 
Each of the proteins contains multiple 
protein-binding domains, including three 
PDZ domains, an SH3 domain, and a GK 
domain, linked together like beads on a 
flexible string. These domains enable the 
proteins to interact with each other and 
with numerous other partners, as indicated 
here, to generate a tightly woven protein 
network that organizes the sealing strands 
of the tight junction and links them to 
the actin cytoskeleton. Scaffold proteins 
with similar structure help organize other 
junctional complexes, including those at 
neural synapses. 
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Figure 19-23 Gap junctions as seen 
in the électron microscope. (A) Thin- 
section and (B) freeze-fracture électron 
micrographs of a large and a small gap- 
junction plaque between fibroblasts in 
culture. In (B), each gap junction is seen as 
a cluster of homogeneous intramembrane 
particles. Each intramembrane particle 
corresponds to a connexon (see Figure 
19-25). (From N.B. Gilula, in Cell 
Communication [R.P. Cox, ed.], 
pp. 1-29. New York: Wiley, 1974.) 


cytoplasm of one to that of the other. These channels are called gap junctions. 

Gap junctions are présent in most animal tissues, including connective tissues 
as well as epithelia and heart muscle. Each gap junction appears in conventional 
électron micrographs as a patch where the membranes of two adjacent cells are 
separated by a uniform narrow gap of about 2-4 nm (Figure 19-23). The gap is 
spanned by channel-forming proteins, of which there are two distinct families, 
called the connexins and the innexins. Connexins are the prédominant gap-junc- 
tion proteins in vertebrates, with 21 isoforms in humans. Innexins are found in the 
gap junctions of invertebrates. 

Gap junctions hâve a pore size of about 1.4 nm, which allows the exchange 
of inorganic ions and other small water-soluble molécules, but not of macro- 
molecules such as proteins or nucleic acids (Figure 19-24). An electric current 
injected into one cell through a microelectrode causes an electrical disturbance 
in the neighboring cell, due to the flow of ions carrying electric charge through 
gap junctions. This electrical coupling via gap junctions serves an obvious pur- 
pose in tissues containing electrically excitable cells: action potentials can spread 
rapidly from cell to cell, without the delay that occurs at Chemical synapses. In 
vertebrates, for example, electrical coupling through gap junctions synchronizes 
the contractions of heart muscle cells as well as those of the smooth muscle cells 
responsible for the peristaltic movements of the intestine. Gap junctions also 
occur in many tissues whose cells are not electrically excitable. In principle, the 
sharing of small métabolites and ions provides a mechanism for coordinating the 
activities of individual cells in such tissues and for smoothing out random fluctua¬ 
tions in small-molecule concentrations in different cells. 



A Gap-Junction Connexon Is Made of Six Transmembrane 
Connexin Subunits 

Connexins are four-pass transmembrane proteins, six of which assemble to form 
a hemichannel, or connexon. When the connexons in the plasma membranes of 
two cells in contact are aligned, they form a continuous aqueous channel that 
connects the two cell interiors (Figure 19-25). A gap junction consists of many 
such connexon pairs in parallel, forming a sort of molecular sieve. Not only does 
this sieve provide a communication channel between cells, but it also provides a 
form of cell-cell adhesion that suppléments the cadherin- and claudin-mediated 
adhesions we discussed earlier. 


Figure 19-24 Determining the size of a 
gap-junction channel. When fluorescent 
molécules of various sizes are injected into 
one of two cells coupled by gap junctions, 
molécules with a molecular weight (MW) of 
less than about 1000 daltons can pass into 
the other cell, but larger molécules cannot. 
Thus, the coupled cells share their small 
molécules (such as inorganic ions, sugars, 
amino acids, nucléotides, vitamins, and the 
intracellular signaling molécules cyclic AMP 
and inositol trisphosphate) but not their 
macromolecules (proteins, nucleic acids, 
and polysaccharides). 
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Figure 19-25 Gap junctions. (A) A drawing 
of the interacting plasma membranes 
of two adjacent cells connected by gap 
junctions. Each lipid bilayer is shown as 
a pair of red sheets. Protein assemblies 
called connexons (green), each of which 
is formed by six connexin subunits, 
penetrate the apposed lipid bilayers. Two 
connexons join across the intercellular 
gap to form a continuous aqueous 
channel connecting the two cells. (B) The 
organization of connexins into connexons, 
and connexons into intercellular channels. 
The connexons can be homomeric or 
heteromeric, and the intercellular channels 
can be homotypic or heterotypic. (C) The 
high-resolution structure of a homomeric 
gap-junction channel, determined by x-ray 
crystallography of human connexin 26. In 
this view, we are looking down on the pore, 
formed from six connexin subunits. The 
structure illustrâtes the general features of 
the channel and suggests a pore size of 
about 1.4 nm, as predicted from studies 
of gap-junction permeability with molécules 
of various sizes (see Figure 19-24). 

(PDB code: 2ZW3.) 


Gap junctions in different tissues can hâve different properties because they 
are formed from different combinations of connexins, creating channels that dif- 
fer in permeability and régulation. Most cell types express more than one type of 
connexin, and two different connexin proteins can assemble into a heteromeric 
connexon, with its own distinct properties. Moreover, adjacent cells expressing 
different connexins can form intercellular channels in which the two aligned half- 
channels are different (see Figure 19-25B). 

Lilce conventional ion channels (discussed in Chapter 11), individual gap- 
junction channels do not remain open ail the time; instead, they flip between open 
and closed States. These changes are triggered by a variety of stimuli, including the 
voltage différence between the two connected cells, the membrane potential of 
each cell, and various Chemical properties of the cytoplasm, including the pH and 
concentration of free Ca 2+ . Some subtypes of gap junctions can also be regulated 
by extracellular signais such as neurotransmitters. We are only just beginning to 
understand the physiological functions and structural basis of these various gat- 
ing mechanisms. 

Each gap-junctional plaque is a dynamic structure that can readily assem¬ 
ble, disassemble, or be remodeled, and it can contain a cluster of a few to many 
thousands of connexons (see Figure 19-23B). Studies with fluorescently labeled 
connexins in living cells show that new connexons are continually added around 
the periphery of an existing junctional plaque, while old connexons are removed 
from the middle of it and destroyed (Figure 19-26). This turnover is rapid: the 
connexin molécules hâve a half-life of only a few hours. 

The mechanism of removal of old connexons from the middle of the plaque 
is not lcnown, but the route of delivery of new connexons to its periphery seems 
clear: they are inserted into the plasma membrane by exocytosis, lilce other inté¬ 
gral membrane proteins, and then diffuse in the plane of the membrane until 
they bump into the periphery of a connexon plaque and become trapped. This 
has a corollary: the plasma membrane away from the gap junction should contain 
connexons—hemichannels—that hâve not yet paired with their counterparts on 
another cell. It is thought that these unpaired hemichannels are normally held 
in a closed conformation, preventing the cell from losing its small molécules by 
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Figure 19-26 Connexin turnover at a gap junction. Cells were transfected 
with a slightly modified connexin gene, coding for a connexin with a short 
amino acid tag containing four cysteines in the sequence Cys-Cys-X-X- 
Cys-Cys (where X dénotés an arbitrary amino acid). This tetracysteine tag 
can bind strongly to certain small fluorescent dye molécules, which can be 
added to the culture medium and will readily enter cells by diffusing across 
the plasma membrane. In the experiment shown, a green dye was added 
first to label ail the connexin molécules in the cells, and the cells were then 
washed and incubated for 4 or 8 hours. At the end of this time, a red dye 
was added to the medium and the cells were washed again and fixed. 
Connexin molécules already présent at the beginning of the experiment are 
labeled green (and take up no red dye because their tetracysteine tags are 
already saturated with green dye), while connexins synthesized subsequently, 
during the 4- or 8-hour incubation, are labeled red. The fluorescence images 
show gap junctions between pairs of cells treated in this way. The central 
part of the gap-junction plaque is green, indicating that it consists of old 
connexin molécules, while the periphery is red, indicating that it consists 
of connexins synthesized during the previous 4 or 8 hours. The longer the 
time of incubation, the smaller the green central patch of old molécules, and 
the larger the peripheral ring of new molécules that hâve been recruited to 
replace the old ones. (From G. Gaietta et al., Science 296:503-507, 2002. 
With permission from AAAS.) 


lealcage through them. But there is also evidence that in some circumstances they 
can open and serve as channels for the release of small signal molécules. 


CROSS SECTIONS 
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In Plants, Plasmodesmata Perform Many of the Same Functions 
as Gap Junctions 

The tissues of a plant are organized on different principles from those of an ani¬ 
mal. This is because plant cells are imprisoned within tough cell walls composed 
of an extracellular matrix rich in cellulose and other polysaccharides, as we dis- 
cuss later. The cell walls of adjacent cells are firmly cemented to those of their 
neighbors, which éliminâtes the need for anchoring junctions to hold the cells 
in place. But a need for direct cell-cell communication remains. Thus, plant cells 
hâve only one class of intercellular junctions, plasmodesmata. Lilce gap junc¬ 
tions, they directly connect the cytoplasms of adjacent cells. 

In plants, the cell wall between a typical pair of adjacent cells is at least 0.1 pm 
thiclc, and so a structure very different from a gap junction is required to médi¬ 
ate communication across it. Plasmodesmata solve the problem. With a few spe- 
cialized exceptions, every living cell in a higher plant is connected to its living 
neighbors by these structures, which form fine cytoplasmic channels through the 
intervening cell walls. As shown in Figure 19-27A, the plasma membrane of one 
cell is continuous with that of its neighbor at each plasmodesma, which connects 
the cytoplasms of the two cells by a roughly cylindrical channel with a diameter 
of 20-40 nm. 

Running through the center of the channel in most plasmodesmata is a nar- 
rower cylindrical structure, the desmotubule, which is continuous with éléments 
of the smooth endoplasmic réticulum (ER) in each of the connected cells (Fig¬ 
ure 19-27B-D). Between the outside of the desmotubule and the inner face of 
the cylindrical channel formed by plasma membrane is an annulus of cytosol 
through which small molécules can pass from cell to cell. As each new cell wall 
is assembled during the cytokinesis phase of cell division, plasmodesmata are 
created within it. They form around éléments of smooth ER that become trapped 
across the developing cell plate (discussed in Chapter 17). They can also be 
inserted de novo through preexisting cell walls, where they are commonly found 
in dense clusters called pit fields. When no longer required, plasmodesmata can 
be removed. 

In spite of the radical différence in structure between plasmodesmata and gap 
junctions, they seem to function in remarkably similar ways. Evidence obtained 
by injecting tracer molécules of different sizes suggests that plasmodesmata allow 
the passage of molécules with a molecular weight of less than about 800, which 
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Figure 19-27 Plasmodesmata. (A) The cytoplasmic channels of 
plasmodesmata pierce the plant cell wall and connect cells in a plant 
together. (B) Each plasmodesma is lined with plasma membrane that is 
common to the two connected cells. It usually also contains a fine tubular 
structure, the desmotubule, derived from smooth endoplasmic réticulum. 
(C) Electron micrograph of a longitudinal section of a plasmodesma from 
a water fern. The plasma membrane Unes the pore and is continuous from 
one cell to the next. Endoplasmic réticulum and its association with the 
central desmotubule can also be seen. (D) A similar plasmodesma seen in 
cross section. (C and D, from R. Overall, J. Wolfe and B.E.S. Gunning, in 
Protoplasma 111, pp. 134-150. Heidelberg: Springer-Verlag, 1982.) 


is similar to the molecular-weight cutoff for gap junctions. As with gap junctions, 
transport through plasmodesmata is regulated. Dye-injection experiments, for 
example, show that there can be barriers to the movement of even low-molecular- 
weight molécules between certain cells, or groups of cells, that are connected by 
apparently normal plasmodesmata; the mechanisms that restrict communication 
in these cases are not understood. 


plasma endoplasmic 

membrane réticulum 



0.1 pm 



Selectins Médiate Transient Cell—Cell Adhesions in the 
Bloodstream 

We now complété our overview of cell-cell junctions and adhesion by briefly 
describing some of the more specialized adhesion mechanisms used in some tis- 
sues. In addition to those we hâve already discussed, at least three other super- 
families of cell-cell adhesion proteins are important: the integrins, the selectins , 
and the adhesive immunoglobulin (Ig) superfamily members. We shall discuss 
integrins in more detail later: their main function is in cell-matrix adhesion, but a 
few of them médiate cell-cell adhesion in specialized circumstances. Ca 2+ depen- 
dence provides one simple way to distinguish among these classes of adhesion 
proteins experimentally. Selectins, like cadherins and integrins, require Ca 2+ for 
their adhesive function; Ig superfamily members do not. 

Selectins are cell-surface carbohydrate-binding proteins ( lectins ) that médi¬ 
ate a variety of transient cell-cell adhesion interactions in the bloodstream. Their 
main rôle, in vertebrates at least, is in governing the traffic of white blood cells 
into normal lymphoid organs and any inflamed tissues. White blood cells lead a 
nomadic life, roving between the bloodstream and the tissues, and this nécessi¬ 
tâtes spécial adhesive behavior. The selectins control the binding of white blood 
cells to the endothélial cells lining blood vessels, thereby enabling the blood cells 
to migrate out of the bloodstream into a tissue. 

Each selectin is a transmembrane protein with a conserved lectin domain that 
binds to a spécifie oligosaccharide on another cell (Figure 19-28A). There are at 
least three types: L-selectin on white blood cells, P-selectin on blood platelets and 
on endothélial cells that hâve been locally activated by an inflammatory response, 
and E-selectin on activated endothélial cells. In a lymphoid organ, such as a lymph 
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node or the spleen, the endothélial cells express oligosaccharides that are rec- 
ognized by L-selectin on lymphocytes, causing the lymphocytes to loiter and 
become trapped. At sites of inflammation, the rôles are reversed: the endothé¬ 
lial cells switch on expression of selectins that recognize the oligosaccharides on 
white blood cells and platelets, flagging the cells down to help deal with the local 
emergency. Selectins do not act alone, however; they collaborate with integrins, 
which strengthen the binding of the blood cells to the endothélium. The cell-cell 
adhesions mediated by both selectins and integrins are heterophilic —that is, the 
binding is to a molécule of a different type: selectins bind to spécifie oligosaccha¬ 
rides on glycoproteins and glycolipids, while integrins bind to spécifie Ig-family 
proteins. 

Selectins and integrins act in sequence to let white blood cells leave the blood- 
stream and enter tissues (Figure 19-28B). The selectins médiate a wealc adhesion 
because the binding of the lectin domain of the selectin to its carbohydrate ligand 
is of low affinity. This allows the white blood cell to adhéré wealdy and revers- 
ibly to the endothélium, rolling along the surface of the blood vessel, propelled 
by the flow of blood. The rolling continues until the blood cell activâtes its integ¬ 
rins. As we discuss later, these transmembrane molécules can be switched into an 
adhesive conformation that enables them to latch onto spécifie macromolecules 
external to the cell—in the présent case, proteins on the surfaces of the endothé¬ 
lial cells. Once it has attached in this way, the white blood cell escapes from the 
bloodstream into the tissue by crawling out of the blood vessel between adjacent 
endothélial cells. 


Figure 19-28 The structure and 
function of selectins. (A) The structure 
of P-selectin. The selectin attaches to the 
actin cytoskeleton through adaptor proteins 
that are still poorly characterized. (B) How 
selectins and integrins médiate the cell-cell 
adhesions required for a white blood cell 
to migrate out of the bloodstream into a 
tissue. First, selectins on endothélial cells 
bind to oligosaccharides on the white 
blood cell, so that it becomes loosely 
attached and rolls along the vessel wall. 
Then the white blood cell activâtes a cell- 
surface integrin called LFA1, which binds 
to a protein called ICAM1 (belonging to 
the Ig superfamily) on the membrane of 
the endothélial cell. The white blood cell 
adhères to the vessel wall and then crawls 
out of the vessel by a process that requires 
another immunoglobulin superfamily 
member called PECAM1 (orCD31), not 
shown (Movie 19.2). EGF, epidermal 
growth factor. 


Members of the Immunoglobulin Superfamily Médiate 
Ca 2+ -lndependent Cell—Cell Adhesion 

The chief endothélial cell proteins that are recognized by the white blood cell inte¬ 
grins are called ICAMs (intercellular cell adhesion molécules) or VCAMs (vascular 
cell adhesion molécules). They are members of another large and ancient family 
of cell-surface molécules—the immunoglobulin (Ig) superfamily. These contain 
one or more extracellular Ig-like domains that are characteristic of antibody mol¬ 
écules. They hâve many functions outside the immune System that are unrelated 
to immune defenses. 

While ICAMs and VCAMs on endothélial cells both médiate heterophilic 
binding to integrins, many other Ig superfamily members appear to médiate 
homophilic binding. An example is the neural cell adhesion molécule (NCAM), 
which is expressed by various cell types, including most nerve cells, and can talce 
different forms, generated by alternative splicing of an RNA transcript produced 
from a single gene (Figure 19-29). Some forms of NCAM carry an unusually large 
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Figure 19-29 Two members of the 
Ig superfamily of cell-cell adhesion 
molécules. NCAM is expressed on 
neurons and many other cell types, and 
médiates homophilic binding. ICAM is 
expressed on endothélial cells and some 
other cell types and binds heterophilically 
to an integrin on white blood cells. Both 
NCAM and ICAM are glycoproteins, but 
their attached carbohydrate chai ns are 
not shown. 


quantity of sialic acid (with chains containing hundreds of repeating sialic acid 
units). By virtue of their négative charge, the long polysialic acid chains can inter¬ 
fère with cell adhesion (because lilce charges repel one another); thus, these forms 
of NCAM can serve to inhibit adhesion, rather than cause it. 

A cell of a given type generally uses an assortment of different adhesion pro¬ 
teins to interact with other cells, just as each cell uses an assortment of differ¬ 
ent receptors to respond to the many soluble extracellular signal molécules in its 
environment. Although cadherins and Ig superfamily members are frequently 
expressed on the same cells, the adhesions mediated by cadherins are much 
stronger, and they are largely responsible for holding cells together, segregating 
cell collectives into discrète tissues, and maintaining tissue integrity. Molécules 
such as NCAM seem to contribute more to the fine-tuning of these adhesive inter¬ 
actions during development and régénération, playing a part in various special- 
ized adhesive phenomena, such as that discussed for blood cells and endothélial 
cells. Thus, while mutant mice that laclc N-cadherin die early in development, 
those that laclc NCAM develop relatively normally but show some mild abnormal- 
ities in the development of certain spécifie tissues, including parts of the nervous 
System. 

Summary 

In epithelia, as well as in some other types of tissue, cells are directly attached to one 
another through strong cell-cell adhesions, mediated by transmembrane proteins 
called cadherins; which are anchored intracellularly to the cytoskeleton. Cadherins 
generally bind to one another homophilically: the head of one cadherin molécule 
binds to the head ofa similar cadherin on an opposite cell This selectivity enables 
mixed populations of cells of different types to sort out from one another according 
to the spécifie cadherins they express, and it helps to control cell rearrangements 
during development. 

The “classical” cadherins at adherens junctions are linked to the actin cytoskel¬ 
eton by intracellular adaptor proteins called catenins. These form an anchoring 
complex on the intracellular tail of the cadherin molécule, and are involved not 
only in physical anchorage but also in the détection ofand response to tension and 
other regulatory signais at the junction. 

Tight junctions seal the gaps between cells in epithelia, creating a barrier to the 
diffusion of molécules across the cell sheet and also helping to separate the popula¬ 
tions of pro teins in the apical and basolateral plasma membrane domains of the 
épithélial cell. Claudins are the major transmembrane proteins forming gap junc¬ 
tions. Intracellular scaffold proteins organize the claudins and other junctional 
proteins into a complex protein network that is linked to the actin cytoskeleton. 
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The cells ofmany animal tissues are coupled by gap junctions, which take the 
form of plaques ofclustered connexons, which usually allow molécules smaller than 
about 1000 daltons to pass directly from the inside ofone cell to the inside of the 
next. Cells connected by gap junctions share many oftheir inorganic ions and other 
small molécules and are therefore chemically and electrically coupled. 

Three additional classes of transmembrane adhesion proteins médiate more 
transient cell-cell adhesion: selectins, immunoglobulin (Ig) superfamily members, 
and integrins. Selectins are expressed on white blood cells, blood platelets, and 
endothélial cells; they bind heterophilically to carbohydrate groups on cell surfaces, 
helping to médiate the adhesive interactions between these cells. Ig superfamily pro¬ 
teins also play a part in these interactions, as well as in many other adhesive pro¬ 
cesses; some ofthem bind homophilically, some heterophilically. Integrins, though 
they mainly serve to attach cells to the extracellular matrix, can also médiate cell- 
cell adhesion by binding to spécifie Ig superfamily proteins. 

THE EXTRACELLULAR MATRIX OF ANIMALS 

Tissues are not made up solely of cells. They also contain a remarlcably complex 
and intricate networlc of macromolecules constituting the extracellular matrix. 
This matrix is composed of many different proteins and polysaccharides that are 
secreted locally and assembled into an organized meshwork in close association 
with the surfaces of the cells that produce them. 

The classes of macromolecules constituting the extracellular matrix in differ¬ 
ent animal tissues are broadly similar, but variations in the relative amounts of 
these different classes of molécules and in the ways in which they are organized 
give rise to an amazing diversity of materials. The matrix can become calcified to 
form the rock-hard structures of bone or teeth, or it can form the transparent sub¬ 
stance of the cornea, or it can adopt the ropelilce organization that gives tendons 
their enormous tensile strength. It forms the jelly in a jellyfish. Covering the body 
of a beetle or a lobster, it forms a rigid carapace. Moreover, the extracellular matrix 
is more than a passive scaffold to provide physical support. It has an active and 
complex rôle in regulating the behavior of the cells that touch it, inhabit it, or crawl 
through its meshes, influencing their survival, development, migration, proliféra¬ 
tion, shape, and function. 

In this section, we describe the major features of the extracellular matrix in 
animal tissues, with an emphasis on vertebrates. We begin with an overview of the 
major classes of macromolecules in the matrix, after which we turn to the struc¬ 
ture and function of the basal lamina , the thin layer of specialized extracellular 
matrix that lies beneath ail épithélial cells. In the sections that follow, we then 
describe the varied types of cell-matrix junctions through which cells are con¬ 
nected to the matrix. 

The Extracellular Matrix Is Made and Oriented by the Cells Within It 

The macromolecules that constitute the extracellular matrix are mainly produced 
locally by cells in the matrix. As we discuss later, these cells also help to organize 
the matrix: the orientation of the cytoslceleton inside the cell can control the ori¬ 
entation of the matrix produced outside. In most connective tissues, the matrix 
macromolecules are secreted by cells called fibroblasts (Figure 19-30). In certain 
specialized types of connective tissues, such as cartilage and bone, however, they 
are secreted by cells of the fibroblast family that hâve more spécifie names: chon- 
droblasts , for example, form cartilage, and osteoblasts form bone. 

The extracellular matrix is constructed from three major classes of macro¬ 
molecules: (1) glycosaminoglycans ( GAGs ), which are large and highly charged 
polysaccharides that are usually covalently linlced to protein in the form of pro- 
teoglycans; (2) fibrous proteins, which are primarily members of the collagen fam¬ 
ily; and (3) a large class of noncollagen glycoproteins, which carry conventional 
asparagine-linked oligosaccharides (described in Chapter 12). Ail three classes of 
macromolecule hâve many members and corne in a great variety of shapes and 
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Figure 19-30 Fibroblasts in connective 
tissue. This scanning électron micrograph 
shows tissue from the cornea of a rat. 

The extracellular matrix surrounding the 
fibroblasts is here composed largely 
of collagen fibrils. The glycoproteins, 
hyaluronan, and proteoglycans, which 
normally form a hydrated gel filling the 
interstices of the fibrous network, hâve 
been removed by enzyme and acid 
treatment. (Courtesy of T. Nishida.) 
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sizes (Figure 19-31). Mammals are thought to hâve almost 300 matrix proteins, 
including about 36 proteoglycans, about 40 collagens, and over 200 glycoproteins, 
which usually contain multiple subdomains and self-assodate to form multimers. 
Add to this the large number of matrix-assodated pro teins and enzymes that can 
modify matrix behavior by cross-linking, dégradation, or other mechanisms, and 
one begins to see that the matrix is an almost infinitely variable material. Each tis- 
sue contains its own unique blend of matrix components, resulting in an extracel¬ 
lular matrix that is specialized for the needs of that tissue. 

The proteoglycan molécules in connective tissue typically form a highly 
hydrated, gel-like “ground substance" in which collagens and glycoproteins are 
embedded. The polysaccharide gel resists compressive forces on the matrix while 
permitting the rapid diffusion of nutrients, métabolites, and hormones between 
the blood and the tissue cells. The collagen fibers strengthen and help organize 
the matrix, while other fibrous pro teins, such as the rubberlike elastin, give it resil- 
ience. Finally, the many matrix glycoproteins help cells migrate, settle, and differ- 
entiate in the appropriate locations. 

Glycosaminoglycan (GAG) Chains Occupy Large Amounts of 
Space and Form Hydrated Gels 

Glycosaminoglycans (GAGs) are unbranched polysaccharide chains composed 
of repeating disaccharide units. One of the two sugars in the repeating disaccha- 
ride is always an amino sugar (N-acetylglucosamine or AT-acetylgalactosamine), 
which in most cases is sulfated. The second sugar is usually a uronic acid (gluc- 
uronic or iduronic). Because there are sulfate or carboxyl groups on most of their 
sugars, GAGs are highly negatively charged (Figure 19-32). Indeed, they are the 
most anionic molécules produced by animal cells. Four main groups of GAGs 
are distinguished by their sugars, the type of linkage between the sugars, and the 
number and location of sulfate groups: (1) hyaluronan, (2) chondroitin sulfate and 
dermatan sulfate, (3) heparan sulfate, and (4) keratan sulfate. 

Polysaccharide chains are too stiff to fold into compact globular structures, 
and they are strongly hydrophilic. Thus, GAGs tend to adopt highly extended con¬ 
formations that occupy a huge volume relative to their mass (Figure 19-33), and 
they form hydrated gels even at very low concentrations. The weight of GAGs in 
connective tissue is usually less than 10% of the weight of pro teins, but GAG chains 
fill most of the extracellular space. Their high density of négative charges attracts a 
cloud of cations, especially Na + , that are osmotically active, causing large amounts 
of water to be sucked into the matrix. This créâtes a swelling pressure, or turgor, 
that enables the matrix to withstand compressive forces (in contrast to collagen 
fibrils, which resist stretching forces). The cartilage matrix that lines the knee 
joint, for example, can support pressures of hundreds of atmosphères in this way. 

Defects in the production of GAGs can affect many different body Systems. In 
one rare human genetic disease, for example, there is a severe deficiency in the 
synthesis of dermatan sulfate disaccharide. The affected individuals hâve a short 
stature, a prematurely aged appearance, and generalized defects in their skin, 
joints, muscles, andbones. 
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Figure 19-31 The comparative shapes 
and sizes of some of the major 
extracellular matrix macromolecules. 

Protein is shown in green, and 
glycosaminoglycan (GAG) in red. 
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Figure 19-32 The repeating disaccharide 
sequence of a heparan sulfate 
glycosaminoglycan (GAG) chain. These 
chains can consist of as many as 200 
disaccharide units, but are typically less 
than hait that size. There is a high density 
of négative charges along the chain due to 
the presence of both carboxyl and sulfate 
groups. The molécule is shown here with 
its maximal number of sulfate groups. 

In vivo, the proportion of sulfated and 
nonsulfated groups is variable. Heparin 
typically has >70% sulfation, while heparan 
sulfate has <50%. 
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Hyaluronan Acts as a Space Filler During Tissue Morphogenesis 
and Repair 

Hyaluronan (also called hyaluronic acid or hyaluronaté) is the simplest of the 
GAGs (Figure 19-34). It consists of a regular repeating sequence of up to 25,000 
disaccharide units, is found in variable amounts in ail tissues and fiuids in adult 
animais, and is especially abundant in early embryos. Hyaluronan is not a typical 
GAG because it contains no sulfated sugars, ail its disaccharide units are identi- 
cal, its chain length is enormous, and it is not generally linlced covalently to any 
core protein. Moreover, whereas other GAGs are synthesized inside the cell and 
released by exocytosis, hyaluronan is spun out directly from the cell surface by an 
enzyme complex embedded in the plasma membrane. 

Hyaluronan is thought to hâve a rôle in resisting compressive forces in tissues 
and joints. It is also important as a space filler during embryonic development, 
where it can be used to force a change in the shape of a structure, as a small quan- 
tity expands with water to occupy a large volume. Hyaluronan synthesized locally 
from the basal side of an epithelium can deform the epithelium by creating a cell- 
free space beneath it, into which cells subsequently migrate. In the developing 
heart, for example, hyaluronan synthesis helps in this way to drive formation of 
the valves and septa that separate the heart's chambers. Similar processes occur 
in several other organs. When cell migration ends, the excess hyaluronan is gener¬ 
ally degraded by the enzyme hyaluronidase. Hyaluronan is also produced in large 
quantities during wound healing, and it is an important constituent of joint fiuid, 
in which it serves as a lubricant. 

Proteoglycans Are Composée! of GAG Chains Covalently Linked 
to a Core Protein 

Except for hyaluronan, ail GAGs are covalently attached to protein as proteogly¬ 
cans, which are produced by most animal cells. Membrane-bound ribosomes 
malce the polypeptide chain, or core protein , of a proteoglycan, which is then 
threaded into the lumen of the endoplasmic réticulum. The polysaccharide chains 
are mainly assembled on this core protein in the Golgi apparatus before delivery 
to the exterior of the cell by exocytosis. First, a spécial linlcage tetrasaccharide is 
attached to a serine side chain on the core protein to serve as a primer for polysac¬ 
charide growth; then, one sugar at a time is added by spécifie glycosyl transferases 
(Figure 19-35). While still in the Golgi apparatus, many of the polymerized sugars 
are covalently modified by a sequential and coordinated sériés of reactions. Epi- 
merizations alter the configuration of the substituents around individual carbon 
atoms in the sugar molécule; sulfations increase the négative charge. 

Proteoglycans are clearly distinguished from other glycoproteins by the nature, 
quantity, and arrangement of their sugar side chains. By définition, at least one of 
the sugar side chains of a proteoglycan must be a GAG. Whereas glycoproteins 
generally contain relatively short, branched oligosaccharide chains that contrib- 
ute only a small fraction of their weight, proteoglycans can contain as much as 
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Figure 19-33 The relative dimensions 
and volumes occupied by various 
macromolecules. Several proteins, a 
glycogen granule, and a single hydrated 
molécule of hyaluronan are shown. 


Figure 19-34 The repeating disaccharide 
sequence in hyaluronan, a relatively 
simple GAG. This ubiquitous molécule in 
vertebrates consists of a single long chain 
of up to 25,000 sugar monomers. Note the 
absence of sulfate groups. 
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Figure 19-35 The linkage between a 
GAG chain and its core protein in a 
proteoglycan molécule. A spécifie link 
tetrasaccharide is first assembled on a 
serine side chain. The rest of the GAG 
chain, consisting mainly of a repeating 
disaccharide unit, is then synthesized, 
with one sugar being added at a time. 

In chondroitin sulfate, the disaccharide 
is composed of D-glucuronic acid and 
A/-acetyl-D-galactosamine; in heparan 
sulfate, it is either D-glucuronic acid 
or L-iduronic acid and N- acetyl-D- 
glucosamine; in keratan sulfate, it is 
D-galactose and A/-acetyl-D-glucosamine. 


95% carbohydrate by weight, mostly in the form of long, unbranched GAG chains, 
each typically about 80 sugars long. 

In principle, proteoglycans hâve the potential for almost limitless heterogene- 
ity. Even a single type of core protein can carry highly variable numbers and types 
of attached GAG chains. Moreover, the underlying repeating sequence of disac- 
charides in each GAG can be modified by a complex pattern of sulfate groups. The 
core proteins, too, are diverse, though many of them share some characteristic 
domains such as the LINK domain, involved in binding to GAGs. 

Proteoglycans can be huge. The proteoglycan aggrecan, for example, which is 
a major component of cartilage, has a mass of about 3 x 10 6 daltons with over 
100 GAG chains. Other proteoglycans are much smaller and hâve only 1-10 GAG 
chains; an example is decorin, which is secreted by fibroblasts and has a single 
GAG chain (Figure 19-36). Decorin binds to collagen fibrils and régulâtes fibril 
assembly and fibril diameter; mice that cannot malce decorin hâve fragile skin 
that has reduced tensile strength. The GAGs and proteoglycans of these various 
types can associate to form even larger polymeric complexes in the extracellu¬ 
lar matrix. Molécules of aggrecan, for example, assemble with hyaluronan in 
cartilage matrix to form aggregates that are as big as a bacterium (Figure 19-37). 
Moreover, besides associating with one another, GAGs and proteoglycans asso¬ 
ciate with fibrous matrix proteins such as collagen and with protein meshworks 
such as the basal lamina, creating extremely complex composites (Figure 19-38). 

Not ail proteoglycans are secreted components of the extracellular matrix. 
Some are intégral components of plasma membranes and hâve their core pro¬ 
tein either inserted across the lipid bilayer or attached to the lipid bilayer by a 
glycosylphosphatidylinositol (GPI) anchor. Among the best-characterized plasma 
membrane proteoglycans are the syndecans, which hâve a membrane-spanning 
core protein whose intracellular domain is thought to interact with the actin cyto- 
slceleton and with signaling molécules in the cell cortex. Syndecans are located 
on the surface of many types of cells, including fibroblasts and épithélial cells. In 


Figure 19-36 Examples of a small 
(decorin) and a large (aggrecan) 
proteoglycan found in the extracellular 
matrix. The figure compares these two 
proteoglycans with a typical secreted 
glycoprotein molécule, pancreatic 
ribonucléase B. Ail three are drawn to 
scale. The core proteins of both aggrecan 
and decorin contain oligosaccharide chains 
as well as the GAG chains, but these are 
not shown. Aggrecan typically consists of 
about 100 chondroitin sulfate chains and 
about 30 keratan sulfate chains linked to 
a serine-rich core protein of almost 3000 
amino acids. Decorin “décorâtes” the 
surface of collagen fibrils, hence its name. 
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Figure 19-37 An aggrecan aggregate 
from fêtai bovine cartilage. (A) An 

électron micrograph of an aggrecan 
aggregate shadowed with platinum. Many 
free aggrecan molécules are also visible. 

(B) A drawing of the giant aggrecan 
aggregate shown in (A). It consists of about 
100 aggrecan monomers (each like the 
one shown in Figure 19-36) noncovalently 
bound through the N-terminal domain of 
the core protein to a single hyaluronan 
chain. A link protein binds both to the core 
protein of the proteoglycan and to the 
hyaluronan chain, thereby stabilizing the 
aggregate. The link proteins are members 
of a family of hyaluronan-binding proteins, 
some of which are cell-surface proteins. 

The molecular mass of such a complex can 
be 10 8 daltons or more, and it occupies a 
volume équivalent to that of a bacterium, 
which is about 2 x 10 -12 cm 3 . (A, courtesy 
of Lawrence Rosenberg.) 


fïbroblasts, syndecans can be found in cell-matrix adhesions, where they modu- 
late integrin function by interacting with fibronectin on the cell surface and with 
cytoskeletal and signaling proteins inside the cell. As we discuss later, syndecan 
and other proteoglycans also interact with soluble peptide growth factors, influ- 
encing their effects on cell growth and prolifération. 

Collagens Are the Major Proteins of the Extracellular Matrix 

The collagens are a family of fibrous proteins found in ail multicellular animais. 
They are secreted in large quantities by connective-tissue cells, and in smaller 
quantities by many other cell types. As a major component of skin and bone, col¬ 
lagens are the most abundant proteins in mammals, where they constitute 25% of 
the total protein mass. 

The primary feature of a typical collagen molécule is its long, stiff, tri- 
ple-stranded helical structure, in which three collagen polypeptide chains, called 
a chains , are wound around one another in a ropelike superhelix (Figure 19-39). 



Figure 19-38 Proteoglycans in the 
extracellular matrix of rat cartilage. 

The tissue was rapidly frozen at -196°C, 
and fixed and stained while still frozen 
(a process called freeze substitution) to 
prevent the GAG chains from collapsing. In 
this électron micrograph, the proteoglycan 
molécules are seen to form a fine 
filamentous network in which a single 
striated collagen fibril is embedded. 

The more darkly stained parts of the 
proteoglycan molécules are the core 
proteins; the faintly stained threads are 
the GAG chains. (Reproduced from 
E.B. Hunziker and R.K. Schenk, J. Cell 
Biol. 98:277-282, 1984. With permission 
from The Rockefeller University Press.) 
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Figure 19-39 The structure of a typical collagen molécule. (A) A model of 
part of a single collagen a chain, in which each amino acid is represented by 
a sphere. The chain is about 1000 amino acids long. It is arranged as a left- 
handed hélix, with three amino acids per turn and with glycine as every third 
amino acid. Therefore, an a chain is composed of a sériés of triplet Gly-X-Y 
sequences, in which X and Y can be any amino acid (although X is commonly 
proline and Y is commonly hydroxyproline, a form of proline that is chemically 
modified during collagen synthesis in the cell). (B) A model of part of a 
collagen molécule, in which three a chains, each shown in a different color, 
are wrapped around one another to form a triple-stranded helical rod. Glycine 
is the only amino acid small enough to occupy the crowded interior of the 
triple hélix. Only a short length of the molécule is shown; the entire molécule 
is 300 nm long. (From a model by B.L. Trus.) 


Collagens are extremely rich in proline and glycine, both of which are important 
in the formation of the triple-stranded hélix. 

The human genome contains 42 distinct genes coding for different collagen a 
chains. Different combinations of these genes are expressed in different tissues. 
Although in principle thousands of types of triple-stranded collagen molécules 
could be assembled from various combinations of the 42 a chains, only a limited 
number of triple-helical combinations are possible, and roughly 40 types of col¬ 
lagen molécules hâve been found. Type I is by far the most common, being the 
principal collagen of skin and bone. It belongs to the class of fibrillar collagens, 
or fibril-forming collagens: after being secreted into the extracellular space, they 
assemble into higher-order polymers called collagen fibrils, which are thin struc¬ 
tures (10-300 nm in diameter) many hundreds of micrometers long in mature 
tissues, where they are clearly visible in électron micrographs (Figure 19-40; see 
also Figure 19-38). Collagen fibrils often aggregate into larger, cablelike bundles, 
several micrometers in diameter, that are visible in the light microscope as colla¬ 
gen fibers. 

Collagen types IX and XII are called fibril-associated collagens because they 
decorate the surface of collagen fibrils. They are thought to linlc these fibrils to 
one another and to other components in the extracellular matrix. Type IV is a 
network-forming collagen , forming a major part of basal laminae, while type VII 
molécules form dimers that assemble into specialized structures called anchor- 
ingfibrils. Anchoring fibrils help attach the basal lamina of multilayered epithelia 
to the underlying connective tissue and therefore are especially abundant in the 
skin. There are also a number of “collagen-like" proteins containing short colla- 
gen-like segments. These include collagen type XVTI, which has a transmembrane 
domain and is found in hemidesmosomes, and type XVTII, the core protein of a 
proteoglycan in basal laminae. 

Many proteins appear to hâve evolved by repeated duplications of an original 
DNA sequence, giving rise to a répétitive pattern of amino acids. The genes that 
encode the a chains of most of the fibrillar collagens provide a good example: they 
are very large (up to 44 kilobases in length) and contain about 50 exons. Most of 
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Figure 19-40 A fibroblast surrounded by 
collagen fibrils in the connective tissue 
of embryonic chick skin. In this électron 
micrograph, the fibrils are organized into 
bundles that run approximately at right 
angles to one another. Therefore, some 
bundles are oriented longitudinally, whereas 
others are seen in cross section. The 
collagen fibrils are produced by fibroblasts. 
(From C. Ploetz, E.l. Zycband and 
D.E. Birk, J. Struct. Biol. 106:73-81, 1991. 
With permission from Elsevier.) 
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HSome Types of Collagen and Their Properties 



Type 

Polymerized form 

Tissue distribution 

Mutant phenotype 

Fibril-forming 

(fibrillar) 

1 

Fibril 

Bone, skin, tendons, ligaments, 
cornea, internai organs (accounts for 
90% of body collagen) 

Severe bone defects, fractures 
{osteogenesis imperfecta) 


II 

Fibril 

Cartilage, intervertébral dise, 
notochord, vitreous humor of the eye 

Cartilage deficiency, dwarfism 
(chondrodysplasia) 


III 

Fibril 

Skin, blood vessels, internai organs 

Fragile skin, loose joints, blood 
vessels prone to rupture 
(Ehlers-Danlos syndrome) 


V 

Fibril (with type 1) 

As for type 1 

Fragile skin, loose joints, blood 
vessels prone to rupture 


XI 

Fibril (with type II) 

As for type II 

Myopia, blindness 

Fibril-associated 

IX 

Latéral association 
with type II fibrils 

Cartilage 

Osteoarthritis 

Network-forming 

IV 

Sheetlike network 

Basal lamina 

Kidney disease (glomerulonephritis), 
deafness 


VII 

Anchoring fibrils 

Beneath stratified squamous 
epithelia 

Skin blistering 

Transmembrane 

XVII 

Nonfibrillar 

Hemidesmosomes 

Skin blistering 

Proteoglycan 
core protein 

XVIII 

Nonfibrillar 

Basal lamina 

Myopia, detached retina, 
hydrocephalus 

Note that types 1, IV, V, IX, and XI are each composed of two or three types of a chains (distinct, nonoverlapping sets in each 
case), whereas types II, III, VII, XVII, and XVIII are composed of only one type of a chain each. 


the exons are 54, or multiples of 54, nucléotides long, suggesting that these colla- 
gens originated through multiple duplications of a primordial gene containing 54 
nucléotides and encoding exactly six Gly-X-Y repeats (see Figure 19-39). 

Table 19-2 provides additional details for some of the collagen types discussed 
in this chapter. 

Secreted Fibril-Associated Collagens Help Organize the Fibrils 

In contrast to GAGs, which resist compressive forces, collagen fibrils form struc¬ 
tures that resist tensile forces. The fibrils hâve various diameters and are orga- 
nized in different ways in different tissues. In mammalian skin, for example, they 
are woven in a wiclcerwork pattern so that they resist tensile stress in multiple 
directions; leather consists of this material, suitably preserved. In tendons, col¬ 
lagen fibrils are organized in parallel bundles aligned along the major axis of ten¬ 
sion. In mature bone and in the cornea, they are arranged in orderly plywoodlike 
layers, with the fibrils in each layer lying parallel to one another but nearly at right 
angles to the fibrils in the layers on either side. The same arrangement occurs in 
tadpole skin (Figure 19-41). 

The connective-tissue cells themselves détermine the size and arrangement of 
the collagen fibrils. The cells can express one or more genes for the different types 
of fibrillar collagen molécules. But even fibrils composed of the same mixture of 
collagens hâve different arrangements in different tissues. How is this achieved? 
Part of the answer is that cells can regulate the disposition of the collagen 

Figure 19-41 Collagen fibrils in the tadpole skin. This électron micrograph 
shows the plywoodlike arrangement of the fibrils: successive layers of fibrils 
are laid down nearly at right angles to each other. This organization is also 
found in mature bone and in the cornea. (Courtesy of Jerome Gross.) 
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type IX collagen 
molécule 


fibril of 

type II collagen 



Figure 19-42 Type IX collagen. (A) Type IX collagen molécules binding 
in a periodic pattern to the surface of a fibril containing type II collagen. 

(B) Electron micrograph of a rotary-shadowed type-ll-collagen-containing 
fibril in cartilage, decorated by type IX collagen molécules. (C) An individual 
type IX collagen molécule. (B and C, from L. Vaughan et al., J. Cell Biol. 
106:991-997, 1988. With permission from The Rockefeller University Press.) 


molécules after sécrétion by guiding collagen fibril formation near the plasma 
membrane. In addition, cells can influence this organization by secreting, along 
with their fibrillar collagens, different lcinds and amounts of other matrix mac- 
romolecules. In particular, they secrete the fibrous protein fibronectin, as we 
discuss later, and this précédés the formation of collagen fibrils and helps guide 
their organization. 

Fibril-associated collagens, such as types IX and XII collagens, are thought 
to be especially important in organizing collagen fibrils. They differ from fibril¬ 
lar collagens in the following ways. First, their triple-stranded helical structure is 
interrupted by one or two short nonhelical domains, which malces the molécules 
more flexible than fibrillar collagen molécules. Second, they do not aggregate 
with one another to form fibrils in the extracellular space. Instead, they bind in 
a periodic manner to the surface of fibrils formed by the fibrillar collagens. Type 
IX molécules bind to type-II-collagen-containing fibrils in cartilage, the cornea, 
and the vitreous of the eye (Figure 19-42), whereas type XII molécules bind to 
type-I-collagen-containing fibrils in tendons and various other tissues. 

Fibril-associated collagens are thought to médiate the interactions of collagen 
fibrils with one another and with other matrix macromolecules to help détermine 
the organization of the fibrils in the matrix. 


Cells Help Organize the Collagen Fibrils They Secrete by Exerting 
Tension on the Matrix 

Cells interact with the extracellular matrix mechanically as well as chemically, 
and studies in culture suggest that the mechanical interaction can hâve dramatic 
effects on the architecture of connective tissue. Thus, when fibroblasts are mixed 
with a meshworlc of randomly oriented collagen fibrils that form a gel in a culture 
dish, the fibroblasts tug on the meshworlc, drawing in collagen from their sur- 
roundings and thereby causing the gel to contract to a small fraction of its initial 
volume. By similar activities, a cluster of fibroblasts surrounds itself with a capsule 
of densely paclced and circumferentially oriented collagen fibers. 

If two small pièces of embryonic tissue containing fibroblasts are placed far 
apart on a collagen gel, the intervening collagen becomes organized into a com¬ 
pact band of aligned fibers that connect the two expiants (Figure 19-43). The 
fibroblasts subsequently migrate out from the expiants along the aligned collagen 
fibers. Thus, the fibroblasts influence the alignment of the collagen fibers, and the 
collagen fibers in turn affect the distribution of the fibroblasts. 

Fibroblasts may hâve a similar rôle in organizing the extracellular matrix inside 
the body. First they synthesize the collagen fibrils and deposit them in the correct 
orientation. Then they worlc on the matrix they hâve secreted, crawling over it and 
tugging on it so as to create tendons and ligaments and the tough, dense layers of 
connective tissue that surround and bind together most organs. 



1 mm 


Figure 19-43 The shaping of the 
extracellular matrix by cells. This 
micrograph shows a région between two 
pièces of embryonic chick heart (rich in 
fibroblasts as well as heart muscle cells) 
that were cultured on a collagen gel for 
4 days. A dense tract of aligned collagen 
fibers has formed between the expiants, 
presumably as a resuit of the fibroblasts 
in the expiants tugging on the collagen. 
(From D. Stopak and A. K. Harris, Dev. Biol. 
90:383-398, 1982. With permission from 
Academie Press.) 







THE EXTRACELLULAR MATRIX OF ANIMALS 


1065 




(A) 


1 mm 


Figure 19-44 Elastic fibers. These 
scanning électron micrographs show 
(A) a low-power view of a segment of a 
dog’s aorta and (B) a high-power view 
of the dense network of longitudinally 
oriented elastic fibers in the outer layer 
of the same blood vessel. Ail the other 
components hâve been digested away with 
enzymes and formic acid. (From K.S. Haas 
et al., Anat. Rec. 230:86-96, 1991. With 
permission from Wiley-Liss.) 


Elastin Gives Tissues Their Elasticity 

Many vertebrate tissues, such as slcin, blood vessels, and lungs, need to be both 
strong and elastic in order to function. A network of elastic fibers in the extra¬ 
cellular matrix of these tissues gives them the resilience to recoil after transient 
stretch (Figure 19-44). Elastic fibers are at least five times more extensible than a 
rubber band of the same cross-sectional area. Long, inelastic collagen fibrils are 
interwoven with the elastic fibers to limit the extent of stretching and prevent the 
tissue from tearing. 

The main component of elastic fibers is elastin, a highly hydrophobie protein 
(about 750 amino acids long), which, lilce collagen, is unusually rich in proline and 
glycine but, unlilce collagen, is not glycosylated. Soluble tropoelastin (the biosyn- 
thetic precursor of elastin) is secreted into the extracellular space and assembled 
into elastic fibers close to the plasma membrane, generally in cell-surface infold- 
ings. After sécrétion, the tropoelastin molécules become highly cross-linked to 
one another, generating an extensive network of elastin fibers and sheets. 

The elastin protein is composed largely of two types of short segments that 
alternate along the polypeptide chain: hydrophobie segments, which are respon- 
sible for the elastic properties of the molécule; and alanine- and lysine-rich a-he- 
lical segments, which are cross-linked to adjacent molécules by covalent attach- 
ment of lysine residues. Each segment is encoded by a separate exon. There is 
still uncertainty concerning the conformation of elastin molécules in elastic fibers 
and how the structure of these fibers accounts for their rubberlike properties. 
However, it seems that parts of the elastin polypeptide chain, lilce the polymer 
chains in ordinary rubber, adopt a loose “random coil" conformation, and it is the 
random coil nature of the component molécules cross-linked into the elastic fiber 
network that allows the network to stretch and recoil lilce a rubber band (Figure 
19-45). 

Elastin is the dominant extracellular matrix protein in arteries, comprising 
50% of the dry weight of the largest artery—the aorta (see Figure 19-44). Muta¬ 
tions in the elastin gene causing a deficiency of the protein in mice or humans 
resuit in narrowing of the aorta and other arteries and excessive prolifération of 
smooth muscle cells in the arterial wall. Apparently, the normal elasticity of an 
artery is required to restrain the prolifération of these cells. 

Elastic fibers do not consist solely of elastin. The elastin core is covered with a 
sheath of microfibrils, each of which has a diameter of about 10 nm. The microfi- 
brils appear before elastin in developing tissues and seem to provide scaffolding 
to guide elastin déposition. Arrays of microfibrils are elastic in their own right, 
and in some places they persist in the absence of elastin: they help to hold the 
lens in its place in the eye, for example. Microfibrils are composed of a number 
of distinct glycoproteins, including the large glycoprotein fibrillin, which binds to 
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elastic fiber 



Figure 19-45 Stretching a network of 
elastin molécules. The molécules are 
joined together by covalent bonds (red) 
to generate a cross-linked network. In 
this model, each elastin molécule in the 
network can extend and contract in a 
manner resembling a random coil, so that 
the entire assembly can stretch and recoil 
like a rubber band. 


STRETCH 


RELAX 


single elastin molécule 



elastin and is essential for the integrity of elastic fibers. Mutations in the fibrillin 
gene resuit in Marfan’s syndrome , a relatively common human disorder. In the 
most severely affected individuals, the aorta is prone to rupture; other common 
effects include displacement of the lens and abnormalities of the skeleton and 
joints. Affected individuals are often unusually tall and lanky: Abraham Lincoln is 
suspected to hâve had the condition. 

Fibronectin and Other Multidomain Glycoproteins Help Organize 
the Matrix 

In addition to proteoglycans, collagens, and elastic fibers, the extracellular matrix 
contains a large and varied assortment of glycoproteins that typically hâve mul¬ 
tiple domains, each with spécifie binding sites for other matrix macromolecules 
and for receptors on the surface of cells (Figure 19-46). These proteins therefore 
contribute to both organizing the matrix and helping cells attach to it. Like the 
proteoglycans, they also guide cell movements in developing tissues, by serving 
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Figure 19-46 Complex glycoproteins 
of the extracellular matrix. Many matrix 
glycoproteins are large scaffold proteins 
containing multiple copies of spécifie 
protein-interaction domains. Each domain 
is folded into a discrète globular structure, 
and many such domains are arrayed 
along the protein like beads on a string. 
This diagram shows four représentative 
proteins among the roughly 200 matrix 
glycoproteins that are found in mammals. 
Each protein contains multiple repeat 
domains, with the names listed in the key 
at the bottom. Fibronectin, for example, 
contains numerous copies of three different 
fibronectin repeats (types l-lll, labeled 
here as FN1, FN2, and FN3). Two type 
III repeats near the C-terminus contain 
important binding sites for cell-surface 
integrins, whereas other FN repeats are 
involved in binding fibrin, collagen, and 
heparin, as indicated (see Figure 19-47). 
Other matrix proteins contain repeated 
sequences resembling those of epidermal 
growth factor (EGF), a major regulator 
of cell growth and prolifération; these 
repeats might serve a similar signaling 
function in matrix proteins. Other proteins 
contain domains, such as the insulin-like 
growth factor-binding protein (IGFBP) 
repeat, that bind and regulate the function 
of soluble growth factors. To add more 
structural diversity, many of these proteins 
are encoded by RNA transcripts that 
can be spliced in different ways, adding 
or removing exons, such as those in 
fibronectin. Finally, the scaffolding and 
regulatory functions of many matrix 
proteins are further expanded by assembly 
into multimeric forms, as shown at the 
right: fibronectin forms dimers linked 
at the C-termini, whereas tenascin and 
thrombospondin form N-terminally linked 
hexamers and trimers, respectively. 

Other domains include four repeats from 
thrombospondin (TSPN, TSP1, TSP3, 
TSP_C). VWC, von Willebrand type C; 

FBG, fibrinogen-like. (Adapted from 
R.O. Hynes and A. Naba, Cold Spring 
Harb. Perspect. Biol. 4:a004903, 2012.) 
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as traclcs along which cells can migrate or as repellents that lceep cells out of for- 
bidden areas. They can also bind and thereby influence the function of peptide 
growth factors and other small molécules produced by nearby cells. 

The best-understood member of this class of matrix proteins is fibronec- 
tin, a large glycoprotein found in ail vertebrates and important for many cell- 
matrix interactions. Mutant mice that are unable to malce fibronectin die early in 
embryogenesis because their endothélial cells fail to form proper blood vessels. 
The defect is thought to resuit from abnormalities in the interactions of these cells 
with the surrounding extracellular matrix, which normally contains fibronectin. 

Fibronectin is a dimer composed of two very large subunits joined by disulfide 
bonds at their C-terminal ends. Each subunit contains a sériés of small repeated 
domains, or modules, separated by short stretches of flexible polypeptide chain 
(Figure 19-47). Each domain is usually encoded by a separate exon, suggesting 
that the fibronectin gene, lilce the genes encoding many matrix proteins, evolved 
by multiple exon duplications. In the human genome, there is only one fibronectin 
gene, containing about 50 exons of similar size, but the transcripts can be spliced 
in different ways to produce multiple fibronectin isoforms (see Figure 19-46). 
The major repeat domain in fibronectin is called the type III fibronectin repeat, 
which is about 90 amino acids long and occurs at least 15 times in each subunit. 
This repeat is among the most common of ail protein domains in vertebrates. 

Fibronectin Binds to Integrins 


Figure 19-47 The structure of 
a fibronectin dimer. (A) Electron 
micrographs of individual fibronectin dimer 
molécules shadowed with platinum; red 
arrows mark the joined C-termini. (B) The 
two polypeptide chains are similar but 
generally not identical (being made from 
the same gene but from differently spliced 
mRNAs). They are joined by two disulfide 
bonds nearthe C-termini. Each chain is 
almost 2500 amino acids long and is folded 
into multiple domains (see Figure 19-46). 

As indicated, some domains are specialized 
for binding to a particular molécule. For 
simplicity, not ail of the known binding 
sites are shown. (C) The three-dimensional 
structure of the ninth and tenth type III 
fibronectin repeats, as determined by 
x-ray crystallography. Both the Arg-Gly- 
Asp (RGD) and the “synergy” sequences 
shown in red are important for binding to 
integrins on cell surfaces. (A, from J. Engel 
et al., J. Mol. Biol. 150:97-120, 1981. With 
permission from Academie Press; C, from 
Daniel J. Leahy, An nu. Rev. Cell Dev. Biol. 
13:363-393, 1997. With permission from 
Annual Reviews.) 


One way to analyze a complex multifunctional protein molécule such as fibronec¬ 
tin is to synthesize individual régions of the protein and test their ability to bind 
other proteins. By these and other methods, it was possible to show that one 
région of fibronectin binds to collagen, another to proteoglycans, and another 
to spécifie integrins on the surface of various types of cells (see Figure 19-47B). 
Synthetic peptides corresponding to different segments of the integrin-binding 
domain were then used to show that binding dépends on a spécifie tripeptide 
sequence ( Arg-Gly-Asp , or RGD ) that is found in one of the type III repeats (see 
Figure 19-47C). Even very short peptides containing this RGD sequence can com- 
pete with fibronectin for the binding site on cells, thereby inhibiting the attach- 
ment of the cells to a fibronectin matrix. 

Several extracellular proteins besides fibronectin also hâve an RGD sequence 
that médiates cell-surface binding. Many of these proteins are components of 
the extracellular matrix, while others are involved in blood clotting. Peptides 
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Figure 19-48 Tension-sensing by 
fibronectin. Some type III fibronectin 
repeats are thought to unfold when 
fibronectin is stretched. The unfolding 
exposes cryptic binding sites that interact 
with other fibronectin molécules resulting 
in the formation of fibronectin filaments like 
those shown in Figure 19-49. (From 
V. Vogel and M. Sheetz, Nat. Rev. Mol. Cell 
Biol. 7:265-275, 2006. With permission 
from Macmillan Publishers Ltd.) 


containing the RGD sequence hâve been useful in the development of anti-clot- 
ting drugs. Some snalces use a similar strategy to cause their victims to bleed: 
they secrete RGD-containing anti-clotting pro teins called disintegrins into their 
venom. 

The cell-surface receptors that bind RGD-containing pro teins are members of 
the integrin family, which we describe in detail later. Each integrin specifically 
recognizes its own small set of matrix molécules, indicating that tight binding 
requires more than just the RGD sequence. Moreover, RGD sequences are not the 
only sequence motifs used for binding to integrins: many integrins recognize and 
bind to other motifs instead. 


Tension Exerted by Cells Régulâtes the Assembly of Fibronectin 
Fibrils 

Fibronectin can exist both in a soluble form, circulating in the blood and other 
body fluids, and as insoluble fibronectin fibrils , in which fibronectin dimers are 
cross-linlced to one another by additional disulfide bonds and form part of the 
extracellular matrix. Unlilce fibrillar collagen molécules, however, which can 
self-assemble into fibrils in a test tube, fibronectin molécules assemble into 
fibrils only on the surface of cells, and only where those cells possess appropri- 
ate fibronectin-binding proteins—in particular, integrins. The integrins provide a 
linkage from the fibronectin outside the cell to the actin cytoslceleton inside it. The 
linkage transmits tension to the fibronectin molécules—provided that they also 
hâve an attachment to some other structure—and stretches them, exposing cryp¬ 
tic binding sites in the fibronectin molécules (Figure 19-48). This allows them 
to bind directly to one another and to recruit additional fibronectin molécules 
to form a fibril (Figure 19-49). This dependence on tension and interaction with 
cell surfaces ensures that fibronectin fibrils assemble where there is a mechanical 
need for them and not in inappropriate locations such as the bloodstream. 

Many other extracellular matrix proteins contain multiple copies of the type 
III fibronectin repeat (see Figure 19-46), and it is possible that tension exerted 
on these proteins also uncovers cryptic binding sites and thereby influences their 
behavior. 

The Basal Lamina Is a Specialized Form of Extracellular Matrix 

Thus far in this section we hâve reviewed the general principles underlying the 
structure and function of the major classes of extracellular matrix components. 
We now describe how some of these components are assembled into a specialized 
type of extracellular matrix called the basal lamina (also known as the basement 
membrane). This exceedingly thin, tough, flexible sheet of matrix molécules is an 
essential underpinning of ail epithelia. Although small in volume, it has a criti- 
cal rôle in the architecture of the body. Like the cadherins, it seems to be one of 
the defining features common to ail multicellular animais, and it seems to hâve 
appeared very early in their évolution. The major molecular components of the 
basal lamina are among the most ancient extracellular matrix macromolecules. 

Basal laminae are typically 40-120 nm thiclc. A sheet of basal lamina not only 
lies beneath épithélial cells but also surrounds individual muscle cells, fat cells, 



Figure 19-49 Organization of fibronectin 
into fibrils at the cell surface. This 
fluorescence micrograph shows the front 
end of a migrating mouse fibroblast. 
Extracellular fibronectin is stained green 
and intracellular actin filaments are stained 
red. The fibronectin is initially présent as 
small dotlike aggregates near the leading 
edge of the cell. It accumulâtes at focal 
adhesions (sites of anchorage of actin 
filaments, discussed later) and becomes 
organized into fibrils parallel to the actin 
filaments. Integrin molécules spanning the 
cell membrane link the fibronectin outside 
the cell to the actin filaments inside it (see 
Figure 19-55). Tension exerted on the 
fibronectin molécules through this linkage is 
thought to stretch them, exposing binding 
sites that promote fibril formation. (Courtesy 
of Roumen Pankov and Kenneth Yamada.) 
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and Schwann cells (which wrap around peripheral nerve cell axons to form 
myelin). The basal lamina thus séparâtes these cells and epithelia from the under- 
lying or surrounding connective tissue and forms the mechanical connection 
between them. In other locations, such as the kidney glomerulus, a basal lam¬ 
ina lies between two cell sheets and functions as a sélective filter (Figure 19-50). 
Basal laminae hâve more than simple structural and filtering rôles, however. They 
are able to détermine cell polarity; influence cell metabolism; organize the pro¬ 
teins in adjacent plasma membranes; promote cell survival, prolifération, or dif¬ 
férentiation; and serve as highways for cell migration. 

The mechanical rôle is nevertheless essential. In the skin, for example, the épi¬ 
thélial outer layer—the epidermis—dépends on the strength of the basal lamina 
to lceep it attached to the underlying connective tissue—the dermis. In people 
with genetic defects in certain basal lamina proteins or in a spécial type of col- 
lagen that anchors the basal lamina to the underlying connective tissue, the epi¬ 
dermis becomes detached from the dermis. This causes a blistering disease called 
junctional epidermolysis bullosa, a severe and sometimes léthal condition. 

Laminin and Type IV Collagen Are Major Components of the Basal 
Lamina 

The basal lamina is synthesized by the cells on each side of it: the épithélial cells 
contribute one set of basal lamina components, while cells of the underlying bed 
of connective tissue (called the stroma , Greelc for “bedding") contribute another 
set (Figure 19-51). Although the précisé composition of the mature basal lamina 
varies from tissue to tissue and even from région to région in the same lamina, it 
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Figure 19-50 Three ways in which 
basal laminae are organized. Basal 
laminae (yellow) surround certain cells 
(such as skeletal muscle cells), underlie 
epithelia, and are interposed between two 
cell sheets (as in the kidney glomerulus). 
Note that, in the kidney glomerulus, both 
cell sheets hâve gaps in them, and the 
basal lamina has a filtering as well as a 
supportive function, helping to détermine 
which molécules will pass into the urine 
from the blood. The filtration also dépends 
on other protein-based structures, called 
slit diaphragms, that span the intercellular 
gaps in the epithelia! sheet. 


Figure 19-51 The basal lamina in 
the cornea of a chick embryo. In this 
scanning électron micrograph, some of 
the epithelia! cells hâve been removed to 
expose the upper surface of the matlike 
basal lamina. A network of collagen fibrils in 
the underlying connective tissue interacts 
with the lower face of the lamina. (Courtesy 
of Robert Trelstad.) 
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typically contains the glycoproteins laminin, type IV collagen, and nidogen, along 
with the proteoglycan perlecan. Other common basal lamina components are 
fïbronectin and type XVIII collagen (an atypical member of the collagen family, 
forming the core protein of a proteoglycan). 

Laminin is the primary organizer of the sheet structure, and, early in devel¬ 
opment, basal laminae consist mainly of laminin molécules. Laminins comprise 
a large family of proteins, each composed of three long polypeptide chains (a, P, 
and y) held together by disulfide bonds and arranged in the shape of an asym- 
metric bouquet, lilce a bunch of three flowers whose stems are twisted together 
at the foot but whose heads remain separate (Figure 19-52). These heterotrimers 
can self-assemble in vitro into a networlc, largely through interactions between 
their heads, although interaction with cells is needed to organize the networlc into 
an orderly sheet. Since there are several isoforms of each type of chain, and these 
can associate in different combinations, many different laminins can be pro- 
duced, creating basal laminae with distinctive properties. The laminin y 1 chain is, 
however, a component of most laminin heterotrimers; mice laclcing it die during 
embryogenesis because they are unable to malce basal laminae. 

Type IV collagen is a second essential component of mature basal laminae, 
and it, too, exists in several isoforms. Lilce the fibrillar collagens that constitute 
the bulle of the protein in connective tissues such as bone or tendon, type IV col¬ 
lagen molécules consist of three separately synthesized long protein chains that 
twist together to form a ropelilce superhelix; however, they differ from the fibrillar 
collagens in that the triple-stranded helical structure is interrupted in more than 
20 régions, allowing multiple bends. Type IV collagen molécules interact via their 
terminal domains to assemble extracellularly into a flexible, feltlilce networlc that 
gives the basal lamina tensile strength. 

Laminin and type IV collagen interact with other basal lamina components, 
such as the glycoprotein nidogen and the proteoglycan perlecan, resulting in a 
highly cross-linlced networlc of proteins and proteoglycans (Figure 19-53). The 
laminin molécules that generate the initial sheet structure first join to each other 
while bound to receptors on the surface of the cells that produce laminin. The 
cell-surface receptors are primarily members of the integrin family, but another 
important type of laminin receptor is dystroglycan, a proteoglycan with a core pro¬ 
tein that spans the cell membrane, dangling its GAG chains in the extracellular 
space. Together, these receptors organize basal lamina assembly: they hold the 
laminin molécules by their feet, leaving the laminin heads positioned to interact 
so as to form a two-dimensional networlc. This laminin networlc then coordinates 
the assembly of the other basal lamina components. 

Basal Laminae Hâve Diverse Functions 

In the kidney glomerulus, an unusually thiclc basal lamina acts as one of the layers 
of a molecular filter, helping to prevent the passage of macromolecules from the 
blood into the urine as urine is formed (see Figure 19-50). The proteoglycan in 
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Figure 19-52 The structure of laminin. 

(A) The best-understood family member 
is laminin-111, shown here with some 
of its binding sites for other molécules 
(yellow boxes). Laminins are multidomain 
glycoproteins composed of three 
polypeptides (a, p, and y) that are disulfide- 
bonded into an asymmetric crosslike 
structure. Each of the polypeptide chains 
is more than 1500 amino acids long. Five 
types of a chains, four types of P chains, 
and three types of y chains are known, and 
various combinations of these subunits can 
assemble to form a large variety of different 
laminins, which are named according 
to numbers assigned to each of their 
three subunits: laminin-111, for example, 
contains al, pi , and yl subunits. Each 
isoform tends to hâve a spécifie tissue 
distribution: laminin-332 is found in skin, 
laminin-211 in muscle, and laminin-411 in 
endothélial cells of blood vessels. Through 
their binding sites for other proteins, laminin 
molécules play a central part in organizing 
basal laminae and anchoring them to 
cells. (B) Electron micrographs of laminin 
molécules shadowed with platinum. 

(B, from J. Engel et al., J. Mol. Biol. 
150:97-120, 1981. With permission 
from Academie Press.) 
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the basal lamina is important for this function: when its GAG chains are removed 
by spécifie enzymes, the filtering properties of the lamina are destroyed. Type IV 
collagen also has a rôle: in a human hereditary lcidney disorder (Alport syndrome), 
mutations in a type IV collagen gene resuit in an irregularly thiclcened and dys- 
functional glomerular filter. Laminin mutations, too, can disrupt the function of 
the kidney filter, but in a different way—by interfering with the différentiation of 
the cells that contact it and support it. 

The basal lamina can act as a sélective barrier to the movement of cells, as 
well as a filter for molécules. The lamina beneath an epithelium, for example, usu- 
ally prevents fibroblasts in the underlying connective tissue from making contact 
with the épithélial cells. It does not, however, stop macrophages, lymphocytes, or 
nerve processes from passing through it, using specialized protease enzymes to 
eut a hole for their transit. The basal lamina is also important in tissue régénéra¬ 
tion after injury. When cells in tissues such as muscles, nerves, and epithelia are 
damaged or killed, the basal lamina often survives and provides a scaffold along 
which regenerating cells can migrate. In this way, the original tissue architecture 
is readily reconstructed. 

A particularly striking example of the rôle of the basal lamina in régénération 
cornes from studies of the neuromuscular junction, the site where the nerve ter¬ 
minais of a motor neuron form a Chemical synapse with a slceletal muscle cell 
(discussed in Chapter 11). In vertebrates, the basal lamina that surrounds the 
muscle cell séparâtes the nerve and muscle cell plasma membranes at the syn¬ 
apse, and the synaptic région of the lamina has a distinctive Chemical character, 


Figure 19-53 A model of the molecular 
structure of a basal lamina. (A) The 

basal lamina is formed by spécifie 
interactions (B) between the proteins 
laminin, type IV collagen, and nidogen, 
and the proteoglycan perlecan. Arrows 
in (B) connect molécules that can bind 
directly to each other. There are various 
isoforms of type IV collagen and laminin, 
each with a distinctive tissue distribution. 


Transmembrane laminin receptors 
(integrins and dystroglycan) in the 
plasma membrane are thought to 
organize the assembly of the basal lamina; 
only the integrins are shown. (Based on 
H. Colognato and P.D. Yurchenco, Dev. 
Dyn. 218:213-234, 2000. With permission 
from Wiley-Liss.) 
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with spécial isoforms of type IV collagen and laminin and a proteoglycan called 
agrin. After a nerve or muscle injury, the basal lamina at the synapse has a central 
rôle in reconstructing the synapse at the correct location (Figure 19-54). Defects 
in components of the basal lamina at the synapse are responsible for some forms 
of muscular dystrophy, in which muscles develop normally but then degenerate 
later in life. 


Figure 19-54 Régénération experiments 
demonstrating the spécial character 
of the junctional basal lamina at a 
neuromuscular junction. If afrog muscle 
and its motor nerve are destroyed, 
the basal lamina around each muscle 
cell remains intact and the sites of the 
old neuromuscular junctions are still 
recognizable. When the nerve, but not the 
muscle, is allowed to regenerate (upper 
right), the junctional basal lamina directs 
the regenerating nerve to the original 
synaptic site. When the muscle, but not 
the nerve, is allowed to regenerate (lower 
right), the junctional basal lamina causes 
newly made acétylcholine receptors (blue) 
to accumulate at the original synaptic site. 
These experiments show that the junctional 
basal lamina Controls the localization of 
synaptic components on both sides of the 
lamina. Some of the molécules responsible 
for these effects hâve been identified. 

Motor neuron axons, for example, deposit 
agrin in the junctional basal lamina, where 
it régulâtes the assembly of acétylcholine 
receptors and other proteins in the 
junctional plasma membrane of the muscle 
cell. Reciprocally, muscle cells deposit a 
particular isoform of laminin in the junctional 
basal lamina, and this molécule is likely to 
interact with spécifie ion channels on the 
presynaptic membrane of the neuron. 


Cells Hâve to Be Able to Dégradé Matrix, as Well as Make It 

The ability of cells to dégradé and destroy extracellular matrix is as important as 
their ability to make it and bind to it. Rapid matrix dégradation is required in pro- 
cesses such as tissue repair, and even in the seemingly static extracellular matrix 
of adult animais there is a slow, continuous turnover, with matrix macromol- 
ecules being degraded and resynthesized. This allows bone, for example, to be 
remodeled so as to adapt to changes in the stresses on it. 

From the point of view of individual cells, the ability to eut through matrix is 
crucial in two ways: it enables them to divide while embedded in matrix, and it 
enables them to travel through it. Cells in connective tissues generally need to be 
able to stretch out in order to divide. If a cell laclcs the enzymes needed to dégradé 
the surrounding matrix, it is strongly inhibited from dividing, as well as being hin- 
dered from migrating. 

Localized dégradation of matrix components is also required wherever cells 
hâve to escape from confinement by a basal lamina. It is needed during normal 
branching growth of épithélial structures such as glands, for example, to allow the 
population of épithélial cells to increase, and needed also when white blood cells 
migrate across the basal lamina of a blood vessel into tissues in response to infec¬ 
tion or injury. Matrix dégradation is important both for the spread of cancer cells 
through the body and for their ability to proliferate in the tissues that they invade 
(discussed in Chapter 20). 

In general, matrix components are degraded by extracellular proteolytic 
enzymes (proteases) that act close to the cells that produce them. Many of these 
proteases belong to one of two general classes. The largest group, with about 50 
members in vertebrates, is the matrix métalloprotéases, which dépend on bound 
Ca 2+ or Zn 2+ for activity. The second group is the serine proteases, which hâve a 
highly reactive serine in their active site. Together, métalloprotéases and serine 
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proteases cooperate to dégradé matrix proteins such as collagen, laminin, and 
fibronectin. Some métalloprotéases, such as the collagénases, are highly spécifie, 
cleaving particular proteins at a small number of sites. In this way, the structural 
integrity of the matrix is largely retained, while the limited amount of proteoly- 
sis that occurs is sufficient for cell migration. Other métalloprotéases may be less 
spécifie, but, because they are anchored to the plasma membrane, they can act 
just where they are needed; it is this type of matrix metalloprotease that is crucial 
for a cell's ability to divide when embedded in matrix. 

Clearly, the activities of the proteases that dégradé the matrix must be tightly 
controlled, if the fabric of the body is not to collapse in a heap. Numerous mech- 
anisms are therefore employed to ensure that matrix proteases are activated only 
at the correct time and place. Protease activity is generally confined to the cell 
surface by spécifie anchoring proteins, by membrane-associated activators, and 
by the production of spécifie protease inhibitors in régions where protease activ¬ 
ity is not needed. 

Matrix Proteoglycans and Glycoproteins Regulate the Activities of 
Secreted Proteins 

The physical properties of extracellular matrix are important for its fundamental 
rôles as a scaffold for tissue structure and as a substrate for cell anchorage and 
migration. The matrix also has an important impact on cell signaling. Cells com- 
municate with each other by secreting signal molécules that diffuse through the 
extracellular fluid to influence other cells (discussed in Chapter 15). En route 
to their targets, the signal molécules encounter the tightly woven meshworlc of 
the extracellular matrix, which contains a high density of négative charges and 
protein-interaction domains that can interact with the signal molécules, thereby 
altering their function in a variety of ways. 

The highly charged heparan sulfate chains of proteoglycans, for example, 
interact with numerous secreted signal molécules, including fibroblast growth 
factors (FGFs ) and vascular endothélial growth factor (VEGF), which (among 
other effects) stimulate a variety of cell types to proliferate. By providing a dense 
array of growth factor binding sites, proteoglycans are thought to generate large 
local réservoirs of these factors, limiting their diffusion and focusing their actions 
on nearby cells. Similarly, proteoglycans might help generate steep growth fac¬ 
tor gradients in an embryo, which can be important in the patterning of tissues 
during development. FGF activity can also be enhanced by proteoglycans, which 
oligomerize the FGF molécules, enabling them to cross-link and activate their 
cell-surface receptors more effectively. 

The importance of proteoglycans as regulators of the distribution and activ¬ 
ity of signal molécules is illustrated by the severe developmental defects that can 
occur when spécifie proteoglycans are inactivated by mutation. In Drosophila, 
for example, the function of several signal proteins during development is gov- 
erned by interactions with the membrane-associated proteoglycans Daily and 
Dally-like. These members of the glypican family are thought to concentrate sig¬ 
nal proteins in spécifie locations and act as co-receptors that collaborate with the 
conventional cell-surface receptor proteins; as a resuit, they promote signaling in 
the correct location and prevent it in the wrong locations. In the Drosophila ovary, 
for example, Daily is partly responsible for the restricted localization and func¬ 
tion of a signaling protein called Dpp, which bloclcs différentiation of the germ- 
line stem cells: when the gene encoding Daily is mutated, Dpp activity is greatly 
reduced and oocyte development is abnormal. 

Several matrix proteins also interact with signal proteins. The type IV collagen 
of basal laminae interacts with Dpp in Drosophila, for example. Fibronectin con¬ 
tains a type III fibronectin repeat that interacts with VEGF, and another domain 
that interacts with another growth factor called hépatocyte growth factor (HGF), 
thereby promoting the activities of these factors. As discussed earlier, many matrix 
glycoproteins contain extensive arrays of binding domains, and the arrangement 
of these domains is lilcely to influence the présentation of signal proteins to their 
target cells (see Figure 19-46). 
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Finally, many matrix glycoproteins contain domains that bind directly to spé¬ 
cifie cell-surface receptors, thereby generating signais that influence the behavior 
of the cells, as we describe in the next section. 

Summary 

Cells are embedded in an intricate extracellular matrix, which not only binds the 
cells together but also influences their survival, development, shape, polarity, and 
migratory behavior. The matrix contains various protein fibers interwoven in a net- 
worlc of glycosaminoglycan (GAG) chains. GAGs are negatively charged polysac¬ 
charide chains that (exceptfor hyaluronan) are covalently linked to protein toform 
proteoglycan molécules. GAGs attract water and occupy a large volume of extracel¬ 
lular space. Proteoglycans are also found on the surface of cells, where they often 
Junction as co-receptors to help cells respond to secreted signal proteins. Fiber-form- 
ing proteins give the matrix strength and resilience. The fibrillar collagens (types I, 
II, III, V, and XI) are ropelike, triple-stranded helical molécules that aggregate into 
long fibrils in the extracellular space, thereby providing tensile strength. They also 
form structures to which cells can be anchored, often via large multidomain glyco- 
proteins; such as laminin and fibronectin, that bind to integrins on the cell surface. 
Elasticity is provided by elastin molécules, which form an extensive cross-linked 
networlc of fibers and sheets that can stretch and recoil. 

The basal lamina is a specialized form of extracellular matrix that underlies 
épithélial cells or is wrapped around certain other cell types, such as muscle cells. 
Basal laminae are organized on a frameworlc of laminin molécules, which are 
linked together by their side-arms and bind to integrins and other receptors in the 
basal plasma membrane of overlying épithélial cells. Type IV collagen molécules, 
together with the protein nidogen and the large heparan sulfate proteoglycan per- 
lecan, assemble into a sheetlilce mesh that is an essential component ofall mature 
basal laminae. Basal laminae provide mechanical support for epithelia; they form 
the interface and attachment between epithelia and connective tissue; they serve as 
fïlters in the kidney; they act as barriers to Iceep cells in their proper compartments; 
they influence cell polarity and cell différentiation; and they guide cell migration 
during development and tissue régénération. 

CELL-MATRIX JUNCTIONS 

Cells malce extracellular matrix, organize it, and dégradé it. The matrix in its turn 
exerts powerful influences on the cells. The influences are exerted chiefly through 
transmembrane cell adhesion proteins that act as matrix receptors. These proteins 
tie the matrix outside the cell to the cytoskeleton inside it, but their rôle goes far 
beyond simple passive mechanical attachment. Through them, components of 
the matrix can affect almost any aspect of a cell's behavior. The matrix receptors 
hâve a crucial rôle in épithélial cells, mediating their interactions with the basal 
lamina beneath them. They are no less important in connective-tissue cells, medi¬ 
ating the cells' interactions with the matrix that surrounds them. 

Several types of molécules can function as matrix receptors or co-receptors, 
including the transmembrane proteoglycans. But the principal receptors on ani¬ 
mal cells for binding most extracellular matrix proteins are the integrins. Lilce the 
cadherins and the lcey components of the basal lamina, integrins are part of the 
fundamental architectural toolkit that is characteristic of multicellular animais. 
The members of this large family of homologous transmembrane adhesion mol¬ 
écules hâve a remarkable ability to transmit signais in both directions across the 
plasma membrane. The binding of a matrix component to an integrin can send a 
message into the interior of the cell, and conditions in the cell interior can send a 
signal outward to control binding of the integrin to the matrix. Tension applied to 
an integrin can cause it to tighten its grip on intracellular and extracellular struc¬ 
tures, and loss of tension can loosen its hold, so that molecular signaling com¬ 
plexes fall apart on either side of the membrane. In this way, integrins can serve 
not only to transmit mechanical and molecular signais, but also to convert one 
type of signal into the other. 
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Integrins Are Transmembrane Heterodimers That Link the 
Extracellular Matrix to the Cytoskeleton 

There are many varieties of integrins, but they ail conform to a common plan. 
An integrin molécule is composed of two noncovalently associated glycopro- 
tein subunits called a and p. Both subunits span the cell membrane, with short 
intracellular C-terminal tails and large N-terminal extracellular domains (Figure 
19-55). The extracellular domains bind to spécifie amino acid sequence motifs 
in extracellular matrix proteins or, in some cases, in proteins on the surfaces of 
other cells. The best-understood binding site for integrins is the RGD sequence 
mentioned earlier (see Figure 19-47), which is found in fibronectin and other 
extracellular matrix proteins. Some integrins bind a Leu-Asp-Val (LDV) sequence 
in fibronectin and other proteins. Additional integrin-binding sequences, as yet 
poorly defined, exist in laminins and collagens. 

Humans contain 24 types of integrins, formed from the products of 8 differ¬ 
ent P-chain genes and 18 different a-chain genes, dimerized in different combi¬ 
nations. Each integrin dimer has distinctive properties and functions. Moreover, 
because the same integrin molécule in different cell types can hâve different 
ligand-binding specificities, it seems that additional cell-type-specific factors can 
interact with integrins to modulate their binding activity. The binding of integrins 
to their matrix ligands is also affected by the concentration of Ca 2+ and Mg 2+ in the 
extracellular medium, reflecting the presence of divalent cation-binding domains 
in the a and p subunits. The divalent cations can influence both the affinity and 
the specificity of the binding of an integrin to its extracellular ligands. 

The intracellular portion of an integrin dimer binds to a complex of several 
different proteins, which together form a linlcage to the cytoskeleton. For ail but 
one of the 24 varieties of human integrins, this intracellular linlcage is to actin fila¬ 
ments. These linkages dépend on proteins that assemble at the short cytoplasmic 
tails of the integrin subunits (see Figure 19-55). A large adaptor protein called 
talin is a component of the linlcage in many cases, but numerous additional pro¬ 
teins are also involved. Lilce the actin-linlced cell-cell junctions formed by cad- 
herins, the actin-linlced cell-matrix junctions formed by integrins may be small, 
inconspicuous, and transient, or large, prominent, and durable. Examples of the 
latter are the focal adhesions that form when fibroblasts hâve sufficient time to 
establish strong attachments to the rigid surface of a culture dish, and the myoten- 
dinous junctions that attach muscle cells to their tendons. 


extracellular matrix protein 



Figure 19-55 The subunit structure 
of an active integrin molécule, linking 
extracellular matrix to the actin 
cytoskeleton. The N-terminal heads of 
the integrin chai ns attach directly to an 
extracellular protein such as fibronectin; 
the C-terminal intracellular tail of the 
integrin p subunit binds to adaptor 
proteins that interact with filamentous 
actin. The best-understood adaptor is 
a giant protein called talin, which contains 
a string of multiple domains for binding 
actin and other proteins, such as vinculin, 
that help reinforce and regulate the linkage 
to actin filaments. One end of talin binds 
to a spécifie site on the integrin p subunit 
cytoplasmic tail; other regulatory proteins, 
such as kindlin, bind at another site on 
the tail. 
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In epithelia, the most prominent cell-matrix attachment sites are the hemides- 
mosomes, where a spécifie type of integrin anchors the cells to laminin in the 
basal lamina. Here, uniquely, the intracellular attachment is to keratin interme- 
diate filaments, via the intracellular adaptor proteins plectin and BP230 (Figure 
19-56). 


Figure 19-56 Hemidesmosomes. 

(A) Hemidesmosomes spot-weld épithélial 
cells to the basal lamina, linking laminin 
outside the cell to keratin filaments 
inside it. (B) Molecular components of a 
hemidesmosome. A specialized integrin 
((X 6 P 4 integrin) spans the membrane, 
attaching to keratin filaments intracellularly 
via adaptor proteins called plectin and 
BP230, and to laminin extracellularly. The 
adhesive complex also contains, in parallel 
with the integrin, an unusual collagen 
family member known as collagen type 
XVII; this has a membrane-spanning 
domain attached to its extracellular 
collagenous portion. Defects in any of 
these components can give rise to a 
blistering disease of the skin. One such 
disease, called bullous pemphigoid, is an 
autoimmune disease in which the immune 
System develops antibodies against 
collagen XVII or BP230. 


Integrin Defects Are Responsible for Many Genetic Diseases 

Although there is some overlap in the activities of the different integrins—at least 
five bind laminin, for example—it is the diversity of integrin functions that is more 
remarkable. Table 19-3 lists some varieties of integrins and the problems that 
resuit when individual integrin a or P chains are defective. 

The Pi subunit forms dimers with at least 12 distinct a subunits and is found 
on almost ail vertebrate cells: (X 5 P 1 is a fibronectin receptor and o^Pi is a laminin 


Some Types of Integrins 

Integrin 

Ligand* 

Distribution 

Phenotype when a subunit 
is mutated 

Phenotype when P subunit is 
mutated 

asPi 

Fibronectin 

Ubiquitous 

Death of embryo; defects 
in blood vessels, somites, 
neural crest 

Early death of embryo 
(at implantation) 

«6Pl 

Laminin 

Ubiquitous 

Severe skin blistering; 
defects in other epithelia also 

Early death of embryo 
(at implantation) 

«7 Pi 

Laminin 

Muscle 

Muscular dystrophy; 
defective myotendinous 
junctions 

Early death of embryo 
(at implantation) 

a L p 2 (LFA1) 

Ig superfamily 

counterreceptors 

(ICAM1) 

White blood cells 

Impaired recruitment of 
leucocytes 

Leukocyte adhesion deficiency 
(LAD); impaired inflammatory 
responses; récurrent life- 
threatening infections 

0t|lbP3 

Fibrinogen 

Platelets 

Bleeding; no platelet 
aggregation (Glanzmann’s 
disease) 

Bleeding; no platelet 
aggregation (Glanzmann’s 
disease); mild osteopetrosis 

a6P4 

Laminin 

Hemidesmosomes in 
epithelia 

Severe skin blistering; 
defects in other epithelia also 

Severe skin blistering; defects in 
other epithelia also 

*Not ail ligands are listed. 
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receptor on many types of cells. Mutant mice that cannot malce any Pi integrins 
die early in embryonic development. Mice that are only unable to malce the a 7 
subunit (the partner for Pi in muscle) survive but develop muscular dystrophy (as 
do mice that cannot malce the laminin ligand for the a 7 Pi integrin). 

The p 2 subunit forms dimers with at least four types of a subunit and is 
expressed exclusively on the surface of white blood cells, where it has an essential 
rôle in enabling these cells to fight infection. The p 2 integrins mainly médiate cell- 
cell rather than cell-matrix interactions, binding to spécifie ligands on another 
cell, such as an endothélial cell. The ligands are members of the Ig superfamily of 
cell-cell adhesion molécules. We hâve already described an example earlier in the 
chapter: an integrin of this class ( 0 TP 2 , also lcnown as LFA1) on white blood cells 
enables them to attach firmly to the Igfamily protein ICAM1 on vascular endothé¬ 
lial cells at sites of infection (see Figure 19-28B). People with the genetic disease 
called leukocyte adhesion deficiency fail to synthesize functional P 2 subunits. As 
a conséquence, their white blood cells laclc the entire family of P 2 receptors, and 
they suffer repeated bacterial infections. 

The P 3 integrins are found on blood platelets (as well as various other cells), 
and they bind several matrix proteins, including the blood clotting factor fibrin- 
ogen. Platelets hâve to interact with fibrinogen to médiate normal blood clotting, 
and humans with Glanzmann's disease , who are genetically déficient in P 3 integ¬ 
rins, suffer from defective clotting and bleed excessively. 

Integrins Can Switch Between an Active and an Inactive 
Conformation 

A cell crawling through a tissue—a fibroblast or a macrophage, for example, or an 
épithélial cell migrating along a basal lamina—has to be able both to malce and to 
break attachments to the matrix, and to do so rapidly if it is to travel quickly. Simi- 
larly, a circulating white blood cell has to be able to switch on or off its tendency to 
bind to endothélial cells in order to crawl out of a blood vessel at a site of inflam¬ 
mation. Furthermore, if force is to be applied where it is needed, the malcing and 
brealcing of the extracellular attachments in ail these cases has to be coupled to 
the prompt assembly and disassembly of cytoslceletal attachments inside the cell. 
The integrin molécules that span the membrane and médiate the attachments 
cannot simply be passive, rigid objects with sticlcy patches at their two ends. They 
must be able to switch between an active State, where they readily form attach¬ 
ments, and an inactive State, where they do not. 

Structural studies, using a combination of électron microscopy and x-ray crys- 
tallography, suggest that integrins exist in multiple structural conformations that 
reflect different States of activity (Figure 19-57). In the inactive State, the external 
segments of the integrin dimer are folded together into a compact structure that 
cannot bind matrix proteins. In this State, the cytoplasmic tails of the dimer are 


strong 

ligand binding 




ACTIVE 



strong 

adaptor-protein 

binding 


Figure 19-57 Integrins exist in two 
major activity States. Inactive (folded) 
and active (extended) structures of an 
integrin molécule, based on data from x-ray 
crystallography and other methods. 
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Figure 19-58 Activation of integrins by intracellular signaling. Signais received from outside the cell can act through various 
intracellular mechanisms to stimulate integrin activation. In platelets, as illustrated here, the extracellular signal protein thrombin 
activâtes a G-protein-coupled receptor on the cell surface, thereby initiating a signaling pathway that leads to activation of 
Rapl, a member of the monomeric GTPase family. Activated Rapl interacts with the protein RIAM, which then recruits talin to 
the plasma membrane. Together with another protein called kindlin, talin interacts with the integrin p chain to trigger integrin 
activation. Talin then interacts with adaptor proteins such as vinculin, resulting in the formation of an actin linkage (see Figure 
19-55). 

Talin régulation dépends in part on an interaction between its flexible C-terminal rod domain and the N-terminal head domain 
that contains the integrin-binding site. This interaction is thought to maintain talin in an inactive State when it is free in the 
cytoplasm. When talin is recruited by RIAM to the plasma membrane, the talin head domain interacts with a phosphoinositide 
called PI(4,5)P2 (not shown here, but see Figure 15-28), resulting in dissociation of the rod domain. Talin unfolds to expose its 
binding sites for integrin and other proteins. 

hoolced together, preventing their interaction with cytoskeletal linker proteins. 

In the active State, the two integrin subunits are unhoolced at the membrane to 
expose the intracellular binding sites for cytoplasmic adaptor proteins, and the 
external domains unfold and extend, lilce a pair of legs, to expose a high-affin- 
ity matrix-binding site at the tips of the subunits. Thus, the switch from inactive 
to active States dépends on a major conformational change that simultaneously 
exposes the external and internai ligand-binding sites at the ends of the integrin 
molécule. External matrix binding and internai cytoslceleton linkages are thereby 
coupled. 

Switching between the inactive and active States is regulated by a variety of 
mechanisms that vary, depending on the needs of the cell. In some cases, activa¬ 
tion occurs by an “outside-in" mechanism: the binding of an external matrix pro¬ 
tein, such as the RGD sequence of fibronectin, can drive some integrins to switch 
from the low-affinity inactive State to the high-affinity active State. As a resuit, 
binding sites for talin and other cytoplasmic adaptor proteins are exposed on the 
tail of the P chain. The binding of these adaptor proteins then leads to attachment 
of actin filaments to the intracellular end of the integrin molécule (see Figure 
19-55). In this way, when the integrin catches hold of its ligand outside the cell, 
the cell reacts by tying the integrin molécule to the cytoslceleton, so that force can 
be applied at the point of cell attachment. 

The chain of cause and effect can also operate in reverse, from inside to out¬ 
side. This “inside-out" integrin-activation process generally dépends on intra¬ 
cellular regulatory signais that stimulate the ability of talin and other proteins to 
interact with the P chain of the integrin. Talin competes with the integrin a chain 
for its binding site on the tail of the P chain. Thus, when talin binds to the p chain, 
it bloclcs the intracellular a~P linkage, allowing the two legs of the integrin molé¬ 
cule to spring apart. 

The régulation of "inside-out" integrin activation is particularly well under- 
stood in platelets, where an extracellular signal protein called thrombin binds 
to a spécifie G-protein-coupled receptor (GPCR) on the cell surface and thereby 
activâtes an intracellular signaling pathway that leads to integrin activation (Fig¬ 
ure 19-58). It is lilcely that similar signaling pathways govern integrin activation in 
numerous other cell types. 
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Integrins Cluster to Form Strong Adhesions 

Integrins, lilce other cell adhesion molécules, differ from cell-surface receptors for 
hormones and for other extracellular soluble signal molécules in that they usually 
bind their ligand with lower affinity and are présent at a 10-100-fold higher con¬ 
centration on the cell surface. The Velcro principle, mentioned earlier in the con- 
text of cadherin adhesion (see Figure 19-6C), opérâtes here too. Following their 
activation, integrins cluster together to create a dense plaque in which many inte- 
grin molécules are anchored to cytoslceletal filaments. The resulting protein struc¬ 
ture can be remarlcably large and complex, as seen in the focal adhesion made by 
a fibroblast on a fibronectin-coated surface culture dish. 

The assembly of mature cell-matrix junctional complexes dépends on the 
recruitment of dozens of different scaffolding and signaling proteins. Talin is a 
major component of many cell-matrix complexes, but numerous other proteins 
also malce important contributions. These include the integrin-linlced kinase {ILK) 
and its binding partners pinch and parvin, which together form a trimeric com¬ 
plex that serves as an organizing hub at many junctions. Cell-matrix junctions 
also employ several actin-binding proteins, such as vinculin, zyxin, VASP, and 
a-actinin, to promote the assembly and organization of actin filaments. Another 
critical component of many cell-matrix junctions is the focal adhesion kinase 
[FAK), which interacts with multiple components in the junction and serves an 
important function in signaling, as we describe next. 

Extracellular Matrix Attachments Act Through Integrins to Control 
Cell Prolifération and Survival 

Like other transmembrane cell adhesion pro teins, integrins do more than just cre¬ 
ate attachments. They also activate intracellular signaling pathways and thereby 
allow control of almost any aspect of the celFs behavior according to the nature of 
the surrounding matrix and the State of the celFs attachments to it. 

Many cells will not grow or proliferate in culture unless they are attached to 
extracellular matrix; nutrients and soluble growth factors in the culture medium 
are not enough. For some cell types, including épithélial, endothélial, and mus¬ 
cle cells, even cell survival dépends on such attachments. When these cells lose 
contact with the extracellular matrix, they undergo apoptosis. This dependence of 
cell growth, prolifération, and survival on attachment to a substratum is known as 
anchorage dependence, and it is mediated mainly by integrins and the intracel¬ 
lular signais they generate. Mutations that disrupt or override this form of control, 
allowing cells to escape from anchorage dependence, occur in cancer cells and 
play a major part in their invasive behavior. 

Our understanding of anchorage dependence has corne mainly from studies 
of cells living on the surface of matrix-coated culture dishes. For connective-tissue 
cells that are normally surrounded by matrix on ail sides, this is a far cry from the 
natural environment. Walking over a two-dimensional plain is very different from 
clambering through a three-dimensional jungle. The types of contacts that cells 
malce with a rigid substratum are not the same as those, much less well studied, 
that they malce with the déformable web of fibers of the extracellular matrix, and 
there are substantial différences in cell behavior in the two contexts. Nevertheless, 
it is lilcely that the same basic principles apply. Both in vitro and in vivo , intracel¬ 
lular signais generated at cell-matrix adhesion sites are crucial for cell proliféra¬ 
tion and survival. 

Integrins Recruit Intracellular Signaling Proteins at Sites of 
Cell-Matrix Adhesion 

The mechanisms by which integrins signal into the cell interior are complex, 
involving several pathways, and integrins and conventional signaling receptors 
often influence one another and worlc together to regulate cell behavior, as we 
hâve already emphasized. The Ras/MAP kinase pathway (see Figure 15-49), for 
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Figure 19-59 Tyrosine phosphorylation 
at focal adhesions. A fibroblast cultured 
on a fibronectin-coated substratum and 
stained with fluorescent antibodies: actin 
filaments are stained green and activated 
proteins that contain phosphotyrosine 
are red, giving orange where the two 
components overlap. The actin filaments 
terminate at focal adhesions, where 
the cell attaches to the substratum by 
means of integrins. Proteins containing 
phosphotyrosine are also concentrated at 
these sites, reflecting the local activation 
of FAK and other protein kinases. Signais 
generated at such adhesion sites help 
regulate cell division, growth, and survival. 
(Courtesy of Keith Burridge.) 


example, can be activated both by conventional signaling receptors and by inte¬ 
grins, but cells often need both kinds of stimulation of this pathway at the same 
time to give sufficient activation to induce cell prolifération. Integrins and con¬ 
ventional signaling receptors also cooperate to promote cell survival (discussed 
in Chapters 15 and 18). 

One of the best-studied modes of integrin signaling dépends on a cytoplas- 
mic protein tyrosine kinase called focal adhesion kinase (FAK). In studies of cells 
cultured on plastic dishes, focal adhesions are often prominent sites of tyrosine 
phosphorylation (Figure 19-59), and FAK is one of the major tyrosine-phosphor- 
ylated proteins found at these sites. When integrins cluster at cell-matrix contacts, 
FAK is recruited to the integrin P subunit by intracellular adaptor proteins such 
as talin or paxillin (which binds to one type of integrin a subunit). The clustered 
FAK molécules phosphorylate each other on a spécifie tyrosine, creating a phos¬ 
photyrosine docking site for members of the Src family of cytoplasmic tyrosine 
kinases. In addition to phosphorylating other proteins at the adhesion sites, these 
kinases then phosphorylate FAK on additional tyrosines, creating docking sites 
for a variety of additional intracellular signaling proteins. In this way, outside-in 
signaling from integrins, via FAK and Src family kinases, is relayed into the cell in 
much the same way as receptor tyrosine kinases generate signais (as discussed in 
Chapter 15). 

Cell-Matrix Adhesions Respond to Mechanical Forces 

Like the cell-cell junctions we described earlier, cell-matrix junctions can sense 
and respond to the mechanical forces that act on them. Most cell-matrix junc¬ 
tions, for example, are connected to a contractile actin network that tends to pull 
the junctions inward, away from the matrix. When cells are attached to a rigid 
matrix that strongly resists such pulling forces, the cell-matrix junction is able to 
sense the resulting high tension and trigger a response in which it recruits addi¬ 
tional integrins and other proteins to increase the junction's ability to withstand 
that tension. Cell attachment to a relatively soft matrix generates less tension 
and therefore a less robust response. These mechanisms allow cells to sense and 
respond to différences in the rigidity of extracellular matrices in different tissues. 

We saw earlier that mechanotransduction at cadherin-based cell-cell junc¬ 
tions likely dépends on junctional proteins that change their structure when 
the junction is stretched by tension (see Figure 19-12). The same is true for cell- 
matrix junctions. The long C-terminal tail domain of talin, for example, includes 
a large number of binding sites for the actin-regulatory protein vinculin. Many of 
these sites are hidden inside folded protein domains but are exposed when those 
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domains are unfolded by stretching the protein (Figure 19-60). The N-terminal 
end of talin binds integrin and the C-terminal end binds actin (see Figure 19-55); 
thus, when actin filaments are pulled by myosin motors inside the cell, the result- 
ing tension stretches the talin rod, thereby exposing vinculin-binding sites. The 
vinculin molécules then recruit and organize additional actin filaments. Tension 
thereby increases the strength of the junction. 

Summary 

Integrins are the principal cell-surface receptors used by animal cells to bind to the 
extracellular matrix: theyJunction as transmembrane linkers between the extracel¬ 
lular matrix and the cytoslceleton. Most integrins connect to actin filaments , while 
those at hemidesmosomes bind to intermediate filaments. Integrin molécules are 
heterodimers, and the binding of extracellular matrix ligands or intracellular acti- 
vator proteins such as talin results in a dramatic conformational switch from an 
inactive to an active State. Ihis créâtes an allosteric coupling between binding to 
matrix outside the cell and binding to the cytoskeleton inside it, allowing the inte¬ 
grin to convey signais in both directions across the plasma membrane. Complex 
assemblies of proteins become organized around the intracellular tails ofactivated 
integrins , producing intracellular signais that can influence almost any aspect of 
cell behavior ; from prolifération and survival, as in the phenomenon ofanchorage 
dependence, to cell polarity and guidance of migration. Integrin-based cell-matrix 
junctions are also capable ofmechanotransduction: they can sense and respond to 
mechanical forces acting across the junction. 


Figure 19-60 Talin is a tension sensor 
at cell-matrix junctions. Tension 
across cell-matrix junctions stimulâtes 
the local recruitment of vinculin and 
other actin-regulatory proteins, thereby 
strengthening the junction’s attachment 
to the cytoskeleton. The experiments 
presented here tested the hypothesis that 
tension is sensed by the talin adaptor 
protein that links integrins to actin filaments 
(see Figure 19-55). (A) The long, flexible, 
C-terminal région of talin is divided into a 
sériés of folded domains, some of which 
contain vinculin-binding sites (dark green 
Unes) that are thought to be hidden and 
therefore inaccessible. One domain near 
the N-terminus, for example, comprises a 
folded bundle of 12 a helices containing 
five vinculin-binding sites. (B) This 
experiment tested the hypothesis that 
tension stretches the 12-helix domain, 
thereby exposing vinculin-binding sites. 

A fragment of talin containing this domain 
was attached to an apparatus in which the 
domain could be stretched, as shown here. 
The fragment was labeled at its N-terminus 
with a tag that sticks to the surface of a 
glass slide on a microscope stage. The 
C-terminal end of the fragment was bound 
to a tiny magnetic bead, so the talin 
fragment could be stretched using a small 
magnetic electrode. The solution around 
the protein contained fluorescently tagged 
vinculin proteins. After the talin protein was 
stretched, excess vinculin solution was 
washed away, and the microscope was 
used to détermine if any fluorescent vinculin 
proteins were bound to the talin protein. 

In the absence of stretching (top), most 
talin molécules did not bind vinculin. When 
the protein was stretched (bottom), two or 
three vinculin molécules were bound (only 
one is shown here for clarity). (Adapted 
from A. del Rio et al., Science 323:638- 
641,2009.) 


THE PLANT CELL WALL 

Each cell in a plant deposits, and is in turn completely enclosed by, an elaborate 
extracellular matrix called the plant cell wall. It was the thiclc cell walls of corlc, vis¬ 
ible in a primitive microscope, that in 1663 enabled Robert Hooke to distinguish 
and name cells for the first time. The walls of neighboring plant cells, cemented 
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Figure 19-61 Plant cell walls. (A) Electron 
micrograph of the root tip of a rush, 
showing the organized pattern of cells 
that results from an ordered sequence of 
cell divisions in cells with relatively rigid 
cell walls. In this growing tissue, the cell 
walls are still relatively thin, appearing as 
fine black Unes between the cells in the 
micrograph. (B) Section of a typical cell wall 
separating two adjacent plant cells. The 
two dark transverse bands correspond to 
plasmodesmata that span the wall (see 
Figure 19-27). (A, courtesy of C. Busby 
and B. Gunning, Eur. J. Cell Biol. 21:214- 
223, 1980. With permission from Elsevier; 
B, courtesy of Jeremy Burgess.) 


together to form the intact plant (Figure 19-61), are generally thicker, stronger, 
and, most important of ail, more rigid than the extracellular matrix produced by 
animal cells. In evolving relatively rigid walls, which can be up to many microm- 
eters thick, early plant cells forfeited the ability to crawl about and adopted a sed- 
entary lifestyle that has persisted in ail present-day plants. 

The Composition of the Cell Wall Dépends on the Cell Type 

Ail cell walls in plants hâve their origin in dividing cells, as the cell plate forms 
during cytokinesis to create a new partition wall between the daughter cells 
(discussed in Chapter 17). The new cells are usually produced in spécial régions 
called meristems, and they are generally small in comparison with their final size. 
To accommodate subséquent cell growth, the walls of the newborn cells, called 
primary cell walls, are thin and extensible, although tough. Once cell growth 
stops, the wall no longer needs to be extensible: sometimes the primary wall is 
retained without major modification, but, more commonly, a rigid secondary cell 
wall is produced by depositing new layers of matrix inside the old ones. These new 
layers generally hâve a composition that is significantly different from that of the 
primary wall. The most common additional polymer in secondary walls is lignin, 
a complex networlc of covalently linlced phenolic compounds found in the walls of 
the xylem vessels and fiber cells of woody tissues. 

Although the cell walls of higher plants vary in both composition and orga- 
nization, they are ail constructed, lilce animal extracellular matrices, using a 
structural principle common to ail fiber-composites, including fiberglass and 
reinforced concrète. One component provides tensile strength, while another, 
in which the first is embedded, provides résistance to compression. While the 
principle is the same in plants and animais, the chemistry is different. Unlilce the 
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animal extracellular matrix, which is rich in protein and other nitrogen-contain- 
ing polymers, the plant cell wall is made almost entirely of polymers that contain 
no nitrogen, including cellulose and lignin. For a sedentary organism that dépends 
on CO 2 , H 2 0, and sunlight, these two abundant biopolymers represent “cheap," 
carbon-based structural materials, helping to conserve the scarce fixed nitrogen 
available in the soil that generally limits plant growth. Thus trees, for example, 
malce a huge investment in the cellulose and lignin that comprise the bulle of their 
biomass. 

In the cell walls of higher plants, the tensile fibers are made from the poly¬ 
saccharide cellulose, the most abundant organic macromolecule on Earth, tightly 
linlced into a network by cross-linking glycans. In primary cell walls, the matrix 
in which the cross-linked cellulose network is embedded is composed of pectin, 
a highly hydrated network of polysaccharides rich in galacturonic acid. Second- 
ary cell walls contain additional molécules to malce them rigid and permanent; 
lignin, in particular, forms a hard, waterproof filler in the interstices between the 
other components. Ail of these molécules are held together by a combination of 
covalent and noncovalent bonds to form a highly complex structure, whose com¬ 
position, thickness, and architecture dépend on the cell type. 

The plant cell wall thus has a “skeletal" rôle in supporting the structure of the 
plant as a whole, a protective rôle as an enclosure for each cell individually, and 
a transport rôle, helping to form channels for the movement of fluid in the plant. 
When plant cells become specialized, they generally adopt a spécifie shape and 
produce specially adapted types of walls, according to which the different types 
of cells in a plant can be recognized and classified. We focus here, however, on 
the primary cell wall and the molecular architecture that underlies its remarkable 
combination of strength, resilience, and plasticity, as seen in the growing parts of 
a plant. 

The Tensile Strength of the Cell Wall Allows Plant Cells to Develop 
Turgor Pressure 

The aqueous extracellular environment of a plant cell consists of the fluid con- 
tained in the walls that surround the cell. Although the fluid in the plant cell wall 
contains more solutés than does the water in the plant's external milieu (for exam¬ 
ple, soil), it is still hypotonie in comparison with the cell interior. This osmotic 
imbalance causes the cell to develop a large internai hydrostatic pressure, or tur¬ 
gor pressure, which pushes outward on the cell wall, just as an inner tube pushes 
outward on a tire. The turgor pressure increases just to the point where the cell is 
in osmotic equilibrium, with no net influx of water despite the sait imbalance. The 
turgor pressure generated in this way may reach 10 or more atmosphères, about 
five times that in the average car tire. This pressure is vital to plants because it is the 
main driving force for cell expansion during growth, and it provides much of the 
mechanical rigidity of living plant tissues. Compare the wilted leaf of a dehydrated 
plant, for example, with the turgid leaf of a well-watered one. It is the mechanical 
strength of the cell wall that allows plant cells to sustain this internai pressure. 

The Primary Cell Wall Is Built from Cellulose Microfibrils Interwoven 
with a Network of Pectic Polysaccharides 

Cellulose gives the primary cell wall tensile strength. Each cellulose molécule con¬ 
sists of a linear chain of at least 500 glucose residues that are covalently linked to 
one another to form a ribbonlilce structure, which is stabilized by hydrogen bonds 
within the chain (Figure 19-62). In addition, hydrogen bonds between adjacent 
cellulose molécules cause them to stick together in overlapping parallel arrays, 
forming bundles of about 40 cellulose chains, ail of which hâve the same polarity. 
These highly ordered crystalline aggregates, many micrometers long, are called 
cellulose microfibrils, and they hâve a tensile strength comparable to that of 
Steel. Sets of microfibrils are arranged in layers, or lamellae, with each microfibril 
about 20-40 nm from its neighbors and connected to them by long cross-linking 
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Figure 19-62 Cellulose. Cellulose 
molécules are long, unbranched chains of 
pi ,4-linked glucose units. Each glucose 
residue is inverted with respect to its 
neighbors, and the resulting disaccharide 
repeat occurs hundreds of times in a single 
cellulose molécule. About 16 individual 
cellulose molécules assemble to form 
a strong, hydrogen-bonded cellulose 
microfibril. 
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Figure 19-63 Scale model of a portion 
of a primary plant cell wall showing the 
two major polysaccharide networks. The 

orthogonally arranged layers of cellulose 
microfibrils (green) are tied into a network 
by the cross-linking glycans (red) that form 
hydrogen bonds with the microfibrils. This 
network is coextensive with a network of 
pectin polysaccharides (blue). The network 
of cellulose and cross-linking glycans 
provides tensile strength, while the pectin 
network resists compression. Cellulose, 
cross-linking glycans, and pectin are 
typically présent in roughly equal amounts 
in a primary cell wall. The middle lamella 
is especially rich in pectin, and it cements 
adjacent cells together. 


glycan molécules, which are attached by hydrogen bonds to the surface of the 
microfibrils. The primary cell wall consists of several such lamellae arranged in a 
plywoodlike network (Figure 19-63). 

The cross-linking glycans are a heterogeneous group of branched polysaccha¬ 
rides that bind tightly to the surface of each cellulose microfibril and thereby help 
to cross-link the microfibrils into a complex network. There are many classes of 
cross-linking glycans, but they ail hâve a long linear backbone composed of one 
type of sugar (glucose, xylose, or mannose) from which short side chains of other 
sugars protrude. It is the backbone sugar molécules that form hydrogen bonds 
with the surface of cellulose microfibrils, cross-linking them in the process. Both 
the backbone and the side-chain sugars vary according to the plant species and 
its stage of development. 

Coextensive with this network of cellulose microfibrils and cross-linking gly¬ 
cans is another cross-linlced polysaccharide network based on pectins (see Figure 
19-63). Pectins are a heterogeneous group of branched polysaccharides that con- 
tain many negatively charged galacturonic acid units. Because of their négative 
charge, pectins are highly hydrated and associated with a cloud of cations, resem- 
bling the glycosaminoglycans of animal cells in the large amount of space they 
occupy (see Figure 19-33). When Ca 2+ is added to a solution of pectin molécules, 
it cross-links them to produce a semirigid gel (it is pectin that is added to fruit 
juice to malce jam set). Certain pectins are particularly abundant in the middle 
lamella , the specialized région that cements together the walls of adjacent cells 
(see Figure 19-63); here, Ca 2+ cross-links are thought to help hold cell wall com- 
ponents together. Although covalent bonds also play a part in linlcing the com- 
ponents, very little is known about their nature. Regulated séparation of cells at 
the middle lamella underlies such processes as the ripening of tomatoes and the 
abscission (detachment) of leaves in the fall. 

In addition to the two polysaccharide-based networks that form the bulle of ail 
plant primary cell walls, proteins are présent, contributing up to about 5% of the 
wall's dry mass. Many of these proteins are enzymes, responsible for wall turn¬ 
over and remodeling, particularly during growth. Another class of wall proteins, 
lilce collagen, contains high levels of hydroxyproline. These proteins are thought 
to strengthen the wall, and they are produced in greatly increased amounts as a 
local response to attaclcby pathogens. From the genome sequence of Arabidopsis, 
it has been estimated that more than 700 genes are required to synthesize, assem¬ 
ble, and remodel the plant cell wall. 
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Oriented Cell Wall Déposition Controls Plant Cell Growth 

Once a plant cell has left the meristem where it is generated, it can grow dramat- 
ically, commonly by more than a thousand times in volume. The manner of this 
expansion détermines the final shape of each cell, and hence the final form of the 
plant as a whole. Turgor pressure inside the cell drives the expansion, but it is the 
behavior of the cell wall that governs its direction and extent. Complex wall-re- 
modeling activities are required, as well as the déposition of new wall materials. 
Because of their crystalline structure, the individual cellulose microfibrils in the 
wall are unable to stretch, and this gives them a crucial rôle in the process. For the 
cell wall to stretch or deform, the microfibrils must either slide past one another 
or become more widely separated, or both. The orientation of the microfibrils in 
the innermost layers of the wall governs the direction in which the cell expands. 
Cells in plants therefore anticipate their future morphology by controlling the ori¬ 
entation of the cellulose microfibrils that they deposit in the wall (Figure 19-64). 

Unlilce most other matrix macromolecules, which are made in the endoplas- 
mic réticulum and Golgi apparatus and are secreted, cellulose is spun out from 
the surface of the cell by a plasma-membrane-bound enzyme complex (cellulose 
synthase), which uses as its substrate the sugar nucléotide UDP-glucose supplied 
from the cytosol. Each enzyme complex, or rosette, has a sixfold symmetry (see 
Figure 19-65) and contains the protein products of three separate cellulose syn¬ 
thase ( CESA ) genes. Each CESA protein is essential for the production of a cellu¬ 
lose microfibril. Three CESA genes are required for primary cell wall synthesis and 
a different three for secondary cell wall synthesis. 

As they are being synthesized, the nascent cellulose chains assemble into 
microfibrils. These are spun out on the extracellular surface of the plasma mem¬ 
brane, forming a layer, or lamella, in which ail the microfibrils hâve more or less 
the same alignment (see Figure 19-63). Each new lamella is deposited internally 
to the previous one, so that the wall consists of concentrically arranged lamellae, 
with the oldest on the outside. The most recently deposited microfibrils in elon- 
gating cells commonly lie perpendicular to the axis of cell élongation, although 
the orientation of the microfibrils in the outer lamellae that were laid down earlier 
may be different (see Figure 19-64B and C). 
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Figure 19-64 Cellulose microfibrils 
influence the direction of cell élongation. 

(A) The orientation of cellulose microfibrils in 
the primary cell wall of an elongating carrot 
cell is shown in this électron micrograph of 
a shadowed replica from a rapidly frozen 
and deep-etched cell wall. The cellulose 
microfibrils are aligned parallel to one 
another and perpendicular to the axis of 
cell élongation. The microfibrils are cross- 
linked by, and interwoven with, a complex 
web of matrix molécules (compare with 
Figure 19-63). (B, C) The cells in (B) and 
(C) start off with identical shapes (shown 
here as cubes) but with different net 
orientations of cellulose microfibrils in their 
walls. Although turgor pressure is uniform 
in ail directions, cell wall loosening allows 
each cell to elongate only in a direction 
perpendicular to the orientation of the 
innermost layer of microfibrils, which hâve 
great tensile strength. Cell expansion 
occurs in concert with the insertion of new 
wall material. The final shape of an organ, 
such as a shoot, is determined in part by 
the direction in which its component cells 
can expand. (A, courtesy of Brian Wells and 
Keith Roberts.) 
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Microtubules Orient Cell Wall Déposition 

An important due to the mechanism that dictâtes microfibril orientation came 
from observations of the microtubules in plant cells. These are frequently arranged 
in the cortical cytoplasm with the same orientation as the cellulose microfibrils 
that are currently being deposited in the cell wall in that région. These cortical 
microtubules form a cortical array close to the cytosolic face of the plasma mem¬ 
brane, held there by poorly characterized proteins. The congruent orientation of 
the cortical array of microtubules (lying just inside the plasma membrane) and 
cellulose micro fibrils (lying just outside) is seen in many types and shapes of plant 
cells and is présent during both primary and secondary cell wall déposition, sug- 
gesting a causal relationship. 

This suggestion can be tested by treating a plant tissue with a microtubule-de- 
polymerizing drug so as to disassemble the entire System of cortical microtu¬ 
bules. The conséquences for subséquent cellulose déposition, however, are not 
as straightforward as might be expected. The drug treatment does not disrupt the 
production of new cellulose microfibrils, and in some cases cells can continue to 
deposit new microfibrils in the preexisting orientation. Any developmental switch 
in the orientation of the microfibril pattern that would normally occur between 
successive lamellae, however, is invariably bloclced. It seems that a preexisting 
orientation of microfibrils can be propagated even in the absence of microtu¬ 
bules, but any change in the déposition of cellulose microfibrils requires that 
intact microtubules be présent to détermine the new orientation. 

These observations are consistent with the following model. The cellulose-syn- 
thesizing rosettes embedded in the plasma membrane spin out long cellulose 
molécules. As the synthesis of cellulose molécules and their self-assembly into 
microfibrils proceeds, the distal end of each microfibril presumably forms indi¬ 
rect cross-links to the previous layer of wall material, orienting the new microfi¬ 
bril in parallel with the old ones as it becomes integrated into the texture of the 
wall. Since the microfibril is stiff, the rosette at its growing, proximal end has to 
move as it deposits the new material. Traveling in the plane of the membrane, 
the rosette moves in the direction defined by the way in which the far end of the 
microfibril is anchored in the existing wall. In this way, each layer of microfibrils 
would tend to be spun out from the membrane in the same orientation as the 
layer laid down previously, with the rosettes following the direction of the preex¬ 
isting oriented microfibrils outside the cell. Oriented microtubules inside the cell, 
however, can force a change in the direction in which the rosettes move: they can 
create boundaries in the plasma membrane that act like the banlcs of a canal to 
constrain rosette movement (Figure 19-65). In this view, cellulose synthesis can 
occur independently of microtubules; but it is constrained spatially when cortical 
microtubules are présent to define membrane micro domains within which the 
enzyme complex can move. 


Figure 19-65 One model of how the 
orientation of newly deposited cellulose 
microfibrils might be determined by the 
orientation of cortical microtubules. 

(A) The large cellulose synthase complexes, 
o r rosettes, are intégral membrane 
proteins that continuously synthesize 
cellulose microfibrils on the outer face of 
the plasma membrane. The distal ends 
of the stiff microfibrils become integrated 
into the texture of the wall, and their 
élongation at the proximal end pushes the 
synthase complex along in the plane of 
the membrane. Because the cortical array 
of microtubules is attached to the plasma 
membrane in a way that confines this 
complex to defined membrane channels, 
the orientation of these microtubules— 
when they are présent—détermines the 
axis along which the new microfibrils 
are laid down. (B, C) Two électron 
micrographs show the tight association of 
the cortical microtubules with the plasma 
membrane. One shows the microtubules 
in cross section while the other shows a 
microtubule in longitudinal section. Both 
emphasize the constant gap of about 20 
nm between membrane and microtubule; 
the connecting molécules responsible 
remain obscure. (B and C, courtesy of 
Andrew Staehelin.) 
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In this way, plant cells can change their direction of expansion by a sudden 
change in the orientation of their cortical array of microtubules. Because plant 
cells cannot move (being constrained by their walls), the entire morphology of 
a multicellular plant presumably dépends on a coordinated, highly patterned 
deployment of cortical microtubule orientations during plant development. It is 
not lcnown how these orientations are controlled, although it has been shown that 
the microtubules can reorient rapidly in response to extracellular stimuli, includ- 
ing plant growth regulators such as ethylene and auxins (discussed in Chapter 15). 

Microtubules are not, however, the only cytoskeletal éléments that influence 
wall déposition. Local foci of cortical actin filaments can also direct the déposition 
of new wall material at spécifie sites on the cell surface, contributing to the elabo- 
rate final shaping of many differentiated plant cells. 


WHATWE DONT KNOW 


• What are the regulatory mechanisms 
that control the rearrangement of 
cell—cell junctions in epithelia during 
early development? What rôles do 
mechanical force and tension play in 
these rearrangements? 

• How do extracellular matrix proteins 
and carbohydrates influence the 
localization and actions of extracellular 
signal molécules or their cell-surface 
receptors? 


Summary 

Plant cells are surrounded by a tough extracellular matrix, or cell wall, which is 
responsible for many of the unique features of a plan fs lifestyle. The wall is com- 
posed ofa network of cellulose microfibrils and cross-linking glycans, embedded in 
a highly cross-linked matrix ofpectin polysaccharides. In secondary cell walls, lig- 
nin may be deposited to malce them waterproof, hard, and woody. A cortical array 
of microtubules can control the orientation of newly deposited cellulose microfi¬ 
brils, which in turn détermine the direction ofcell expansion and therefore the final 
shape ofthe cell and, ultimately, ofthe plantas a whole. 


PROBLEMS 


• How do intracellular adaptor proteins 
coordinate the activation of integrin 
proteins and their interactions with 
cytoskeletal components and their 
response to changes in mechanical 
force acting on cell-matrix junctions? 

• Given that extracellular matrix 
molécules hâve the ability to présent 
ordered arrays of signais to cells, 
might the exact spatial relationships 
between such signais carry a message 
beyond that of the individual signais 
themselves? 


Which statements are true? Explain why or why not. 

19-1 Given the numerous processes inside cells that 
are regulated by changes in Ca 2+ concentration, it seems 
lilcely that Ca 2+ -dependent cell-cell adhesions are also 
regulated by changes in Ca 2+ concentration. 

19-2 Tight junctions perform two distinct functions: 
they seal the space between cells to restrict paracellular 
flow and they fence off plasma membrane domains to 
prevent the mixing of apical and basolateral membrane 
proteins. 

19-3 The elasticity of elastin dérivés from its high con¬ 
tent of a helices, which act as molecular springs. 

19-4 Integrins can convert mechanical signais into 
intracellular molecular signais. 

Discuss the following problems. 

19-5 Comment on the following (1922) quote from 
Warren Lewis, who was one of the pioneers of cell biology. 
“Were the various types of cells to lose their stickiness for 
one another and for the supporting extracellular matrix, 
our bodies would at once disintegrate and fiow off into the 
ground in a mixed stream of cells.” 

19-6 Cell adhesion molécules were originally identi- 
fied using antibodies raised against cell-surface compo¬ 
nents to bloclc cell aggregation. In the adhesion-blocking 
assays, the researchers found it necessary to use antibody 
fragments, each with a single binding site (so-called Fab 
fragments), rather than intact IgG antibodies, which are 
Y-shaped molécules with two identical binding sites. The 


sites of sites for 



Figure Q19-1 Production of Fab fragments from IgG antibodies by 
digestion with papain (Problem 19-6). 

Fab fragments were generated by digesting the IgG anti¬ 
bodies with papain, a protease, to separate the two bind¬ 
ing sites (Figure Q19-1). Why do you suppose it was nec¬ 
essary to use Fab fragments to bloclc cell aggregation? 

19-7 The food-poisoning bacterium Clostridium per- 
fringens malces a toxin that binds to members of the clau- 
din family of proteins, which are the main constituents of 
tight junctions. When the C-terminus of the toxin is bound 
to a claudin, the N-terminus can insert into the adjacent 
cell membrane, forming holes that kill the cell. The por¬ 
tion of the toxin that binds to the claudins has proven to be 
a valuable reagent for investigating the properties of tight 
junctions. MDCK cells are a common choice for studies 
of tight junctions because they can form an intact épithé¬ 
lial sheet with high transépithélial résistance. MDCK cells 
express two claudins: claudin-1, which is not bound by 
the toxin, and claudin-4, which is. 

When an intact MDCK épithélial sheet is incu- 
bated with the C-terminal toxin fragment, claudin-4 
disappears, becoming undetectable within 24 hours. In 
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Figure Q19-2 Effects of Clostridium toxin on the barrier function 
of MDCK cells (Problem 19-7). (A) Addition of toxin from the 
basolateral side of the épithélial sheet. (B) Addition of toxin from 
the apical side of the épithélial sheet. For a given voltage, a higher 
résistance (ohms cm 2 ) gives less paracellular current. 


the absence of claudin-4, the cells remain healthy and 
the épithélial sheet appears intact. The mean number 
of strands in the tight junctions that linlc the cells also 
decreases over 24 hours from about four to about two, and 
they are less highly branched. A functional assay for the 
integrity of the tight junctions shows that transepithelial 
résistance decreases dramatically in the presence of the 
toxin, but the résistance can be restored by washing out 
the toxin (Figure Q19-2A). Curiously, the toxin produces 
these effects only when it is added to the basolateral side 
of the sheet; it has no effect when added to the apical sur¬ 
face (Figure Q19-2B). 

A. How can it be that two tight-junction strands 
remain, even though ail of the claudin-4 has disappeared? 

B. Why do you suppose the toxin worlcs when it is 
added to the basolateral side of the épithélial sheet, but 
not when added to the apical side? 

19-8 It is not an easy matter to assign particular func- 
tions to spécifie components of the basal lamina, since 
the overall structure is a complicated composite material 
with both mechanical and signaling properties. Nidogen, 
for example, cross-links two central components of the 
basal lamina by binding to the laminin y-1 chain and to 
type IV collagen. Given such a key rôle, it was surprising 
that mice with a homozygous knoclcout of the gene for 
nidogen-1 were entirely healthy, with no abnormal phe¬ 
notype. Similarly, mice homozygous for a knoclcout of the 
gene for nidogen-2 also appeared completely normal. By 
contrast, mice that were homozygous for a defined muta¬ 
tion in the gene for laminin y-1, which eliminated just the 
binding site for nidogen, died at birth with severe defects 
in lung and lcidney formation. The mutant portion of the 
laminin y-1 chain is thought to hâve no other function 
than to bind nidogen, and does not affect laminin struc¬ 
ture or its ability to assemble into the basal lamina. How 
would you explain these genetic observations, which are 
summarized in Table Q19-1? What would you predict 
would be the phenotype of a mouse that was homozygous 
for knockouts of both nidogen genes? 


H Phenotypes of mice with genetic defects 
in components of the basal lamina (Problem 19-8). 

Protein 

Genetic defect 

Phenotype 

Nidogen-1 

Gene knockout (-/-) 

None 

Nidogen-2 

Gene knockout (-/-) 

None 

Laminin y-1 

Nidogen binding-site 
délétion (+/-) 

None 

Laminin y-1 

Nidogen binding-site 
délétion (-/-) 

Dead at birth 

+/- stands for heterozygous, -/- stands for homozygous. 


19-9 Discuss the following statement: "The basal lam¬ 
ina of muscle fibers serves as a molecular bulletin board, 
in which adjoining cells can post messages that direct the 
différentiation and function of the underlying cells." 

19-10 The affinity of integrins for matrix components can 
be modulated by changes to their cytoplasmic domains: 
a process lcnown as inside-out signaling. You hâve iden- 
tified a key région in the cytoplasmic domains of (XnbP 3 
integrin that seems to be required for inside-out signaling 
(Figure Q19-3). Substitution of alanine for either D723 
in the P chain or R995 in the a chain leads to a high level 
of spontaneous activation, under conditions where the 
wild-type chains are inactive. Your advisor suggests that 
you convert the aspartate in the P chain to an arginine 
(D723R) and the arginine in the a chain to an aspartate 
(R995D). You compare ail three a chains (R995, R995A, 
and R995D) against ail three P chains (D723, D723A, and 
D723R). You find that ail pairs hâve a high level of sponta¬ 
neous activation, except D723 vs R995 (the wild type) and 
D723R vs R995D, which hâve low levels. Based on these 
results, how do you thinlc the anbP 3 integrin is held in its 
inactive State? 

extracellular plasma cytoplasm 

space membrane 

723 

_L 

WKLLITIHDRKEF- 

WKVGFFKRNRP 

I 

995 

Figure Q19-3 Schematic représentation of anbP 3 integrin 
(Problem 19-10). The D723 and R995 residues are indicated. 

(From P.E. Hughes et al., J. Biol. Chem. 271:6571-6574, 1996. 

With permission from American Society for Biochemistry and 
Molecular Biology.) 

19-11 The glycosaminoglycan polysaccharide chains 
that are linked to spécifie core proteins to form the pro- 
teoglycan components of the extracellular space are 
highly negatively charged. How do you suppose these 
negatively charged polysaccharide chains help to estab- 
lish a hydrated gel-like environment around the cell? How 
would the properties of these molécules differ if the poly¬ 
saccharide chains were uncharged? 
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19-12 At body température, L-aspartate in proteins race- 
mizes to D-aspartate at an appréciable rate. Most pro¬ 
teins in the body hâve a very low level of D-aspartate, if it 
can be detected at ail. Elastin, however, has a fairly high 
level of D-aspartate. Moreover, the amount of D-aspartate 
increases in direct proportion to the âge of the person from 
whom the sample was talcen. Why do you suppose that 
most proteins hâve little if any D-aspartate, while elastin 
has levels of D-aspartate that increase steadily with âge? 

19-13 Your boss is coming to dinner! Ail you hâve for a 
salad is some wilted, day-old lettuce. You vaguely recall 
that there is a triclc to rejuvenating wilted lettuce, but you 
cannot remember what it is. Should you soalc the lettuce in 
sait water, soak it in tap water, or soalc it in sugar water, or 
maybe just shine a bright light on it and hope that photo- 
synthesis will perle it up? 


19-14 A plant must be able to respond to changes in the 
water status of its surroundings. It does so by the flow of 
water molécules through water channels called aquapo- 
rins. The hydraulic conductivity of a single aquaporin is 
4.4 x 10 -22 m 3 per second per MPa (megapascal) of pres¬ 
sure. What does this correspond to in terms of water mol¬ 
écules per second at atmospheric pressure? [Atmospheric 
pressure is 0.1 MPa (1 bar) and the concentration of water 
is 55.5 M.] 
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Cancer 


CHAPTER 

20 


About one in five of us will die of cancer, but that is not why we devote a chapter 
to this disease. Cancer cells break the most basic rules of cell behavior by which 
multicellular organisms are built and maintained, and they exploit every kind of 
opportunity to do so. These transgressions help to reveal what the normal rules 
are and how they are enforced. As a resuit, cancer research helps to illuminate 
the fundamentals of cell biology—especially cell signaling (Chapter 15), the cell 
cycle and cell growth (Chapter 17), programmed cell death (apoptosis, Chapter 
18), and the control of tissue architecture (Chapters 19 and 22). Of course, with 
a deeper understanding of these normal processes, we also gain a deeper under- 
standing of the disease and better tools to treat it. 

In this chapter, we first consider what cancer is and describe the natural 
history of the disease from a cellular standpoint. We then discuss the molecu- 
lar changes that malce a cell cancerous. And we end the chapter by considering 
how our enhanced understanding of the molecular basis of cancer is leading to 
improved methods for its prévention and treatment. 


IN THIS CHAPTER 

CANCER AS A 
MICROEVOLUTION ARY 
PROCESS 

CANCER-CRITICAL GENES: 
HOW THEY ARE FOUND AND 
WHAT THEY DO 

CANCER PREVENTION AND 
TREATMENT: PRESENT AND 
FUTURE 


CANCER AS A MICROEVOLUTION ARY PROCESS 

The body of an animal opérâtes as a society or ecosystem, whose individual mem- 
bers are cells that reproduce by cell division and organize themselves into collabo¬ 
rative assemblies called tissues. This ecosystem is very peculiar, however, because 
self-sacrifice—as opposed to survival of the fittest—is the rule. Ultimately, ail of 
the somatic cell lineages in animais are committed to die: they leave no progeny 
and instead dedicate their existence to the support of the germ cells, which alone 
hâve a chance of continued survival (discussed in Chapter 21). There is no mys- 
tery in this, for the body is a clone derived from a fertilized egg, and the genome 
of the somatic cells is the same as that of the germ-cell lineage that gives rise to 
sperm or eggs. By their self-sacrifice for the salce of the germ cells, the somatic 
cells help to propagate copies of their own genes. 

Thus, unlilce free-living cells such as bacteria, which compete to survive, the 
cells of a multicellular organism are committed to collaboration. To coordinate 
their behavior, the cells send, receive, and interpret an elaborate set of extracel¬ 
lular signais that serve as social Controls, directing cells how to act (discussed in 
Chapter 15). As a resuit, each cell behaves in a socially responsible manner—rest- 
ing, growing, dividing, differentiating, or dying—as needed for the good of the 
organism. 

Molecular disturbances that upset this harmony mean trouble for a multicel¬ 
lular society. In a human body with more than 10 14 cells, billions of cells expéri¬ 
ence mutations every day, potentially disrupting the social Controls. Most danger- 
ously, a mutation may give one cell a sélective advantage, allowing it to grow and 
divide slightly more vigorously and survive more readily than its neighbors and 
in this way to become a founder of a growing mutant clone. A mutation that pro¬ 
mûtes such selfish behavior by individual members of the cooperative can jeop- 
ardize the future of the whole enterprise. Over time, repeated rounds of mutation, 
compétition, and natural sélection operating within the population of somatic 
cells can cause matters to go from bad to worse. These are the basic ingrédients 
of cancer: it is a disease in which an individual mutant clone of cells begins by 
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prospering at the expense of its neighbors. In the end—as the clone grows, evolves, 
and spreads—it can destroy the entire cellular society (Movie 20.1). 

In this section, we discuss the development of cancer as a microevolutionary 
process that takes place within the course of a human life-span in a subpopulation 
of cells in the body. But the process dépends on the same principles of mutation 
and natural sélection that hâve driven the évolution of living organisms on Earth 
for billions of years. 

Cancer Cells Bypass Normal Prolifération Controls and Colonize 
Other Tissues 

Cancer cells are defined by two heritable properties: (1) they reproduce in défi¬ 
ance of the normal restraints on cell growth and division, and (2) they invade 
and colonize territories normally reserved for other cells. It is the combination of 
these properties that malces cancers particularly dangerous. An abnormal cell that 
grows (increases in mass) and proliférâtes (divides) out of control will give rise to 
a tumor, or neoplasm— literally, a new growth. As long as the neoplastic cells hâve 
not yet become invasive, however, the tumor is said to be benign. For most types 
of such neoplasms, removing or destroying the mass locally usually achieves a 
complété cure. A tumor is considered a true cancer if it is malignant; that is, when 
its cells hâve acquired the ability to invade surrounding tissue. Invasiveness is an 
essential characteristic of cancer cells. It allows them to break loose, enter blood 
or lymphatic vessels, and form secondary tumors called métastasés at other sites 
in the body (Figure 20-1). In general, the more widely a cancer spreads, the harder 
it becomes to eradicate. It is generally métastasés that kill the cancer patient. 

Cancers are traditionally classified according to the tissue and cell type from 
which they arise. Carcinomas are cancers arising from épithélial cells, and they 
are by far the most common cancers in humans. They account for about 80% of 
cases, perhaps because most of the cell prolifération in adults occurs in epithe- 
lia. In addition, épithélial tissues are the most lilcely to be exposed to the various 
forms of physical and Chemical damage that favor the development of cancer. 
Sarcomas arise from connective tissue or muscle cells. Cancers that do not fit in 
either of these two broad categories include the various leukemias and lympho- 
mas, derived from white blood cells and their precursors (hemopoietic cells), as 
well as cancers derived from cells of the nervous System. Figure 20-2 shows the 
types of cancers that are common in the United States, together with their inci¬ 
dence and death rates. Each broad category has many subdivisions according to 
the spécifie cell type, the location in the body, and the microscopie appearance of 
the tumor. 

In parallel with the set of names for malignant tumors, there is a related set of 
names for benign tumors: an adenoma, for example, is a benign épithélial tumor 
with a glandular organization; the corresponding type of malignant tumor is an 
adenocarcinoma (Figure 20-3). Similarly, a chondroma and a chondrosarcoma 
are, respectively, benign and malignant tumors of cartilage. 

Most cancers hâve characteristics that refiect their origin. Thus, for example, 
the cells of a basal-cell carcinoma, derived from a kératinocyte stem cell in the slcin, 
generally continue to synthesize cytokeratin intermediate filaments, whereas the 
cells of a melanoma, derived from a pigment cell in the skin, will often (but not 
always) continue to make pigment granules. Cancers originating from different 
cell types are, in general, very different diseases. Basal-cell carcinomas of the skin, 
for example, are only locally invasive and rarely metastasize, whereas melanomas 
can become much more malignant and often form métastasés. Basal-cell carcino¬ 
mas are readily cured by surgery or local irradiation, whereas malignant melano¬ 
mas, once they hâve metastasized widely, are usually fatal. 

Later, we shall see that there is also a different way to classify cancers, one that 
cuts across the traditional classification by site of origin: we can classify them in 
terms of the mutations that make the tumor cells cancerous. The final section 
of the chapter will show how this information can be crucial to the design and 
choice of treatments. 


Ùt 


Figure 20-1 Metastasis. Malignant tumors 
typically give rise to métastasés, making 
the cancer hard to eradicate. Shown in 
this fusion image is a whole-body scan of 
a patient with metastatic non-Hodgkin’s 
lymphoma (NHL). The background image 
of the body’s tissues was obtained by CT 
(computed x-ray tomography) scanning. 
Overlaid on this image, a PET (positron 
émission tomography) scan reveals the 
tumor tissue (yellow), detected by its 
unusually high uptake of radioactively 
labeled fluorodeoxyglucose (FDG). High 
FDG uptake occurs in cells with unusually 
active glucose uptake and metabolism, 
which is a characteristic of cancer cells 
(see Figure 20-12). The yellow spots 
in the abdominal région reveal multiple 
métastasés. (Courtesy of S. Gambhir.) 
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Figure 20-2 Cancer incidence and 
mortality in the United States. The total 
number of new cases diagnosed in 2012 
in the United States was 1,665,540, and 
total cancer deaths were 585,720. Note 
that deaths reflect cases diagnosed at 
many different times and that somewhat 
less than hait of the people who develop 
cancer die of it. In the world as a whole, the 
five most common cancers are those of the 
lung, stomach, breast, colon/rectum, and 
uterine cervix (included in the figure under 
the heading of reproductive tract), and the 
total number of new cancer cases recorded 
per year is just over 6 million. Skin cancers 
other than melanomas are not included in 
these figures, since al most ail are cured 
easily and many are unrecorded. 

The data for the United Kingdom are 
similar. However, incidences are different 
in some other parts of the world, reflecting 
widespread exposures to different 
infectious agents and environmental toxins. 
(Data from American Cancer Society, 
Cancer Facts and Figures, 2014.) 


Most Cancers Dérivé from a Single Abnormal Cell 

Even when a cancer has metastasized, we can usually trace its origins to a single 
primary tumor, arising in a spécifie organ. The primary tumor is thought to dérivé 
by cell division from a single cell that initially experienced some heritable change. 
Subsequently, additional changes accumulate in some of the descendants of this 
cell, allowing them to outgrow, out-divide, and often outlive their neighbors. By 
the time it is first detected, a typical human cancer will hâve been developing for 
many years and will already contain a billion cancer cells or more (Figure 20-4). 
Tumors will usually also contain a variety of other cell types; for example, fibro- 
blasts will be présent in the supporting connective tissue associated with a car- 
cinoma, in addition to inflammatory and vascular endothélial cells. How can we 
be sure that the cancer cells are the clonal descendants of a single abnormal cell? 

One way of proving clonal origin is through molecular analysis of the chro¬ 
mosomes in tumor cells. In almost ail patients with chronic myelogenous leuke- 
mia (CML), for example, we can distinguish the leukemic white blood cells from 
the patient's normal cells by a spécifie chromosomal abnormality: the so-called 
Philadelphia chromosome, created by a translocation between the long arms of 
chromosomes 9 and 22 (Figure 20-5). When the DNA at the site of translocation 
is cloned and sequenced, it is found that the site of brealcage and rejoining of the 
translocated fragments is identical in ail the leukemic cells in any given patient, 
but that this site differs slightly (by a few hundred or thousand base pairs) from 
one patient to another. This is the expected resuit if, and only if, the cancer in each 
patient arises from a unique accident occurring in a single cell. We will see later 
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Figure 20-3 Benign versus malignant 
tumors. A benign glandular tumor {pink 
cells; an adenoma) remains inside the basal 
lamina (yellow) that marks the boundary 
of the normal structure (a duct, in this 
example). In contrast, a malignant glandular 
tumor (red cells; an adenocarcinoma) can 
develop from a benign tumor cell, and 
it destroys the integrity of the tissue, as 
shown. There are many different forms that 
such tumors may take. 
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Figure 20-4 The growth of a typical human tumor, such as a tumor of 
the breast. The diameter of the tumor is plotted on a logarithmic scale. Years 
may elapse before the tumor becomes noticeable. The doubling time of a 
typical breast tumor, for example, is about 100 days. However, particularly 
virulent tumors may grow much more rapidly. 


how this particular translocation promûtes the development of CML by creating a 
novel hybrid gene encoding a protein that promûtes cell prolifération. 

Many other Unes of evidence, from a variety of cancers, point to the same con¬ 
clusion: most cancers originate from a single aberrant cell. 

Cancer Cells Contain Somatic Mutations 

If a single abnormal cell is to give rise to a tumor, it must pass on its abnormal- 
ity to its progeny: the aberration has to be heritable. Thus, the development of 
a clone of cancer cells dépends on genetic changes. The tumor cells contain 
somatic mutations: they hâve one or more shared détectable abnormalities in 
their DNA sequence that distinguish them from the normal cells surrounding the 
tumor, as in the example of CML just described. (The mutations are called somatic 
because they occur in the soma, or body cells, not in the germ line). Cancers are 
also driven by epigenetic changes —persistent, heritable changes in gene expres¬ 
sion that resuit from modifications of chromatin structure without alteration of 
the cell's DNA sequence. But somatic mutations that alter DNA sequence appear 
to be a fundamental and universal feature, and cancer is in this sense a genetic 
disease. 

Factors that cause genetic changes tend to provoke the development of can¬ 
cer. Thus, carcinogenesis (the génération of cancer) can be linked to mutagenesis 
(the production of a change in the DNA sequence). This corrélation is particularly 
clear for two classes of external agents: (1) Chemical carcinogens (which typically 
cause simple local changes in the nucléotide sequence), and (2) radiation such as 
x-rays (which typically cause chromosome breaks and translocations) or ultravio¬ 
let (UV) light (which causes spécifie DNA base alterations). 

As would be expected, people who hâve inherited a genetic defect in one of 
several DNA repair mechanisms, causing their cells to accumulate mutations at an 
elevated rate, run a heightened rislc of cancer. Those with the disease xeroderma 
pigmentosum, for example, hâve defects in the System that repairs DNA damage 
induced by UV light, and they hâve a greatly increased incidence of skin cancers. 

A Single Mutation Is Not Enough to Change a Normal Cell into a 
Cancer Cell 

An estimated 10 16 cell divisions occur in a normal human body in the course of 
a typical lifetime; in a mouse, with its smaller number of cells and its shorter life- 
span, the number is about 10 12 . Even in an environment that is free of mutagens, 
mutations would occur spontaneously at an estimated rate of about 10 -6 mutations 
per gene per cell division—a value set by fundamental limitations on the accuracy 
of DNA réplication and repair (see pp. 237-238). Thus, in a typical lifetime, every 
single gene is lilcely to hâve undergone mutation on about 10 10 separate occasions 
in a human, or on about 10 6 occasions in a mouse. Among the resulting mutant 
cells, we might expect a large number that hâve sustained deleterious mutations 
in genes that regulate cell growth and division, causing the cells to disobey the 
normal restrictions on cell prolifération. From this point of view, the problem of 
cancer seems to be not why it occurs, but why it occurs so infrequently. 

Clearly, if a mutation in a single gene were enough to convert a typical healthy 
cell into a cancer cell, we would not be viable organisms. Many lines of evidence 
indicate that the development of a cancer typically requires that a substantial 
number of independent, rare genetic and epigenetic accidents occur in the lin- 
eage that emanates from a single cell. One such indication cornes from epidemi- 
ological studies of the incidence of cancer as a function of âge (Figure 20-6). If a 
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Figure 20-5 The translocation between 
chromosomes 9 and 22 responsible 
for chronic myelogenous leukemia. 

The normal structures of chromosomes 
9 and 22 are shown at the left. When a 
translocation occurs between them at the 
indicated site, the resuit is the abnormal 
pair at the right. The smaller of the two 
resulting abnormal chromosomes (22q _ ) is 
called the Philadelphia chromosome, after 
the city where the abnormality was first 
recorded. 
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Figure 20-6 Cancer incidence as a function of âge. The number of newly 
diagnosed cases of colon cancer in women in England and Wales in 1 year 
is plotted as a function of âge at diagnosis, relative to the total number 
of individuals in each âge group. The incidence of cancer rises steeply 
as a function of âge. If only a single mutation were required to trigger the 
cancer and this mutation had an equal chance of occurring at any time, the 
incidence of this cancer would be the same at ail âges. Analyses of this type 
suggest that the development of a solid tumor instead requires five to eight 
independent accidents (“hits”) that occur randomly over time. This calculation 
assumes that the mutation rate remains constant as a cancer evolves, where 
in fact it often increases (see p. 1097). (Data from C. Muir et al., Cancer 
Incidence in Five Continents, Vol. V. Lyon: International Agency for Research 
on Cancer, 1987.) 

single mutation were responsible for cancer, occurring with a fixed probability per 
year, the chance of developing cancer in any given year of life should be indepen¬ 
dent of âge. In fact, for most types of cancer, the incidence rises steeply with âge— 
as would be expected if cancer is caused by a progressive, random accumulation 
of a set of mutations in a single lineage of cells. 

As discussed later, these indirect arguments hâve now been confirmed by sys- 
tematically sequencing the genomes of the tumor cells from individual cancer 
patients and cataloging the mutations that they contain. 
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Cancers Develop Gradually from Increasingly Aberrant Cells 

For those cancers known to hâve a spécifie external cause, the disease does 
not usually become apparent until long after exposure to the causal agent. The 
incidence of lung cancer, for example, does not begin to rise steeply until after 
décades of heavy smoking (Figure 20-7). Similarly, the incidence of leukemias 
in Hiroshima and Nagasaki did not show a marlced rise until about 5 years after 
the explosion of the atomic bombs, and industrial worlcers exposed for a limited 
period to Chemical carcinogens do not usually develop the cancers characteristic 
of their occupation until 10, 20, or even more years after the exposure. During 
this long incubation period, the prospective cancer cells undergo a succession 
of changes, and the same presumably applies to cancers where the initial genetic 
lésion has no such obvious external cause. 

The concept that the development of a cancer requires a graduai accumulation 
of mutations in a number of different genes helps to explain the well-lcnown phe- 
nomenon of tumor progression, whereby an initial mild disorder of cell behavior 
evolves gradually into a full-blown cancer. Chronic myelogenous leukemia again 
provides a clear example. It begins as a disorder characterized by a nonlethal 
overproduction of white blood cells and continues in this form for several years 
before changing into a much more rapidly progressing illness that usually ends 
in death within a few months. In the early chronic phase, the leukemic cells are 
distinguished mainly by the chromosomal translocation (the Philadelphia chro¬ 
mosome) mentioned previously, although there may well be other, less visible 
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Figure 20-7 Smoking and the onset of 
lung cancer. A major increase in cigarette 
smoking (red line) has caused a dramatic 
rise in lung cancer deaths (green line), with 
a lag time of about 35 years. Because 
global cigarette smoking peaked in 1990, 
global lung cancer deaths are expected to 
décliné after a similar lag. (Data from R.N. 
Proctor, Nat. Rev. Cancer 1:82-86, 2001). 
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Figure 20-8 Stages of progression in the 
development of cancer of the epithelium 
of the uterine cervix. Pathologists use 
standardized terminology to classify the 
types of disorders they see, so as to guide 
the choice of treatment. (A) In a stratified 
squamous epithelium, dividing cells are 
confined to the basal layer. (B) In this 
low-grade intraepithélial neoplasia (right 
hait of image), dividing cells can be found 
throughout the lower third of the epithelium; 
the superficial cells are still flattened and 
show signs of différentiation, but this is 
incomplète. (C) In high-grade intraepithélial 
neoplasia, cells in ail the épithélial layers 
are proliferating and exhibit détective 
différentiation. (D) True malignancy begins 
when the cells move through or destroy the 
basal lamina that underlies the basal layer 
of epithelium and invade the underlying 
connective tissue. (Photographs courtesy 
of Andrew J. Connolly.) 


genetic or epigenetic changes. In the subséquent acute phase, cells that show not 
only the translocation but also several other chromosomal abnormalities over- 
run the hemopoietic (blood-forming) System. It appears that cells from the initial 
mutant clone hâve undergone further mutations that malce them proliferate even 
more vigorously, so that they corne to outnumber both the normal blood cells and 
their ancestors with the primary chromosomal translocation. 

Carcinomas and other solid tumors evolve in a similar way (Figure 20-8). 
Although many such cancers in humans are not diagnosed until a relatively late 
stage, in some cases it is possible to observe the earlier steps and, as we shall see 
later, to relate them to spécifie genetic changes 

Tumor Progression Involves Successive Rounds of Random 
Inherited Change Followed by Natural Sélection 

From ail the evidence, therefore, it seems that cancers arise by a process in which 
an initial population of slightly abnormal cells—descendants of a single abnor¬ 
mal ancestor—evolve from bad to worse through successive cycles of random 
inherited change followed by natural sélection. Correspondingly, tumors grow in 
fits and starts, as additional advantageous inherited changes arise and the cells 
bearing them flourish. Tumor progression involves a large element of chance and 
usually talces many years, which may be why the majority of us will die of causes 
other than cancer. 

At each stage of progression, some individual cell acquires an additional muta¬ 
tion or epigenetic change that gives it a sélective advantage over its neighbors, 
malcing it better able to thrive in its environment—an environment that, inside 
a tumor, may be harsh, with low levels of oxygen, scarce nutrients, and the nat¬ 
ural barriers to growth presented by the surrounding normal tissues. The larger 
the number of tumor cells, the higher the chance that at least one of them will 
undergo a change that favors it over its neighbors. Thus, as the tumor grows, pro¬ 
gression accelerates. The offspring of the best-adapted cells continue to divide, 
eventually producing the dominant clones in the developing lésion (Figure 20-9). 

lust as in the évolution of plants and animais, a kind of spéciation often occurs: 
the original cancer cell lineage can diversify to give many genetically different vig- 
orous subclones of cells. These may coexist in the same mass of tumor tissue; or 
they may migrate and colonize separate environments suited to their individual 
quirlcs, where they settle, thrive, and progress as independently evolving métas¬ 
tasés. As new mutations arise within each tumor mass, different subclones may 
gain an advantage and corne to predominate, only to be overtaken by others or 
outgrown by their own sub-subclones. The increasing genetic diversity as a cancer 
progresses is one of the chief factors that malce cures difficult. 
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Figure 20-9 Clonal évolution. In this schematic diagram, a tumor develops 
through repeated rounds of mutation and prolifération, giving rise eventually 
to a clone of fully malignant cancer cells. At each step, a single cell 
undergoes a mutation that either enhances cell prolifération or decreases 
cell death, so that its progeny become the dominant clone in the tumor. 
Prolifération of each clone hastens the occurrence of the next step of tumor 
progression by increasing the size of the cell population that is at risk of 
undergoing an additional mutation. The final step depicted here is invasion 
through the basement membrane, an initial step in metastasis. In reality, there 
are more than the three steps shown here, and a combination of genetic and 
epigenetic changes are involved. Not shown here is the fact that, over time, a 
variety of competing subclones will often arise in a tumor. As we will discuss 
later, this heterogeneity complicates cancer thérapies (see Figure 20-30). 
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Human Cancer Cells Are Genetically Unstable 

Most human cancer cells accumulate genetic changes at an abnormally rapid 
rate and are said to be genetically unstable. The extent of this instability and its 
molecular origins differ from cancer to cancer and from patient to patient, as we 
shall discuss in a later section. The basic phenomenon was évident even before 
modem molecular analyses. For example, the cells of many cancers show grossly 
abnormal sets of chromosomes, with duplications, délétions, and translocations 
that are visible at mitosis (Figure 20-10). When the cells are maintained in cul¬ 
ture, these patterns of chromosomal disruption can often be seen to evolve rap- 
idly and in a seemingly haphazard way. And for many years, pathologists hâve 
used an abnormal appearance of the cell nucléus to identify and classify cancer 
cells in tumor biopsies; in particular, cancer cells can contain an unusually large 
amount of heterochromatin—a condensed form of interphase chromatin that 
silences genes (see pp. 194-195). This suggested that epigenetic changes of chro¬ 
matin structure can also contribute to the cancer cell phenotype, as recently con- 
firmed by molecular analysis. 

The genetic instability observed in cancer cells can arise from defects in the 
ability to repair DNA damage or to correct réplication errors of various kinds. 
These alterations lead to changes in DNA sequence and produce rearrangements 
such as DNA translocations and duplications. Also common are defects in chro¬ 
mosome ségrégation during mitosis, which provide another possible source of 
chromosome instability and changes in karyotype. 

From an evolutionary perspective, none of this should be a surprise: anything 
that increases the probability of random changes in gene function heritable from 
one cell génération to the next—and that is not too deleterious—is lilcely to speed 
the évolution of a clone of cells toward malignancy, thereby causing this property 
to be selected for during tumor progression. 




Figure 20-10 Chromosomes from a 
breast tumor displaying abnormalities 
in structure and number. Chromosomes 
were prepared from a breast tumor cell in 
metaphase, spread on a glass slide, and 
stained with (A) a general DNA stain or 
(B) a combination of fluorescently labeled 
DNA molécules that color each normal 
human chromosome differently (see Figure 
4-10). The staining (displayed in false color) 
shows multiple translocations, including a 
doubly translocated chromosome (white 
arrow) that is made up of two pièces of 
chromosome 8 (green-brown) and a piece 
of chromosome 17 (purple). The karyotype 
also contains 48 chromosomes, instead 
of the normal 46. (Courtesy of Joanne 
Davidson and Paul Edwards.) 









1098 Chapter 20: Cancer 


Cancer Cells Display an Altered Control of Growth 

Mutability and large cell population numbers create the opportunities for muta¬ 
tions to occur, but the driving force for development of a cancer has to corne from 
some sort of sélective advantage possessed by the mutant cells. Most obviously, 
a mutation or epigenetic change can confer such an advantage by increasing the 
rate at which a clone of cells proliférâtes or by enabling it to continue proliferating 
when normal cells would stop. Cancer cells that can be grown in culture, or cul- 
tured cells artificially engineered to contain the types of mutations encountered 
in cancers, typically show a transformed phenotype. They are abnormal in their 
shape, their motility, their responses to growth factors in the culture medium, 
and, most characteristically, in the way they react to contact with the substratum 
and with one another. Normal cells will not divide unless they are attached to the 
substratum; transformed cells will often divide even if held in suspension. Normal 
cells become inhibited from moving and dividing when the culture reaches con¬ 
fluence (where the cells are touching one another); transformed cells continue 
moving and dividing even after confluence, and so pile up in layer upon layer in 
the culture dish (Figure 20-11). In addition, transformed cells no longer require 
ail of the positive signais from their surroundings that normal cells require. 

Their behavior in culture gives a hint of the ways in which cancer cells may 
misbehave in their natural environment, embedded in a tissue. But cancer cells in 
the body show other peculiarities that mark them out from normal cells, beyond 
those just described. 

Cancer Cells Hâve an Altered Sugar Metabolism 

Given sufficient oxygen, normal adult tissue cells will generally fully oxidize 
almost ail the carbon in the glucose they talce up to CO 2 , which is lost from the 
body as a waste product. A growing tumor needs nutrients in abundance to pro- 
vide the building bloclcs to malce new macromolecules. Correspondingly, most 
tumor s hâve a metabolism more similar to that of a growing embryo than to that 
of normal adult tissue. Tumor cells consume glucose avidly, importing it from the 
blood at a rate that can be as much as 100 times higher than neighboring nor¬ 
mal cells. Moreover, only a small fraction of this imported glucose is used for 
production of ATP by oxidative phosphorylation. Instead, a great deal of lactate 
is produced, and many of the remaining carbon atoms derived from glucose are 
diverted for use as raw materials for synthesis of the proteins, nucleic acids, and 
lipids required for tumor growth (Figure 20-12). 

This tendency of tumor cells to de-emphasize oxidative phosphorylation even 
when oxygen is plentiful, while at the same time taking up large quantities of 
glucose, can be shown to promote cancer cell growth and is called the Warburg 
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Figure 20-11 Loss of contact inhibition 
by cancer cells in cell culture. Most 
normal cells stop proliferating once they 
hâve carpeted the dish with a single layer 
of cells: prolifération seems to dépend 
on contact with the dish, and to be 
inhibited by contacts with other cells—a 
phenomenon known as “contact inhibition.” 
Cancer cells, in contrast, usually disregard 
these restraints and continue to grow, so 
that they pile up on top of one another, 
as shown (Movie 20.2). (A) Schematic 
drawing. (B and C) Light micrographs of 
normal (B) and transformed (C) fibroblasts. 
(B and C, courtesy of Lan Bo Chen.) 
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Figure 20-12 The Warburg effect in 
tumor cells reflects a dramatic change in 
glucose uptake and sugar metabolism. 

(A) Cells that are not proliferating will 
normally oxidize nearly ail of the glucose 
that they import from the blood to produce 
ATP through the oxidative phosphorylation 
that takes place in their mitochondria. Only 
when deprived of oxygen will these cells 
generate most of their ATP from glycolysis, 
converting the pyruvate produced to lactate 
in order to regenerate the NAD+ that they 
need to keep glycolysis going (see Figure 
2-47). (B) Tumor cells, by contrast, will 
generally produce abundant lactate even 
in the presence of oxygen. This results 
from a greatly increased rate of glycolysis 
that is fed by a very large increase in the 
rate of glucose import. In this way, tumor 
cells resemble the rapidly proliferating 
cells in embryos (and during tissue repair), 
which likewise require for biosynthesis a 
large supply of the small-molecule building 
blocks that can be produced from imported 
glucose (see also Figure 20-26). 


Cancer Cells Hâve an Abnormal Ability to Survive Stress and DNA 
Damage 

In a large multicellular organism, there are powerful safety mechanisms that 
guard against the trouble that can be caused by damaged and deranged cells. For 
example, internai disorder gives rise to danger signais in the faulty cell, activat- 
ing protective devices that can eventually lead to apoptosis (see Chapter 18). To 
survive, cancer cells require additional mutations to éludé or break through these 
defenses against cellular misbehavior. 

Cancer cells are found to contain mutations that drive the cell into an abnormal 
State, where metabolic processes may be unbalanced and essential cell compo- 
nents may be produced in ill-matched proportions. States of this type, where the 
cell's homeostatic mechanisms are inadéquate to cope with an imposed distur¬ 
bance, are loosely referred to as States of cell stress. As one example, chromosome 
breakage and other forms of DNA damage are commonly observed during the 
development of cancer, refiecting the genetic instability that cancer cells display. 
Thus, to survive and divide without limit, a prospective cancer cell must accumu- 
late mutations that disable the normal safety mechanisms that would otherwise 
induce a cell that is stressed, in this or in other ways, to commit suicide. In fact, 
one of the most important properties of many types of cancer cells is that they fail 
to undergo apoptosis when a normal cell would do so (Figure 20-13). 

While cancer cells tend to avoid apoptosis, this does not mean that they rarely 
die. On the contrary, in the interior of a large solid tumor, cell death often occurs 
on a massive scale: living conditions are difficult, with severe compétition among 
the cancer cells for oxygen and nutrients. Many die, but typically much more by 
necrosis than by apoptosis (Figure 20-14). The tumor grows because the cell birth 
rate outpaces the cell death rate, but often by only a small margin. For this reason, 
the time that a tumor takes to double in size can be far longer than the cell-cycle 
time of the tumor cells. 

Human Cancer Cells Escape a Built-in Limit to Cell Prolifération 

Many normal human cells hâve a built-in limit to the number of times they can 
divide when stimulated to proliferate in culture: they permanently stop dividing 
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effect —so named because Otto Warburg first noticed the phenomenon in the early 
twentieth century. It is this abnormally high glucose uptake that allows tumors to 
be selectively imaged in whole-body scans (see Figure 20-1), thereby providing a 
way to monitor cancer progression and responses to treatment. 
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Figure 20-13 Both increased cell 
division and decreased apoptosis can 
contribute to tumorigenesis. In normal 
tissues, apoptosis balances cell division 
to maintain homeostasis (see Movie 18.1). 
During the development of cancer, either 
an increase in cell division or an inhibition 
of apoptosis can lead to the increased 
cell numbers important for tumorigenesis. 
The cells fated to undergo apoptosis are 
gray in this diagram. Both an increase in 
cell division and a decrease in apoptosis 
normally contribute to tumor growth. 


after a certain number of population doublings (25-50 for human fibroblasts, 
for example). This cell-division-counting mechanism is termed réplicative 
cell senescence, and it generally dépends on the progressive shortening of the 
telomeres at the ends of chromosomes, a process that eventually changes their 
structure (discussed in Chapter 17). As discussed in Chapter 5, the réplication of 
telomere DNA during S phase dépends on the enzyme telomerase , which main- 
tains a spécial telomeric DNA sequence that promûtes the formation of protein 
cap structures to protect chromosome ends. Because many proliferating human 
cells (stem cells being an exception) are déficient in telomerase, their telomeres 
shorten with every division, and their protective caps deteriorate, creating a DNA 
damage signal. Eventually, the altered chromosome ends can trigger a permanent 
cell-cycle arrest, causing a normal cell to die. 

Human cancer cells avoid réplicative cell senescence in one of two ways. They 
can maintain the activity of telomerase as they proliferate, so that their telomeres 
do not shorten or become uncapped, or they can evolve an alternate mechanism 
based on homologous recombination (called ALT) for elongating their chromo¬ 
some ends. Regardless of the strategy used, the resuit is that the cancer cells con¬ 
tinue to proliferate under conditions when normal cells would stop. 

The Tumor Microenvironment Influences Cancer Development 

While the cancer cells in a tumor are the bearers of dangerous mutations and 
are often grossly abnormal, the other cells in the tumor—especially those of the 
supporting connective tissue, or stroma—are far from passive bystanders. The 



Figure 20-14 Cross-section of a colon 
adenocarcinoma that has metastasized 

to the lung. This tissue slice shows 
well-differentiated colorectal cancer cells 
forming cohesive glands in the lung. 

The metastasis has central pink areas of 
necrosis where dying cancer cells hâve 
outgrown their blood supply. Such anoxie 
régions are common in the interior of large 
tumors. (Courtesy of Andrew J. Connolly.) 
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development of a tumor relies on a two-way communication between the tumor 
cells and the tumor stroma, just as the normal development of épithélial organs 
relies on communication between épithélial cells and mesenchymal cells (dis- 
cussed in Chapter 22). 

The stroma provides a framework for the tumor. It is composed of normal con¬ 
nective tissue containing fibroblasts and inflammatory white blood cells, as well 
as the endothélial cells that form blood and lymphatic vessels with their attendant 
pericytes and smooth muscle cells (Figure 20-15). As a carcinoma progresses, the 
cancer cells induce changes in the stroma by secreting signal proteins that alter 
the behavior of the stromal cells, as well as proteolytic enzymes that modify the 
extracellular matrix. The stromal cells in turn act baclc on the tumor cells, secret¬ 
ing signal proteins that stimulate cancer cell growth and division as well as pro¬ 
teases that further remodel the extracellular matrix. In these ways, the tumor and 
its stroma evolve together, lilce weeds and the ecosystem that they invade, and 
the tumor becomes dépendent on its particular stromal cells. Experiments using 
mice indicate that the growth of some transplanted carcinomas dépends on the 
tumor-associated fibroblasts and normal fibroblasts will not do. Such environ- 
mental requirements help to protect us from cancer, as we discuss next in consid- 
ering the critical phenomenon called metastasis. 

Cancer Cells Must Survive and Proliferate in a Foreign Environment 

Cancer cells generally need to spread and multiply at new sites in the body in 
order to kill us, through a process called metastasis. This is the most deadly—and 
least understood—aspect of cancer, being responsible for 90% of cancer-associ- 
ated deaths. By spreading through the body, a cancer becomes almost impossi¬ 
ble to eradicate by either surgery or local irradiation. Metastasis is itself a multi- 
step process: the cancer cells first hâve to invade local tissues and vessels, move 
through the circulation, leave the vessels, and then establish new cellular colonies 
at distant sites (Figure 20-16). Each of these events is complex, and most of the 
molecular mechanisms involved are not yet clear. 

For a cancer cell to become dangerous, it must break free of constraints that 
lceep normal cells in their proper places and prevent them from invading neigh- 
boring tissues. Invasiveness is thus one of the defining properties of malignant 
tumors, which show a disorganized pattern of growth and ragged borders, with 
extensions into the surrounding tissue (see, for example, Figure 20-8). Although 
the underlying molecular changes are not well understood, invasiveness almost 
certainly requires a disruption of the adhesive mechanisms that normally lceep 
cells tethered to their proper neighbors and to the extracellular matrix. For carci¬ 
nomas, this change resembles the epithelial-mesenchymal transition (FMI) that 
occurs in some épithélial tissues during normal development (see p. 1042). 

The next step in metastasis—the establishment of colonies in distant organs— 
begins with entry into the circulation: the invasive cancer cells must penetrate the 
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Figure 20-15 The tumor 
microenvironment plays a rôle in 
tumorigenesis. Tumors consist of 
many cell types, including cancer cells, 
endothélial cells, pericytes (vascular 
smooth muscle cells), fibroblasts, 
and inflammatory white blood cells. 
Communication among these and other 
cell types plays an important part in tumor 
development. Note, however, that only the 
cancer cells are thought to be genetically 
abnormal in a tumor. 
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cells grow as benign tumor in epithelium cells become invasive and enter capillary 



Figure 20-16 Steps in the process of metastasis. This example illustrâtes the spread of a tumor from an organ such as the 
bladder to the liver. Tumor cells may enter the bloodstream directly by Crossing the wall of a blood vessel, as diagrammed here, 
or, more commonly perhaps, by Crossing the wall of a lymphatic vessel that ultimately discharges its contents (lymph) into the 
bloodstream. Tumor cells that hâve entered a lymphatic vessel often become trapped in lymph nodes along the way, giving rise 
to lymph-node métastasés. 

Studies in animais show that typically far fewer than one in every thousand malignant tumor cells that enter the bloodstream 
will colonize a new tissue so as to produce a détectable tumor at a new site. 

wall of a blood or lymphatic vessel. Lymphatic vessels, being larger and having 
more flimsy walls than blood vessels, allow cancer cells to enter in small clumps; 
such clumps may then become trapped in lymph nodes, giving rise to lymph- 
node métastasés. The cancer cells that enter blood vessels, in contrast, seem to 
do so singly. With modem techniques for sorting cells according to their surface 
properties, it has become possible in some cases to detect these circulating tumor 
cells ( CTCs ) in samples of blood from cancer patients, even though they are only a 
minute fraction of the total blood-cell population. These cells, in principle at least, 
provide a useful sample of the tumor-cell population for genetic analysis. 

Of the cancer cells that enter the lymphatics or bloodstream, only a tiny pro¬ 
portion succeed in making their exit, settling in new sites, and surviving and pro- 
liferating there as founders of métastasés. Experiments show that fewer than one 
in thousands, perhaps one in millions, manage this feat. The final step of coloni- 
zation seems to be the most difficult: lilce the Vikings who landed on the inhospi- 
table shores of Greenland, the migrant cells may fail to survive in the alien envi¬ 
ronment; or they may only thrive there for a short while to found a little colony—a 
micrometastasis—XheX then dies out (Movie 20.3). 

Many cancers are discovered before they hâve managed to found metastatic 
colonies and can be cured by destruction of the primary tumor. But on occasion, 
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an undetected micrometastasis will remain dormant for many years, only to 
reveal its presence by erupting into growth to form a large secondary tumor long 
after the primary tumor has been removed. 

Many Properties Typically Contribute to Cancerous Growth 

Clearly, to produce a cancer, a cell must acquire a range of aberrant properties—a 
collection of subversive new skills—as it evolves. Different cancers require differ¬ 
ent combinations of these properties. Nevertheless, cancers ail share some com- 
mon features. By définition, they ail ignore or misinterpret normal social Controls 
so as to proliferate and spread where normal cells would not. These defining prop¬ 
erties are commonly combined with other features that help the miscreants to 
arise and thrive. A list of the lcey attributes of cancer cells in general would include 
the following, ail of which we hâve just discussed: 

1. They grow (biosynthesize) when they should not, aided by a metabolism 
shifted from oxidative phosphorylation toward aérobic glycolysis. 

2. They go through the cell-division cycle when they should not. 

3. They escape from their home tissues (that is, they are invasive) and survive 
and proliferate in foreign sites (that is, they metastasize). 

4. They hâve abnormal stress responses, enabling them to survive and con¬ 
tinue dividing in conditions of stress that would arrest or lcill normal cells, 
and they are less prone than normal cells to commit suicide by apoptosis. 

5. They are genetically and epigenetically unstable. 

6. They escape réplicative cell senescence, either by producing telomerase or 
by acquiring another way of stabilizing their telomeres. 

In the next section of the chapter, we examine the mutations and molecular 
mechanisms that underlie these and other properties of cancer cells. 

Summary 

Cancer cells, by définition, grow and proliferate in défiance of normal Controls (that 
is; they are neoplastic) and are able to invade surrounding tissues and colonize dis¬ 
tant organs (that is, they are malignant). By giving rise to secondary tumors, or 
métastasés, they become difficult to eradicate by surgery or local irradiation. Can¬ 
cers are thought to originate from a single cell that has experienced an initial muta¬ 
tion, but the progeny of this cell must undergo many further changes, requiring 
additional mutations and epigenetic events; to become cancerous. Tumor progres¬ 
sion usually takes many years and reflects the operation ofa Darwinian-like pro- 
cess of évolution, in which somatic cells undergo mutation and epigenetic changes 
accompanied by natural sélection. 

Cancer cells acquire a variety of spécial properties as they évolué, multiply, and 
spread. Their mutant genomes enable them to grow and divide in défiance ofthe 
signais that normally Iceep cell prolifération under tight control. As part ofthe evo- 
lutionary process of tumor progression, cancer cells acquire a collection of addi¬ 
tional abnormalities, including defects in the Controls that permanently stop cell 
division or induce apoptosis in response to cell stress or DNA damage, and in the 
mechanisms that normally Iceep cells from strayingfrom their proper place. AU of 
these changes increase the ability of cancer cells to survive, grow, and divide in their 
original tissue and then to metastasize, founding new colonies in foreign environ- 
ments. The évolution ofa tumor also dépends on other cells présent in the tumor 
microenvironment, collectively called stromal cells, that the cancer attracts and 
manipulâtes. 

Since many changes are needed to confer this collection of asocial behaviors; it is 
not surprising that most cancer cells are genetically and/or epigenetically unstable. 
This instability is thought to be selected for in the clones of aberrant cells that are 
able to produce tumors, because itgreatly accelerates the accumulation ofthe fur¬ 
ther genetic and epigenetic changes that are requiredfor tumor progression. 
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CANCER-CRITICAL GENES: HOW THEY ARE FOUND 
AND WHATTHEY DO 

As we hâve seen, cancer dépends on the accumulation of inherited changes in 
somatic cells. To understand it at a molecular level we need to identify the muta¬ 
tions and epigenetic changes involved and to discover how they give rise to can- 
cerous cell behavior. Finding the relevant cells is often easy; they are favored by 
natural sélection and call attention to themselves by giving rise to tumors. But 
how do we identify those genes with the cancer-promoting changes among ail 
the other genes in the cancerous cells? A typical cancer dépends on a whole set 
of mutations and epigenetic changes—usually a somewhat different set in each 
individual patient. In addition, a given cancer cell will also contain a large num- 
ber of somatic mutations that are accidentai by-products—so-called passengers 
rather than drivers —of its genetic instability, and it can be difficult to distinguish 
these meaningless changes from those changes that hâve a causative rôle in the 
disease. Despite these difficulties, many of the genes that are repeatedly altered 
in human cancers hâve been identified over the past 40 years. We will call such 
genes, for want of a better term, cancer-critical genes, meaning ail genes whose 
alteration contributes to the causation or évolution of cancer by driving tumori- 
genesis. 

In this section, we shall first discuss how cancer-critical genes are identified. 
We shall then examine their functions and the parts they play in conferring on 
cancer cells the properties outlined in the first part of the chapter. We shall end 
the section by discussing colon cancer as an extended example, showing how a 
succession of changes in cancer-critical genes enables a tumor to evolve from one 
pattern of bad behavior to another that is worse. 

The Identification of Gain-of-Function and Loss-of-Function 
Cancer Mutations Has Traditionally Required Different Methods 

Cancer-critical genes are grouped into two broad classes, according to whether the 
cancer rislc arises from too much activity of the gene product or too little. Genes of 
the first class, in which a gain-of-function mutation can drive a cell toward cancer, 
are called proto-oncogenes; their mutant, overactive or overexpressed forms are 
called oncogenes. Genes of the second class, in which a loss-of-function muta¬ 
tion can contribute to cancer, are called tumor suppressor genes. In either case, 
the mutation may lead toward cancer directly (by causing cells to proliferate 
when they should not) or indirectly—for example, by causing genetic or epigen¬ 
etic instability and so hastening the occurrence of other inherited changes that 
directly stimulate tumor growth. Those genes whose alteration results in genomic 
instability represent a subclass of cancer-critical genes that are sometimes called 
genome maintenance genes. 

As we shall see, mutations in oncogenes and tumor suppressor genes can 
hâve similar effects in promoting the development of cancer; overproduction of a 
signal for cell prolifération, for example, can resuit from either lcind of mutation. 
Thus, from the point of view of a cancer cell, oncogenes and tumor suppressor 
genes—and the mutations that affect them—are flip sides of the same coin. The 
techniques that led to the discovery of these two categories of genes, however, are 
quite different. 

The mutation of a single copy of a proto-oncogene that converts it to an onco¬ 
gène has a dominant, growth-promoting effect on a cell (Figure 20-17A). Thus, we 
can identify the oncogene by its effect when it is added —by DNA transfection, for 
example, or through infection with a viral vector—to the genome of a suitable type 
of tester cell or experimental animal. In the case of the tumor suppressor gene, on 
the other hand, the cancer-causing alleles produced by the change are generally 
récessive: often (but not always) both copies of the normal gene must be removed 
or inactivated in the diploid somatic cell before an effect is seen (Figure 20-17B). 
This calls for a different experimental approach, one focusing on discovering what 
is missing in the cancer cell. 
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We begin by discussing some examples of each class of cancer-critical genes 
to illustrate basic principles. These examples are chosen also for their historical 
importance: the experiments that led to their discovery—at different times and by 
different methods—marlced turning points in the understanding of cancer. 

Retroviruses Can Act as Vectors for Oncogenes That Alter Cell 
Behavior 

The search for the genetic causes of human cancer toolc a devious route, begin- 
ning with dues that came from the study of tumor viruses. Although viruses are 
involved only in a minority of human cancers, a set of viruses that infect animais 
provided critical early tools for studying cancer. 

One of the first animal viruses to be implicated in cancer was discovered over 
100 years ago in chiclcens, when an infectious agent that causes connective-tissue 
tumors, or sarcomas, was characterized as a virus —the Rous sarcoma virus. Lilce 
ail the other RNA tumor viruses discovered since, it is a retrovirus. When it infects 
a cell, its RNA genome is copied into DNA by reverse transcription, and the DNA 
is inserted into the host genome, where it can persist and be inherited by subsé¬ 
quent générations of cells. Something in the DNA inserted by the Rous sarcoma 
virus made the host cells cancerous, but what was it? The answer was a surprise. 
It turned out to be a piece of DNA that was unnecessary for the virus's own sur- 
vival or reproduction; instead, it was a passenger, a gene called v-Src , that the virus 
had picked up on its travels. v-Src was unmistakably similar, but not identical, to a 
gene— c-Src —that was discovered in the normal vertebrate genome. c-Src had evi- 
dently been caught up accidentally by the retrovirus from the genome of a previ- 
ously infected host cell, and it had undergone mutation in the process to become 
an oncogene {v-Src). 

This Nobel Prize-winning finding was followed by a flood of discoveries of 
other viral oncogenes carried by retroviruses that cause cancer in nonhuman ani¬ 
mais. Each such oncogene turned out to hâve a counterpart proto-oncogene in 
the normal vertebrate genome. As was the case for Src, these other oncogenes 
generally differed from their normal counterparts, either in structure or in level of 
expression. But how did this relate to typical human cancers, most of which are 
not infectious and in which retroviruses play no part? 


Figure 20-17 Cancer-critical mutations 
fall into two readily distinguishable 
categories, dominant and récessive. 

In this diagram, activating mutations 
are represented by solid red boxes, 
inactivating mutations by hollow red 
boxes. (A) Oncogenes act in a dominant 
manner: a gain-of-function mutation in a 
single copy of the cancer-critical gene can 
drive a cell toward cancer. (B) Mutations 
in tumor suppressor genes, on the other 
hand, generally act in a récessive manner: 
the function of both alleles of the cancer- 
critical gene must be lost to drive a cell 
toward cancer. Although in this diagram 
the second allele of the tumor suppressor 
gene is inactivated by mutation, it is often 
inactivated instead by loss of the second 
chromosome. Not shown is the fact that 
mutation of some tumor suppressor genes 
can hâve an effect even when only one of 
the two gene copies is damaged. 
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Different Searches for Oncogenes Converged on the Same 
Gene —Ras 

In an attempt to answer the above question, other researchers searched directly 
for oncogenes in the genomes of human cancer cells. They did this by searching 
for DNA fragments from cancer cells that could provoke uncontrolled proliféra¬ 
tion when introduced into noncancerous cell lines. As tester cells for the assay, 
cell lines derived from mouse fibroblasts were used. These cells had been pre- 
viously selected for their ability to proliferate indefinitely in culture, and they 
are thought to already contain alterations that talce them part of the way toward 
malignancy. For this reason, the addition of a single oncogene can sometimes be 
enough to produce a dramatic effect. 

When DNA was extracted from the human tumor cells, brolcen into fragments, 
and introduced into the cultured cells, occasional colonies of abnormally prolifer- 
ating cells began to appear in the culture dish. These cells showed a transformed 
phenotype, outgrowing the untransformed cells in the culture and piling up in 
layer upon layer (see Figure 20-11). Each colony was a clone originating from a 
single cell that had incorporated a DNA fragment that drove cancerous behavior. 
This fragment, which carried markers of its human origin, could be isolated from 
the transformed cultured mouse cells. And once isolated and sequenced, it could 
be recognized: it contained a human version of a gene already lcnown from study 
of a retrovirus that caused tumors in rats—an oncogene called v-Ras. 

The newly discovered oncogene was clearly derived by mutation from a nor¬ 
mal human gene, one of a small family of proto-oncogenes called Ras. This dis- 
covery in the early 1980s of the same oncogene in human tumor cells and in an 
animal tumor virus was electrifying. The implication that cancers are caused by 
mutations in a limited number of cancer-critical genes transformed our under- 
standing of the molecular biology of cancer. 

As discussed in Chapter 15, normal Ras proteins are monomeric GTPases that 
help transmit signais from cell-surface receptors to the cell interior (see Movie 
15.7). The Ras oncogenes isolated from human tumors contain point mutations 
that create a hyperactive Ras protein that cannot shut itself off by hydrolyzing its 
bound GTP to GDP. Because this malces the protein hyperactive, its effect is dom¬ 
inant—that is, only one of the cell's two gene copies needs to change to hâve an 
effect. One or another of the three human Ras family members is mutated in per- 
haps 30% of ail human cancers. Ras genes are thus among the most important of 
ail cancer-critical genes. 

Genes Mutated in Cancer Can Be Made Overactive in Many Ways 

Figure 20-18 summarizes the types of accidents that can convert a proto-on¬ 
cogene into an oncogene. (1) A small change in DNA sequence such as a point 
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Figure 20-18 The types of accidents that can convert a proto-oncogene into an oncogene. 
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Figure 20-19 Mutation of the epidermal 
growth factor (EGF) receptor can make 
it active even in the absence of EGF, and 
consequently oncogenic. Only one of the 

possible types of activating mutations is 
illustrated here. 


mutation or délétion may produce a hyperactive protein when it occurs within a 
protein-coding sequence, or lead to protein overproduction when it occurs within 
a regulatory région for that gene. (2) Gene amplification events, such as those that 
can be caused by errors in DNA réplication, may produce extra gene copies; this 
can lead to overproduction of the protein. (3) A chromosomal rearrangement— 
involving the breakage and rejoining of the DNA hélix—may either change the 
protein-coding région, resulting in a hyperactive fusion protein, or alter the con- 
trol régions for a gene so that a normal protein is overproduced. 

As one example, the receptor for the extracellular signal protein epidermal 
growth factor {EGF) can be activated by a délétion that removes part of its extracel¬ 
lular domain, causing it to be active even in the absence of EGF (Figure 20-19). It 
thus produces an inappropriate stimulatory signal, like a faulty doorbell that rings 
even when nobody is pressing the button. Mutations of this type are frequently 
found in the most common type of human brain tumor, called glioblastoma. 

As another example, the Myc protein , which acts in the nucléus to stimulate 
cell growth and division (see Chapter 17), generally contributes to cancer by 
being overproduced in its normal form. In some cases, the gene is amplified— 
that is, errors of DNA réplication lead to the création of large numbers of gene 
copies in a single cell. Or a point mutation can stabilize the protein, which nor- 
mally turns over very rapidly. More commonly, the overproduction appears to 
be due to a change in a regulatory element that acts on the gene. For example, a 
chromosomal translocation can inappropriately bring powerful gene regulatory 
sequences next to the Myc protein-coding sequence, so as to produce unusually 
large amounts of Myc mRNA. Thus, in Burkitt's lymphoma, a translocation brings 
the Myc gene under the control of sequences that normally drive the expression of 
antibody genes in B lymphocytes. As a resuit, the mutant B cells tend to proliferate 
excessively and form a tumor. Different spécifie chromosome translocations are 
common in other cancers. 

Studies of Rare Hereditary Cancer Syndromes First Identified 
Tumor Suppressor Genes 

Identifying a gene that has been inactivated in the genome of a cancer cell requires 
a different strategy from finding a gene that has become hyperactive: one cannot, 
for example, use a cell transformation assay to identify something that simply is 
not there. The key insight that led to the discovery of the first tumor suppressor 
gene came from studies of a rare type of human cancer, retinoblastoma, which 
arises from cells in the retina of the eye that are converted to a cancerous State by 
an unusually small number of mutations. As often happens in biology, the discov¬ 
ery arose from examination of a spécial case, but it turned out to reveal a gene of 
widespread importance. 

Retinoblastoma occurs in childhood, and tumors develop from neural pre- 
cursor cells in the immature retina. About one child in 20,000 is afflicted. One 
form of the disease is hereditary, and the other is not. In the hereditary form, 
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multiple tumors usually arise independently, affecting both eyes; in the nonhe- 
reditary form, only one eye is affected, and by only one tumor. A few individuals 
with retinoblastoma hâve a visibly abnormal karyotype, with a délétion of a spé¬ 
cifie band on chromosome 13 that, if inherited, prédisposés an individual to the 
disease. Délétions of this same région are also encountered in tumor cells from 
some patients with the nonhereditary disease, which suggested that the cancer 
was caused by loss of a critical gene in that location. 

Using the location of this chromosomal délétion, it was possible to clone and 
sequence the Rb gene. It was then discovered that those who suffer from the 
hereditary form of the disease hâve a délétion or loss-of-function mutation prés¬ 
ent in one copy of the Rb gene in every somatic cell. These cells are predisposed 
to becoming cancerous, but do not do so if they retain one good copy of the gene. 
The retinal cells that are cancerous are defective in both copies of Rb because of a 
somatic event that has eliminated the function of the previously good copy. 

In patients with the nonhereditary form of the disease, by contrast, the non- 
cancerous cells show no defect in either copy of Rb, while the cancerous cells hâve 
become defective in both copies. These nonhereditary retinoblastomas are very 
rare because they require two independent events that inactivate the same gene 
on two chromosomes in a single retinal cell lineage (Figure 20-20). 

The Rb gene is also missing in several common types of sporadic cancer, 
including carcinomas of lung, breast, and bladder. These more common cancers 
arise by a more complex sériés of genetic changes than does retinoblastoma, and 
they make their appearance much later in life. But in ail of them, it seems, loss of 
Rb function is frequently a major step in the progression toward malignancy. 

The Rb gene encodes the Rb protein, which is a universal regulator of the cell 
cycle présent in almost ail cells of the body (see Figure 17-61). It acts as one of 
the main bralces on progress through the cell-division cycle, and its loss can allow 
cells to enter the cell cycle inappropriately, as we discuss later. 


Figure 20-20 The genetic mechanisms 
that cause retinoblastoma. In the 

hereditary form, ail cells in the body lack 
one of the normal two functional copies of 
the Rb tumor suppressor gene, and tumors 
occur where the remaining copy is lost 
or inactivated by a somatic event (either 
mutation or epigenetic silencing). In the 
nonhereditary form, ail cells initially contain 
two functional copies of the gene, and the 
tumor arises because both copies are lost 
or inactivated through the coincidence of 
two somatic events in a single line of cells. 


Both Genetic and Epigenetic Mechanisms Can Inactivate Tumor 
Suppressor Genes 

For tumor suppressor genes, it is their inactivation that is dangerous. This inacti¬ 
vation can occur in many ways, with different combinations of mishaps serving 
to eliminate or cripple both gene copies. The first copy may, for example, be lost 
by a small chromosomal délétion or inactivated by a point mutation. The second 
copy is commonly eliminated by a less spécifie and more probable mechanism: 
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Figure 20-21 Six ways of losing the 
remaining good copy of a tumor 
suppressor gene through a change in 
DNA sequences. A cell that is détective 
in only one of its two copies of a tumor 
suppressor gene—for example, the Rb 
gene—usually behaves as a normal, 
healthy cell; the diagrams below show how 
this cell may lose the function of the other 
gene copy as well and thereby progress 
toward cancer. A seventh possibility, 
frequently encountered with some tumor 
suppressors, is that the gene may be 
silenced by an epigenetic change, without 
alteration of the DNA sequence, 
as illustrated in Figure 20-22. (After 
W.K. Cavenee et al., Nature 305:779-784, 
1983. With permission from Macmillan 
Publishers Ltd.) 


the chromosome carrying the remaining normal copy may be lost from the cell 
through errors in chromosome ségrégation; or the normal gene, along with 
neighboring genetic material, may be replaced by a mutant version through 
either a mitotic recombination event or a gene conversion that accompanies it (see 

p. 286). 

Figure 20-21 summarizes the range of ways in which the remaining good copy 
of a tumor suppressor gene can be lost through a DNA sequence change, using the 
Rb gene as an example. It is important to note that, except for the point mutation 
mechanism illustrated at the far right, these pathways ail produce cells that carry 
only a single type of DNA sequence in the chromosomal région containing their 
Rb genes—a sequence that is identical to the sequence in the original mutant 
chromosome. 

Epigenetic changes provide another important way to permanently inactivate 
a tumor suppressor gene. Most commonly, the gene may become packaged into 
heterochromatin and/or the C nucléotides in CG sequences in its promoter may 
become methylated in a heritable manner (see pp. 404-405). These mechanisms 
can irreversibly silence the gene in a cell and in ail of its progeny. Analysis of méth¬ 
ylation patterns in cancer genomes shows that epigenetic gene silencing is a fre¬ 
quent event in tumor progression, and epigenetic mechanisms are now thought 
to help inactivate several different tumor suppressor genes in most human can¬ 
cers (Figure 20-22). 

Systematic Sequencing of Cancer Cell Genomes Has Transformed 
Our Understanding of the Disease 

Methods such as those we hâve described above shone a spotlight on a set of can¬ 
cer-critical genes that were identified in a piecemeal fashion. Meanwhile, the rest 
of the cancer cell genome remained in darkness: it was a mystery how many other 
mutations might lurlc there, of what types, in which varieties of cancer, at what 
frequencies, with what variations from patient to patient, and with what consé¬ 
quences. With the sequencing of the human genome and the dramatic advances 
in DNA sequencing technology (see Panel 8-1, pp. 478-481), it has become pos¬ 
sible to see the whole picture—to view cancer cell genomes in their entirety. This 
transforms our understanding of the disease. 

Cancer cell genomes can be scanned systematically in several different ways. 
At one extreme—the most costly, but no longer prohibitively so—one can déter¬ 
mine a tumor's complété genome sequence. More cheaply, one can focus just on 
the 21,000 or so genes in the human genome that code for protein (the so-called 
exome), looking for mutations in the cancer cell DNA that alter the amino acid 
sequence of the product or prevent its synthesis (Figure 20-23). There are also 
efficient techniques to survey the genome for régions that hâve undergone 
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Figure 20-22 The pathways leading to 
loss of tumor suppressor gene function 
in cancer involve both genetic and 
epigenetic changes. (A) As indicated, the 
changes that silence tumor suppressor 
genes can occur in any order. Both DNA 
méthylation and the packaging of a gene 
into condensed chromatin can prevent 
its expression in a way that is inherited 
when a cell divides (see Figure 4-44). 

(B) The frequency of gene silencing by 
hyperméthylation observed in four different 
types of cancer. The five genes listed at the 
top can ail function as tumor suppressor 
genes; BRCA1 and hMLHI affect genome 
stability and are in the subclass known as 
genome maintenance genes. ND, no data. 
(Adapted from M. Esteller et al., Cancer 
Res. 61:3225-3229, 2001.) 
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délétion or duplication, without the need for complété sequence information. 
The genome can be scanned for epigenetic changes. And finally, alterations in 
levels of gene expression can be systematically determined by analysis of mRNAs 
(see Figure 7-3). These approaches generally involve comparing cancer cells with 
normal Controls—ideally, noncancerous cells originating in the same tissue and 
from the same patient. 
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Figure 20-23 The distinct types of DNA 
sequence changes found in oncogenes 
compared to tumor suppressor 
genes. In this diagram, mutations that 
change an amino acid are denoted by 
blue arrowheads, whereas mutations 
that truncate the polypeptide chain are 
marked by yellow arrowheads. (A) As in 
this example, oncogene mutations can 
be detected by the fact that the same 
nucléotide change is repeatedly found 
among the missense mutations in a 
gene. (B) For tumor suppressor genes, by 
contrast, missense mutations that abort 
protein synthesis by creating stop codons 
predominate. (Adapted from B. Vogelstein 
et al., Science 339:1546-1558, 2013.) 
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Many Cancers Hâve an Extraordinarily Disrupted Genome 

Cancer genome analysis reveals, first of ail, the scale of gross genetic disruption 
in cancer cells. This varies greatly from one type of cancer and one cancer patient 
to another, both in severity and in character. In some cases, the karyotype—the 
set of chromosomes as they appear at mitosis—is normal or nearly so, but many 
point mutations are detected in individual genes, suggesting a failure of the 
repair mechanisms that normally correct local errors in the réplication or main¬ 
tenance of DNA sequences. Often, however, the karyotype is severely disordered, 
with many chromosome breaks and rearrangements. In some breast cancers, for 
example, genome sequencing reveals an astonishing scene of genetic chaos (Fig¬ 
ure 20-24), with hundreds of chromosome breaks and translocations, resulting in 
many délétions, duplications, and amplifications of parts of the genome. In such 
cells, the normal machinery for avoidance or repair of DNA double-strand breaks 
is evidently somehow defective, destabilizing the genome by giving rise to brolcen 
chromosomes whose fragments then rejoin in random combinations. From the 
pattern of changes, one can infer that this disruptive process has occurred repeat- 
edly during the évolution of the tumor, with a progressive increase of genetic dis- 
order. Breast cancers showing the most extreme chromosome disorder are usu- 
ally hard to treat and hâve a gloomy prognosis. 

One survey of more than 3000 individual cancer specimens showed that on 
average 24 separate bloclcs of genetic material were duplicated in each tumor, 
amounting to 17% of the normal genome, and 18 bloclcs were deleted, amount- 
ing to 16% of the normal genome. Many of these changes were found repeatedly, 
suggesting that they contain cancer-critical genes whose loss (tumor suppressor 
genes) or gain (oncogenes) confers a sélective advantage. 

Whole-genome analysis also helps to explain some cancers that seem, at first 
sight, to be exceptions to the general rules. An example is retinoblastoma, with 
its early onset during childhood. If cancers in general require an accumulation of 
many genetic changes and are thus diseases of old âge, what malces retinoblas¬ 
toma different? Whole-genome sequencing confirms that in retinoblastoma, the 
tumor cells contain loss-of-function mutations in the Rb gene; but, astonishingly, 
they contain practically no mutations or genome rearrangements that affect any 
other oncogene or tumor suppressor gene. Instead, they contain many epigenetic 
modifications, which alter the level of expression of many lcnown cancer-critical 
genes—as many as 15 in one well-analyzed case. 

Many Mutations in Tumor Cells are Merely Passengers 

Cancer cells generally contain many mutations in addition to gross chromosome 
abnormalities: point mutations can be scattered over the genome as a whole at 
a rate of about one per million nucléotide pairs, in addition to the abnormalities 




highly 

amplified 

régions 


/'nterchromosomal 
rearrangement (purple) 


copy number (blue) 


/ntrachromosomal 
rearrangement (green) 


BREAST CANCER 2 


Figure 20-24 The chromosomal 
rearrangements in breast cancer 
cells. The results of an extensive DNA 
sequencing analysis performed on two 
different primary tumors are displayed 
as “Circos plots.” In each plot, the 
reference DNA sequences of the 22 
autosomes and single sex chromosome 
(X) of a normal human female (3.2 billion 
nucléotide pairs) are aligned end-to-end 
to form a circle. Colored lines within 
the circle are then used to indicate the 
chromosome alterations found in the 
particular primary tumor. As indicated, 
purple lines connect sites at which two 
different chromosomes hâve become 
joined to create an interchromosomal 
rearrangment, while green lines connect 
the sites of rearrangements found within a 
single chromosome. The intrachromosomal 
rearrangements can be seen to 
predominate, and most join neighboring 
sections of DNA that were originally located 
within 2 million nucléotide pairs of each 
other. The increases in copy number, 
shown in blue, reveal the amplified DNA 
sequences (see the highly amplified régions 
indicated). (Adapted from P.J. Stephens et 
al., Nature 462:1005-1010, 2009.) 
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attributed to chromosome breakage and rejoining. Systematic surveys of the pro- 
tein-coding genes in common solid tumors—such as those of the breast, colon, 
brain, or pancréas—hâve revealed that an average of 33 to 66 genes hâve under- 
gone somatic mutation affecting the sequence of their protein product. Mutations 
in noncoding régions of the genome are much more numerous, as one would 
expect from the much larger fraction of the genome that noncoding DNA rep- 
resents. But they are considerably more difficult to interpret. 

The high frequency of mutations testifies to the genetic instability of many 
cancer cells, but it leaves us with a difficult problem. How can we discover which 
of the mutations are drivers of cancer—that is, causal factors in the development 
of the disease—and which are merely passengers—mutations that happen to 
hâve occurred in the same cell as the driver mutations, thanlcs to genetic insta¬ 
bility, but are irrelevant to the development of the disease? A simple criterion is 
based on frequency of occurrence. Driver mutations affecting a gene that plays a 
part in the disease will be seen repeatedly, in many different patients. In contrast, 
passenger mutations, occurring at more-or-less random locations in the genome 
and conferring no sélective advantage on the cancer cell, are unlilcely to be found 
in the same genes in different patients. 

Figure 20-25 shows the results of an analysis of this sort for a large sample 
of colorectal cancers. The different sites in the genome are laid out on a two-di- 
mensional array, with chromosome serial number along one axis and position 
within each chromosome along the other. The frequency with which mutations 
are encountered is shown by height above this plane, creating a mutation “land- 
scape” with mountains (sites where mutations are found in a large proportion of 
the tumors in the sample), hills (where mutations are found less frequently but 
still more often than would be expected for a random scattering over the genome), 
and hillocks (sites of occasional mutations, occurring at a frequency no higher 
than would be expected for mutations scattered at random in each individual 
tumor). The mountains and the hills are strong candidates to be the sites of driver 
mutations—in other words, sites of cancer-critical genes; the hillocks are lilcely to 
correspond to passengers. Indeed, many of the mountains and hills turn out to be 
sites of lcnown oncogenes or tumor suppressor genes, whereas the hillocks mostly 
correspond to genes that hâve no known or probable rôle in causation of cancer. 

Of course, some hillocks may correspond to genes that are mutated in only a few 
rare patients but are nevertheless cancer-critical for them. 

About One Percent of the Genes in the Human Genome Are 
Cancer-Critical 

From studies such as the one just described, it is estimated that the number of 
driver mutations for an individual case of cancer (the sum of meaningful epigen- 
etic and genetic changes in both coding sequences and regulatory régions) is typ- 
ically on the order of 10, explaining why cancer progression generally involves an 
increase in genetic and/or epigenetic instability that enhances the rate of such 

changes. Figure 20-25 The mutation landscape in 



colorectal cancer. In this two-dimensional 
représentation of the human genome, 
the green surface depicts the 22 human 
autosomes plus the X sex chromosome 
as being laid out side-by-side in numerical 
order from left to right, with the DNA 
sequence of each chromosome running 
from back to front. The mountains 
represent the locations of genes 
mutated with high frequency in different, 
independent tumors. As indicated, these 
are suspected driver mutations in the 
adenomatous polyposis coli (APC), K-Ras, 
p53, phosphoinositide 3-kinase (PIK3CA), 
and ubiquitin ligase (FBXW7) proteins. 
(Adapted from L.D. Wood et al., Science 
318:1108-1113, 2007.) 
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By compiling the data for different types of cancer, each with its own range 
of identified driver mutations, we can develop a comprehensive catalog of genes 
that are strongly suspected to be cancer-critical. Current estimâtes put the total 
number of such genes at about 300, about 1% of the genes in the human genome. 
These cancer-critical genes are amazingly diverse. Their products include 
secreted signal proteins, transmembrane receptors, GTP-binding proteins, pro- 
tein kinases, transcription regulators, chromatin modifiers, DNA repair enzymes, 
cell-cell adhesion molécules, cell-cycle controllers, apoptosis regulators, scaffold 
proteins, metabolic enzymes, components of the RNA splicing machinery, and 
more besides. Ail these are susceptible to mutations that can contribute, in one 
way or another, in one tissue or another, to the évolution of cells with the cancer- 
ous properties that we listed earlier on page 1103. 

Clearly, the molecular changes that cause cancer are complex. As we now 
explain, however, the complexity is not quite as daunting as it may initially seem. 

Disruptions in a Handful of Key Pathways Are Common to Many 
Cancers 

Some genes, like Rb and Ras, are mutated in many cases of cancer and in cancers 
of many different types. The involvement of genes such as Rb and Ras in cancer 
is no surprise, now that we understand their normal functions: they control fun- 
damental processes of cell division and growth. But even these common culprits 
feature in considerably less than half of individual cases. What is happening to 
the control of these processes in the many cases of cancer where, for example, Rb 
is intact or Ras is not mutated? What part do mutations in the hundreds of other 
cancer-critical genes play in the development of the disease? With our increas- 
ing knowledge of the normal functions of the genes in the human genome, it is 
becoming easier to see patterns in the cataloged driver mutations and to give 
some simplifying answers to these questions. 

Glioblastoma— the commonest type of human brain tumor—provides a good 
example. Analysis of the genomes of tumor cells from 91 patients identified a total 
of at least 79 genes that were mutated in more than one individual. The normal 
functions of most of these genes were known or could be guessed, allowing them 
to be assigned to spécifie biochemical or regulatory pathways. Three functional 
groupings stood out, accounting for a total of 21 of the recurrently mutated genes. 
One of these groupings consisted of genes in the Rb pathway (that is, Rb itself, 
along with genes that directly regulate Rb); this pathway governs initiation of the 
cell-division cycle. Another consisted of genes in the same regulatory subnetworlc 
as Ras— a more loosely defined System of genes referred to as the RTK/Ras/PI3K 
pathway, after three of its core components; this pathway serves to transmit sig¬ 
nais for cell growth and cell division from the cell exterior into the heart of the 
cell. The third grouping consisted of genes in a pathway regulating responses to 
stress and DNA damage—the p53 pathway. We shall hâve more to say about each 
of these pathways below. 

Out of ail tumors, 74% had identifiable mutations in ail three pathways. If one 
were to trace these three pathways further upstream and include ail the com¬ 
ponents, known and unknown, on which they dépend, this percentage would 
almost certainly be even higher. In other words, in almost every case of glioblas¬ 
toma, there are mutations that disrupt each of three fundamental Controls: the 
control of cell growth, the control of cell division, and the control of responses to 
stress and DNA damage. 

Strikingly, in any given tumor-cell clone, there is a strong tendency for no more 
than one gene to be mutated in each pathway. Evidently, what matters for tumor 
évolution is the disruption of the control mechanism, and not the genetic means 
by which that is achieved. Thus, for example, in a patient whose tumor cells hâve 
no mutation in Rb itself, there is generally a mutation in some other component 
of the Rb pathway, producing a similar biological effect. 

Similar patterns are seen in other types of cancers. A survey of many specimens 
of the major variety of ovarian cancer, for example, identified 67% of patients as 
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having mutations in the Rb pathway, 45% in the Ras/PI3K pathway (defined more 
narrowly than in the glioblastoma study), and more than 96% in the p53 path¬ 
way. Allowing for additional pathway components not included in the analysis, 
it seems that most cases of this type of cancer, too, hâve mutations disrupting the 
same three Controls, leading to misregulated cell growth, misregulated cell prolif¬ 
ération, and abnormal disregard of stress and DNA damage. It seems that these 
three fundamental Controls are subverted in one way or another in virtually every 
type of cancer. 

We hâve devoted an entire chapter to the cell cycle and growth Controls (Chap¬ 
ter 17). Some important details of the other two control pathways are reviewed 
next. 


Mutations in the PI3K/Akt/mTOR Pathway Drive Cancer Cells to 
Grow 

Cell prolifération is not simply a matter of progression through the cell cycle; it 
also requires cell growth, which involves complex anabolic processes through 
which the cell synthesizes ail the necessary macromolecules from small-mole- 
cule precursors. If a cell divides inappropriately without growing first, it will get 
smaller at each division and will ultimately die or become too small to divide. 
Cells appear to require two separate signais to grow and divide (Figure 20-26). 
Cancer dépends, therefore, not only on a loss of restraints on cell-cycle progres¬ 
sion, but also on disrupted control of cell growth. 

The phosphoinositide 3-kinase (PI 3-kinase)/Alct/mTOR intracellular signal- 
ing pathway is critical for cell growth control. As described in Chapter 15, various 
extracellular signal proteins, including insulin and insulin-like growth factors, 




Figure 20-26 Cells seem to require two types of signais to proliferate. (A) In order to multiply successfully, most normal cells are suspected to 
require both extracellular signais that drive cell-cycle progression (shown here as blue mitogen) and extracellular signais that drive cell growth (shown 
here as red growth factor). How mitogens activate the Rb pathway to drive entry into the cell cycle is described in Figure 17-61. (B) Diagram of 
the signaling System containing Akt that drives cell growth through greatly stimulating glucose uptake and utilization, including a conversion of the 
excess citric acid produced from sugar intermediates in mitochondria into the acetyl CoA that is needed in the cytosol for lipid synthesis and new 
membrane production. As indicated, protein synthesis is also increased. This System becomes abnormally activated early in tumor progression. TCA 
cycle indicates the tricarboxylic acid cycle (citric acid cycle). 
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normally activate this pathway. In cancer cells, however, the pathway is activated 
by mutation so that the cell can grow in the absence of such signais. The resulting 
abnormal activation of the protein kinases Alct and mTOR not only stimulâtes pro- 
tein synthesis (see Figure 17-64), but also greatly increases both glucose uptalce 
and the production of the acetyl CoA in the cytosol required for cell lipid synthe¬ 
sis, as outlined in Figure 20-26B. 

The abnormal activation of the PI 3-kinase/Alct/mTOR pathway, which nor¬ 
mally occurs early in the process of tumor progression, helps to explain the exces¬ 
sive rate of glycolysis that is observed in tumor cells, known as the Warburg effect, 
as discussed earlier (see Figure 20 - 12 ). As expected from our previous discussion, 
cancers can activate this pathway in many different ways. Thus, for example, a 
growth factor receptor can become abnormally activated, as in Figure 20-19. Also 
very common in cancers is the loss of the PTEN phosphatase, an enzyme that nor¬ 
mally suppresses the PI 3-kinase/Alct/mTOR pathway by dephosphorylating the 
PI (3,4,5) P 3 molécules that the PI 3-kinase forms (see pp. 859-861). PTEN is thus 
a common tumor-suppressor gene. 

Of course, mutation is not the only way to overactivate the pathway: high levels 
of insulin in the circulation can hâve a similar effect. This may explain why the 
rislc of cancer is signihcantly increased, by a factor of two or more, in people who 
are obese or hâve type 2 diabètes. Their insulin levels are abnormally high, driv- 
ing cancer cell growth without need of mutation in the PI 3-kinase/Alct/mTOR 
pathway. 

Mutations in the p53 Pathway Enable Cancer Cells to Survive and 
Proliferate Despite Stress and DNA Damage 

That cancer cells must break the normal rules governing cell growth and cell divi¬ 
sion is obvious: that is part of the définition of cancer. It is not so obvious why 
cancer cells should also be abnormal in their response to stress and DNA dam¬ 
age, and yet this too is an almost universal feature. The gene that lies at the center 
of this response, the p53 gene, is mutated in about 50% of ail cases of cancer— 
a higher proportion than for any other known cancer-critical gene. When we 
include with p53 the other genes that are closely involved in its function, we find 
that most cases of cancer harbor mutations in the p53 pathway. Why should this 
be? To answer, we must first consider the normal function of this pathway. 

In contrast to Rb, most cells in the body hâve very little p53 protein under nor¬ 
mal conditions: although the protein is synthesized, it is rapidly degraded. More- 
over, p53 is not essential for normal development. Mice in which both copies of 
the gene hâve been deleted or inactivated typically appear normal in ail respects 
except one—they universally develop cancer before 10 months of âge. These 
observations suggest that p53 has a function that is required only in spécial cir- 
cumstances. In fact, cells raise their concentration of p53 protein in response to 
a whole range of conditions that hâve only one obvious thing in common: they 
are, from the celFs point of view, pathological, putting the cell in danger of death 
or serious injury. These conditions include DNA damage, putting the cell at rislc 
from a faulty genome; telomere loss or shortening (see p. 1016), also dangerous 
to the integrity of the genome; hypoxia, depriving the cell of the oxygen it needs 
to lceep its metabolism going; osmotic stress, causing the cell to swell or shrivel; 
and oxidative stress, generating dangerous levels of highly reactive free radicals. 

Yet another form of stress that can activate the p53 pathway arises, it seems, 
when regulatory signais are so intense or uncoordinated as to drive the cell 
beyond its normal limits and into a danger zone where its mechanisms of control 
and coordination break down, as in an engine driven badly or too fast. The p53 
concentration rises, for example, when Myc is overexpressed to oncogenic levels. 

Ail these circumstances call for desperate action, which may talce either of two 
forms: the cell can bloclc any further progress through the division cycle in order 
to talce time out to repair or recover from the pathological condition; or it can 
accept that it must die, and do so in a way that minimizes damage to the organ- 
ism. A good death, from this point of view, is a death by apoptosis. In apoptosis, 
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Figure 20-27 Modes of action of the 
p53 tumor suppressor. The p53 protein 
is a cellular stress sensor. In response to 
hyperprolitérative signais, DNA damage, 
hypoxia, telomere shortening, and various 
other stresses, the p53 levels in the cell 
rise. As indicated, this may either arrest 
cell cycling in a way that allows the cell to 
adjust and survive, trigger cell suicide by 
apoptosis, or cause cell “senescence”—an 
irréversible cell-cycle arrest that stops 
damaged cells from dividing. 


the cell is phagocytosed by its neighbors and its contents are efficiently recycled. 
A bad death is a death by necrosis. In necrosis, the cell bursts or disintegrates and 
its contents are spilled into the extracellular space, inducing inflammation. 

The p53 pathway, therefore, behaves as a sort of antenna, sensing the prés¬ 
ence of a wide range of dangerous conditions, and when any are detected, trig- 
gering appropriate action—either a temporary or permanent arrest of cell cycling 
(senescence), or suicide by apoptosis (Figure 20-27). These responses serve to 
prevent deranged cells from proliferating. Cancer cells are indeed generally 
deranged, and their survival and prolifération thus dépend on inactivation of the 
p53 pathway. If the p53 pathway were active in them, they would be halted in their 
traclcs or die (Movie 20.4). 

The p53 protein performs its job mainly by acting as a transcription regulator 
(see Movie 17.8). Indeed, the most common mutations observed in p53 in human 
tumors are in its DNA-binding domain, where they cripple the ability of p53 to 
bind to its DNA target sequences. Because p53 binds to DNA as a tetramer, a sin¬ 
gle mutant subunit within a tetrameric complex can be enough to bloclc its func- 
tion. Thus, mutations in p53 can hâve a dominant négative effect, causing loss of 
p53 function even when the cell also contains a wild-type version of the gene. For 
this reason, in contrast with other tumor suppressor genes such as Rb, the devel¬ 
opment of cancer does not always require that both copies of p53 be knocked out. 

As discussed in Chapter 17, the p53 protein exerts its inhibitory effects on the 
cell cycle, in part at least, by inducing the transcription of p21, which encodes a 
protein that binds to and inhibits the cyclin-dépendent kinase (Cdlc) complexes 
required for progression through the cell cycle. By blocking the kinase activity of 
these Cdlc complexes, the p21 protein prevents the cell from progressing through 
S phase and replicating its DNA. 

The mechanism by which p53 induces apoptosis includes stimulation of the 
expression of many pro-apoptotic genes, and it will be described in Chapter 18. 

Genome Instability Takes Different Forms in Different Cancers 

If the p53 pathway is functional, a cell with unrepaired DNA damage will stop 
dividing or die; it cannot proliferate. Mutations in the p53 pathway are, therefore, 
generally présent in cancer cells showing genome instability—which is to say, 
the majority. But how does this genome instability originate? Here too, cancer 
genome studies are illuminating. 

In ovarian cancers, for example, chromosome breaks, translocations, and 
délétions are very common, and these aberrations correlate with a high frequency 
of mutations and epigenetic silencing in the genes needed for repair of DNA dou- 
ble-strand breaks by homologous recombination, especially Brcal and Brca2 (see 
pp. 281-282). In a subset of colorectal cancers with DNA mismatch repair defects, 
on the other hand, one instead finds many point mutations scattered throughout 
the genome (see pp. 250-251). In both kinds of cancer, the genome is commonly 
destabilized, but different types of mutations can bring this about. 
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Cancers of Specialized Tissues Use Many Different Routes to 
Target the Common Core Pathways of Cancer 

Mutations in core components of the machinery that régulâtes cell growth, divi¬ 
sion, and survival, such as Rb, Ras, PTEN, or p53, are not the only way to pervert 
the control of these processes. Specialized tissues dépend on a variety of path¬ 
ways, as discussed in Chapter 15, to relay environmental signais to the core con¬ 
trol machinery, and each pathway lays the cells open to subversion in a different 
set of ways. Thus, in different cancers, we can find examples of driver mutations 
in practically ail the major signaling pathways through which cells communicate 
during development and tissue maintenance (discussed in Chapters 21 and 22). 

In glioblastoma, for example, most patients hâve mutations in one or other of 
a set of cell-surface receptor tyrosine kinases, especially the EGF receptor men- 
tioned earlier (linking into the Ras/PI3K pathway), suggesting that the cells from 
which the cancer originates are normally controlled by this route. The cells of the 
prostate gland, on the other hand, respond to the androgen hormone testoster¬ 
one, and in prostate cancer, components of the androgen receptor signaling path¬ 
way (a variety of nuclear hormone receptor signaling; see Chapter 15) are often 
mutated. In the normal gut lining, Wnt signaling is critical, and Wnt pathway 
mutations are présent in most colorectal cancers. Pancreatic cancers generally 
hâve mutations in the transforming growth factor-P (TGF|3) signaling pathway. 
Activating mutations in the Notch pathway are présent in more than 50% of T cell 
acute lymphocytic leukemias, and so on. 

Cells are generally regulated by several different types of external signais that 
must act in combination, representing a “fail-safe" control mechanism that pro- 
tects the organism as a whole from cancer. These signais are different in different 
tissues. As expected, therefore, the corresponding cancers often hâve mutations 
in several signaling pathways concurrently. This is true of the examples we hâve 
just listed, which commonly hâve mutations in other signaling pathways in addi¬ 
tion to the ones that we hâve singled out. 

Studies Using Mice Help to Define the Functions of Cancer-Critical 
Genes 

The ultimate test of a gene's rôle in cancer has to corne from investigations in the 
intact, mature organism. The most favored organism for such studies, apart from 
humans themselves, is the mouse. To explore the function of a candidate onco¬ 
gène or tumor suppressor gene, one can malce a transgenic mouse that overex- 
presses it or a knockout mouse that laclcs it. Using the techniques described in 
Chapter 8, one can engineer mice in which the misexpression or délétion of the 
gene is restricted to a spécifie set of cells, or in which expression of the gene can 
be switched on at will at a chosen point in time, or both, to see whether and how 
tumors develop. Moreover, to follow the growth of tumors from day to day in the 
living organism, the cells of interest can be genetically marlced and made visible 
by expression of a fluorescent or luminescent reporter (Figure 20-28). In these 
ways, one can begin to clarify the part that each cancer-critical gene plays in can¬ 
cer initiation or progression. 
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Figure 20-28 Monitoring tumor growth 
and metastasis in a mouse with a 
luminescent reporter. A mouse was 
genetically engineered in a way that allows 
both copies of its PTEN tumor suppressor 
gene to be inactivated in the prostate 
gland, simultaneously with the prostate- 
specific activation of a gene engineered 
to produce the enzyme luciferase (derived 
from fireflies). After an injection of luciferin 
(the substrate molécule for luciferase) into 
the mouse’s bloodstream, the cells in the 
prostate émit light and can be detected 
by their bioluminescense in a live mouse, 
as seen in the 67-day-old animal at the 
left. Cells lacking the PTEN phosphatase 
enzyme contain elevated amounts of the 
Akt activator, PI(3,4,5)P3, and this causes 
the prostate cells to proliferate abnormally, 
progressing over time to form a cancer. In 
this way, the process of metastasis could 
be followed in the same animal over the 
course of a year. The light intensity in these 
experiments is proportional to the number 
of prostate-cell descendants, increasing 
from light blue to green, to yellow, to red 
in this représentation. (Adapted from 
C.-P. Liao et al., Cancer Res. 67:7525- 
7533, 2007.) 
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Figure 20-29 Oncogene collaboration in transgenic mice. The graphs 
show the incidence of tumors in three types of transgenic mouse strains, one 
carrying a Myc oncogene, one carrying a Ras oncogene, and one carrying 
both oncogenes. For these experiments, two lines of transgenic mice were 
first generated. One carries an inserted copy of an oncogene created by 
fusing the proto-oncogene Myc with the mouse mammary tumor virus 
regulatory DNA (which then drives Myc overexpression in the mammary 
gland). The other line carries an inserted copy of the Ras oncogene under 
control of the same regulatory element. Both strains of mice develop tumors 
much more frequently than normal, most often in the mammary or salivary 
glands. Mice that carry both oncogenes together are obtained by Crossing 
the two strains. These hybrids develop tumors at a far higher rate still, 
much greater than the sum of the rates for the two oncogenes separately. 
Nevertheless, the tumors arise only after a delay and only from a small 
proportion of the cells in the tissues where the two genes are expressed. 
Further accidentai changes, in addition to the two oncogenes, are apparently 
required for the development of cancer. (After E. Sinn et al., Cell 49:465-475, 
1987. With permission from Elsevier.) 



Transgenic mouse studies confirm, for example, that a single oncogene is gen- 
erally not enough to turn a normal cell into a cancer cell. Thus, in mice engineered 
to express a Myc or Ras oncogenic transgene, some of the tissues that express the 
oncogene may show enhanced cell prolifération, and, over time, occasional cells 
will undergo further changes to give rise to cancers. Most cells expressing the 
oncogene, however, do not give rise to cancers. Nevertheless, from the point of 
view of the whole animal, the inherited oncogene is a serious menace because 
it créâtes a high rislc that a cancer will arise somewhere in the body. Mice that 
express both Myc and Ras oncogenes (bred by mating a transgenic mouse carry¬ 
ing a Myc oncogene with one carrying a Ras oncogene) develop cancers earlier 
and at a much higher rate than either parental strain (Figure 20-29); but, again, 
the cancers originate as scattered, isolated tumors among noncancerous cells. 
Thus, even cells expressing these two oncogenes must undergo further, randomly 
generated changes to become cancerous. This strongly suggests that multiple 
mutations are required for tumorigenesis, as supported by a great deal of other 
evidence discussed earlier. Experiments using mice with délétions of tumor sup- 
pressor genes lead to similar conclusions. 

Cancers Become More and More Heterogeneous as They 
Progress 

From simple histology, looking at stained tissue sections, it is clear that some 
tumors contain distinct sectors, ail clearly cancerous, but differing in appear- 
ance because they differ genetically: the cancer cell population is heterogeneous. 
Evidently, within the initial clone of cancerous cells, additional mutations hâve 
arisen and thrived, creating diverse subclones. Today, the ability to analyze can¬ 
cer genomes lets us look much deeper into the process. 

One approach involves taking samples from different régions of a primary 
tumor and from the métastasés that it has spawned. With modem methods, it is 
even possible to take représentative single cells and analyze their genomes. Such 
studies reveal a classic picture of Darwinian évolution, occurring on a time scale 
of months or years rather than millions of years, but governed by the same rules of 
natural sélection (Figure 20-30). 

One such investigation compared the genomes of 100 individual cells from dif¬ 
ferent régions of a primary tumor of the breast. A large fraction—just over half—of 
the chosen cells was genetically normal or nearly so: these were connective-tissue 
cells and other cell types, such as those of the immune System, that were mixed 
up with the cancer cells. The cancer cells themselves were distinguished by their 
severely disrupted genomes. The detailed pattern of gene délétions and ampli¬ 
fications in each such cell revealed how closely it was related to the others, and 
from this data one could draw up a family tree (Figure 20-30B). In this case, three 
main branches of the tree were seen; that is, the cancer consisted of three major 
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subclones. From the shared abnormalities, one could deduce that their last com- 
mon ancestor—the presumed founder of the cancer—was already very different 
from a normal cell, but that the first split between branches occurred early, when 
the tumor was small. This was followed by a large amount of additional change 
within each branch. A hint of the future could be seen in the smallest of the three 
major subclones: its cells were distinguished by a massive amplification of a Ras 
oncogene. Given more time, perhaps they would hâve out-competed the other 
cancer cells and talcen over the whole tumor. 

Similar results hâve been obtained with other cancers. Clearly, cancer cells are 
constantly mutating, multiplying, competing, evolving, and diversifying as they 
exploit new ecological niches and react to the treatments that are used against 
them (Figure 20-30C). Diversification accelerates as they metastasize and colo- 
nize new territories, where they encounter new sélection pressures. The longer 
the evolutionary process continues, the harder it becomes to catch them ail in the 
same net and kill them. 


Figure 20-30 How cancers progress 
as a sériés of subclones. (A) Schematic 
illustration of the pattern of mutation and 
natural sélection in a clone of tumor cells. 
(B) A family tree of cancer cells sampled 
from different régions of a single breast 
tumor, showing how the cells hâve evolved 
and diversified from a common ancestor, 
the cancer founder cell. The genome of 
each of the indicated 100 cells from a 
human breast tumor was sequenced to 
produce an evolutionary tree. About half 
of these cells were normal cells from the 
stroma {blue cells). The red cells hâve 
greatly amplified their K-Ras gene. 

Note that many subclones appear to hâve 
died out, including the one that contained 
the founder cells for the three subclones 
that survive. 

(C) A depiction of how driver mutations 
are thought to cause cancer progression 
over long periods of time, before producing 
a large enough clone of proliferating cells to 
be detected as a tumor. The data indicate 
that driver mutations occur only rarely in 
a background of long-lived subclones of 
cells that continually accumulate passenger 
mutations without gaining a growth 
advantage. (A, adapted from M. Greaves, 
Semin. Cancer Biol. 20:65-70, 2010; 

B, adapted from N. Navin et al., Nature 
472:90-94, 2011 ; C, adapted from 
S. Nik-Zainal et al., Cell 149:994-1007, 
2012 .) 


The Changes in Tumor Cells That Lead to Metastasis Are Still 
Largely a Mystery 

Perhaps the most significant gap in our understanding of cancer concerns inva- 
siveness and metastasis. For a start, it is not clear exactly what new properties a 
cancer cell must acquire to become metastatic. In some cases, it is possible that 
invasion and metastasis require no further genetic changes beyond those needed 
to violate the normal Controls on cell growth, cell division, and cell death. On the 
other hand, it may be that, for some cancers, metastasis requires a large number 
of additional mutations and epigenetic changes. Clues are coming from compar- 
isons of the genomes of cells of primary tumors with the cells of métastasés that 
they hâve spawned. The results appear complex and variable from one cancer to 
another. Nevertheless, some general principles hâve emerged. 

As we discussed earlier, it is helpful to distinguish three phases of tumor pro¬ 
gression required for a carcinoma to metastasize (see Figure 20-16). First, the cells 
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must escape the normal confines of their parent epithelium and begin to invade 
the tissue immediately beneath. Second, they must travel via the blood or lymph 
to lodge in distant sites. Third, they must survive there and multiply. It is the first 
and last steps in this sequence that are the most diffïcult to accomplish for most 
cancers (Figure 20-31). 

The first step, local invasiveness, requires a relaxation of the mechanisms that 
normally hold épithélial cells together. As mentioned earlier, this step resembles 
the normal developmental process lcnown as the epithelial-mesenchymal transi¬ 
tion (EMT ), in which épithélial cells undergo a shift in character, becoming less 
adhesive and more migratory (discussed in Chapter 19). A key part of the EMT 
process involves switching off expression of the E-cadherin gene. The primary 
function of the transmembrane E-cadherin protein is in cell-cell adhesion, bind- 
ing épithélial cells together through adherens junctions (see Figure 19-13). In 
some carcinomas of the stomach and of the breast, E-cadherin has been identified 
as a tumor suppressor gene, and a loss of E-cadherin may promote cancer devel¬ 
opment by facilitating local invasiveness. 

The initial entry of tumor cells into the circulation is helped by the presence of 
a dense supply of blood vessels and sometimes lymphatic vessels, which tumors 
attract to themselves as they grow larger and become hypoxie in their interior. 
This process, called angiogenesis , is caused by the sécrétion of angiogenic factors 
that promote the growth of blood vessels, such as vascular endothélial growth 
factor (VEGF; see Figure 22-26). An abnormal fragility and leakiness of the new 
vessels that form may help the cells that hâve become invasive to enter and then 
move through the circulation with relative ease. 

The remaining steps in metastasis, involving exit from a blood or lymphatic 
vessel and the effective colonization of remote sites, are much harder to study. To 
discover which of the later steps in metastasis présent cancer cells with the great- 
est difficulties, one can label the cells with a fluorescent dye or green fluorescent 
protein (GFP), inject them into the bloodstream of a mouse, and then monitor 
their fate (Movie 20.5). In such experiments, one observes that many cells sur¬ 
vive in the circulation, lodge in small vessels, and exit into the surrounding tis¬ 
sue, regardless of whether they corne from a tumor that metastasizes or one that 
does not. Some cells die immediately after they enter foreign tissue; others survive 
entry into the foreign tissue but fail to proliferate. Still others divide a few times 
and then stop, forming micrometastases containing ten to several thousand cells. 
Very few establish full-blown métastasés. 

What, if anything, distinguishes the survivors from the failures? A due may 
corne from the fact that in many types of tumors, the cancer cells show a kind 
of heterogeneity that resembles the heterogeneity seen among the cells of those 
normal tissues that renew themselves continually by a stem-cell strategy, as we 
discuss next. 


Figure 20-31 The barriers to metastasis. 

Studies of labeled tumor cells leaving a 
tumor site, entering the circulation, and 
establishing métastasés show which steps 
in the metastatic process, outlined in Figure 
20-16, are difficult or “inefficient,” in the 
sense that they are steps in which large 
numbers of cells fail and are lost. It is in 
these difficult steps that cells from highly 
metastatic tumors are observed to hâve 
much greater success than cells from a 
nonmetastatic source. It seems that the 
ability to escape from the parent tissue, and 
an ability to survive and grow in the foreign 
tissue, are key properties that cells must 
acquire to become metastatic. (Adapted 
from A.F. Chambers et al., Breast Cancer 
Res. 2:400-407, 2000. With permission 
from BioMed Central Ltd.) 


A Small Population of Cancer Stem Cells May Maintain Many 
Tumors 

Self-renewing tissues, where cell division continues throughout life, are the breed- 
ing ground for the great majority of human cancers. They include the epidermis 
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(the outer épithélial layer of the skin), the lining of the digestive and reproductive 
tracts, and the bone marrow, where blood cells are generated (see Chapter 22). 
In almost ail these tissues, renewal dépends on the presence of stem cells, which 
divide to give rise to terminally differentiated cells, which do not divide. This cré¬ 
âtes a mixture of cells that are genetically identical and closely related by lineage, 
but are in different States of différentiation. Many tumors seem likewise to consist 
of cells in varied States of différentiation, with different capacities for cell division 
and self-renewal. 

To see the implications, it is helpful to consider how normal stem-cell Systems 
operate. When a normal stem cell divides, each daughter cell has a choice—it 
can remain a stem cell, or it can commit to a pathway leading to différentiation. 
A stem-cell daughter remains in place to generate more cells in the future. A 
committed daughter typically undergoes some rounds of cell prolifération (as a 
so-called transit amplifying cell) but then stops dividing, terminally differentiates, 
and eventually is discarded and replaced (it may die by apoptosis, with recycling 
of its materials, or be shed from the body). On average, the two fates—stem cell or 
differentiating cell—normally occur with equal probability, so that half the daugh- 
ters of stem-cell divisions take the one path and half talce the other. In a healthy 
body, feedback Controls regulate the process, adjusting this balance of cell-fate 
choices to correct for any departure from the proper cell population numbers. 
Thus, the number of stem cells remains approximately constant, and the termi¬ 
nally differentiated cells are continually replaced at a steady rate. Because of the 
divisions undergone by the transit amplifying cells, the stem cells may be vastly 
outnumbered by the cells that are committed to terminal différentiation and hâve 
lost the capacity for self-renewal. But the stem cells, though few and far between 
and often relatively slowly dividing, carry the whole responsibility for mainte¬ 
nance of the tissue in the long term. 

Some cancers seem to be organized in a similar way: they consist of rare can¬ 
cer stem cells capable of dividing indefinitely, together with much larger num¬ 
bers of dividing transit amplifying cells that are derived from the cancer stem cells 
but hâve a limited capacity for self-renewal (Figure 20-32). These non-stem cells 
appear to constitute the great majority of the cell population in some tumors. 


Figure 20-32 Cancer stem cells can be 
responsible for tumor growth and yet 
remain only a small part of the tumor- 
cell population. (A) How stem 
cells produce transit amplifying cells. 

(B) How a small proportion of cancer stem 
cells can maintain a tumor. Suppose, for 
example, that each daughter of a cancer 
stem cell has a probability slightly greater 
than 50% of retaining stem-cell potential 
and a probability slightly less than 50% of 
becoming a transit amplifying cell that is 
committed to a program of cell divisions 
that stops after 10 division cycles. While 
the number of cancer stem cells will 
increase slowly but steadily to give a 
growing tumor, the non-stem cells that they 
give rise to will always outnumber the stem 
cells by a large factor—in this example, by 
a factor of about 1000. (If the cell-division- 
cycle and survival times for the two classes 
of cells are equal.) 


The Cancer Stem-Cell Phenomenon Adds to the Difficulty of 
Curing Cancer 

Evidence for the cancer stem-cell phenomenon cornes chiefiy from experiments 
in which individual cells from a cancer are tested for their ability to give rise to fresh 
tumors: a standard assay is to implant the cells into an immunodeficient mouse 
(Figure 20-33). It has been known for half a century that there is usually only a 
small chance—typically much less than 1%—that a tumor cell chosen at random 
and tested in this way will generate a new tumor. This by itself does not prove that 
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the tumor cells are heterogeneous: lilce seeds scattered on difficult ground, each 
of them may hâve only a small chance of finding a spot where it can survive and 
grow. Modem technologies for sorting cells hâve shown, however, that in some 
cancers at least, the rate of success in founding new tumors is even lower than it 
would otherwise be because the cancer cells are heterogeneous in their State of 
différentiation, and only a small subset of them—the cancer stem cells—hâve the 
spécial properties needed for tumor propagation. For example, in several types 
of cancer, including breast cancers and leukemias, one can fractionate the tumor 
cells using monoclonal antibodies that recognize a particular cell-surface marlcer 
that is présent on the normal stem cells in the tissue of origin of the cancer. The 
purified cancer cells expressing this marlcer are found to hâve a greatly enhanced 
ability to found new tumors. And the new tumors consist of mixtures of cells that 
express the marlcer and cells that do not, ail generated from the same founder cell 
that expressed the marlcer. 

Experiments with breast cancer cells hâve revealed that, instead of following a 
rigid program from stem cell to transit amplifying cell to terminally differentiated 
cell, these cancer cells can randomly switch to and fro—with a certain low transi¬ 
tion probability—between different States of différentiation that express different 
molecular marlcers. In one State, they behave lilce stem cells, dividing slowly but 
capable of founding new tumors; in other States, they behave lilce transit amplify¬ 
ing cells, dividing rapidly but unable to found new tumors in a standard transplant 
assay. But a single cell in any of these States—given time in culture, or a congenial 
environment in the body—will give rise to a mixed population that includes ail the 
other States as well. 

The cancer stem-cell phenomenon, whatever its basis, implies that even when 
the tumor cells are genetically similar, they are phenotypically diverse. A treat- 
ment that wipes out those in one State is lilcely to allow survival of others that 
remain a danger. Radiotherapy or a cytotoxic drug, for example, may selectively 
kill off the rapidly dividing cells, reducing the tumor volume to almost nothing, 
and yet spare a few slowly dividing cells that go on to resurrect the disease. This 
greatly adds to the difficulty of cancer therapy, and it is part of the reason why 
treatments that seem at first to succeed often end in relapse and disappointment. 

Colorectal Cancers Evolve Slowly Via a Succession of Visible 
Changes 

At the beginning of this chapter, we saw that most cancers develop gradually from 
a single aberrant cell, progressing from benign to malignant tumors by the accu¬ 
mulation of a number of independent genetic and epigenetic changes. We hâve 
discussed what some of these changes are in molecular terms and seen how they 
contribute to cancerous behavior. We now examine one of the common human 
cancers more closely, using it to illustrate and enlarge upon some of the general 
principles and molecular mechanisms we hâve introduced. We talce colorectal 
cancer as our example. 

Colorectal cancers arise from the epithelium lining the colon (the large intes¬ 
tine) and rectum (the terminal segment of the gut). The organization of this tissue 
is broadly similar to that of the small intestine, discussed in detail in Chapter 22 
(pp. 1217-1221). For both the small and large intestine, the epithelium is renewed 
at an extraordinarily rapid rate, taking about a week to completely replace most of 
the épithélial sheet. In both régions, the renewal dépends on stem cells that lie in 
deep pockets of the epithelium, called intestinal crypts. The signais that maintain 
the stem cells and control the normal organization and renewal of the epithelium 
are beginning to be quite well understood, as explained in Chapter 22. Mutations 
that disrupt these signais begin the process of tumor progression for most col¬ 
orectal cancers (Movie 20.6). 

Colorectal cancers are common, currently causing nearly 60,000 deaths a year 
in the United States, or about 10% of total deaths from cancer. Lilce most can¬ 
cers, they are not usually diagnosed until late in life (90% occur after the âge of 
55). However, routine examination of normal adults with a colonoscope (a fiber 
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Figure 20-33 An immunodeficient 
mouse, as used in transplantation 
assays to test human cancer cells for 
their ability to found new tumors. This 
nude mouse has a mutation that blocks 
development of the thymus and, as a 
side effect, robs it of hair. Because it has 
practically no T cells, it tolérâtes grafts of 
cells even from other species. (Courtesy of 
Harlan Sprague Dawley.) 
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optic device for viewing the interior of the colon and rectum) often reveals a small 
benign tumor, or adenoma, of the gut epithelium in the form of a protruding mass 
of tissue called a polyp (see Figure 22-4). These adenomatous polyps are believed 
to be the precursors of a large proportion of colorectal cancers. Because the pro¬ 
gression of the disease is usually very slow, there is typically a period of about 
10 years in which the slowly growing tumor is détectable but has not yet turned 
malignant. Thus, when people are screened by colonoscopy in their fifties and the 
polyps are removed through the colonoscope—a quiclc and easy surgical proce¬ 
dure—the subséquent incidence of colorectal cancer is much lower: according to 
some studies, less than a quarter of what it would be otherwise. 

In microscopie sections of polyps smaller than 1 cm in diameter, the cells and 
their arrangement in the epithelium usually appear almost normal. The larger the 
polyp, the more lilcely it is to contain cells that look abnormally undifferentiated 
and form abnormally organized structures. Sometimes, two or more distinct areas 
can be distinguished within a single polyp, with the cells in one area appearing 
relatively normal and those in the other appearing clearly cancerous, as though 
they hâve arisen as a mutant subclone within the original clone of adenomatous 
cells. At later stages in the disease, some tumor cells become invasive in a small 
fraction of the polyps, first breaking through the épithélial basal lamina, then 
spreading through the layer of muscle that surrounds the gut, and finally metas- 
tasizing to lymph nodes via lymphatic vessels and to liver, lung, and other organs 
via blood vessels. 

A Few Key Genetic Lésions Are Common to a Large Fraction of 
Colorectal Cancers 

What are the mutations that accumulate with time to produce this chain of 
events? Of those genes so far discovered to be involved in colorectal cancer, three 
stand out as most frequently mutated: the proto-oncogene K-Ras (a member of 
the Ras gene family), in about 40% of cases; p53, in about 60% of cases; and the 
tumor suppressor gene Apc (discussed below), in more than 80% of cases. Others 
are involved in smaller numbers of colon cancers, and some of these are listed in 
Table 20-1. 

The rôle of Apc first came to light through study of certain families show- 
ing a rare type of hereditary prédisposition to colorectal cancer, called familial 


Some Genetic Abnormalities Detected in Colorectal Cancer Cells 


Gene Class Pathway affected Human colon 

cancers (%) 


K-Ras 

Oncogene 

Receptor tyrosine 
kinase signaling 

40 

fi-Catenin A 

Oncogene 

Wnt signaling 

5-10 

Apc 1 

Tumor suppressor 

Wnt signaling 

>80 

p53 

Tumor suppressor 

Response to stress 
and DNA damage 

60 

TGF(3 receptor II 2 

Tumor suppressor 

TGFp signaling 

10 

Smad4 2 

Tumor suppressor 

TGFp signaling 

30 

MLH1 and other 

DNA mismatch repair 
genes (often silenced 
by DNA méthylation) 

Tumor suppressor 
(genetic stability) 

DNA mismatch 
repair 

15 

1i2 The genes with the same superscript numéral act in the same pathway, and therefore only 
one of the components is mutated in an individual cancer. 

















1124 Chapter 20: Cancer 


Figure 20-34 Colon of familial adenomatous polyposis coli patient 
compared with normal colon. (A) The normal colon wall is a gently 
undulating but smooth surface. (B) The polyposis colon is completely covered 
by hundreds of projecting polyps, each resembling a tiny cauliflower when 
viewed with the naked eye. (Courtesy of Andrew Wyllie and Mark Arends.) 


adenomatous polyposis coli ( FAP ). In this syndrome, hundreds or thousands of 
polyps develop along the length of the colon (Figure 20-34). These polyps start 
to appear in early adult life, and if they are not removed, one or more will almost 
always progress to become malignant; the average time from the first détection of 
polyps to the diagnosis of cancer is 12 years. The disease can be traced to a délé¬ 
tion or inactivation of the tumor suppressor gene Apc, named after the syndrome. 
Individuals with FAP hâve inactivating mutations or délétions of one copy of the 
Apc gene in ail their cells and show loss of heterozygosity in tumors, even in the 
benign polyps. Most patients with colorectal cancer do not hâve the hereditary 
condition. Nevertheless, in more than 80% of the cases, their cancer cells (but not 
their normal cells) hâve inactivated both copies of the Apc gene through muta¬ 
tions acquired during the patient's lifetime. Thus, by a route similar to that which 
we discussed for retinoblastoma, mutation of the Apc gene was identified as one 
of the central ingrédients of colorectal cancer. 

The Apc protein, as we now lcnow, is an inhibitory component of the Wnt 
signaling pathway (discussed in Chapter 15). It binds to the P-catenin protein, 
another component of the Wnt pathway, and helps to induce the protein's dég¬ 
radation. By inhibiting p-catenin in this way, Apc prevents the P-catenin from 
migrating to the nucléus, where it would act as a transcriptional regulator to drive 
cell prolifération and maintain the stem-cell State (see Figure 15-60). Loss of Apc 
results in an excess of free P-catenin and thus leads to an uncontrolled expansion 
of the stem-cell population. This causes massive increase in the number and size 
of the intestinal crypts (see Figure 22-4). 

When the P-catenin gene was sequenced in a collection of colorectal tumors, 
it was discovered that, many of the tumors that did not hâve Apc mutations had 
activating mutations in P-catenin instead. Thus, it is excessive activity in the Wnt 
signaling pathway that is critical for the initiation of this cancer, rather than any 
single oncogene or tumor suppressor gene that the pathway contains. 

This being so, why is the Apc gene in particular so often the most common 
culprit in colorectal cancer? The Apc protein is large and it interacts not only with 
P-catenin but also with various other cell components, including microtubules. 
Loss of Apc appears to increase the frequency of mitotic spindle defects, leading 
to chromosome abnormalities when cells divide. This additional, independent 
cancer-promoting effect could explain why Apc mutations feature so prominently 
in the causation of colorectal cancer. 




(B) 


Some Colorectal Cancers Hâve Defects in DNA Mismatch Repair 

In addition to the hereditary disease (FAP) associated with Apc mutations, there 
is a second, more common kind of hereditary prédisposition to colon carcinoma 
in which the course of events differs from the one we hâve described for FAP. In 
this more common condition, called hereditary nonpolyposis colorectal cancer 
{HNPCC), the probability of colon cancer is increased without any increase in the 
number of colorectal polyps (adenomas). Moreover, the cancer cells are unusual, 
in that they hâve a normal (or almost normal) karyotype. The majority of colorec¬ 
tal tumors in non-HNPCC patients, in contrast, hâve gross chromosomal abnor¬ 
malities, with multiple translocations, délétions, and other aberrations, as well as 
having many more chromosomes than normal (Figure 20-35). 

The mutations that prédisposé HNPCC individuals to colorectal cancer occur 
in one of several genes that code for central components of the DNA mismatch 
repair System. These genes are homologous in structure and function to the MutL 
and MutS genes in bacteria and yeast (see Figure 5-19). Only one of the two cop¬ 
ies of the involved gene is defective, so the repair System is still able to remove 
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the inévitable DNA réplication errors that occur in the patient's cells. However, as 
discussed previously, these individuals are at risk, because the accidentai loss or 
inactivation of the remaining good gene copy will immediately elevate the spon- 
taneous mutation rate by a hundredfold or more (discussed in Chapter 5). These 
genetically unstable cells then can presumably speed through the standard pro¬ 
cesses of mutation and natural sélection that allow clones of cells to progress to 
malignancy. 

This particular type of genetic instability produces invisible changes in the 
chromosomes—most notably changes in individual nucléotides and short expan¬ 
sions and contractions of mono- and dinucleotide repeats such as AAAA... or 
CACACA.... Once the defect in HNPCC patients was recognized, the epigenetic 
silencing or mutation of mismatch repair genes was found in about 15% of the 
colorectal cancers occurring in people with no inherited predisposing mutation. 

Thus, the genetic instability found in many colorectal cancers can be acquired 
in at least two ways. The majority of the cancers display a form of chromosomal 
instability that leads to visibly altered chromosomes, whereas in the others the 
instability occurs on a much smaller scale and reflects a defect in DNA mismatch 
repair. Indeed, many carcinomas show either chromosomal instability or defec- 
tive mismatch repair—but rarely both. These findings clearly demonstrate that 
genetic instability is not an accidentai by-product of malignant behavior but a 
contributory cause—and that cancer cells can acquire this instability in multiple 
ways. 

The Steps of Tumor Progression Can Often Be Correlated with 
Spécifie Mutations 

In what order do K-Ras, p53, Apc, and the other identified colorectal cancer-crit- 
ical genes mutate, and what contribution does each of them malce to the asocial 
behavior of the cancer cell? There is no single answer, because colorectal cancer 
can arise by more than one route: thus, we know that in some cases, the first muta¬ 
tion can be in a DNA mismatch repair gene; in others, it can be in a gene regulat- 
ing cell prolifération. Moreover, as previously discussed, a general feature such as 
genetic instability or a tendency to proliferate abnormally can arise in a variety of 
ways, through mutations in different genes. 

Nevertheless, certain sets of mutations are particularly common in colorectal 
cancer, and they occur in a characteristic order. Thus, in most cases, mutations 
inactivating the Apc gene appear to be the first, or at least a very early step, as they 
are detected at the same high frequency in small benign polyps as in large malig¬ 
nant tumors. Changes that lead to genetic and epigenetic instability are lilcely also 
to arise early in tumor progression, since they are needed to drive the later steps. 

Activating mutations in the K-Ras gene occur later, as they are rare in small 
polyps but common in larger ones that show disturbances in cell différentiation 
and histological pattern. 

Inactivating mutations in p53 are thought to corne later still, as they are rare 
in polyps but common in carcinomas (Figure 20-36). We hâve seen that loss 
of p53 function allows cancer cells to endure stress and to avoid apoptosis and 
cell-cycle arrest. Additionally, loss of p53 is related to the heightened activation 
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Figure 20-35 Chromosome 
compléments (karyotypes) of colon 
cancers showing different kinds of 
genetic instability. (A) The karyotype 
of a typical cancer shows many gross 
abnormalities in chromosome number 
and structure. Considérable variation can 
also exist from cell to cell (not shown). 

(B) The karyotype of a tumor that has a 
stable chromosome complément with 
few chromosomal anomalies; the genetic 
abnormalities are mostly invisible, having 
been created by defects in DNA mismatch 
repair. Ail of the chromosomes in this figure 
were stained as in Figure 4-10, the DNA of 
each human chromosome being marked 
with a different combination of fluorescent 
dyes. (Courtesy Wael Abdel-Rahman and 
Paul Edwards.) 
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of oncogenes such as Ras. Experiments in mice show that an initial low level of 
oncogene activation can give rise to a slowly growing tumor even while p53 is 
functional: genes such as Ras are, after ail, part of the normal machinery of growth 
control, and moderate activation is not stressful for a cell and does not call the p53 
protein into play. Progression of a tumor from slow to rapid, malignant growth, 
however, involves activation of oncogenes beyond normal physiological limits to 
a higher, stressful level. If the p53 protein is présent and functional, this should 
lead to cell-cycle arrest or death. Only by losing p53 function can the cancer cells 
with hyperactive oncogenes survive and progress. 

The steps we hâve just described are only part of the picture. It is important to 
emphasize that each case of colorectal cancer is different, with its own detailed 
combination of mutations, and that even for the mutations that are commonly 
shared, the sequence of occurrence may vary. The same is true for cancers in gen¬ 
eral. 

Advances in molecular biology hâve recently provided the tools to find out 
precisely which genes are amplified, deleted, mutated, or misregulated by epi¬ 
genetic mechanisms in the tumor cells of any given patient. As we discuss in the 
next section, such information promises to become as important for the diagnosis 
and treatment of cancer as was the breakthrough of being able to identify micro- 
organisms for the treatment of infectious diseases. 


Figure 20-36 Suggested typical 
sequence of genetic changes underlying 
the development of a colorectal 
carcinoma. This oversimplified diagram 
provides a general idea of the way mutation 
and tumor development are related. 

But many other mutations are generally 
involved, and different colon cancers can 
progress through different sequences of 
mutations (and/or epigenetic changes). 


Summary 

The molecular analysis of cancer cells reveals two classes of cancer-critical genes: 
oncogenes and tumor suppressor genes. A set of these genes becomes altered by a 
combination of genetic and epigenetic accidents to drive tumor progression. Many 
cancer-critical genes code for components ofthe social control pathways that reg- 
ulate when cells grow, divide, differentiate, or die. In addition, a subclass of tumor 
suppressors can be categorized as “genome maintenance genes,” because their nor¬ 
mal rôle is to help maintain genome integrity. 

The inactivation ofthe p53 pathway, which occurs in nearly ail human cancers, 
allows genetically damaged cells to escape apoptosis and continue to proliferate. 
Inactivation ofthe Rb pathway also occurs in most human cancers, illustrating how 
Jundamental each of these pathways isfor protecting us against cancer. 

The sequencing of cancer cell genomes reveals that—except for the cancers of 
childhood—many cancers acquire 10 or so driver mutations over the long course 
of tumor progression, along with a considerably larger number ofpassenger muta¬ 
tions ofno conséquence. The same methods reveal how subclones of cells arise and 
die out as a tumor âges. Tumors thus contain a heterogeneous mixture of cells, 
some—the so-called cancer stem cells—being much more dangerous than others. 

We can often correlate the steps of tumor progression with mutations that 
activate spécifie oncogenes and inactivate spécifie tumor suppressor genes, with 
colon cancer providing a good example. But different combinations of mutations 
and epigenetic changes arefound in different types of cancer, and even in different 
patients with the same type of cancer, reflecting the random way in which these 
inherited changes arise. Nevertheless, many of the same changes are encoun- 
tered repeatedly, suggesting that there are a limited number ofways to breach our 
defenses against cancer. 
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CANCER PREVENTION AND TREATMENT: PRESENT 
AND FUTURE 

We can apply the growing understanding of the molecular biology of cancer 
to sharpen our attaclc on the disease at three levels: prévention, diagnosis, and 
treatment. Prévention is always better than cure, and indeed many cancers can 
be prevented, especially by avoiding smoking. Highly sensitive molecular assays 
promise new opportunities for earlier and more précisé diagnosis, with the aim of 
detecting primary tumors while they are still small and hâve not yet metastasized. 
Cancers caught at these early stages can often be nipped in the bud by surgery or 
radiotherapy, as we saw for colorectal polyps. Nevertheless, full-blown malignant 
disease will continue to be common for many years to corne, and cancer treat- 
ments will continue to be needed. 

In this section, we first examine the preventable causes of cancer and then 
consider how advances in our understanding at a molecular level are beginning 
to transform the treatment of the disease. 

Epidemiology Reveals That Many Cases of Cancer Are 
Preventable 

A certain irreducible background incidence of cancer is to be expected regardless 
of circumstances. As discussed in Chapter 5, mutations can never be absolutely 
avoided because they are an inescapable conséquence of fundamental limita¬ 
tions on the accuracy of DNA réplication and repair. If a person could live long 
enough, it is inévitable that at least one of his or her cells would eventually accu- 
mulate a set of mutations sufficient for cancer to develop. 

Nevertheless, environmental factors seem to play a large part in determining 
the rislc for cancer. This is demonstrated most clearly by a comparison of cancer 
incidence in different countries: for almost every cancer that is common in one 
country, there is another country where the incidence is much lower. Because 
migrant populations tend to adopt the pattern of cancer incidence typical of their 
new host country, the différences are thought to be due mostly to environmental, 
not genetic, factors. From such findings, it has been suggested that 80-90% of can¬ 
cers should be avoidable, or at least postponable (Figure 20-37). 

Unfortunately, different cancers hâve different environmental rislc factors, and 
a population that escapes one such danger is usually exposed to another. This is 
not, however, inévitable. There are some human subgroups whose way of life sub- 
stantially reduces the total cancer death rate among individuals of a given âge. 
Under the current conditions in the United States and Europe, approximately one 
in five people will die of cancer. But the incidence of cancer among strict Mor¬ 
mons in Utah—who avoid alcohol, coffee, cigarettes, drugs, and casual sex—is 
only about half the incidence for non-practicing members of the same family or 
for Americans in general. Cancer incidence is also low in certain relatively afflu¬ 
ent populations in Africa. 

Although such observations on human populations indicate that cancer can 
often be avoided, it has been difficult in most cases—with tobacco as a striking 
exception—to pinpoint the spécifie environmental factors responsible for these 
large population différences or to establish how they act. Nevertheless, several 
important classes of environmental cancer rislc factors hâve been identified (Fig¬ 
ure 20-37B). One thinlcs first of mutagens. But there are also many other influ¬ 
ences—including the amount of food we eat, the hormones that circulate in our 
bodies, and the irritations, infections, and damage to which we expose our tis- 
sues—that are no less important and favor development of the disease in other 
ways. 

Sensitive Assays Can Detect Those Cancer-Causing Agents that 
Damage DNA 

Many quite disparate Chemicals are carcinogenic when they are fed to exper¬ 
imental animais or painted repeatedly on their skin. Examples include a range 
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(A) 


cause 

cancers 

caused 
(percent of 
total) 

number of 
deaths in US 
(annual) 

magnitude of 
réduction 
possible 
(percent) 

smoking 

33 

189,000 

75 

diet, overweight, and obesity 

25 

143,000 

50 

lack of exercise 

5 

28,600 

85 

viruses 

5 

28,600 

100 

alcohol 

3 

17,200 

50 

UV and ionizing radiation 

2 

11,400 

50 

occupational carcinogens 

5 

28,600 

50 


(B) 


Figure 20-37 Cancer incidence is related to environmental influences. 

(A) This map of the world shows the rates of cancer increasing (red 
arrows) or decreasing (blue arrows) when spécifie populations move from 
one location to another. Such observations suggest the importance of 
environmental factors, including diet, in dictating cancer risk. (B) Some 
estimated effects of environment and lifestyle on cancer in the United 
States (US). The table shows both the yearly deaths in the US attributable 
to each cancer and the estimated percentage of that cancer that could be 
eliminated through prévention. (B, data from G.A. Colditz, K.Y. Wolin and 
S. Gehlert, Sci. Transi. Med. 4:127rv4, 2012.) 


of aromatic hydrocarbons and dérivatives of them such as aromatic amines, 
nitrosamines, and alkylating agents such as mustard gas. Although these Chemi¬ 
cal carcinogens are diverse in structure, a large proportion of them hâve at least 
one shared property—they cause mutations. In one common test for mutagen- 
icity (the Ames test), the carcinogen is mixed with an activating extract prepared 
from rat liver cells (to mimic the biochemical processing that occurs in an intact 
animal). The mixture is then added to a culture of specially designed test bacte- 
ria and the bacterial mutation rate measured. Most of the compounds scored as 
mutagenic by this rapid and convenient assay in bacteria also cause mutations or 
chromosome aberrations when tested on mammalian cells. 

A few of these carcinogens act directly on DNA. But generally the more potent 
ones are relatively inert chemically; these Chemicals become damaging only after 
they hâve been converted to a more reactive molécule by metabolic processes in 
the liver, catalyzed by a set of intracellular enzymes lcnown as the cytochrome P-450 
oxidases. These enzymes normally help to convert ingested toxins into harmless 
and easily excreted compounds. Unhappily, their activity on certain Chemicals 
generates products that are highly mutagenic. Examples of carcinogens activated 
in this way include benzo[a]pyrene, a cancer-causing Chemical présent in coal tar 
and tobacco smolce and the fungal toxin aflatoxin B1 (Figure 20-38). 

Fifty Percent of Cancers Could Be Prevented by Changes in 
Lifestyle 

Tobacco smolce is the most important carcinogen in the world today. Even though 
many other Chemical carcinogens hâve been identified, none of these appear to 
be responsible for anything lilce the same numbers of human cancer deaths. It is 
sometimes thought that the main environmental causes of cancer are the prod¬ 
ucts of a highly industrialized way of life—the rise in pollution, the enhanced use 
of food additives, and so on—but there is little evidence to support this view. The 
idea may hâve corne in part from the identification of some highly carcinogenic 
materials used in industry, such as 2-naphthylamine and asbestos. Except for the 
increase in cancers caused by smoking, however, age-adjusted death rates for 
most common human cancers hâve stayed much the same over the past half-cen- 
tury, or, in some cases, hâve declined significantly (Figure 20-39). Survival rates, 
moreover, hâve improved. Thirty years ago, less than 50% of patients lived more 
than five years from the time of diagnosis; now, more than two-thirds do so. 
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(A) AFLATOXIN 


AFLATOXIN-2,3-EPOXIDE 


CARCINOGEN BOUND TO GUANINE IN DNA 


Figure 20-38 Some known carcinogens. (A) Carcinogen activation. 

A metabolic transformation must activate many Chemical carcinogens before 
they will cause mutations by reacting with DNA. The compound illustrated 
here is aflatoxin B1, a toxin from a mold (Aspergillus flavus oryzaë) that grows 
on grain and peanuts when they are stored under humid tropical conditions. 
Aflatoxin is an important cause of liver cancer in the tropics. (B) Different 
carcinogens cause different types of cancer. (B, data from Cancer and the 
Environment: Gene Environment Interactions, National Academies Press, 2002.) 


Most of the carcinogenic factors that are known to be significant are by no 
means spécifie to the modem world. The most potent known carcinogen, by cer¬ 
tain assays at least, is aflatoxin B1 (see Figure 20-38). It is produced by fungi that 
naturally contaminate foods such as tropical peanuts and is an important cause of 
liver cancer in Africa and Asia. 

Except for tobacco, Chemical toxins and mutagens are of lesser importance 
as contributory causes of cancer than other factors that are more a matter of Per¬ 
sonal choice. One important factor is the quantity of food we eat: as mentioned 
earlier, the rislc of cancer is greatly increased in people who are obese. In fact, it is 
estimated that as many as 50% of ail cancers could be avoided by simple, identifi¬ 
able changes in lifestyle (see Figure 20-37B). 


• VINYL CHLORIDE: 
liver angiosarcoma 

• BENZENE: 
acute leukemias 

• ARSENIC: 

skin cardnomas, bladder cancer 

• ASBESTOS: 
mesothelioma 

• RADIUM: 
osteosarcoma 

(B) 


Viruses and Other Infections Contribute to a Significant Proportion 
of Human Cancers 


Cancer in humans is not an infectious disease, and most human cancers do not 
hâve any infectious cause. However, a small but significant proportion of human 
cancers, perhaps 15% in the world as a whole, are thought to arise by mechanisms 
that involve viruses, bacteria, or parasites. Evidence for their involvement cornes 
partly from the détection of viruses in cancer patients and partly from epidemi- 
ology. Thus, cancer of the uterine cervix is associated with infection with a pap- 
illomavirus, while liver cancer is very common in parts of the world (Africa and 
Southeast Asia) where hepatitis-B viral infections are common. Chronic infection 
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Figure 20-39 Age-adjusted cancer 
death rates, United States, 1930-2008. 

Selected death rates, adjusted to the 
âge distribution of the US population, 
are plotted for (A) females and (B) males. 
Note the dramatic rise in lung cancer 
for both sexes, following the pattern of 
tobacco smoking, and the fall in deaths 
from stomach cancer, thought to be 
related to a fall in rates of infection with 
Hélicobacter pylori. Recent réductions in 
other cancer death rates may correspond 
to improvements in détection and 
treatment. Age-adjusted data like these are 
needed to compensate for the inévitable 
increase in cancer as people live longer, 
on average. (Adapted from Cancer Facts 
and Figures, 2012. Data from U.S. Mortality 
Volumes 1930 to 1959, U.S. Mortality Data 
1960 to 2008, National Center for Health 
Statistics, Centers for Disease Control and 
Prévention. © 2012, American Cancer 
Society, Inc., Surveillance Research.) 
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0 Viruses Associated with Human Cancers 


Virus 

Associated cancer 

Areas of high 
incidence 

DNA viruses 

Papovavirus family 

Papillomavirus (many 
distinct strains) 

Warts (benign) 

Worldwide 

Carcinoma of the uterine 
cervix 

Worldwide 

Hepadnavirus family 

Hepatitis-B virus 

Liver cancer (hepatocellular 
carcinoma) 

Southeast Asia, tropical 
Africa 

Herpesvirus family 

Epstein-Barr virus 

Burkitt’s lymphoma (cancer 
of B lymphocytes) 

West Africa, Papua 

New Guinea 

Nasopharyngeal carcinoma 

Southern China, 
Greenland 

Human herpesvirus 8 

Kaposi’s sarcoma 

Central and Southern 
Africa 

RNA viruses 

Retrovirus family 

Human T-cell leukemia virus 
type 1 (HTLV-1) 

Adult T-cell leukemia/ 
lymphoma 

Japan, West Indies 

Human immunodeficiency 
virus (HIV, the AIDS virus) 

Kaposi’s sarcoma (via 
human herpesvirus 8) 

Central and Southern 
Africa 

Flavivirus family 

Hepatitis-C virus 

Liver cancer (hepatocellular 
carcinoma) 

Worldwide 

For ail these viruses, the number of people infected is much larger than the number who 
develop cancer: the viruses must act in conjunction with other factors. As described in the text, 
different viruses contribute to cancer in different ways. 


with hepatitis-C virus, which has infected 170 million people worldwide, is also 
clearly associated with the development of liver cancer. 

The main culprits, as shown in Table 20-2, are the DNA viruses. The DNA 
tumor viruses cause cancer by the most direct route—by interfering with Controls 
of the cell cycle and apoptosis. To understand this type of viral carcinogenesis, it 
is important to review the life history of viruses. Many DNA viruses use the host 
celFs DNA réplication machinery to replicate their own genomes. However, to 
produce a large number of infectious virus particles within a single host cell, the 
DNA virus has to commandeer this machinery and drive it hard, breaking through 
the normal constraints on DNA réplication and usually killing the host cell in the 
process. Many DNA viruses reproduce only in this way. But some hâve a second 
option: they can propagate their genome as a quiet, well-behaved passenger in 
the host cell, replicating in parallel with the host cell's DNA (either integrated into 
the host genome, or as an extrachromosomal plasmid) in the course of ordinary 
cell-division cycles. These viruses will switch between two modes of existence 
according to circumstances, remaining latent and harmless for a long time, but 
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then proliferating in occasional cells in a process that kills the host cell and génér¬ 
âtes large numbers of infectious particles. 

Neither of these conditions converts the host cell to a cancerous character, 
nor is it in the interest of the virus to do so. But for viruses with a latent phase, 
accidents can occur that prematurely activate some of the viral proteins that the 
virus would normally use in its réplicative phase to allow the viral DNA to repli- 
cate independently of the cell cycle. As described in the example below, this type 
of accident can switch on the persistent prolifération of the host cell itself, leading 
to cancer. 


Cancers of the Uterine Cervix Can Be Prevented by Vaccination 
Against Human Papillomavirus 

The papillomaviruses are a prime example of DNA tumor viruses. They are 
responsible for human warts and are especially important as a cause of carcinoma 
of the uterine cervix: this is the second commonest cancer of women in the world 
as a whole, representing about 6% of ail human cancers. Human papillomaviruses 
(HPV) infect the cervical epithelium and maintain themselves in a latent phase 
in the basal layer of cells as extrachromosomal plasmids, which replicate in step 
with the chromosomes. Infectious virus particles are generated through a switch 
to a réplicative phase in the outer épithélial layers, as progeny of these cells begin 
to differentiate before being sloughed from the surface. Here, cell division should 
normally stop, but the virus interfères with this cell-cycle arrest so as to allow 
réplication of its own genome. Usually, the effect is restricted to the outer layers 
of cells and is relatively harmless, as in a wart. Occasionally, however, a genetic 
accident causes the viral genes that encode the proteins that prevent cell-cycle 
arrest to integrate into the host chromosome and become active in the basal layer, 
where the stem cells of the epithelium résidé (see Figure 22-10). This can lead to 
cancer, with the viral genes acting as oncogenes (Figure 20-40). 

The whole process, from initial infection to invasive cancer, is slow, taking 
many years. It involves a long intermediate stage when the affected patch of cervi¬ 
cal epithelium is visibly disordered but the cells hâve not yet begun to invade the 
underlying connective tissue—a phenomenon called intraepithélial neoplasia. 
Many such lésions regress spontaneously. Moreover, at this stage, it is still easy to 
cure the condition by destroying or surgically removing the abnormal tissue. For- 
tunately, the presence of such lésions can be detected by scraping off a sample of 
cells from the surface of the cervix and viewing it under the microscope (the “Pap 
smear” technique). 

Better still, a vaccine has now been developed that protects against infection 
with the relevant strains of human papillomavirus. This vaccine, given to girls 
before puberty and thus before they become sexually active, has been shown 
to greatly reduce their risk of ever developing cervical cancer. Because the virus 
spreads through sexual activity, it is now recommended that both young males 
and young females be routinely vaccinated. Mass immunization programs hâve 
begun in several countries. 


Figure 20-40 How certain papillomaviruses are thought to give rise 
to cancer of the uterine cervix. Papillomaviruses hâve double-stranded 
circular DNA chromosomes of about 8000 nucléotide pairs. These 
chromosomes are normally stably maintained in the basal cells of the 
epithelium as plasmids (red circles), whose réplication is regulated so as to 
keep step with the chromosomes of the host. (A) Normally, the virus perturbs 
the host cell cycle only when the virus is programmed to produce infectious 
progeny, in the outer layers of an epithelium. This is relatively harmless. 

(B) Rare accidents can cause the intégration of a fragment of such a plasmid 
into a chromosome of the host, altering the environment of the viral genes in 
the basal cells of an epithelium. This can disrupt the normal control of viral 
gene expression. The unregulated production of certain viral proteins (E6 and 
E7) interfères with the control of cell division in the basal cells, thereby helping 
to generate a cancer (bottom). 


viral proteins médiate 
the controlled réplication 
of the virus in outer 





unregulated production of viral 
proteins drives cell prolifération 
in basal cell layer 


integrated gene 
encoding viral 
proteins 


(B) EVOLUTION OF 
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Infectious Agents Can Cause Cancer in a Variety of Ways 

In papillomaviruses, the viral genes that are mainly to blâme are called E6 and E7. 
The protein products of these viral oncogenes interact with many host-cell pro¬ 
teins, but, in particular, they bind to two lcey tumor suppressor proteins of the host 
cell, putting them both out of action and so permitting the cell to replicate its DNA 
and divide in an uncontrolled way. One of these host proteins is Rb; the other 
is p53. Other DNA tumor viruses use similar mechanisms to inhibit Rb and p53, 
underlining the central importance of inactivating both of these tumor suppres¬ 
sor pathways if a cell is to escape the normal constraints on prolifération. 

In other cancers, viruses hâve indirect tumor-promoting actions. The hepati- 
tis-B and C viruses, for example, favor the development of liver cancer by causing 
chronic inflammation (hepatitis), which stimulâtes an extensive cell division in 
the liver that promûtes the eventual évolution of tumor cells. In AIDS, the human 
immunodeficiency virus (HIV) promûtes development of an otherwise rare can¬ 
cer called Kaposi's sarcoma by destroying the immune System, thereby permit¬ 
ting a secondary infection with a human herpesvirus (HHV-8) that has a direct 
carcinogenic action. By causing severe inflammation, chronic infection with 
parasites and bacteria can also promote the development of some cancers. For 
example, chronic infection of the stomach with the bacterium Hélicobacter pylori, 
which causes ulcers, appears to be a major cause of stomach cancer; dramatic 
falls in the incidence of stomach cancer over the last half-century (see Figure 
20-39) correlate with a décliné in the incidence of Hélicobacter infections. 

The Search for Cancer Cures Is Difficult but Not Hopeless 

The difficulty of curing a cancer is similar to the difficulty of getting rid of weeds. 
Cancer cells can be removed surgically or destroyed with toxic Chemicals or radi¬ 
ation, but it is hard to eradicate every single one of them. Surgery can rarely ferret 
out every metastasis, and treatments that kill cancer cells are generally toxic to 
normal cells as well. Moreover, unlilce normal cells, cancer cells can mutate rap- 
idly and will often evolve résistance to the poisons and irradiation used against 
them. 

In spite of these difficulties, effective cures using anticancer drugs (alone or in 
combination with other treatments) hâve already been found for some formerly 
highly léthal cancers, including Hodgkin's lymphoma, testicular cancer, chorio- 
carcinoma, and some leulcemias and other cancers of childhood. Even for types 
of cancer where a cure at présent seems beyond our reach, there are treatments 
that will prolong life or at least relieve distress. But what prospect is there of doing 
better and finding cures for the most common forms of cancer, which still cause 
great suffering and so many deaths? 

Traditional Thérapies Exploit the Genetic Instability and Loss of 
Cell-Cycle Checkpoint Responses in Cancer Cells 

Anticancer thérapies need to talce advantage of some molecular peculiarity of 
cancer cells that distinguishes them from normal cells. One such property is 
genetic instability, reflecting deficiencies in chromosome maintenance, cell-cy- 
cle checlcpoints, and/or DNA repair. Remarkably, the most widely used cancer 
thérapies seem to worlc by exploiting these abnormalities, although this was not 
lcnown by the scientists who first developed the treatments. Ionizing radiation 
and most anticancer drugs damage DNA or interfère with chromosome ségréga¬ 
tion at mitosis, and they preferentially kill cancer cells because cancer cells hâve a 
diminished ability to survive the damage. Normal cells treated with radiation, for 
example, arrest their cell cycle until they hâve repaired the damage to their DNA, 
thanlcs to the cell-cycle checkpoint responses discussed in Chapter 17. Because 
cancer cells generally hâve defects in their checkpoint responses, they may con¬ 
tinue to divide after irradiation, only to die after a few days because the genetic 
damage remains unrepaired. More generally, most cancer cells are physiologi- 
cally deranged to a stressful degree: they live dangerously. Even though the cells 
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in a tumor hâve evolved to be unusually tolérant of minor DNA damage, they are 
hypersensitive to the much greater amount of damage that can be created by radi¬ 
ation and by DNA-damaging drugs. A small increase of genetic damage can be 
enough to tip the balance between prolifération and death. 

Unfortunately, while the molecular defects présent in cancer cells often 
enhance their sensitivity to cytotoxic agents, they can also increase their résis¬ 
tance. For example, where a normal cell might die by apoptosis in response to 
DNA damage, thanlcs to the stress response mediated by p53, a cancer cell may 
escape apoptosis because its p53 is lacking. Cancers vary widely in their sensitiv¬ 
ity to cytotoxic treatments, some responding to one drug, some to another, prob- 
ably reflecting the particular kinds of defects that a particular cancer has in DNA 
repair, cell-cycle checkpoints, and the control of apoptosis. 

New Drugs Can Kill Cancer Cells Selectively by Targeting Spécifie 
Mutations 

Radiotherapy and traditional cytotoxic drugs are rather wealdy sélective: they hurt 
normal cells as well as the cancer cells, and the safety margin is narrow. The dose 
often cannot be raised high enough to kill ail the cancer cells, because this would 
kill the patient, and curative treatments, where achievable, generally require a 
combination of several cytotoxic agents. The side effects can be harsh and hard to 
endure. How can we do better? 

An idéal treatment is one that is cell-lethal in combination with some lésion 
that is présent in the cancer cells, but harmless to cells where this lésion is absent. 
Such a treatment is said to be synthetic-lethal (from the original sense of the word 
synthesis, meaning “putting together”): it lcills only in partnership with the can- 
cer-specific mutation. As we become increasingly able to pinpoint the spécifie 
alterations in cancer cells that malce them different from their normal neighbors, 
new opportunities for such precisely targeted treatments are coming into view. 
We end this chapter with some examples of new treatments of this type that are 
already being put into practice. 

PARP Inhibitors Kill Cancer Cells That Hâve Defects in Brcal or 
Brca2 Genes 

As we hâve emphasized, the genetic instability of cancer cells malces the cells 
both dangerous and vulnérable—dangerous because of the enhancement in their 
ability to evolve and proliferate, and vulnérable because treatment that leads to 
still more extreme genetic disruption can talce them over the brinlc and kill them. 
In some cancers, genetic instability results from an identified fault in one of the 
many devices on which normal cells dépend for DNA repair and maintenance. 
In this case, a drug is tailored to block a complementary part of the DNA repair 
machinery can lead to such severe genetic damage that the cancer cells die. 

Detailed studies of the mechanisms for DNA maintenance discussed in Chap¬ 
ter 5 reveal a surprising amount of apparent redundancy. Thus, knocking out a 
particular pathway for DNA repair is generally less disastrous than one might 
expect, because alternate repair pathways exist. For example, stalled DNA rép¬ 
lication forlcs can arise when the forlc encounters a single-strand break in a tem- 
plate strand, but cells can avoid the disaster that would otherwise resuit either by 
directly repairing these single-strand breaks, or, if that fails, repairing the brolcen 
forlc that results by homologous recombination (see Figure 5-50). Suppose that the 
cells in a particular cancer hâve become genetically unstable by acquiring a muta¬ 
tion that reduces their ability to repair brolcen réplication forlcs by homologous 
recombination. Might it be possible to eradicate that cancer by treating it with a 
drug that inhibits the repair of single-strand breaks, thereby greatly increasing the 
number of forlcs that break? The conséquences of such drug treatment might be 
expected to be relatively harmless for normal cells, but léthal for the cancer. 

This strategy appears to worlc to kill the cells in at least one class of cancers— 
those that hâve inactivated both copies of either their Brcal or their Brca2 tumor 
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NORMAL CELL HAS TWO ALTERNATIVE 
DNA RE PAIR PATHWAYS 


TUMOR CELL HAS LOST DNA 
RE PAIR PATHWAY 2 




CELL LIVES 




Figure 20-41 How a tumor’s genetic 
instability can be exploited for cancer 
therapy. As explained in Chapter 5, the 
maintenance of DNA sequences is so 
critical for life that cells hâve evolved 
multiple pathways for repairing DNA 
damage and reducing DNA réplication 
errors. As illustrated, a DNA réplication 
fork will stall whenever it encounters a 
break in a DNA template strand. In this 
example, normal cells hâve two different 
repair pathways that help them to avoid 
the problem, pathways 1 and 2. They are 
therefore not harmed by treatment with a 
drug that blocks repair pathway 1. But, 
because the inactivation of repair pathway 
2 was selected for during the évolution of 
the tumor cell, the tumor cells are killed by 
the same drug treatment. 

In the actual case that underlies this 
example, the function of repair pathway 1 
(requiring the PARP protein discussed in 
the text) is to remove persistent, accidentai 
breaks in a DNA single strand before they 
are encountered by a moving réplication 
fork. Pathway 2 is the recombination- 
dependent process (requiring the Brca2 
and Brcal proteins) for repairing stalled 
réplication forks illustrated in Figure 5-50. 
PARP inhibitors hâve promise for treating 
cancers with defective Brca2 or Brca 1 
tumor suppressor genes. 


suppressor genes. As described in Chapter 5, Brca2 is an accessory protein that 
interacts with the Rad51 protein (the RecA analog in humans) in the repair of 
DNA double-strand breaks by homologous recombination. Brcal is another 
protein that is also required for this repair process. Like Rb, the Brcal and Brca2 
genes were discovered as mutations that prédisposé humans to cancer—in this 
case, chiefly cancers of the breast and ovaries (though unlike Rb, they seem to be 
involved in only a small proportion of such cancers). Individuals who inherit one 
mutant copy of Brcal or Brca2 develop tumors that hâve inactivated the second 
copy of the same gene, presumably because this change malces the cells geneti- 
cally unstable and speeds tumor progression. 

While Brcal and Brca2 are needed for the repair of DNA double-strand breaks, 
single-strand breaks are repaired by other machinery, involving an enzyme called 
PARP (polyADP-ribose polymerase). This understanding of the basic mecha- 
nisms of DNA repair led to a striking discovery: drugs that bloclc PARP activity kill 
Brca -déficient cells with extraordinary selectivity. At the same time, PARP inhibi¬ 
tion has very little effect on normal cells; in fact, mice that hâve been engineered 
to laclc PARP1—the major PARP family member involved in DNA repair—remain 
healthy under laboratory conditions. This resuit suggests that, while the repair 
pathway requiring PARP provides a first line of defense against persistent breaks in 
a DNA strand, these breaks can be repaired efficiently by a genetic recombination 
pathway in normal cells. In contrast, tumor cells that hâve acquired their genetic 
instability by the loss of Brcal or Brca2 hâve lost this second line of defense, and 
they are therefore uniquely sensitive to PARP inhibitors (Figure 20-41). 

PARP inhibitors are still under clinical trial, but they hâve produced some strik¬ 
ing results, causing tumors to regress in many Brca-déficient patients and delay- 
ing progression of their disease, with relatively few disagreeable side effects. These 
drugs also appear to be applicable to cancers with other mutations that cause 
defects in the celFs homologous recombination machinery—a small, though sig¬ 
nifiant, proportion of cancer cases. 
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PARP inhibition provides an example of the type of rational, highly sélective 
approach to cancer therapy that is beginning to be possible. Along with other new 
treatments to be discussed below, it raises high hopes for treating many other 
cancers. 

Small Molécules Can Be Designed to Inhibit Spécifie Oncogenic 
Proteins 

An obvious tactic for treating cancer is to attaclc a tumor expressing an oncogene 
with a drug designed to specifically block the function of the protein that the 
oncogene produces. But how can such a treatment avoid hurting the normal cells 
that dépend on the function of the proto-oncogene from which the oncogene has 
evolved, and why should the drug kill the cancer cells, rather than simply calm 
them down? One answer may lie in the phenomenon of oncogene dependence. 
Once a cancer cell has undergone an oncogenic mutation, it will often undergo 
further mutations, epigenetic changes, or physiological adaptations that make it 
reliant on the hyperactivity of the initial oncogene, just as drug addicts become 
reliant on high doses of their drug. Blocking the activity of the oncogenic protein 
may then kill the cancer cell without significantly harming its normal neighbors. 
Some remarkable successes hâve been achieved in this way. 

As we saw earlier, chronic myelogenous leulcemia (CML) is usually associated 
with a particular chromosomal translocation, visible as the Philadelphia chromo¬ 
some (see Figure 20-5). This results from chromosome breakage and rejoining 
at the sites of two spécifie genes, Abl and Ber. The fusion of these genes créâtes 
a hybrid gene, called Ber-Abl, that codes for a chimeric protein consisting of the 
N-terminal fragment of Ber fused to the C-terminal portion of Abl (Figure 20-42). 
Abl is a tyrosine kinase involved in cell signaling. The substitution of the Ber frag¬ 
ment for the normal N-terminus of Abl malces it hyperactive, so that it stimulâtes 
inappropriate prolifération of the hemopoietic precursor cells that contain it and 
prevents these cells from dying by apoptosis—which many of them would nor- 
mally do. As a resuit, excessive numbers of white blood cells accumulate in the 
bloodstream, producing CML. 

The chimeric Bcr-Abl protein is an obvious target for therapeutic attaclc. 
Searches for synthetic drug molécules that can inhibit the activity of tyrosine 
kinases discovered one, called imatinib (trade name Gleevec®), that bloclcs Bcr- 
Abl (Figure 20-43). When the drug was first given to patients with CML, nearly 
ail of them showed a dramatic response, with an apparent disappearance of the 
cells carrying the Philadelphia chromosome in over 80% of patients. The response 
appears relatively durable: after years of continuous treatment, many patients 
hâve not progressed to later stages of the disease—although imatinib-resistant 
cancers emerge with a probability of about 5% per year during the early years. 
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Figure 20-42 The conversion of the 
Abl proto-oncogene into an oncogene 
in patients with chronic myelogenous 
leukemia. The chromosome translocation 
responsible joins the Ber gene on 
chromosome 22 to the Abl gene from 
chromosome 9, thereby generating a 
Philadelphia chromosome (see Figure 
20-5). The resulting fusion protein has the 
N-terminus of the Ber protein joined to 
the C-terminus of the Abl tyrosine protein 
kinase; in conséquence, the Abl kinase 
domain becomes inappropriately active, 
driving excessive prolifération of a clone of 
hemopoietic cells in the bone marrow. 
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Figure 20-43 How imatinib (Gleevec) blocks the activity of Bcr-Abl protein and halts chronic myelogenous leukemia. 

(A) Imatinib sits in the ATP-binding pocket of the tyrosine kinase domain of Bcr-Abl and thereby prevents Bcr-Abl from 
transferring a phosphate group from ATP onto a tyrosine residue in a substrate protein. This blocks transmission of a signal for 
cell prolifération and survival. (B) The structure of the complex of imatinib (solid blue object) with the tyrosine kinase domain of 
the Abl protein (ribbon diagram), as determined by x-ray crystallography. (C) The Chemical structure of the drug. It can be given 
by mouth; it has side effects, but they are usually quite tolerable. (B, from T. Schindler et al., Science 289:1938-1942, 2000. 
With permission from AAAS.) 


Results are not so good for those patients who hâve already progressed to the 
more acute phase of myeloid leukemia, known as blast crisis, where genetic insta- 
bility has set in and the march of the disease is far more rapid. These patients show 
a response at first and then relapse because the cancer cells develop a résistance 
to imatinib. This résistance is usually associated with secondary mutations in the 
part of the Bcr-Abl gene that encodes the kinase domain, disrupting the ability 
of imatinib to bind to Bcr-Abl kinase. Second-generation inhibitors that function 
effectively against a whole range of imatinib-resistant mutants hâve now been 
developed. By combining one or more of these new inhibitors with imatinib as 
the initial therapy (see below), it seems that CML—at least in the chronic (early) 
stage—may be on its way to becoming a curable disease. 

Despite the complications with résistance, the extraordinary success of ima¬ 
tinib is enough to drive home an important principle: once we understand pre- 
cisely what genetic lésions hâve occurred in a cancer, we can begin to design effec¬ 
tive rational methods to treat it. This success story has fueled efforts to identify 
small-molecule inhibitors for other oncogenic protein kinases and to use them 
to attack the appropriate cancer cells. Increasing numbers are being developed. 
These include molécules that target the EGF receptor and are currently approved 
for the treatment of some lung cancers, as well as drugs that specifically target the 
B-Raf oncoprotein in melanomas. 

Protein kinases hâve been relatively easy to inhibit with small molécules 
like imatinib, and many kinase inhibitors are being produced by pharmaceuti- 
cal companies in the hope that they can be effective as drugs for some forms of 
cancer. Many cancers laclc an oncogenic mutation in a protein kinase. But most 
tumors contain inappropriately activated signaling pathways, for which a target 
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Figure 20-44 Some anticancer drugs 
and drug targets in the Ras-MAP-kinase 
signaling pathway. Each of the signaling 
proteins in this diagram has been identified 
as a product of a cancer-critical gene, with 
the exception of Rafl and Erk. This Ras- 
MAP-kinase signaling pathway is triggered 
by a variety of receptor tyrosine kinases 
(RTKs), including the EGF receptor (see 
Figures 15-47 and 15-49). Those drugs 
that are antibodies end in “mab,” while 
those that are small molécules end in “nib.” 
(Adapted from B. Vogelstein et al, Science 
339:1546-1558, 2013.) 


somewhere in the pathway can hopefully be found (Movie 20.7). As an example, 
Figure 20-44 displays some of the anticancer drugs and drug targets that are cur- 
rently being tested for a pathway frequently activated in cancers. 

Many Cancers May Be Treatable by Enhancing the Immune 
Response Against the Spécifie Tumor 

Cancers hâve complex interactions with the immune System, and its various com- 
ponents may sometimes help as well as hinder tumor progression. But for more 
than a century it has been a dream of cancer researchers to somehow harness 
the immune System in a controlled and efficient way to exterminate cancer cells, 
just as it exterminâtes infectious organisms. There are finally signs that this dream 
may one day be realized, at least for some forms of cancer. 

The simplest type of immunological therapy, conceptually at least, is to inject 
the patient with antibodies that target the cancer cells. This approach has had 
some successes. About 25% of breast cancers, for example, express unusually high 
levels of the Her2 protein, a receptor tyrosine kinase related to the EGF receptor 
that plays a part in the normal development of mammary epithelium. A monoclo¬ 
nal antibody called trastuzumab (trade name Herceptin®) that binds to Her2 and 
inhibits its function slows the growth of breast tumors in humans that overexpress 
Her2, and it is now an approved therapy for these cancers (see Figure 20-44). A 
related approach uses antibodies to deliver poisons to the cancer cells. Antibodies 
against proteins that are abundant on the surface of a particular type of cancer cell 
but rare on normal cells can be armed with a toxin that kills those cells that bind 
the antibody molécule. 

A great deal of current excitement centers around a different type of approach, 
based on the relatively recent récognition that the micro environment in a tumor is 
highly immunosuppressive. As a resuit, the cancer victim's immune System is pre- 
vented from destroying the tumor cells. Recall that, from the thousands of genome 
sequences thus far determined, we know that a typical cancer cell will contain on 
the order of 50 proteins with a mutation that alters an amino acid sequence, most 
of these being “passenger" mutations, as previously explained (see p. 1104). Many 
of these mutant proteins will be recognized by the patient's immune System as 
foreign, but—to allow the cancer cells to survive throughout the course of tumor 
progression—the cancer cells hâve evolved a set of anti-immune defenses. These 
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defenses include the expression on the cancer cell surface of one or more proteins 
that bind to inhibitory receptors on activated T cells. 

The normal immune System is subject to complex Controls that lceep its activity 
within safe bounds and prevent autoimmunity from developing. The inhibitory 
receptors that are expressed on the surface of activated T cells hâve an important 
normal function: they control the immune response by down-regulating the T 
cell response under appropriate circumstances. But in the context of a tumor, the 
down-regulation is inappropriate, because it prevents the organism from killing 
the cancer cells that are threatening its survival. 

In its attaclc on infectious organisms, the natural immune System usually élim¬ 
inâtes every last trace of infection and maintains this immunity in the long term. 
The challenge is to find ways of recruiting the immune System to attaclc cancers 
with similar efficiency and specificity, hunting the cancer cells down by virtue of 
the tumor-specific antigens that they express. With this aim, a new type of anti¬ 
cancer therapy focuses on overcoming the immunosuppressive environment in 
a tumor through the use of spécifie antibodies that prevent the tumor cells from 
engaging with the inhibitory receptors on T cells. As illustrated in Figure 20-45A, 
blocking the action of the immune suppressors with such treatments should 
unleash an immune attaclc on the cancer cells. Importantly, multiple antigens 
are recognized as foreign; thus, the cancer cells cannot escape through the muta- 
tional loss of a single antigen, malcing it difficult for the tumor to escape from the 
T cell attaclc. 

This is a potentially dangerous strategy. If one provolces the immune System to 
recognize the cancer cells as targets for destruction, there is a rislc of autoimmune 
side effects with dire conséquences for normal tissues of the body, since the can¬ 
cer cells and the normal cells are close cousins and share most of their molecular 
features. Nevertheless, several recent successes seem to hold great promise for 
the future. 

One of the many molécules involved in lceeping the activity of the normal 
immune System within safe bounds is a protein called CTLA4 (cytotoxic T-lym- 
phocyte-associated protein 4), which functions as an inhibitory receptor on the 
surface of T cells. If the function of CTLA4 is bloclced, the T cells become more 
reactive and may mount an attaclc on cells that they would otherwise leave 
in peace. In particular, the T cells may attaclc tumor cells that are recognizably 
abnormal but whose presence was previously tolerated. With this in mind, cancer 
immunologists developed a monoclonal antibody, called ipilimumab, that binds 
to CTLA4 and bloclcs its action. Injected repeatedly into patients with metastatic 
melanoma, this antibody increases their médian lifespan by several months and, 
in one large trial, enabled as many as a quarter of them to survive for five years 


Figure 20-45 Thérapies designed 
to remove the immunosuppressive 
microenvironment in tumors. (A) The 

cells in tumors will produce many mutant 
proteins. As described in Chapter 24, 
peptides from these proteins will be 
displayed on MHC complexes on the 
tumor-cell surface and would normally 
activate a T cell response that destroys 
the tumor (see Figure 24-42). However, as 
schematically illustrated, during the course 
of tumor progression, the cancer cells hâve 
evolved immunosuppressive mechanisms 
that protect them from such killing. (B) The 
cells in tumors often protect themselves 
from immune attack by expressing 
proteins on their surface that bind to and 
thereby activate the inhibitory receptors 
on T cells. As indicated, this makes the 
tumor susceptible to spécifie antibody 
thérapies. In this diagram, two such 
inhibitory receptors are shown, PDI and a 
hypothetical protein X. Different tumors are 
thought to protect themselves by activating 
different members of a large set of T cell 
inhibitory receptors, some of which are not 
yet well characterized. 
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or more—far beyond expectations for comparable patients without this treat- 
ment. Even more promising are recent clinical trials using a combination of two 
antibodies, one against CTLA4 and the other against PDI, a second cell-surface 
receptor on T cells that normally restrains their activity. 

In clinical trials using such techniques, a substantial fraction of the patients 
can respond in a dramatic way, with their cancer being driven into remission 
for years, while the treatment fails to help others with the same type of cancer. 
One possible explanation is that, while most tumors express proteins that protect 
them from T-cell attaclc, these proteins are different for different tumors. Thus, 
while some tumors will respond dramatically when treated with an antibody that 
bloclcs a particular immunosuppressive agent, many others will not. If true, one 
can foresee an era of personalized immunotherapy, in which each patient's tumor 
is molecularly analyzed to détermine its particular mechanisms of immunosup¬ 
pression. The patient would then be treated with a spécifie cocktail of antibodies 
designed to remove these bloclcs (see Figure 20-45). 

Cancers Evolve Résistance to Thérapies 

High hopes hâve to be tempered with sobering realities. We hâve seen that genetic 
instability can provide an Achilles heel that cancer thérapies can exploit, but at the 
same time it can malce eradicating the disease more difficult by allowing the can¬ 
cer cells to evolve résistance to therapeutic drugs, often at an alarming rate. This 
applies even to the drugs that target genetic instability itself. Thus, PARP inhibitors 
give valuable remission of illness, but in the long term the disease generally cornes 
baclc. For example, Brca -déficient cancers can sometimes develop résistance to 
PARP inhibitors by undergoing a second mutation in an affected Brca gene that 
restores its function. By then, the cancer is already out of control and it may be too 
late to affect the course of the disease with additional treatments. 

There are many different strategies by which cancers can evolve résistance to 
anticancer drugs. Often, a cancer will be dramatically reduced in size by an initial 
drug treatment, with ail of the détectable tumor cells seeming to disappear. But 
months or years later the cancer will reappear in an altered form that is résistant 
to the drug that was at first so successful. In such cases, the initial drug treatment 
has evidently failed to destroy some tiny fraction of cells in the original tumor- 
cell population. These cells hâve escaped death because they carry a protective 
mutation or epigenetic change, or perhaps simply because they were lurking in 
a protected environment. They eventually regenerate the cancer by continuing to 
proliferate, mutating and evolving still further as they do so. 

In some cases, cells that are exposed to one anticancer drug evolve a résis¬ 
tance not only to that drug but also to other drugs to which they hâve never been 
exposed. This phenomenon of multidrug résistance frequently correlates with 
amplification of a part of the genome that contains a gene called Mdrl or Abcbl. 
This gene encodes a plasma-membrane-bound transport ATPase of the ABC 
transporter superfamily (discussed in Chapter 11), which pumps lipophilie drugs 
out of the cell (see Movie 11.5). The overproduction of this protein (or some of its 
other family members) by a cancer cell can prevent the intracellular accumula¬ 
tion of many cytotoxic drugs, making the cell insensitive to them. 

In the to-and-fro struggle between advanced metastatic cancer and the ther- 
apist, as current practice stands, the cancer usually wins in the end. Does it hâve 
to be so? As we discuss below, there is reason to thinlc that by attacking a cancer 
with many weapons at once—instead of using them one after another, each until 
it fails—it may be possible to do much better. 

Combination Thérapies May Succeed Where Treatments with 
One Drug at a Time Fail 

Nowadays, cancers caught at an early stage can often be cured, by surgery, radia¬ 
tion, or drugs. For most cancers that hâve progressed and metastasized widely, 
however, cure is still beyond us. Treatments such as those described above can 
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Figure 20-46 Why multidrug treatments 
can be more effective than sequential 
treatments for cancer therapy. 

(A) Because tumor cells are hypermutable, 
two single-drug treatments that are given 
sequentially often allow for the sélection of 
mutant cell clones that are résistant to both 
drugs. (B) Simultaneous treatment with 
both drugs can be more effective. 
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give valuable remissions, but sooner or later these are typically followed by relapse. 

Nevertheless, for some relatively rare forms of advanced cancer, curative thér¬ 
apies hâve been developed. These generally involve a cocktail of several different 
anticancer agents: by trial and error, certain combinations of cytotoxic drugs hâve 
been found to wipe out the cancer completely. Discovering such combinations 
has hitherto involved a long, hard search. But now, armed with our new tools for 
identifying the spécifie genetic lésions that cancer cells contain, the prospects are 
better. 

The logic of combination thérapies is the same as that behind the current treat¬ 
ment of HIV-AIDS with a cocktail of three different protease inhibitors: whereas 
there may always be some cells in the initial population carrying the rare muta¬ 
tions that confer résistance to any one drug treatment, there should be no cell 
carrying the whole set of rare mutations that would confer résistance to several 
different drugs delivered simultaneously. In contrast, sequential drug treatments 
will allow the few cells résistant to the first drug to multiply to large numbers. 
Within this large population of cells résistant to the first drug, a small number 
of cells are lilcely to hâve arisen that are résistant to the next drug also; and so on 
(Figure 20-46). 

We Now Hâve the Tools to Devise Combination Thérapies 
Tailored to the Individual Patient 

Efficient, rational combination drug therapy requires three things. First, we hâve 
to identify multiple peculiarities of cancer cells that malce them vulnérable in ways 
that normal cells are not. Second, we hâve to devise drugs (or other treatments) 
that target each of these vulnerabilities. Third, we hâve to match the combination 
of drugs to the spécifie set of peculiarities présent in the cancer cells of the indi¬ 
vidual patient. 

The first requirement is already partially met: we now hâve large catalogs of 
cancer-critical genes that are commonly mutated in cancer cells. The second 
requirement is harder, but attainable: we hâve described some remarkable recent 
successes, and for cancer researchers there is excitement in the air. It is becoming 
increasingly possible to use our growing knowledge of cell and molecular biol- 
ogy to design new drugs against designated targets. At the same time, efficient, 
high-throughput automated methods are available to screen large libraries of 
Chemicals for any that may be effective against cells with a given cancer-related 
defect. In such searches, the goal is synthetic lethality: a cell death that occurs 
when and only when a particular drug is put together with a particular cancer 
cell abnormality. Through these and other approaches, the répertoire of precisely 
targeted anticancer drugs is rapidly increasing. 
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This brings us to the third requirement: the therapy—the choice of drugs to 
be given in combination—must be tailored to the individual patient. Here, too, 
the prospects are bright. Cancers evolve by a fundamentally random process, 
and each patient is different; but modem methods of genome analysis now let us 
characterize the cells from a tumor biopsy in exhaustive detail so as to discover 
which cancer-critical genes are affected in a particular case. Admittedly, this is 
not straightforward: the tumor cells in an individual patient are heterogeneous 
and do not ail contain the same genetic lésions. With increased understandings of 
the pathways of cancer évolution, however, and with the expérience gained from 
many different cases, it should become possible to malce good guesses at the opti¬ 
mal thérapies to use. 

From the perspective of the patient, the pace of advance in cancer research 
can seem frustratingly slow. Each new drug has to be tested in the clinic, first for 
safety and then for efficacy, before it can be released for general use. And if the 
drug is to be used in combination with others, the combination therapy must then 
go through the same long process. Strict ethical rules constrain the conduct of 
trials, which means that they talce time—typically several years. But slow and cau- 
tious steps, talcen systematically in the right direction, can lead to great advances. 
There is still far to go, but the examples that we hâve discussed provide proof of 
principle and grounds for optimism. 

From the cancer research effort, we hâve learned a great deal of what we lcnow 
about the molecular biology of the normal cell. Now, more and more, we are dis- 
covering how to put that knowledge to use in the battle with cancer itself. 


WHAT WE DONT KNOW 


• What is required to enable a cancer 
cell to metastasize? 

• How can the molecular analysis of 
an individual tumor be more effectively 
used to design effective thérapies to 
kill it? 

• Can we identify general features 
common to ail cancer cells—such as 
their production of misfolded, mutated 
proteins—that can be used for the 
targeted destruction of many different 
types of cancers? 

• Can sensitive and reliable blood tests 
be devised to detect cancers very 
early, before they hâve grown to a size 
where treatment with a single drug will 
generally be defeated by the survival of 
a preexisting résistant variant? 


Summary 

Ourgrowing understanding ofthe cell biology of cancers has already begun to lead 
to better ways of preventing, diagnosing, and treating these diseases. Anticancer 
thérapies can be designed to destroy cancer cells preferentially by exploiting the 
properties that distinguish cancer cells from normal cells , including the cancer cells ' 
dependence on oncogenic proteins and the defects they harbor in their DNA repair 
mechanisms. We now hâve good evidence that , by increasing our understanding 
of normal cell control mechanisms and exactly how they are subverted in spécifie 
cancers , we can eventually devise drugs to kill cancers precisely by attacking spécifie 
molécules critical for the growth and survival ofthe cancer cells. In addition , great 
progress has recently been made through sophisticated immunological approaches 
to cancer therapy. And , as we become better able to détermine which genes are 
altered in the cells ofany given tumor ; we can begin to tailor treatments more accu- 
rately to each individual patient. 


• How can the observed 
environmental effects on cancer rates 
be exploited to reduce avoidable 
cancers? 

• Can new technologies be devised 
to reveal exactly how a quiescent 
micrometastasis converts to a full- 
blown metastatic tumor? 


PROBLEMS 

Which statements are true? Explain why or why not. 

20-1 The Chemical carcinogen dimethylbenz[a]anthra- 
cene (DMBA) must be an extraordinarily spécifie mutagen 
since 90% of the slcin tumors it causes hâve an A-to-T alter¬ 
ation at exactly the same site in the mutant Ras gene. 

20-2 In the cellular regulatory pathways that control 
cell growth and prolifération, the products of oncogenes 
are stimulatory components and the products of tumor 
suppressor genes are inhibitory components. 

20-3 Cancer thérapies directed solely at lcilling the rap- 
idly dividing cells that malce up the bulle of a tumor are 
unlikely to eliminate the cancer from many patients. 


20-4 The main environmental causes of cancer are the 
products of our highly industrialized way of life such as 
pollution and food additives. 

Discuss the following problems. 

20-5 In contrast to colon cancer, whose incidence 
increases dramatically with âge, incidence of osteosar- 
coma—a tumor that occurs most commonly in the long 
bones—pealcs during adolescence. Osteosarcomas are rel- 
atively rare in young children (up to âge 9) and in adults 
(over 20). Why do you suppose that the incidence of osteo- 
sarcoma does not show the same sort of age-dependence 
as colon cancer? 
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Figure Q20-1 Cumulative 
risk of lung cancer mortality 
for nonsmokers, smokers, 
and former smokers (Problem 
20-6). Cumulative risk is the 
running total of deaths, as a 
percentage, for each group. 
Thus, for continuing smokers, 

1 % died of lung cancer 
between âges 45 and 55; an 
additional 4% died between 
55 and 65 (giving a cumulative 
risk of 5%); and 11 % more 
died between 65 and 75 (for a 
cumulative risk of 16%). 
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20-8 Virtually ail cancer treatments are designed to kill 
cancer cells, usually by inducing apoptosis. However, one 
particular cancer— acute promyelocytic leukemia (APL)— 
has been successfully treated with all-tmns-retinoic acid, 
which causes the promyélocytes to differentiate into neu- 
trophils. How might a change in the State of différentiation 
of APL cancer cells help the patient? 

20-9 One major goal of modem cancer therapy is to 
identify small molécules—anticancer drugs—that can 
be used to inhibit the products of spécifie cancer-critical 
genes. If you were searching for such molécules, would 
you design inhibitors for the products of oncogenes or 
the products of tumor suppressor genes? Explain why you 
would (or would not) select each type of gene. 


20-6 Mortality due to lung cancer was followed in 
groups of males in the United Kingdom for 50 years. Figure 
Q20-1 shows the cumulative risk of dying from lung can¬ 
cer as a function of âge and smoking habits for four groups 
of males: those who never smolced, those who stopped at 
âge 30, those who stopped at âge 50, and those who contin- 
ued to smolce. These data show clearly that individuals can 
substantially reduce their cumulative risk of dying from 
lung cancer by stopping smoking. What do you suppose is 
the biological basis for this observation? 

20-7 A small fraction—2 to 3%—of ail cancers, across 
many subtypes, displays a quite remarkable phenome- 
non: tens to hundreds of rearrangements that primarily 
involve a single chromosome, or chromosomal région. 
The breakpoints can be tightly clustered, with several in a 
few kilobases; the junctions of the rearrangements often 
involve segments of DNA that were not originally close 
together on the chromosome. The copy number of various 
segments within the rearranged chromosome was found 
to be either zéro, indicating délétion, or one, indicating 
rétention. 

You can imagine two ways in which such multi¬ 
ple, localized rearrangements might happen: a progressive 
rearrangements model with ongoing inversions, délétions, 
and duplications involving a localized area, or a cata¬ 
strophe model in which the chromosome is shattered into 
fragments that are stitched baclc together in random order 
by nonhomologous end joining (Figure Q20-2). 

A. Which of the two models in Figure Q20-2 accounts 
more readily for the features of these highly rearranged 
chromosomes? Explain your reasoning. 

B. For whichever model you choose, suggest how 
such multiple rearrangements might arise. (The true 
mechanism is not known.) 

C. Do you suppose such rearrangements are lilcely 
to be causative events in the cancers in which they are 
found, or are they probably just passenger events that are 
unrelated to the cancer? If you thinlc they could be driver 
events, suggest how such rearrangements might activate 
an oncogene or inactivate a tumor suppressor gene. 
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Figure Q20-2 Two models to explain the multiple, localized 
chromosome rearrangements found in some cancers (Problem 
20-7). The progressive rearrangements model shows a sequence 
of rearrangements that disrupts the chromosome, generating 
increasingly complex chromosomal configurations. The chromosome 
catastrophe model shows the chromosome being fragmented and 
then reassembled randomly, with some pièces left out. 
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20-10 PolyADP-ribose polymerase (PARP) plays a key 
rôle in the repair of DNA single-strand breaks. In the prés¬ 
ence of the PARP inhibitor olaparib, single-strand breaks 
accumulate. When a réplication fork encounters a sin¬ 
gle-strand break, it converts it to a double-strand break, 
which in normal cells is then repaired by homologous 
recombination. In cells defective for homologous recom¬ 
bination, however, inhibition of PARP triggers cell death. 

Patients who hâve only one functional copy of the 
Brcal gene, which is required for homologous recombina¬ 
tion, are at much higher rislc for cancer of the breast and 
ovary. Cancers that arise in these tissues in these patients 
can be treated successfully with olaparib. Explain how it is 
that treatment with olaparib kills the cancer cells in these 
patients, but does not harm their normal cells. 

20-11 The Tasmanian devil, a carnivorous Australian 
marsupial, is threatened with extinction by the spread 
of a fatal disease in which a malignant oral-facial tumor 
interfères with the animaFs ability to feed. You hâve been 
called in to analyze the source of this unusual cancer. It 
seems clear to you that the cancer is somehow spread from 
devil to devil, very lilcely by their frequent fighting, which 
is accompanied by biting around the face and mouth. To 
uncover the source of the cancer, you isolate tumors from 
11 devils captured in widely separated régions and exam¬ 
ine them. As might be expected, the lcaryotypes of the 
tumor cells are highly rearranged relative to that of the 
wild-type devil (Figure Q20-3). Surprisingly, you find that 
the lcaryotypes from ail 11 tumor samples are very similar. 
Moreover, one of the Tasmanian devils has an inversion on 
chromosome 5 that is not présent in its facial tumor. How 
do you suppose this cancer is transmitted from devil to 
devil? Is it lilcely to arise as a conséquence of an infection 
by a virus or microorganism? Explain your reasoning. 
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Figure Q20-3 Karyotypes of cells from Tasmanian devils (Problem 
20-11). (A) A Tasmanian devil. (B) Normal karyotype for a male 
Tasmanian devil. The karyotype has 14 chromosomes, including XY. 

(C) Karyotype of cancer cells found in each of the 11 facial tumors 
studied. The karyotype has 13 chromosomes, no sex chromosomes, 
no chromosome 2 pair, one chromosome 6, two chromosomes 1 with 
deleted long arms, and four highly rearranged marker chromosomes 
(M1-M4). (A, reproduced courtesy of Muséum Victoria; B and C, from 
A.M. Pearse and K. Swift, Nature 439:549, 2006. With permission from 
Macmillan Publishers Ltd.) 
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Development of 
Multicellular Organisms 


An animal or plant starts its life as a single cell—a fertilized egg, or zygote. During 
development, this cell divides repeatedly to produce many different kinds of cells, 
arranged in a final pattern of spectacular complexity and précision. The goal of 
developmental cell biology is to understand the cellular and molecular mecha- 
nisms that direct this amazing transformation (Movie 21.1). 

Plants and animais hâve very different ways of life, and they use different 
developmental strategies; in this chapter, we focus mainly on animais. Four pro¬ 
cesses are fundamental to animal development: (1) cell prolifération, which pro¬ 
duces many cells from one; (2) cell-cell interactions, which coordinate the behav- 
ior of each cell with that of its neighbors; (3) cell specialization, or différentiation, 
which créâtes cells with different characteristics at different positions; and (4) cell 
movement, which rearranges the cells to form structured tissues and organs (Fig¬ 
ure 21-1). It is on the fourth point that plant development differs radically: plant 
cells are unable to migrate or move independently through the embryo because 
each one is contained within a cell wall, through which it is cemented to its neigh¬ 
bors, as discussed in Chapter 19. 

In a developing animal embryo, the four fundamental processes are happen¬ 
ing in a lcaleidoscopic variety of ways, as they give rise to different parts of the 
organism. Lilce the members of an orchestra, the cells in the embryo hâve to play 
their individual parts in a highly coordinated manner. In the embryo, however, 
there is no conductor—no central authority—to direct the performance. Instead, 
development is a self-assembly process in which the cells, as they grow and prolif- 
erate, organize themselves into increasingly complex structures. Each of the mil¬ 
lions of cells has to choose for itself how to behave, selectively utilizing the genetic 
instructions in its chromosomes. 

At each stage in its development, the cell is presented with a limited set of 
options, so that its developmental pathway branches repeatedly, reflecting a large 
set of sequential choices. Lilce the decisions we malce in our own lives, the choices 
made by the cell are based on its internai State—which largely reflects its his- 
tory—and on current influences from other cells, especially its close neighbors. 
To understand development, we need to lcnow how each choice is controlled and 
how it dépends on previous choices. Beyond that, we need to understand how 
the choices, once made, influence the celFs chemistry and behavior, and how cell 
behaviors act synergistically to détermine the structure and function of the body. 
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Figure 21-1 The four essential cell processes that allow a multicellular organism to be made. 
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As cells become specialized they change not only their chemistry but also their 
shape and their attachments to other cells and to the extracellular matrix. They 
move and rearrange themselves to create the complex architecture of the body, 
with ail its tissues and organs, each structured precisely and defined in size. To 
understand this process of form génération, or morphogenesis, we will need to 
talce account of the mechanical, as well as the biochemical, interactions between 
the cells. 

At first glance, one would no more expect the worm, the flea, the eagle, and 
the giant squid ail to be generated by the same developmental mechanisms than 
one would suppose that the same methods were used to malce a shoe and an air- 
plane. Remarkably, however, research in the past 30 years has revealed that much 
of the basic machinery of development is essentially the same in ail animais—not 
just in ail vertebrates, but in ail the major phyla of invertebrates too. Recognizably 
similar, evolutionarily related molécules define the specialized animal cell types, 
mark the différences between body régions, and help create the animal body pat¬ 
tern. Homologous proteins are often functionally interchangeable between very 
different species. Thus, a human protein produced artificially in a fly, for example, 
can perform the same function as the fly's own version of that protein (Figure 
21-2). Thanlcs to an underlying unity of mechanisms, developmental biologists 
hâve been making great strides toward a cohérent understanding of animal devel¬ 
opment. 

We begin this chapter with an overview of some of the basic mechanisms that 
operate in animal development. We then discuss, in sequence, how cells in the 
embryo diversify to form patterns in space, how the timing of developmental 
events is controlled, how cell movements contribute to morphogenesis, and how 
the size of an animal is regulated. We end by considering the most challenging 
aspect of development—the mechanisms that enable a highly complex nervous 
System to form. 
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Figure 21-2 Homologous proteins can 
function interchangeably. (A-C) The 
Eyeless protein (also called Pax6) Controls 
eye development in Drosophila and, when 
misexpressed during development, can 
cause an eye to form in an abnormal 
site, such as a wing (B) or a leg (C). The 
scanning électron micrographs show 
a patch of eye tissue on the leg of a fly 
resulting from misexpression of Drosophila 
Eyeless (E) and of squid Pax6 (F). The 
homologous protein from a human or 
practically any animal possessing eyes, 
when similarly misexpressed in a transgenic 
fly, has the same effect. The entire eye of a 
normal Drosophila is shown for comparison 
in (A) and (D). (B-C, courtesy of Georg 
Halder; D-F, from S. I. Tomarev, et al. Proc. 
NatlAcad. Soi. USA 94:2421-2426, 1997. 
With permission from National Academy of 
Sciences.) 
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Animais live by eating other organisms. Thus, despite their remarkable diversity, 
animais as different as worms, molluslcs, insects, and vertebrates share anatom- 
ical features that are fundamental to this way of life. Epidermal cells form a pro- 
tective outer layer; gut cells absorb nutrients from ingested food; muscle cells 
allow movement; and neurons and sensory cells control behavior. These diverse 
cell types are organized into tissues and organs, forming a sheet of skin covering 
the exterior, a mouth for feeding, and an internai gut tube to digest food—with 
muscles, nerves, and other tissues arranged in the space between the skin and 
the gut tube. Many animais hâve clearly defined axes—an anteroposterior axis, 
with mouth and brain anterior and anus posterior; a dorsoventral axis, with baclc 
dorsal and belly ventral; and a left-right axis. In this section, we discuss some fun¬ 
damental mechanisms underlying animal development, beginning with how the 
basic animal body plan is established. 


Conserved Mechanisms Establish the Basic Animal Body Plan 


The shared anatomical features of animais develop through conserved mecha¬ 
nisms. After fertilization, the zygote usually divides rapidly, or cleaves, to form 
many smaller cells; during this cleavage, the embryo, which cannot yet feed, does 
not grow. This phase of development is initially driven and controlled entirely by 
the material deposited in the egg by the mother. The embryonic genome remains 
inactive until a point is reached when maternai mRNAs and proteins rather 
abruptly begin to be degraded. The embryo's genome is activated, and the cells 
cohere to form a blastula—typically a solid or a hollow fluid-füled bail of cells. 
Complex cell rearrangements called gastrulation (from the Greelc “gaster," mean- 
ing "belly") then transform the blastula into a multilayered structure containing 
a rudimentary internai gut (Figure 21-3). Some cells of the blastula remain exter- 
nal, constituting the ectoderm, which will give rise to the epidermis and the ner- 
vous System; other cells invaginate, forming the endoderm, which will give rise to 
the gut tube and its appendages, such as lung, pancréas, and liver. Another group 
of cells moves into the space between ectoderm and endoderm and forms the 
mesoderm, which will give rise to muscles, connective tissues, blood, lcidney, and 
various other components. Further cell movements and accompanying cell dif¬ 
férentiations create and refine the embryo's architecture. 
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Figure 21-3 The early stages of 
development, as exemplified by a frog. 

(A) A fertilized egg divides to produce a 
blastula—a sheet of épithélial cells often 
surrounding a cavity. During gastrulation, 
some of the cells tuck into the interior to 
form the mesoderm (green) and endoderm 
(yellow). Ectodermal cells (blue) remain on 
the outside. (B) A cross section through 
the trunk of an amphibian embryo shows 
the basic animal body plan, with a sheet 
of ectoderm on the outside, a tube of 
endoderm on the inside, and mesoderm 
sandwiched between them. The endoderm 
forms the épithélial lining of the gut, from 
the mouth to the anus. It gives rise not only 
to the pharynx, esophagus, stomach, and 
intestines, but also to many associated 
structures. The salivary glands, liver, 
pancréas, trachea, and lungs, for example, 
ail develop from the wall of the digestive 
tract and grow to become Systems of 
branching tubes that open into the gut or 
pharynx. The endoderm forms only the 
épithélial components of these structures— 
the lining of the gut and the secretory 
cells of the pancréas, for example. The 
supporting muscular and fibrous éléments 
arise from the mesoderm. 

The mesoderm gives rise to the 
connective tissues—at first, to the loose 
mesh of cells in the embryo known as 
mesenchyme, and ultimately to cartilage, 
bone, and fibrous tissue, including the 
dermis (the inner layer of the skin). The 
mesoderm also forms the muscles, the 
entire vascular System—including the heart, 
blood vessels, and blood cells—and the 
tubules, ducts, and supporting tissues of 
the kidneys and gonads. The notochord 
forms from the mesoderm and serves 
as the core of the future backbone and 
the source of signais that coordinate the 
development of surrounding tissues. 

The ectoderm will form the epidermis 
(the outer, épithélial layer of the skin) and 
epidermal appendages such as hair, 
sweat glands, and mammary glands. It will 
also give rise to the whole of the nervous 
System, central and peripheral, including 
not only neurons and glia but also the 
sensory cells of the nose, the ear, the eye, 
and other sense organs. (B, after T. Mohun 
et al., Cell 22:9-15, 1980. With permission 
from Elsevier.) 
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The ectoderm, mesoderm, and endoderm formed during gastrulation consti- 
tute the three germ layers of the early embryo. Many later developmental trans¬ 
formations will produce the elaborately structured organs. But the basic body 
plan and axes set up in miniature during gastrulation are preserved into adult life, 
when the organism may be billions of times larger (Movie 21.2). 

The Developmental Potential of Cells Becomes Progressively 
Restricted 

Concomitant with the refinement of the body plan, the individual cells become 
more and more restricted in their developmental potential. During the blastula 
stages, cells are often totipotent or pluripotent—they hâve the potential to give 
rise to ail or almost ail of the cell types of the adult body. The pluripotency is lost 
as gastrulation proceeds: a cell located in the endodermal germ layer, for exam¬ 
ple, can give rise to the cell types that will line the gut or form gut-derived organs 
such as the liver or pancréas, but it no longer has the potential to form meso¬ 
derm-derived structures such as slceleton, heart, or kidney. Such a cell is said 
to be determined for an endodermal fate. Thus, cell détermination starts early 
and progressively narrows the options as the cell steps through a programmed 
sériés of intermediate States—guided at each step by its genome, its history, and 
its interactions with neighbors. The process reaches its limit when a cell under- 
goes terminal différentiation to form one of the highly specialized cell types of 
the adult body (Figure 21-4). Although there are cell types in the adult that retain 
some degree of pluripotency, their range of options is generally narrow (discussed 
in Chapter 22). 

Cell Memory Underlies Cell Decision-Making 

Underlying the richness and astonishingly complex outcomes of development is 
cell memory (see p. 404). Both the genes a cell expresses and the way it behaves 
dépend on the cell's past, as well as on its présent circumstances. The cells of our 
body—the muscle cells, the neurons, the slcin cells, the gut cells, and so on—main- 
tain their specialized characters largely because they retain a record of the extra¬ 
cellular signais their ancestors received during development, rather than because 
they continually receive such instructions from their surroundings. Despite their 
radically different phenotypes, they retain the same complété genome that was 
présent in the zygote; their différences arise instead from differential gene expres¬ 
sion. We hâve discussed the molecular mechanisms of gene régulation, cell mem¬ 
ory, cell division, cell signaling, and cell movement in previous chapters. In this 
chapter, we shall see how these basic processes are collectively deployed to create 
an animal. 


blastomere 

I 

endoderm cell 

pancreatic bud cell 

ï 

endocrine pancréas cell 

À 

(3-islet cell 

Figure 21-4 The lineage from 
blastomere to differentiated cell type. 

As development proceeds, cells become 
more and more specialized. Blastomeres 
hâve the potential to give rise to most or ail 
cell types. Under the influence of signaling 
molécules and gene regulatory factors, 
cells acquire more restricted fates until 
they differentiate into highly specialized cell 
types, such as the pancreatic p-islet cells 
that secrete the hormone insulin. 


Several Model Organisms Hâve Been Crucial for Understanding 
Development 

The anatomical features that animais share hâve undergone many extreme mod¬ 
ifications in the course of évolution. As a resuit, the différences between species 
are usually more striking to our human eye than the similarities. But at the level 
of the underlying molecular mechanisms and the macromolecules that médiate 
them, the reverse is true: the similarities among ail animais are profound and 
extensive. Through more than half a billion years of evolutionary divergence, ail 
animais hâve retained unmistakably similar sets of genes and proteins that are 
responsible for generating their body plans and for forming their specialized cells 
and organs. 

This astonishing degree of evolutionary conservation was discovered not by 
broad surveys of animal diversity, but through intensive study of a small num- 
ber of représentative species—the model organisms discussed in Chapter 1. For 
animal developmental biology, the most important hâve been the fly Drosophila 
melanogaster , the frog Xenopus laevis, the roundworm Caenorhabditis elegans, 
the mouse Mus musculus, and the zebrafish Danio rerio. In discussing the mecha¬ 
nisms of development, we shall draw our examples mainly from these few species. 
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Genes Involved in Cell-Cell Communication and Transcriptional 
Control Are Especially Important for Animal Development 

What are the genes that animais share with one another but not with other king- 
doms of life? These would be expected to include genes required specifically for 
animal development but not needed for unicellular existence. Comparison of 
animal genomes with the genome of budding yeast—a unicellular eukaryote— 
suggests that three classes of genes are especially important for multicellular 
organization. The first class includes genes that encode proteins used for cell-cell 
adhesion and cell signaling; hundreds of human genes encode signal proteins, 
cell-surface receptors, cell adhesion proteins, or ion channels that are either not 
présent in yeast or présent in much smaller numbers. The second class includes 
genes encoding proteins that regulate transcription and chromatin structure: 
more than 1000 human genes encode transcription regulators, but only about 250 
yeast genes do so. As we shall see, the development of animais is dominated by 
cell-cell interactions and by differential gene expression. The third class of non- 
coding RNAs has a more uncertain status: it includes genes that encode microR- 
NAs (miRNAs); there are at least 500 of these in humans. Along with the regula- 
tory proteins, they play a significant part in controlling gene expression during 
animal development, but the full extent of their importance is still unclear. The 
loss of individual miRNA genes in C. elegans, where their functions hâve been well 
studied, rarely leads to obvious phenotypes, suggesting that the rôles of miRNAs 
during animal development are often subtle, serving to fine-tune the develop- 
mental machinery rather than to form its core structures. 

Regulatory DNA Seems Largely Responsible for the Différences 
Between Animal Species 

As discussed in Chapter 7, each gene in a multicellular organism is associated 
with many thousands of nucléotides of noncoding DNA that contains regulatory 
éléments. These regulatory éléments détermine when, where, and how strongly 
the gene is to be expressed, according to the transcription regulators and chroma¬ 
tin structures that are présent in the particular cell (Figure 21-5). Consequently, 
a change in the regulatory DNA, even without any change in the coding DNA, can 
alter the logic of the gene-regulatory networlc and change the outcome of devel¬ 
opment. 

As discussed in Chapter 4, when we compare the genomes of different ani¬ 
mal species, we find that évolution has altered the coding and regulatory DNA to 
different extents. The coding DNA, for the most part, has been highly conserved, 
the noncoding regulatory DNA much less so. It seems that changes in regulatory 
DNA are largely responsible for the dramatic différences between one class of 
animais and another (see p. 227). We can view the protein products of the cod¬ 
ing sequences as a conserved kit of common molecular parts, and the regulatory 
DNA as instructions for assembly: with different instructions, the same kit of parts 
can be used to make a whole variety of different body structures. We will return to 
this important concept later. 
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Figure 21-5 Regulatory DNA defines 
the gene expression patterns in 
development. The genome is the same in 
a muscle cell as in a skin cell, but different 
genes are active because these cells 
express different transcription regulators 
that bind to gene regulatory éléments. For 
example, transcription regulators in skin 
cells recognize a regulatory element in 
gene 1, leading to its activation, whereas 
a different set of regulators is présent in 
muscle cells, binding to and activating gene 
3. Transcriptional regulators that activate 
the expression of gene 2 are présent in 
both cell types. 
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Small Numbers of Conserved Cell-Cell Signaling Pathways 
Coordinate Spatial Patterning 

Spatial patterning of a developing animal requires that cells become different 
according to their positions in the embryo, which means that cells must respond 
to extracellular signais produced by other cells, especially their neighbors. In what 
is probably the commonest mode of spatial patterning, a group of cells starts out 
with the same developmental potential, and a signal from cells outside the group 
then induces one or more members of the group to change their character. This 
process is called inductive signaling. Generally, the inductive signal is limited in 
time and space so that only a subset of the cells capable of responding—the cells 
close to the source of the signal—talce on the induced character (Figure 21-6). 
Some inductive signais dépend on cell-cell contact; others act over a longer range 
and are mediated by molécules that diffuse through the extracellular medium or 
are transported in the bloodstream (see Figure 15-2). 

Most of the known inductive events in animal development are governed by 
a small number of highly conserved signaling pathways, including transforming 
growth factor-P (TGFp), Wnt, Hedgehog, Notch, and receptor tyrosine kinase 
(RTK) pathways (discussed in Chapter 15). The discovery of the limited vocabu- 
lary that developing cells use for intercellular communication has emerged over 
the past 25 years as one of the great simplifying features of developmental biology. 

Through Combinatorial Control and Cell Memory, Simple Signais 
Can Generate Complex Patterns 

But how can this small number of signaling pathways generate the huge diversity 
of cells and patterns? Three kinds of mechanisms are responsible. First, through 
gene duplication, the basic components of a pathway often corne to be encoded 
by small families of closely related homologous genes. This allows for diversity in 
the operation of the pathway, according to which family member is employed in 
a given situation. Notch signaling, for example, may be mediated by Notchl in 
one tissue, but by its homolog Notch4 in another. Second, the response of a cell 
to a given signal protein dépends on the other signais that the cell is receiving 
concurrently (Figure 21-7A). As a resuit, different combinations of signais can 
generate a large variety of different responses. Third, and most fundamental, the 
effect of activating a signaling pathway dépends on the previous expériences of 
the responding cell: past influences leave a lasting mark, registered in the State 
of the cell's chromatin and the sélection of transcription regulatory proteins and 
RNA molécules that the cell contains. This cell memory enables cells with different 





inductive signal 



I 


cells directed to new 
developmental pathway 

Figure 21-6 Inductive signaling. 


Figure 21-7 Two mechanisms for 
generating different responses to the 
same inductive signal. (A) In combinatorial 
signaling, the effect of a signal dépends on 
the presence of other signais received at 
the same time. (B) Through cell memory, 
previous signais (or other events) can leave 
a lasting trace that alters the response to 
the current signal (see Figure 7-54). The 
memory trace is represented here in the 
coloring of the cell nucléus. 
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Figure 21-8 Gradient formation and 
interprétation. A gradient forms by 
localized production of an inducer—a 
morphogen—that diffuses away from 
its source. Different concentrations of 
morphogen (or different durations of 
exposure) induce different gene expression 
patterns and cell fates in responding cells. 
Diffusive transport can generate gradients 
only over short distances, and morphogens 
generally act over distances of 1 mm or 
less. 


0.1 mm 


historiés to respond to the same signais differently (Figure 21-7B). Thus, the same 
few signaling pathways can be used repeatedly at different times and places with 
different outcomes, so as to generate patterns of unlimited complexity. 

Morphogens Are Long-Rangé Inductive Signais That Exert Graded 
Effects 

Signal molécules often govern simple yes-no choices—one outcome when 
their concentration is high, another when it is low. In many cases, however, the 
responses are more finely graded: a high concentration of a signal molécule may, 
for example, direct cells into one developmental pathway, an intermediate con¬ 
centration into another, and a low concentration into yet another. 

One common way to generate such different concentrations of a signal molé¬ 
cule is for the molécule to diffuse out from a localized signaling source, creating 
a concentration gradient. Cells at different distances from the source are driven 
to behave in a variety of different ways, according to the signal concentration that 
they expérience (Figure 21-8). A signal molécule that imposes a pattern on a 
whole field of cells in this way is called a morphogen. In the simplest case, a spe- 
cialized group of cells produces a morphogen at a steady rate, and the morphogen 
is then degraded as it diffuses away from this source. The speed of diffusion and 
the half-life of the morphogen will together détermine the range and steepness of 
its resulting gradient (Figure 21-9). 

This simple mechanism can be modified in various ways. Receptors on the 
surface of cells along the way, for example, may trap the diffusing morphogen and 
cause it to be endocytosed and degraded, shortening its effective half-life. Alter- 
natively, the morphogen may bind to molécules in the extracellular matrix such as 
heparan sulfate proteoglycan (discussed in Chapter 19), thereby greatly reducing 
its diffusion rate. 

Latéral Inhibition Can Generate Patterns of Different Cell Types 

Morphogen gradients, and other kinds of inductive signal, exploit an existing 
asymmetry in the embryo to create further asymmetries and différences between 
cells: already, at the outset, some cells are specialized to produce the morphogen 
and thereby impose a pattern on another class of cells that are sensitive to it. But 


(A) (B) (C) 



distance from source (mm) 


Figure 21-9 Setting up a signal gradient 
by diffusion. (A-C) Each graph shows six 
successive stages in the buildup of the 
concentration of a signal molécule that is 
produced at a steady rate at the origin, with 
production starting at time 0. In ail cases, 
the molécule undergoes dégradation as 
it diffuses away from the source, and the 
graphs are calculated on the assumption 
that diffusion is occurring along two axes in 
space (for example, radially from a source 
in an épithélial sheet). (A) The pattern of the 
morphogen assuming that the molécule 
has a half-life of 170 minutes, and that it 
diffuses with an effective diffusion constant 
of D = 1 gm 2 sec -1 , typical of a small 
protein molécule in extracellular tissues. 
Note that the gradient is already close to its 
steady-state form within an hour and that 
the concentration at steady State falls off 
exponentially with distance. (B) A threefold 
increase in the diffusion constant of the 
morphogen extends its range but lowers its 
concentration next to the source, whereas 
(C) a threefold increase in morphogen half- 
life increases its concentration throughout 
the tissue. Effects of the morphogen will 
dépend not just on its concentration at 
some critical moment, but also on how 
each target cell intégrâtes its response over 
time. (Courtesy of Patrick Müller.) 
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Figure 21-10 Genesis of asymmetry through latéral inhibition and 
positive feedback. In this example, two cells interact, each producing 
a substance X that acts on the other cell to inhibit its production of X, an 
effect known as latéral inhibition. An increase of X in one of the cells leads 
to a positive feedback that tends to increase X in that cell still further, while 
decreasing X in its neighbor. This can create a runaway instability, making 
the two cells become radically different. Ultimately, the System cornes to rest 
in one or the other of two opposite stable States. The final choice of State 
represents a form of memory: the small influence that initially directed the 
choice is no longer required to maintain it. 


what if there is no clear initial asymmetry? Can a regular pattern arise sponta- 
neously within a set of cells that are initially ail alike? 

The answer is yes. The fundamental principle underlying such de novo pat¬ 
tern formation is positive feedback: cells can exchange signais in such a way that 
any small initial discrepancy between cells at different sites becomes self-ampli- 
fying, driving the cells toward different fates. This is most clearly illustrated in the 
phenomenon of latéral inhibition , a form of cell-cell interaction that forces close 
neighbors to become different and thereby generates fine-grained patterns of dif¬ 
ferent cell types. 

Consider a pair of adjacent cells that start off in a similar State. Each of these 
cells can both produce and respond to a certain signal molécule X, with the added 
rule that the stronger the signal a cell receives, the weaker the signal it gener¬ 
ates (Figure 21-10). If one cell produces more X, the other is forced to produce 
less. This gives rise to a positive feedback loop that tends to amplify any initial 
différence between the two adjacent cells. Such a différence may arise from a 
bias imposed by some présent or past external factor, or it may simply originate 
from spontaneous random fluctuations, or "noise"—an inévitable feature of the 
genetic control circuitry in cells (discussed in Chapter 7). In either case, latéral 
inhibition means that if cell 1 malces a little more of X, it will thereby cause cell 2 to 
malce less; and because cell 2 malces less X, it delivers less inhibition to cell 1 and 
so allows the production of X in cell 1 to rise higher still; and so on, until a steady 
State is reached where cell 1 produces a lot of X and cell 2 produces very little. In 
the standard case, the signal molécule X acts in the receiving cell by regulating 
gene transcription, and the resuit is that the two cells are driven along different 
pathways of différentiation. 

In almost ail tissues, a balanced mixture of different cell types is required. 
Latéral inhibition provides a common way to generate the mixture. As we 
shall see, latéral inhibition is very often mediated by exchange of signais at cell- 
cell contacts via the Notch signaling pathway, driving cell diversification by 
enabling individual cells that express one set of genes to direct their immédiate 
neighbors to express a different set, in exactly the way we hâve described (see also 
Figure 15-58). 


cell 1 

cell 2 

0 \ 

/X x\ 

X X 

x A 


I transient bias créâtes 
slight asymmetry 



Short-Range Activation and Long-Rangé Inhibition Can Generate 
Complex Cellular Patterns 

Latéral inhibition mediated by the Notch pathway is not the only example of pat¬ 
tern génération through positive feedback: there are other ways in which, through 
the same basic principle, a System that starts off homogeneous and symmetrical 
can pattern itself spontaneously, even in the absence of an external morphogen. 
Positive feedback processes mediated by diffusible signal molécules can operate 
over broad arrays of cells to create many types of spatial patterns. Mechanisms of 
this sort are called reaction-diffusion Systems. For example, a substance A (a short- 
range activator) may stimulate its own production in the cells that contain it and 
in their immédiate neighbors, while also causing these cells to produce a signal 
I (a long-range inhibitor) that diffuses widely and inhibits the production of A in 
cells farther away. If the cells ail start the same, but one group gains a slight advan- 
tage by making a little more A than the rest, the asymmetry can be self-amplifying 
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Figure 21-11 Pattern génération by a reaction-diffusion System. From 
(A) a uniform field of cells, (B) local positive feedback and (C) long-range 
inhibition can (D) generate patterns within the initially uniform field. The 
patterns can be complex, resembling the spots of a léopard (as shown) or 
the stripes of a zébra; or they can be simple, with création of a single cluster 
of specialized cells that can, for example, go on to serve as the source of a 
morphogen gradient. 

(Figure 21-11). Such short-range activation combined with long-range inhibition 
can account for the formation of clusters of cells within an initially homogeneous 
tissue that become specialized as localized signaling centers. 

Asymmetric Cell Division Can Also Generate Diversity 

Cell diversification does not always dépend on extracellular signais: in some 
cases, daughter cells are born different as a resuit of an asymmetric cell divi¬ 
sion, in which some important molécule or molécules are distributed unequally 
between the two daughters. This asymmetric inheritance ensures that the two 
daughter cells develop differently (Figure 21-12). Asymmetric division is a com- 
mon feature of early development, where the fertilized egg already has an internai 
pattern and cleavage of this large cell segregates different déterminants into sep- 
arate blastomeres. We shall see that asymmetric division also plays a part in some 
later developmental processes. 

Initial Patterns Are Established in Small Fields of Cells and Refined 
by Sequential Induction as the Embryo Grows 

The signais that organize the spatial pattern of cells in an embryo generally act 
over short distances and govern relatively simple choices. A morphogen, for 
example, typically acts over a distance of less than 1 mm—an effective range for 
diffusion —and directs choices between several developmental options for the 
cells on which it acts. Yet the organs that eventually develop are much larger and 
more complex than this. 

The cell prolifération that follows the initial spécification accounts for the size 
increase, while the refinement of the initial pattern is explained by a sériés of local 
inductions plus other interactions that add successive levels of detail on an ini¬ 
tially simple sketch. For example, as soon as two types of cells are présent in a 
developing tissue, one of them can produce a signal that induces a subset of the 
neighboring cells to specialize in a third way. The third cell type can in turn signal 



1. asymmetric division: sister cells born different 



2. symmetric division: sister cells become different as a resuit of 
influences acting on them after their birth 
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Figure 21-12 Two ways of making sister 
cells different. 
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C is induced 
by signal from 
B acting on A 



D and E are 
induced by signal 
from C acting 
on A and B, 
respectively 



Figure 21-13 Patterning by sequential 
induction. A sériés of inductive interactions 
can generate many types of cells, starting 
from only a few. 


baclc to the other two cell types nearby, generating a fourth and a fifth cell type, 
and so on (Figure 21-13). 

This strategy for generating a progressively more complicated pattern is called 
sequential induction. It is chiefiy through sequential inductions that the body 
plan of a developing animal, after being first roughed out in miniature, becomes 
elaborated with finer and finer details as development proceeds. 

Developmental Biology Provides Insights into Disease and Tissue 
Maintenance 

The rapid progress in understanding animal development has been one of the 
great success stories in biology over the last few décades, and it has important 
practical implications. Some 2 to 5% of ail human babies are born with anatomical 
abnormalities, such as heart malformations, truncated limbs, cleft palate, or spina 
bifida. Advances in developmental biology help us understand how these defects 
arise, even if we cannot yet prevent or cure most of them. 

Less obvious, but even more important from a practical point of view, is that 
developmental biology provides insights into the workings of cells and tissues 
in the adult body. Developmental processes do not hait at birth; they continue 
throughout life, as tissues are maintained and repaired. The fundamental mecha- 
nisms of cell growth and division, cell-cell signaling, cell memory, cell adhesion, 
and cell movement are involved in adult tissue maintenance and repair—just as 
they are in embryo development. 

Embryos are simpler than adults, and they allow us to analyze such basic pro¬ 
cesses more easily. Studies of the early Drosophila embryo, for example, were cru¬ 
cial to the discovery of several conserved signaling pathways, including the Wnt, 
Hedgehog, and Notch pathways. They also provided the key to understanding the 
central rôle of these pathways in the maintenance of normal adult human tissues 
and in diseases such as cancer. 

In Chapter 22, we shall consider how other developmental mechanisms oper- 
ate in the normal adult body, especially in tissues that are continually renewed by 
means of stem cells—including the gut, skin, and the hematopoietic System. But 
now, we must look more closely at the way in which an early embryo generates its 
spatial pattern of specialized cells, beginning with the transformations that create 
the adult body plan. 

Summary 

Animal development is a self-assembly process, in which the cells of the embryo 
become different from one another and organize themselves into increasingly com- 
plex structures. The process begins with a single large cell—the fertilized egg This 
cell cleaves toform many smaller cells, producing a blastula. The blastula under- 
goes gastrulation to generate the threegerm loyers ofthe embryo—ectoderm, meso- 
derm, and endoderm—consisting of cells determinedfor differentfates. As develop¬ 
ment continues, the cells become more and more narrowly specialized according 
to their locations and their interactions with one another. Through cell memory, 
these cell-cell interactions, even though transient, can hâve lasting effects on each 
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cell's internai State. Thus, a succession of simple eues that a cell receives at different 
times can direct it along a complex developmental pathway. At each step, the cell 
becomes further restricted in the range of final States open to it. The process reaches 
its limit when the cell differentiates toform one ofthe specialized cell types ofthe 
adult body. 

Différences between developing cells arise in various ways and hâve to be prop- 
erly coordinated in space. In one common strategy, initially similar cells within 
a group become different by exposure to different levels of an inductive signal or 
morphogen emanatingfrom a source outside the group. Neighboring cells can also 
become different by latéral inhibition, in which a cell signais to its neighbors not to 
follow the samefate. These cell-cell interactions are mediated by a small number of 
highly conserved signaling pathway s, which are used repeatedly in different organ- 
isms and at different times during development. Not ail cell diversification arises 
by cell-cell interactions, however: daughter cells can be born different as a resuit of 
asymmetric cell division. 

Regulators of transcription and chromatin structure bind to regulatory DNA 
and détermine thefate ofeach cell. Différences ofbody plan seem to arise to a large 
extentfrom différences in the regulatory DNA associated with each gene. This DNA 
has a central rôle in defining the sequential program of development, callinggenes 
into action at spécifie times and places according to the pattern ofgene expression 
that was présent in each cell at the previous developmental stage. 

Development has been most thoroughly studied in a handjul ofmodel organ- 
isms. But most ofthe genes and mechanisms thereby identified are used in ail ani¬ 
mais and repeatedly at different stages of development. Thus, insights from worms, 
flies, fish, frogs, and mice deeply inform our understanding of embryology, birth 
defects; and adult tissue maintenance in humans. 

MECHANISMS OF PATTERN FORMATION 

A developing multicellular organism has to create a pattern in fields of cells where 
there was little or none before. Some of the early microscopists imagined the 
entire shape and structure of the human body to be already présent in the sperm 
as a “homunculus " a miniature human; after fertilization, the homunculus would 
simply grow and generate a full-sized human. We now know that this view is 
incorrect and that development is a progression from simple to complex, through 
a graduai refinement of an animal's anatomy. To see how the whole sequence of 
events of spatial patterning and cell détermination is set in train, we must return 
to the egg and the early embryo. 

Different Animais Use Different Mechanisms to Establish Their 
Primary Axes of Polarization 

Surprisingly, the earliest steps of animal development are among the most vari¬ 
able, even within a phylum. A frog, a chiclcen, and a mammal, for example, even 
though they develop in similar ways later, malce eggs that differ radically in size 
and structure, and they begin their development with different sequences of cell 
divisions and cell specializations. Gastrulation occurs in ail animal embryos, but 
the details of its timing, of the associated pattern of cell movements, and of the 
shape and size of the embryo as gastrulation proceeds are highly variable. Lilce- 
wise, there is great variation in the time and manner in which the primary axes of 
the body become marlced out. However, this polarization of the embryo usually 
becomes discernible very early, before gastrulation begins: it is the first step of 
spatial patterning. 

Three axes generally hâve to be established. The animal-vegetal (A-V) axis, 
in most species, defines which parts are to become internai (through the move¬ 
ments of gastrulation) and which are to remain external. (The bizarre name dates 
from a century ago and has nothing to do with vegetables.) The anteroposterior 
(A-P) axis spécifiés the locations of future head and tail. The dorsoventral ( D-V ) 
axis spécifiés the future baclc and belly. 
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Figure 21-14 The frog egg and its asymmetries. (A) Side view of a Xenopus egg photographed just before fertilization. 

(B) The asymmetric distribution of molécules inside the egg, and how this changes following fertilization so as to define a 
dorsoventral as well as an animal-vegetal asymmetry. Fertilization, through a reorganization of the microtubule cytoskeleton, 
triggers a rotation of the egg cortex (a layer a few jim deep) through about 30° relative to the core of the egg; the direction of 
rotation determined by the site of sperm entry. Some components are carried still further to the future dorsal side by active 
transport along microtubules. The resulting dorsal concentration of Wntl 1 mRNA leads to dorsal production of the Wntl 1 signal 
protein and defines the dorsoventral polarity of the future embryo. Vegetally localized VegT defines the végétal source of signais 
that will induce endoderm and mesoderm. (A, courtesy of Tony Mills.) 


At one extreme, the egg is spherically symmetrical, and the axes only become 
defined during embryogenesis. The mouse cornes close to being an example, with 
little obvious sign of polarity in the egg. Correspondingly, the blastomeres pro- 
duced by the first few cell divisions seem to be ail alilce and are remarkably adapt¬ 
able. If the early mouse embryo is split in two, a pair of identical twins can be pro- 
duced—two complété, normal individuals from a single cell. Similarly, if one of 
the cells in a two-cell mouse embryo is destroyed by pricking it with a needle and 
the resulting “half-embryo" is placed in the utérus of a foster mother to develop, 
in many cases a perfectly normal mouse will emerge. 

At the opposite extreme, the structure of the egg defines the future axes of the 
body. This is the case for most species, including insects such as Drosophila, as we 
shall see shortly. Many other organisms lie between the two extremes. The egg of 
the frog Xenopus, for example, has a clearly defined A-V axis even before fertiliza¬ 
tion: the nucléus near the top defines the animal pôle, while the mass of yollc (the 
embryo's food supply, destined to be incorporated in the gut) toward the bottom 
defines the végétal pôle. Several types of mRNA molécules are already localized 
in the végétal cytoplasm of the egg, where they produce their protein products. 
After fertilization, these mRNAs and proteins act in and on the cells in the lower 
and middle part of the embryo, giving the cells there specialized characters, both 
by direct effects and by stimulating the production of secreted signal proteins. For 
example, mRNA encoding the transcription regulator VegT is deposited at the 
végétal pôle during oogenesis. After fertilization, this mRNA is translated, and the 
resulting VegT protein activâtes a set of genes that code for signal proteins that 
induce mesoderm and endoderm, as discussed later. 

The D-V axis of the Xenopus embryo, by contrast, is defined through the act of 
fertilization. Following entry of the sperm, the outer cortex of the egg cytoplasm 
rotâtes relative to the central core of the egg, so that the animal pôle of the cor¬ 
tex becomes slightly shifted to one side (Figure 21-14). Treatments that bloclc 
the rotation allow cleavage to occur normally but produce an embryo with a cen¬ 
tral gut and no dorsal structures or D-V asymmetry. Thus, this cortical rotation is 
required to define the D-V axis of the future body by creating the D-V axis of the 


egg- 


The site of sperm entry that biases the direction of the cortical rotation in 
Xenopus , perhaps through the centrosome that the sperm brings into the egg— 
inasmuch as the rotation is associated with a reorganization of the microtubules 
nucleated from the centrosome in the egg cytoplasm. The reorganization leads to 
a microtubule-based transport of several cytoplasmic components, including the 
mRNA coding for Wntll, a member of the Wnt family of signal proteins, moving 
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it toward the future dorsal side (see Figure 21-14). This mRNA is soon translated 
and the Wntll protein secreted from cells that form in that région of the embryo 
activâtes the Wnt signaling pathway (see Figure 15-60). This activation is crucial 
for triggering the cascade of subséquent events that will organize the dorsoventral 
axis of the body. (The A-P axis of the embryo will only become clear later, in the 
process of gastrulation.) 

Although different animal species use a variety of different mechanisms to 
specify their axes, the outcome has been relatively well conserved in évolution: 
head is distinguished from tail, baclc from belly, and gut from skin. It seems that it 
does not much matter what triclcs the embryo uses to break the initial symmetry 
and set up this basic body plan. 

Studies in Drosophila Hâve Revealed the Genetic Control 
Mechanisms Underlying Development 

It is the fly Drosophila, more than any other organism, that has provided the key to 
our présent understanding of how genes govern development. Décades of genetic 
study culminated in a large-scale genetic screen, focusing especially on the early 
embryo and searching for mutations that disrupt its pattern. This revealed that 
the key developmental genes fall into a relatively small set of functional classes 
defined by their mutant phenotypes. The discovery of these genes and the subsé¬ 
quent analysis of their functions was a famous tour de force and had a revolution- 
ary impact on ail of developmental biology, earning its discoverers a Nobel Prize. 

Some parts of the developmental machinery revealed in this way are conserved 
between Aies and vertebrates, some parts not. But the logic of the experimental 
approach and the general strategies of genetic control that it revealed hâve trans- 
formed our understanding of multicellular development in general. 

To understand how the early developmental machinery opérâtes in Drosoph¬ 
ila , it is important to note a peculiarity of fly development. Lilce the eggs of other 
insects, but unlilce most vertebrates, the Drosophila egg—shaped lilce a cucum- 
ber—begins its development with an extraordinarily rapid sériés of nuclear divi¬ 
sions without cell division, producing multiple nuclei in a common cytoplasm—a 
syncytium. The nuclei then migrate to the cell cortex, forming a structure called 
the syncytial blastoderm. After about 6000 nuclei hâve been produced, the plasma 
membrane folds inward between them and partitions them into separate cells, 
converting the syncytial blastoderm into the cellular blastoderm (Figure 21-15). 

We shall see that the initial patterning of the Drosophila embryo dépends on 
signais that diffuse through the cytoplasm at the syncytial stage and exert their 
actions on genes in the rapidly dividing nuclei, before the partitioning of the egg 
into separate cells. Here, there is no need for the usual forms of cell-cell signaling; 
neighboring régions of the syncytial blastoderm can communicate by means of 
transcription regulatory proteins that move through the cytoplasm of the giant 
multinuclear cell. 

Egg-Polarity Genes Encode Macromolecules Deposited in the Egg 
to Organize the Axes of the Early Drosophila Embryo 

Figure 21-15 Development of the 

As in most insects, the main axes of the future body of Drosophila are defined Drosophila egg from fertilization to the 
before fertilization by a complex exchange of signais between the developing egg, cellular blastoderm stage. 
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or oocyte, and the follicle cells that surround it in the ovary. In the stages before 
fertilization, the anteroposterior and dorsoventral axes of the future embryo 
become defined by four Systems of egg-polarity genes that create landmarks— 
either mRNA or protein—in the developing oocyte. Following fertilization, each 
landmark serves as a beacon, providing a signal that organizes the developmental 
process in its neighborhood. 

The nature of the genes emerged from studies of mutants in which the pat- 
terning of the embryo was altered. One class of mutations gave embryos with dis- 
rupted polarity—for example, tail-end structures at both ends of the body, with 
no head-end structures. This class of mutations identified the set of egg-polarity 
genes. The egg-polarity gene responsible for the signal that organizes the ante- 
rior end of the embryo is called Bicoid. A deposit of Bicoid mRNA molécules is 
localized, before fertilization, at the anterior end of the egg. Upon fertilization, 
the mRNA is translated to produce Bicoid protein. This protein is an intracellular 
morphogen and transcription regulator that diffuses away from its source to form 
a concentration gradient within the syncytial cytoplasm, with its maximum at the 
head end of the embryo (Figure 21-16). The different concentrations of Bicoid 
along the A-P axis help détermine different cell fates by regulating the transcrip¬ 
tion of genes in the nuclei of the syncytial blastoderm (discussed in Chapter 7). 

Of the three other egg-polarity gene Systems, two contribute to patterning the 
syncytial nuclei along the A-P axis and one to patterning them along the D-V axis. 
Together with the Bicoid group of genes, and acting in a broadly similar way, their 
gene products mark out three fundamental partitions of body régions—head ver¬ 
sus rear, dorsal versus ventral, and endoderm versus mesoderm and ectoderm— 
as well as a fourth partition, no less fundamental to the body plan of animais: the 
distinction between germ cells and somatic cells (Figure 21-17). 

The egg-polarity genes hâve a further spécial feature: they are ail maternal-ef- 
fect genes, in that it is the mother's genome rather than the zygote's genome that 
is critical. For example, a fly whose chromosomes are mutant in both copies of the 
Bicoid gene but who is born from a mother carrying one normal copy of Bicoid 
develops perfectly normally, without any defects in the head pattern. However, if 
that offspring is a female, she cannot deposit any functional Bicoid mRNA into her 
own eggs, which will therefore develop into headless embryos, regardless of the 
father's génotype. 

The egg-polarity genes act first in a hierarchy of gene Systems that define a 
progressively more detailed pattern of body parts. In the next few pages, we begin 
with the molecular mechanisms that pattern the developing Drosophila embryo 
and larva along the A-P axis, before considering the patterning along the D-V axis. 
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(A) Bicoid mRNA 
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wild type 
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anterior 
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Figure 21-16 The Bicoid protein 
gradient. (A) Bicoid mRNA is deposited at 
the anterior pôle during oogenesis. 

(B) Local translation followed by diffusion 
generates the Bicoid protein gradient. 

(C) Absence of the Bicoid protein gradient 
in embryos from Bicoid homozygous 
mutant mothers. (A and B, courtesy of 
Stephen Small.) 
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TERMINAL SYSTEM DORSOVENTRAL SYSTEM 



localized mRNA (Nanos) localized mRNA ( Bicoid) 
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DETERMINING DETERMINING 

• germ cells vs. somatic cells • ectoderm vs. mesoderm vs. endoderm 

• head vs. rear • terminal structures 

• body segments 


Figure 21-17 The organization of the four egg-polarity gradient Systems in Drosophila. Nanos is a translational repressor that governs the 
formation of the abdomen. Localized Nanos mRNA is also incorporated into the germ cells as they form at the posterior of the embryo, and Nanos 
protein is necessary for germ-line development. Bicoid protein is a transcriptional activator that détermines the head and thoracic régions. Toll and 
Torso are receptor proteins that are distributed ail over the membrane but are activated only at the sites indicated by the coloring, through localized 
exposure to the extracellular ligands Spaetzle (the ligand for Toll) and Trunk (the ligand for Torso). Toll activity détermines the mesoderm and Torso 
activity détermines the formation of terminal structures. 
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Figure 21-18 The origins of the Drosophila body segments. (A) At 

3 hours, the embryo (shown in side view) is at the blastoderm stage and 
no segmentation is visible, although a fate map can be drawn showing the 
future segmented régions (color). (B) At 10 hours, ail the segments are 
clearly defined (Tl : first thoracic segment; Al : first abdominal segment). 
See Movie 21.3. (C) The segments of the Drosophila larva and their 
correspondance with régions in the embryo. (D) The segments of the 
Drosophila adult and their correspondence with régions in the embryo. 


Three Groups of Genes Control Drosophila Segmentation Along 
the A-P Axis 

The body of an insect is divided along its A-P axis into a sériés of segments. The 
segments are répétitions of a theme with variations: each segment forms highly 
specialized structures, but ail built according to a similar fundamental plan (Fig¬ 
ure 21-18). The gradients of transcription regulators set up along the A-P axis in 
the early embryo by the egg-polarity genes are the préludé to création of the seg¬ 
ments. These regulators initiate the orderly transcription of segmentation genes , 
which refine the pattern of gene expression to define the boundaries and ground 
plan of the individual segments. Segmentation genes are expressed by subsets of 
cells in the embryo, and their products are the first components that the embryo's 
own genome contributes to embryonic development; they are therefore called 
zygotic-effect genes , to distinguish them from the earlier-acting maternai-effect 
genes. Mutations in segmentation genes can alter either the number of segments 
or their basic internai organization. 

The segmentation genes fall into three groups according to their mutant phe¬ 
notypes (Figure 21-19). It is convenient to thinlc of the three groups as acting in 
sequence, although in reality their functions overlap in time. First to be expressed 
is a set of at least six gap genes, whose products mark out coarse A-P subdivisions 
of the embryo. Mutations in a gap gene eliminate one or more groups of adjacent 
segments: in the mutant Krüppel, for example, the larva laclcs eight segments. Next 
cornes a set of eight pair-rule genes. Mutations in these genes cause a sériés of 
délétions affecting alternate segments, leaving the embryo with only half as many 
segments as usual; although ail the mutants display this two-segment periodicity, 
they differ in the précisé pattern. Finally, there are at least 10 segment-polarity 
genes, in which mutations produce a normal number of segments but with a part 
of each segment deleted and replaced by a mirror-image duplicate of ail or part of 
the rest of the segment. 

In parallel with the segmentation process, a further set of genes—the homeotic 
selector, or Hox, genes— serves to define and preserve the différences between one 
segment and the next, as we describe shortly. 

The phenotypes of the various segmentation mutants suggest that the seg¬ 
mentation genes form a coordinated System that subdivides the embryo progres- 
sively into smaller and smaller domains along the A-P axis, each distinguished 
by a different pattern of gene expression. Molecular genetics has helped to reveal 
how this System worlcs. 


head 

parts thorax abdomen 




A Hierarchy of Gene Regulatory Interactions Subdivides the 
Drosophila Embryo 

Lilce Bicoid, most of the segmentation genes encode transcription regulator pro¬ 
teins. Their control by the egg-polarity genes and their actions on one another and 
on still other genes can be deciphered by comparing gene expression in normal 
and mutant embryos. By using appropriate probes to detect RNA transcripts or 
their protein products, one can observe genes switch on and off in changing pat¬ 
terns. By comparing these patterns in different mutants, one can begin to discern 
the logic of the entire gene control System. 

The products of the egg-polarity genes provide the global positional signais in 
the early embryo (see Figure 21-17). The Bicoid protein, as we hâve seen, acts as 
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Figure 21-19 Examples of the phenotypes of mutations affecting egg-polarity genes and the three types of 
segmentation genes. In each case, the areas shaded in green on the normal larva (left) are deleted in the mutant or are 
replaced by mirror-image duplicates of the unaffected régions. (Modified from C. Nüsslein-Volhard and E. Wieschaus, Nature 
287:795-801, 1980. With permission from Macmillan Publishers Ltd.) 


a morphogen and activâtes different sets of genes at different positions along the 
A-P axis: some gap genes are only activated in régions with high levels of Bicoid, 
others only where levels of Bicoid are lower. After the gap gene products refine 
their positions by mutual repression, they provide a second tier of positional sig¬ 
nais that act more locally to regulate finer details of patterning. Gap genes act by 
controlling the expression of yet other genes, including the pair-rule genes. The 
pair-rule genes, in turn, collaborate with one another and with the gap genes to 
set up a regular, periodic pattern of expression of the segment-polarity genes, 
which collaborate with one another to define the internai pattern of each individ- 
ual segment (Figure 21-20). 

The initial steps in création of the segmentai pattern occur before cellulariza- 
tion of the syncytial blastoderm and are governed by the combinatorial effects of 
transcription regulators, as discussed in detail in Chapter 7 for the régulation of 
the expression of the pair-rule gene Even-skipped (see pp. 394-396). After cellular- 
ization, the segment-polarity genes further subdivide each segment into smaller 
domains. A large subset of the segment-polarity genes codes for components of 
two signaling pathways—the Wnt pathway and the Hedgehog pathway, including 
the secreted signal proteins Wingless (the first-named member of the Wnt fam- 
ily) and Hedgehog. (The Hedgehog pathway was first discovered through study 
of Drosophila segmentation, and it takes its name from the pricldy appearance 
of the surface of the Hedgehog mutant embryo.) Wingless and Hedgehog are syn- 
thesized in different bands of cells that serve as signaling centers within each seg¬ 
ment. The two proteins mutually maintain each other's expression, while regulat- 
ing the expression of genes such as Engrailed in neighboring cells (Figure 21-21). 
In such a manner, a sériés of sequential inductions créâtes a fine-grained pattern 
of gene expression within each segment. 

Egg-Polarity, Gap, and Pair-Rule Genes Create a Transient Pattern 
That Is Remembered by Segment-Polarity and Hox Genes 

The gap genes and pair-rule genes are activated within the first few hours after fer- 
tilization. Their mRNA products initially appear in patterns that only approximate 
the final picture; then, within a short time, this fuzzy initial pattern résolves itself 
into a regular, crisply defined System of stripes. But this pattern itself is unstable 
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segment-polarity genes 



Hox genes 



Antennapedia 


Figure 21-20 An example of the 
regulatory hierarchy of egg-polarity, 
segmentation, and Hox genes. As 

discussed in the text, there are three 
groups of segmentation genes. The 
photographs show mRNA expression 
patterns of représentative examples of 
genes of each type. (Courtesy of Stephen 
Small.) 


and transient: as the embryo proceeds through gastrulation and beyond, the 
pattern disintegrates. The genes' actions, however, hâve passed on an enduring 
memory of their patterns of expression by inducing the expression of certain seg¬ 
ment polarity genes along with Hox genes (discussed shortly). After a period of 
pattern refinement mediated by cell-cell interactions, the expression patterns of 
these new groups of patterning genes is stabilized to provide positional labels that 
serve to maintain the segmentai organization of the larva and adult fly. 

The segment-polarity gene Engrailed provides a good example. Its RNA tran- 
scripts form a sériés of 14 bands in the cellular blastoderm, each approximately 
one-cell wide. These stripes lie immediately anterior to similar stripes of expres¬ 
sion of another segment polarity gene, Wingless. As the cells in the develop- 
ing embryo continue to grow, divide, and move, a mutually reinforcing signal 
between the Wingless expressing cells and the Engrailed expressing cells main- 
tains narrow stripes of their expression (see Figure 21-21). After three cell cycles, 
newly expressed regulators stabilize an Engrailed expression pattern that will last 


Wingless protein Engrailed protein 



Hedgehog protein 


Figure 21-21 Mutual maintenance of 
Hedgehog and Wingless expression. 

Engrailed is a transcription regulator (blue) 
that drives the expression of Hedgehog. 
Hedgehog encodes a secreted protein 
(red) that activâtes its signaling pathway 
in neighboring cells and thereby drives 
them to express the Wingless gene. In 
turn, Wingless encodes a secreted protein 
(green) that acts back on neighbors of the 
l/Mbg/ess-expressing cell to maintain their 
expression of Engrailed and Hedgehog. As 
indicated, the same control loop repeats 
along the A-P axis of the fly. (Based on 
S. Dinardo et al., Curr. Opin. Genet. Dev. 
4:529-534, 1994.) 
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throughout the life of the fly, long after the signais that induced and refined it hâve 
disappeared. The segment borders will form at the posterior edge of each such 
Engrailed stripe (Figure 21-22). 

In addition to regulating the segment-polarity genes, the products of pair-rule 
genes collaborate with those of gap genes to induce the precisely localized acti¬ 
vation of a further set of genes—originally called homeotic selector genes and now 
often called Hox genes , for reasons that will become clear shortly. It is the Hox 
genes that permanently distinguish one segment from another. In the next sec¬ 
tion, we examine these important genes in detail and consider their rôle in cell 
memory; we shall see that this rôle is critical in a wide range of animais, including 
ourselves. 

Hox Genes Permanently Pattern the A-P Axis 

As animal development proceeds, the body becomes more and more complex. 
But again and again, in every species and at every level of organization, we find 
that complex structures are made by repeating a few basic thèmes, with varia¬ 
tions. Thus, a limited number of basic differentiated cell types, such as muscle 
cells or fibroblasts, recur with subtle individual variations in different sites. These 
cell types are organized into a limited variety of tissue types, such as muscle 
or tendon, which again are repeated with subtle variations in different régions 
of the body. From the various tissues, organs such as teeth or digits are built— 
molars and incisors, fingers and thumbs and toes—a few basic kinds of structure, 
repeated with variations. 

Wherever we find this phenomenon of modulated répétition, we can break 
down the developmental biologiste problem into two kinds of questions: what is 
the basic construction mechanism common to ail the objects of the given class, 
and how is this mechanism modified to give the observed variations in differ¬ 
ent animais? The segments of the insect body provide a good example. We hâve 
thus far slcetched the way in which the rudiment of a single body segment is con- 
structed and how cells within each segment become different from one another. 
We now consider how one segment becomes determined, or specified, to be dif¬ 
ferent from another. 

The first glimpse of the answer to this problem came over 80 years ago, with 
the discovery of a set of mutations in Drosophila that cause bizarre disturbances 
in the organization of the adult fly. In the Antennapedia mutant, for example, legs 
sprout from the head in place of antennae, whereas in the Bithorax mutant, por¬ 
tions of an extra pair of wings appear where normally there should be the much 
smaller appendages called haltères (Figure 21-23). These mutations transform 
parts of the body into structures appropriate to other positions, and they are 
called homeotic mutations (from the Greelc "homoios," meaning similar) because 
the transformation is between structures of a recognizably similar general type, 
changing one kind of limb, or one kind of segment, into another. It was eventu- 
ally discovered that a whole set of genes, the homeotic selector genes, or Hox 
genes, serve to permanently specify the A-P characters of the whole set of animal 
segments. These genes are ail related to one another as members of a multigene 
family. 

There are eight Hox genes in the fly, and they ail lie in one or the other of two 
gene clusters lcnown as the Bithorax complex and the Antennapedia complex. 




Figure 21-22 The pattern of expression 
of Engrailed, a segment-polarity gene. 

The Engrailed pattern is shown in a 10- 
hour embryo and an adult (whose wings 
hâve been removed in this préparation). 
The pattern is revealed by constructing 
a strain of Drosophila containing the 
control sequences of the Engrailed gene 
coupled to the coding sequence of the 
reporter LacZ, whose product is detected 
histochemically through the brown product 
generated by immunohistochemistry 
against LacZ (10-hour embryo) or through 
the blue product generated by a reaction 
that LacZ catalyzes (adult). Note that the 
Engrailed pattern, once established, is 
preserved throughout the animal’s life. 
(Courtesy of Tom Kornberg.) 


Figure 21-23 Homeotic mutations. 

Ultrabithorax, or Ubx, is one of three 
genes in the Bithorax gene complex (a 
Hox gene cluster). Ubx is responsible for 
ail of the différences between the second 
and third thoracic segments. (A, B) Ubx 
loss-of-function mutations transform the 
haltere-bearing segment (A) into a wing- 
bearing segment, resulting in four-winged 
flies (B). (C) Ubx gain-of-funotion in the 
second thoracic segment transforms 
this wing-bearing segment into a haltere- 
bearing segment, resulting in wingless flies. 
(Courtesy of Richard Mann.) 




MECHANISMS OF PATTERN FORMATION 


1163 


The genes in the Bithorax complex control the différences among the abdominal 
and thoracic segments of the body, while those in the Antennapedia complex con¬ 
trol the différences among thoracic and head segments. Comparisons with other 
species show that the same genes are présent in essentially ail animais, includ- 
ing humans. These comparisons also reveal that the Antennapedia and Bithorax 
complexes are the two halves of a single entity, called the Hox complex, that has 
become split in the course of the fly's évolution, and whose members operate in 
a coordinated way to exert their control over the head-to-tail pattern of the body. 

The products of the Hox genes, the Hox proteins, are transcription regulators, 
ail of which possess a highly conserved, 60-amino-acid-long DNA-binding home- 
odomain (see p. 376). The corresponding motif in the DNA sequence is called a 
“homeobox," from which, by abbreviation, the Hox complex takes its name. There 
are many homeobox-containing genes, but only those located in a Hox complex 
are Hox genes. 

Hox Proteins Give Each Segment Its Individuality 

The Hox proteins can be viewed as molecular address labels possessed by the cells 
of each segment: these labels give the cells in each région a positional value—that 
is, an intrinsic character that differs according to a cell's location. If the address 
labels in a developing Drosophila segment are changed, the segment behaves 
as though it were located somewhere else; if ail the Hox genes in an embryo are 
deleted, the body segments in the larva will ail be alilce. 

To a first approximation, each Hox gene is normally expressed in those régions 
that develop abnormally when that gene is mutated or absent. How does each 
Hox protein give a segment its permanent identity? Ail the Hox proteins are sim- 
ilar in their DNA-binding régions, but they are very different in the régions that 
interact with the other proteins with which the Hox proteins form transcriptional 
regulatory complexes. The different protein partners act together with the Hox 
proteins to dictate which DNA binding sites will be recognized, as well as whether 
the effect on transcription at those sites will be activation or repression. Acting 
in this way, the Hox proteins modulate the actions of many other transcription 
regulators. Hundreds of genes are under this type of Hox-modulated control, 
including genes for cell-cell signaling, transcriptional régulation, cell polarity, 
cell adhesion, cytoskeletal function, cell growth, and cell death, ail conspiring (in 
ways that are not yet understood) to give each segment its distinctive Hox-depen- 
dent character. 

Hox Genes Are Expressed According to Their Order in the 
Hox Complex 

How, then, is the expression of the Hox genes themselves regulated? The coding 
sequences of the eight Hox genes in the Antennapedia and Bithorax complexes in 
Drosophila are interspersed amid a much larger quantity of regulatory DNA. This 
DNA includes binding sites for the products of the egg-polarity and segmentation 
genes, thereby serving as an interpréter of the multiple items of spatial informa¬ 
tion supplied to it by ail these transcription regulators. The net resuit is that the 
particular set of Hox genes transcribed is appropriate for each location along the 
A-P body axis. 

The pattern of Hox gene expression exhibits a remarkable regularity that sug- 
gests an additional form of control. The sequence in which the genes are ordered 
along the chromosome, in both the Antennapedia and the Bithorax complexes, 
corresponds almost exactly to the order in which they are expressed along the 
A-P axis of the body (Figure 21-24). This hints at some process of gene activation, 
perhaps dépendent on chromatin structures that propagate along the Hox com¬ 
plexes, switching on one Hox gene after another according to their order along the 
chromosome. The most “posterior” of the Hox genes that are expressed in a cell 
generally dominâtes, driving down expression and activity of the “anterior” genes 
and dictating the character of the segment. The gene regulatory mechanisms 
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underlying these phenomena are still not well understood, but their consé¬ 
quences are profound. We shall see that the serial organization of gene expression 
in the Hox complex is a fundamental feature that has been highly conserved in the 
course of animal évolution. 


Figure 21-24 The patterns of 
expression compared to the 
chromosomal locations of the 
genes of the Hox complex. The 

diagram shows the sequence of 
genes in each of the two subdivisions 
of the chromosomal complex. This 
corresponds, with minor déviations, 
to the spatial sequence in which 
the genes are expressed, shown 
in the photograph of a Drosophila 
embryo at the so-called germ band 
retraction stage, about 10 hours after 
fertilization. The embryo has been 
stained by in situ hybridization with 
differently labeled probes to detect 
the mRNA products of different Hox 
genes in different colors. (Photograph 
courtesy of William McGinnis, adapted 
from D. Kosman et al., Science 
305:846, 2004. With permission from 
AAAS.) 


Trithorax and Polycomb Group Proteins Enable the Hox 
Complexes to Maintain a Permanent Record of Positional 
Information 

The spatial pattern of expression of the genes in the Hox complex is set up by sig¬ 
nais acting early in development, but the conséquences are long lasting. Although 
the pattern of expression undergoes complex adjustments as development pro- 
ceeds, the Hox complexes serve to stamp each cell and its progeny with a perma¬ 
nent record of the A-P position that the cell occupied in the early embryo. In this 
way, the cells of each segment are equipped with a long-term memory of their 
location along the A-P axis of the body. This memory trace is somehow imprinted 
on the Hox complexes, and it governs the segment-specific identity not only of the 
larval segments, but also of the structures of the adult fly. 

The molecular mechanism of this memory of positional information relies on 
two types of régulation. One is from the Hox genes themselves: many of the Hox 
proteins autoactivate the transcription of their own genes, thereby helping to lceep 
the genes on indefinitely. Another crucial input is from two large, complemen- 
tary sets of proteins, called the Trithorax group and the Polycomb group, which 
stamp the chromatin of the Hox complex with a heritable record of its embryonic 
State of activation or repression. These are key general regulators of chromatin 
structure that can be shown to be critical for cell memory: if genes of the Trithorax 
or Polycomb group are defective, the pattern of expression of the Hox genes is set 
up correctly at first, but it is not correctly maintained as the embryo grows older. 

The two sets of regulators act in opposite ways. Trithorax group proteins are 
needed to maintain the transcription of Hox genes in cells where transcription has 
already been switched on. In contrast, Polycomb group proteins form stable com¬ 
plexes that bind to the chromatin of the Hox complex and maintain the repressed 
State in cells where Hox genes hâve not been activated at the critical time (Figure 
21-25). How such changes in chromatin can store developmental cell memory is 
discussed in Chapters 4 and 7. 

The D-V Signaling Genes Create a Gradient of the Transcription 
Regulator Dorsal 

As with the patterning along the Drosophila A-P axis just discussed, the patterning 
along the dorsoventral (D-V) axis begins with maternai gene products that define 
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Figure 21-25 The rôle of genes of the Polycomb group. (A) Photograph of 
a wild-type Drosophila embryo. (B) Photograph of a mutant embryo détective 
for the gene Extra sex combs (Esc) and derived from a mother also lacking 
this gene. The gene belongs to the Polycomb group. Essentially ail segments 
hâve been transformed to resemble the most posterior abdominal segment. 

In the mutant, the pattern of expression of the homeotic selector genes, 
which is roughly normal initially, is unstable in such a way that ail these genes 
soon become switched on ail along the body axis. (From G. Struhl, Nature 
293:36-41, 1981. With permission from Macmillan Publishers Ltd.) 


this axis in the egg (see Figure 21-17), and it then progresses through zygotic gene 
products that further subdivide the D-V axis in the embryo. 

Initially, a protein that is produced by follicle cells underneath the future ven¬ 
tral région of the embryo leads to the localized activation of a transmembrane 
receptor, called Toll, on the ventral side of the egg membrane. The various mater¬ 
nai genes required for this process are called D-V egg-polarity genes. (Curiously, 
Drosophila Toll and vertebrate Toll-like proteins also operate in innate immune 
responses, as discussed in Chapter 24). The localized activation of Toll Controls 
the distribution of Dorsal, a transcription regulator of the NFkB family discussed 
in Chapter 15. The Toll-regulated activity of Dorsal, lilce that of NFkB, dépends on 
the translocation of Dorsal from the cytosol, where it is held in an inactive form, 
to the nucléus, where it régulâtes gene expression (see Figure 15-62). In the newly 
laid egg, both Dorsal mRNA and protein are distributed uniformly in the cytosol. 
After the nuclei in the syncytial blastoderm hâve migrated to the surface of the 
embryo, but before cellularization (see Figure 21-15), Toll receptor activation on 
the ventral side induces a remarkable redistribution of the Dorsal protein. On the 
dorsal side, the protein remains in the cytosol, but ventrally it becomes concen- 
trated in the nuclei, with a smooth gradient of nuclear localization between these 
two extremes (Figure 21-26). 

Once inside the nucléus, the Dorsal protein acts as a morphogen and turns on 
or off the expression of different sets of genes depending on DorsaFs concentra¬ 
tion. The expression of each responding gene dépends on its regulatory DNA— 
specifically, on the number and affinity of the binding sites that this DNA contains 
for Dorsal and other transcription regulators. In this way, the regulatory DNA 
interprets the positional signal provided by the nuclear Dorsal protein gradient, 
so as to define a D-V sériés of territories—distinctive bands of cells that run the 
length of the embryo. Most ventrally—where the nuclear concentration of Dorsal 
protein is highest—it switches on, for example, the expression of a gene called 
Twist , which is spécifie for mesoderm. Most dorsally, where the nuclear concen¬ 
tration of Dorsal protein is lowest, the cells switch on a gene called Decapenta- 
plegic (Dpp ). And in an intermediate région, where the nuclear concentration of 
Dorsal protein is high enough to repress Dpp but too low to activate Twist, the 
cells switch on another set of genes, including one called Short gastrulation (Sog) 
(Figure 21-27A). 

Products of the genes directly regulated by the Dorsal protein generate in turn 
more local signais, which define finer subdivisions along the D-V axis. These sig¬ 
nais act during cellularization and talce the form of conventional extracellular 


wild type 


ventralized 


dorsalized 



100 jum 



wild type no Polycomb activity 



(A) |_| (B) 

100 jum 


Figure 21-26 The concentration gradient 
of Dorsal protein in the nuclei of the 
blastoderm. In wild-type Drosophila 
embryos, the protein is présent in the 
dorsal cytoplasm and absent from the 
dorsal nuclei; ventrally, it is depleted in the 
cytoplasm and concentrated in the nuclei. 

In a mutant in which the Toll pathway is 
activated everywhere and not just ventrally, 
Dorsal protein is everywhere concentrated 
in the nuclei; the resuit is a ventralized 
embryo. Conversely, in a mutant in which 
the Toll signaling pathway is inactivated, 
Dorsal protein everywhere remains in the 
cytoplasm and is absent from the nuclei; 
the resuit is a dorsalized embryo. (From 
S. Roth, D. Stein and C. Nüsslein-Volhard, 
Cell 59:1189-1202, 1989. With permission 
from Elsevier.) 
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Figure 21-27 How morphogen gradients 
guide a patterning process along the 
dorsoventral axis of the Drosophila 
embryo. (A) Initially, a gradient of Dorsal 
protein defines three broad territories 
of gene expression, marked here by 
the expression of three représentative 
genes —Dpp, Sog, and Twist. (B) Slightly 
later, the cells expressing Dpp and Sog 
secrete, respectively, the signal proteins 
Dpp (a TGFp family member) and Sog (an 
antagonist of Dpp). These two proteins 
then diffuse and interact with one another 
(and with certain other factors) to create the 
dorsoventral (D-V) territories shown. 


signal proteins. In particular, Dpp codes for a secreted TGFp-family protein, 
which forms a local morphogen gradient in the dorsal part of the embryo. Sog 
encodes another secreted protein that is produced by the neurogenic ectoderm 
(which gives rise to the nervous System) and acts as an antagonist of Dpp protein. 
The opposing diffusion gradients of these two signal proteins create a steep gra¬ 
dient of Dpp activity: the highest Dpp activity levels, in combination with certain 
other factors, cause development of the most dorsal tissue of ail—an extraembry- 
onic membrane. Intermediate levels cause development of dorsal epidermis; and 
the absence of Dpp activity allows the development of neurogenic ectoderm (Fig¬ 
ure 21-27B). 


A Hierarchy of Inductive Interactions Subdivides the Vertebrate 
Embryo 

The molecular genetic analysis of Drosophila development has uncovered how 
a cascade of transcription regulators and signaling pathways subdivides the 
embryo. The same principle of progressive pattern refinement is used during 
the development of ail animal embryos, including vertebrates. Remarlcably, con¬ 
servation is not restricted to the general strategy of pattern formation, but also 
extends to many of the molécules involved. 

As mentioned previously, the earliest phases of vertebrate development are 
surprisingly variable, even between closely related species, and it is even hard 
to say precisely how the axes of an early fly embryo correspond to those of an 
early frog or mouse embryo. Nevertheless, we shall see that amid this display of 
evolutionary plasticity, some features of early development turn out to be highly 
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conserved. The same is true of later developmental stages also, often to an aston- 
ishing degree. From our own anatomy, it is obvious that we are cousins to birds 
and fish. But loolcing at molecular mechanisms, we see that we are cousins to Aies 
and worms too. 

In the following pages, we discuss how vertebrate embryos are patterned by 
the interplay of signaling molécules and transcription regulators. We begin by dis- 
cussing the formation and patterning of the embryonic axes in amphibians, tak- 
ing the frog Xenopus as our example. We hâve already broached this topic earlier 
in the chapter. Here, we piclc up the thread and draw comparisons with the fly. 

As noted earlier, the origins of the embryonic axes and the three germ layers 
in the frog can be traced back to the blastula (see Figure 21-3A). By labeling indi- 
vidual blastomeres, we can traclc cells through ail their divisions, transformations, 
and migrations and see what they become and where they corne from. The pre- 
cursors of ectoderm, mesoderm, and endoderm are arranged in order along the 
animal-végétal axis of the blastula: the endoderm dérivés from the most végé¬ 
tal blastomeres, the ectoderm from the most animal, and the mesoderm from a 
middle set. Within each of these territories, the cells hâve diverse fates according 
to their positions along the D-V axis of the later embryo. For ectoderm, epider- 
mal precursors are located ventrally, and future neurons are found dorsally; for 
mesoderm, precursors for notochord, muscle, kidney, and blood are arranged 
from dorsal to ventral. Ail this can be represented by a fate map that shows which 
cell types dérivé from which régions of the blastula (Figure 21-28). The fate map 
confronts us with the central question: how are the cells in different positions 
driven toward their different fates? We hâve already explained how maternai fac¬ 
tors deposited in the developing frog egg define its animal-vegetal axis, and how 
cortical rotation triggered by fertilization defines the orientation of the dorsoven- 
tral axis (see Figure 21-14). But how does the establishment of axes lead on to the 
subdivision of the embryo into the future body parts? 

The maternai gene products lead to the formation of signaling centers on the 
végétal and dorsal sides of the embryo. The dorsal signaling center in particular 
has a spécial place in the history of developmental biology. Experiments in the 
early twentieth century identihed it as a small cluster of cells, located on the dor¬ 
sal side of the amphibian embryo, with an extraordinary property: when the cells 
were transplanted to an opposite site, they could trigger a radical reorganization 
of the neighboring tissue, causing it to form a second whole-body axis (Figure 
21-29). The discovery of this signaling center, called the Organizer, led the way 
to a pioneering analysis of the chain of inductive interactions that establish the 
framework of the vertebrate body. 

In contrast to the Drosophila syncytial embryo, the fertilized frog egg under- 
goes rapid cleavage divisions that resuit in an embryo consisting of thousands of 
cells. Patterning must therefore be mediated by extracellular signal molécules 
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Figure 21-28 Blastula fate map in a 
frog embryo. The endoderm dérivés 
from the most végétal blastomeres (yellow), 
the ectoderm from the most animal (blue), 
and the mesoderm from a middle set 
(green) that contributes also to endoderm 
and ectoderm. Different cell types 
dérivé from different positions along the 
dorsoventral axis. 



graft small group of 
cells into host embryo 




Figure 21-29 Induction of a secondary 
axis by the Organizer. An amphibian 
embryo receives a graft of a small cluster 
of cells taken from a spécifie site, called 
the Organizer région, on the dorsal side 
of another embryo at the same stage. 
Signais from the graft organize the behavior 
of neighboring cells of the host embryo, 
causing development of a pair of conjoined 
(Siamese) twins. See Movie 21.4. [After 
J. Holtfreter and V. Hamburger, in Analysis 
of Development (B.H. Willier, P.A. Weiss 
and V. Hamburger, eds), pp. 230-296. 
Philadelphia: Saunders, 1955.] 
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that diffuse through the embryo from cell to cell, not by transcription regulators 
that move through the cytoplasm of a syncytium. Not surprisingly, the Organizer 
is now known to be a major source of secreted protein signais. 

A Compétition Between Secreted Signaling Proteins Patterns the 
Vertebrate Embryo 

The signal molécules that pattern the frog embryo along the animal-vegetal (A-V) 
axis belong to the TGFp family: they are secreted by a signaling center at the vég¬ 
étal pôle and form concentration gradients along the A-V axis. The Nodal protein 
acts over a relatively short range: cells near the végétal pôle are exposed to high 
levels of it and respond by switching on genes that promote the development of 
endoderm; cells further away are exposed to lower levels and activate genes that 
promote the formation of mesoderm. The cells at the végétal pôle that produce 
Nodal also produce a more rapidly diffusing TGFP-like protein called Lefty, which 
antagonizes Nodal. The resuit is a high ratio of Lefty to Nodal at the animal pôle, 
where Lefty prédominâtes and Nodal signaling is bloclced; this causes the cells 
there to develop as ectoderm (Figure 21-30A). Thus, a mid-range activation by 
Nodal, combined with a long-range inhibition by Lefty, sets up the pattern of pro- 
genitors along the A-V axis for the three germ layers—endoderm, mesoderm, and 
ectoderm. 

The frog's dorsal signaling System uses a different set of secreted signais from 
that of the végétal signaling System to subdivide the germ-layer territories accord - 
ing to location along the D-V axis of the embryo. It exerts its influence by secreting 
two inhibitory signal proteins, called Chordin and Noggin. These antagonize the 
action of bone morphogenetic proteins (BMPs; members of yet another subclass 
of the TGFp family), which themselves are secreted throughout the embryo. In 
this way, Chordin and Noggin form a dorsal-to-ventral gradient that bloclcs BMP 
signaling on the dorsal side but allows it to remain high on the ventral side (Figure 
21-30B). Ectodermal cells that expérience high levels of BMP signaling are driven 
to epidermal fates, whereas cells that expérience little or no BMP signaling remain 
neural. 

Knowing the signais that specify the three germ layers and various tissue 
types of the vertebrate body, one can reproduce this spécification in a culture 
dish. Frog cells talcen from the animal-pole région of the embryo, for example, 
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(A) resulting Nodal activity 


(B) resulting BMP activity 


Figure 21-30 How Nodal and bone 
morphogenic protein (BMP) signaling 
pattern the embryonic axes. Nodal and its 
antagonist Lefty pattern the animal-vegetal 
axis, while BMP and its antagonists Chordin 
and Noggin pattern the dorsoventral axis. 

(A) In the animal pôle région, where Nodal 
levels are low relative to Lefty, Lefty blocks 
Nodal from binding to its receptors. In the 
végétal région, there is an excess of Nodal, 
resulting in Nodal pathway activation. 

(B) Along the dorsoventral axis, BMP is 
widely présent but Chordin and Noggin 
are concentrated at the dorsal side: there, 
they bind to BMP and block its binding to 
receptors. The resulting patterns of Nodal 
and BMP activity are illustrated at the 
bottom of the figure. 











MECHANISMS OF PATTERN FORMATION 


1169 


will differentiate into blood (a ventral mesodermal tissue) when diverted from 
their original fate by exposure to intermediate concentrations of Nodal and high 
concentrations of BMP. Similarly, mouse or human embryonic stem cells can be 
coaxed to generate spécifie cell types by exposing them in culture to appropriate 
combinations of signal molécules. In this way, the insights gained through studies 
of animal development can be used to generate the cell types needed for regener- 
ative medicine, as we discuss in the next chapter. 

The Insect Dorsoventral Axis Corresponds to the Vertebrate 
Ventral-Dorsal Axis 

The signaling Systems that pattern the D-V axis in Drosophila and in vertebrates 
are similar. In Drosophila, as we saw, Dpp and its inhibitor Sog are responsible, 
whereas in vertebrates, BMP and its inhibitors Chordin and Noggin do the job. 
Dpp is a member of the BMP family, while Sog is a homolog of Chordin. Both in 
Aies and frogs, high levels of the inhibitors define the région that is neurogenic, 
and high levels of BMP/Dpp activity define the région that is not. These and other 
molecular parallels strongly suggest that this aspect of body patterning has been 
conserved in évolution from insects to vertebrates. Curiously, however, the axis is 
inverted: dorsal in the fly corresponds to ventral in the vertebrate (Figure 21-31). 
At some point in évolution, it seems that the ancestor of one of these classes of 
animais toolc to living life upside-down. 


Hox Genes Control the Vertebrate A-P Axis 


The conservation of developmental mechanisms between Drosophila and verte¬ 
brates extends beyond the D-V signaling System. Hox genes are found in almost 
every animal species studied, where they are often grouped in complexes similar 
to the insect Hox complex. In mice and humans, for example, there are four such 
complexes—called the HoxA , HoxB , HoxC , and HoxD complexes—each on a dif¬ 
ferent chromosome. Individual genes in each complex can be recognized by their 
sequences as counterparts of spécifie members of the Drosophila set. Indeed, 
mammalian Hox genes can function in Drosophila as partial replacements for the 
corresponding Drosophila Hox genes. It appears that each of the four mamma¬ 
lian Hox complexes is, roughly speaking, the équivalent of one complété insect 
Hox complex (that is, an Antennapedia complex plus a Bithorax complex) (Figure 
21-32). 

The ordering of the genes within each vertebrate Hox complex is essentially the 
same as in the insect Hox complex, suggesting that ail four vertebrate complexes 
originated by duplications of a single primordial complex and hâve preserved its 
basic organization. Most tellingly, when the expression patterns of the Hox genes 
are examined in the vertebrate embryo, it turns out that the members of each 
complex are expressed in a head-to-tail sériés along the axis of the body, just as 
they are in Drosophila. As in Drosophila , vertebrate Hox gene expression patterns 
are often aligned with vertebrate segments. This alignment is especially clear in 
the hindbrain (see Figure 21-32), where the segments are called rhombomeres. 

The products of the vertebrate Hox genes, the Hox proteins, specify positional 
values that control the A-P pattern of parts in the hindbrain, neclc, and trunk (as 
well as some other parts of the body). As in Drosophila , when a posterior Hox gene 
is artificially expressed in an anterior région, it can convert the anterior tissue to 
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Figure 21-31 The vertebrate body plan 
as a dorsoventral inversion of the insect 
body plan. Note the correspondence with 
regard to the ci rcu latory System as well as 
the gut and nervous System. In insects, 
the ci rcu latory System is represented by 
a tubular heart and a main dorsal blood 
vessel, which pumps blood out into the 
tissue spaces through one set of apertures 
and receives blood back from the tissues 
through another set. Unlike vertebrates, 
insects hâve no System of capillary vessels 
to contain the blood as it percolates 
through the tissues. Nevertheless, heart 
development dépends on homologous 
genes in vertebrates and insects, 
reinforcing the relationship between the 
two body plans. (After E.L. Ferguson, Curr. 
Opin. Genet. Dev. 6:424-431,1996. With 
permission from Elsevier.) 
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Figure 21-32 The Hox complexes of an 
insect and a mammal, compared and 
related to body régions. The genes of the 
Antennapedia and Bithorax complexes of 
Drosophila are shown in their chromosomal 
order in the top line. The corresponding 
genes of the four mammalian Hox 
complexes are shown below, also in 
chromosomal order. The gene expression 
domains in fly and mammal are indicated 
in a simplified form by color in the cartoons 
of animais above and below. There is a 
remarkable parallelism. However, the details 
of the patterns dépend on developmental 
stage and vary somewhat from one 
mammalian Hox complex to another. Also, 
in many cases, genes shown here as 
expressed in an anterior domain are also 
expressed more posteriorly, overlapping the 
domains of more posterior Hox genes. 

The complexes are thought to hâve 
evolved as follows: first, in some common 
ancestor of worms, flies, and vertebrates, 
a single primordial homeotic selector gene 
underwent repeated duplication to form 
a sériés of such genes in tandem—the 
ancestral Hox complex. In the Drosophila 
sublineage, this single complex became 
split into separate Antennapedia and 
Bithorax complexes. Meanwhile, in the 
lineage leading to the mammals, the whole 
complex was repeatedly duplicated to give 
four Hox complexes. The parallelism is not 
perfect because apparently some individual 
genes hâve been duplicated and others 
lost. Still others hâve been co-opted for 
different purposes (genes in parenthèses in 
the top line) over the time that has elapsed 
since the complexes diverged. (Based on a 
diagram courtesy of William McGinnis.) 


a posterior character. Conversely, loss of posterior Hox genes allows the posterior 
tissue where they are normally expressed to adopt an anterior character (Figure 
21-33). Because of a redundancy between genes in the four Hox gene clusters, 
the transformations observed in mouse Hox mutants are not always so straight- 
forward as those in the fly, and they are often incomplète. Nonetheless, it seems 
clear that the fly and the mouse use essentially the same molecular machinery to 
impart individual characteristics to successive régions along at least a part of the 
A-P axis. 

Some Transcription Regulators Can Activate a Program That 
Défi nés a Cell Type or Créâtes an Entire Organ 

Just as there are genes that regulate pattern formation and segmentai identity, 
there are genes whose products act as triggers for the development of a spécifie 
cell type or even a spécifie organ, initiating and coordinating the whole complex 
program of gene expression that is required. An example is the MyoD/myogenin 
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family of transcription regulators that we encountered in Chapter 7. These pro¬ 
teins drive cells to differentiate into muscle, expressing muscle-specific actins 
and myosins and ail the other specialized cytoskeletal, metabolic, and membrane 
proteins that a muscle cell needs. Analogously, members of the Achaete/Scute 
family of transcription regulators drive cells to become neural progenitors. In 
both these examples, the proteins belong to the basic helix-loop-helix (bHLH) 
class of transcription regulators (see p. 377), and the same is true for many of the 
other proteins that induce the différentiation of particular cell types. These master 
transcription regulators exert their powerful differentiation-inducing activity by 
binding to many different regulatory sites in the genome and thereby controlling 
the expression of large numbers of downstream target genes. In one well-studied 
case, that of an Achaete/Scute family member called Atonal homolog 1 (Atohl), 
the number of direct target genes in the mouse genome is more than 600. It is 
important to note, however, that even such powerful drivers of cell différentiation 
can hâve radically different effects according to the context and history of the cells 
in which they act: Atohl, for example, drives différentiation of certain classes of 
neurons in the brain, of sensory hair cells in the inner ear, and of secretory cells in 
the lining of the gut. 

Other genes encoding transcription regulators can drive the formation and 
assembly of the multiple cell types that constitute an entire organ. A famous 
example is the transcription regulator Eyeless. When it is artificially expressed in a 
patch of cells in the leg precursors of Drosophila, a well-organized eye-like organ 
develops on the leg, with the various eye cell types correctly arranged (see Figure 
7-35B); conversely, loss of the Eyeless gene results in Aies that laclc eyes. More- 
over, loss of the Eyeless homolog Pax6 in vertebrates likewise leads to loss of eye 
structures. Similar organ-selector proteins are known for foregut, heart, pancréas, 
and other organs. They are ail master transcription regulators that directly regu- 
late hundreds of target genes, the products of which then specify and construct 
the different éléments of the appropriate organ. However, as in the example of 
Atohl, they usually exert their spécifie effect only in combination with the right 
partners, which are only expressed in cells that were appropriately primed during 
their earlier development. 


Figure 21-33 Control of anteroposterior 
pattern by Hox genes in the mouse. 

(A, B) A normal mouse (wild type) has 
about 65 vertebrae, differing in structure 
according to their position along the 
body axis: 7 cervical (neck), 13 thoracic 
(with ribs), 6 lumbar [bracketed by yellow 
asterisks in (B)], 4 sacral [bracketed by red 
asterisks in (B)], and about 35 caudal (tail). 
(A) shows a side view and (B) shows a 
dorsal view; for clarity, the limbs hâve been 
removed in each picture. 

(C) The HoxAlO gene is normally 
expressed in the lumbar région (together 
with its paralogs HoxCIO and HoxDIO ); 
here it has been artificially expressed in 
the developing vertébral tissue ail along 
the body axis. As a resuit, the cervical and 
thoracic vertebrae are ail converted to a 
lumbar character. (D) Conversely, when 
HoxAlO is removed along with HoxCIO 
and HoxDIO, vertebrae that should 
normally hâve a lumbar or sacral character 
take on a thoracic character instead. (A and 
C, from M. Carapuço et al., Genes Dev. 

19:2116-2121,2005. With permission 
from Cold Spring Harbor Laboratory 
Press; B and D, from D.M. Wellik and 
M.R. Capecchi, Science 301:363-367, 
2003.) 


Notch-Mediated Latéral Inhibition Refines Cellular Spacing 
Patterns 

After the establishment of the basic body plan and the génération of organ precur¬ 
sors, many further steps of pattern refinement are required to achieve the adult 
pattern of terminally differentiated cells in tissues and organs. As we discussed 
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earlier, latéral inhibition mediated by Notch signaling is crucial for both cell diver¬ 
sification and fine-grained patterning in an enormous variety of tissues in ail ani¬ 
mais. 

One example is the development of sensory bristles in Drosophila, most easily 
seen on the fly's baclc, but also présent on most of its other exposed surfaces. Each 
of these is a miniature sense organ, consisting of a sensory neuron and a small set 
of supporting cells. Some bristles respond to Chemical stimuli, others to mechan- 
ical stimuli, but they are ail constructed in a similar way (Figure 21-34). The pro- 
neural genes Achaete and Scute mentioned earlier mark the patches of epidermis 
within which bristles will form. Mutations that eliminate the expression of these 
genes at some of their usual sites bloclc development of bristles at just those sites, 
and mutations that cause expression in abnormal sites cause bristles to develop 
there. 

The initial cells expressing the proneural genes are called proneural cells, and 
they are primed to talce the neurosensory pathway of différentiation, but which 
of the cells will actually do so dépends on compétitive interactions among them. 
In the first round of these interactions, a single cell within each small group of 
proneural cells is piclced to serve as the progenitor of the bristle. Ibis single cell is 
called the sensory mother cell. It becomes distinct from the other cells of the clus¬ 
ter through latéral inhibition mediated by the Notch signaling pathway. Ibis opér¬ 
âtes in the way we discussed earlier. The cells in the proneural cluster initially ail 
express both the transmembrane receptor Notch and its transmembrane ligand 
Delta, along with proteins that regulate the signaling activity of Delta. Wherever 
Delta activâtes Notch, an inhibitory signal is transmitted that diminishes the ten- 
dency of the Notch-activated cell to specialize as a sensory mother cell.At first, 
ail the cells in the cluster inhibit one another. However, receipt of the signal in a 
given cell diminishes that cell's ability to fight baclc by delivering the inhibitory 
Delta signal in return. Ibis créâtes a compétitive situation, from which a single 
cell in each cluster—the future sensory mother cell—eventually emerges as win- 
ner, sending a strong inhibitory signal to its immédiate neighbors but receiving 
no such signal in return (Figure 21-35). If a cell that would normally become a 
sensory mother cell is genetically disabled from doing so, a neighboring proneu¬ 
ral cell, freed from latéral inhibition, will become a sensory mother cell instead. 

The sensory mother cell goes through a short program of further divisions to 
generate the set of cells that form the final bristle. Notch signaling acts repeat- 
edly at successive stages in this program to drive the descendants of the sensory 
mother cell along different pathways and assign them to their various specialized 
fates. However, it does so in conjunction with additional mechanisms that bias 
the outcome of the compétition mediated by latéral inhibition. Déterminants that 


Figure 21-34 The basic structure of a 
mechanosensory bristle. The lineage of 
the four cells of the bristle—ail descendants 
of a single sensory mother cell—is shown 
on the left. The sensory mother cell, once 
it is specified, generates this set of cells 
through a short program of division cycles. 
In each génération of the progeny, latéral 
inhibition opérâtes again to drive the 
newborn cells toward different fates: one 
of the ultimate progeny will become the 
neuron; another, the shaft of the bristle; 
others, supporting cells of various sorts. 

As the sensory mother cell and its progeny 
divide, certain proteins are allocated 
preferentially to one of each pair of 
newborn sister cells, biasing the outcome 
of the lateral-inhibition compétition 
mediated by Notch signaling. 
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each cell tends to 
inhibit its neighbor 


cell with active Delta specializes 
and inhibits its neighbor from 
doing likewise 



(B) 

are asymmetrically localized inside the dividing cells hâve this rôle in sensory 
bristle development. They are also important in other contexts, as we now discuss. 


Asymmetric Cell Divisions Make Sister Cells Different 

Cell diversification does not always hâve to dépend on extracellular signais: in 
some cases, sister cells are born different as a resuit of an asymmetric cell division, 
during which some significant set of molécules is divided unequally between 
them. This asymmetrically segregated molécule (or set of molécules) then acts as 
a déterminant for one of the cell fates by directly or indirectly altering the pattern 
of gene expression within the daughter cell that receives it (see Figure 21-12). We 
hâve already encountered the asymmetric ségrégation of molécules in the context 
of the early frog embryo: VegTRNA is localized in the végétal région of the fertil- 
ized egg. Following cell division, only végétal daughter cells will inherit VegTRNA. 

Asymmetric divisions often occur at the beginning of development, but they 
are also encountered at some later stages. As mentioned for the sensory bristle, 
they can set the scene for an exchange of Notch signais between the daughter 
cells, with the signaling occurring after the cells hâve become separate and rein¬ 
forcing the différences between them. In the central nervous System, asymmetric 
divisions hâve a key rôle in generating the very large numbers of neurons and 
glial cells that are needed. A spécial class of cells becomes committed as neural 
precursors, but instead of differentiating directly as neurons or glial cells, these 
undergo a long sériés of asymmetric divisions through which a succession of 
additional neurons and glial cells are added to the population. The process is best 
understood in Drosophila, although there are many hints that something similar 
occurs also in vertebrate neurogenesis. 

In the embryonic central nervous System of Drosophila , the nerve-cell pre¬ 
cursors, or neuroblasts , are initially singled out from the neurogenic ectoderm 
by a typical lateral-inhibition mechanism that dépends on Notch. Each neuro- 
blast then divides repeatedly in an asymmetric fashion (Figure 21-36). At each 
division, one daughter remains as a neuroblast, while the other, which is much 


Figure 21-35 Latéral inhibition. (A) The 

basic mechanism of Notch-mediated 
compétitive latéral inhibition, illustrated for 
just two interacting cells. In this diagram, 
the absence of color on proteins or 
effector lines indicates inactivity. (B) The 
outcome of the same process operating 
in a larger patch of cells. At first, ail cells in 
the patch are équivalent, expressing both 
the transmembrane receptor Notch and 
its transmembrane ligand Delta. Each cell 
has a tendency to specialize (as a sensory 
mother cell), and each sends an inhibitory 
signal to its neighbors to discourage them 
from also specializing in that way. This 
créâtes a compétitive situation. As soon 
as an individual cell gains any advantage in 
the compétition, that advantage becomes 
magnified. The winning cell, as it becomes 
more strongly committed to differentiating 
as a sensory mother cell, also inhibits its 
neighbors more strongly. Conversely, as 
these neighbors lose their capacity to 
differentiate as sensory mothers, they also 
lose their capacity to inhibit other cells from 
doing so. Latéral inhibition thus makes 
adjacent cells follow different fates. 

Although the interaction is thought to be 
normally dépendent on cell—cell contacts, 
the future sensory mother cell may be able 
to deliver an inhibitory signal to cells that 
are more than one cell diameter away—for 
example, by sending out long protrusions 
to touch them. 
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smaller, becomes specialized as a ganglion mother cell. Each ganglion mother cell 
will divide only once, giving a pair of neurons, or a neuron plus a glial cell, or a 
pair of glial cells, with Notch-mediated interactions helping to drive the daughters 
along different paths. The neuroblast itself becomes smaller at each division, as it 
parcels out its substance into one ganglion mother cell after another. Eventually, 
typically after about 12 cycles, the process halts, presumably because the neuro¬ 
blast becomes too small to pass the cell-size checlcpoint in the cell-division cycle. 
Later, in the larva, neuroblast divisions résumé, but now they are accompanied 
by cell growth, permitting the process to continue indefinitely and to generate the 
much larger numbers of neurons and glial cells required in the adult fly. 

Différences in Regulatory DNA Explain Morphological Différences 

In the preceding sections, we hâve seen that animais contain the same essential 
cell types, hâve a similar collection of genes, and share many of the molecular 
mechanisms of pattern formation. But how can we square this with the radical 
différences that we see in the body structures of animais as diverse as a worm, a 
fly, a frog, and a mouse? We asserted earlier, in a general way, that these différ¬ 
ences usually seem to reflect différences in the regulatory DNA that calls into play 
the components of the conserved basic kit of parts. We must now examine the 
evidence a little more closely. 

When we compare animal species with similar basic body plans—different 
vertebrates, for example, such as fish, birds, and mammals—we find that corre- 
sponding genes usually hâve similar sets of regulatory éléments: the regulatory 
DNA sequences hâve been well conserved and are recognizably homologous in 
the different animais. The same is true if we compare different species of néma¬ 
tode worms or insects. But, when we compare vertebrate regulatory régions with 
those of worms or Aies, it is hard to see any such resemblance. The protein-cod- 
ing sequences are unmistakably similar, but the corresponding regulatory DNA 
sequences appear mostly very different, suggesting that the différences in body 
plans mainly reflect différences in regulatory DNA. Although variations in the 
proteins themselves also contribute, différences in regulatory DNA would be 
enough to generate radically different tissues and body structures even if the pro¬ 
teins were the same. 

It is not yet possible to trace the genetic steps that hâve led to ail the spectacu- 
lar diversity of animais. Their lineages hâve diverged over hundreds of millions of 
years, and in most cases too many changes hâve occurred for us to be able to say 
that this or that feature results from this or that mutation. The picture is clearer, 
however, for more recent evolutionary events. Studies of both closely related ani¬ 
mal populations and plant populations whose members hâve different morphol¬ 
ogies hâve revealed that dramatic developmental effects can resuit from subtle 
changes in regulatory DNA. 

A well-studied example is the morphological diversity found in sticldeback 
fish. After the last ice âge ended about 10,000 years ago, marine sticldebaclcs col- 
onized many newly formed freshwater streams and lakes. Marine sticldebaclcs 
extend sharp spines from their pelvic slceleton. These spines are thought to help 
protect the fish from soft-mouthed fish predators. In contrast, several popula¬ 
tions of freshwater sticldebaclcs hâve lost these spines, usually in lakes that laclc 
such predators. The different morphologies reflect différences in control of the 
expression of a transcription regulator called Pitxl. Whereas marine sticldebaclcs 
express the Pitxl gene in the pelvic bone precursor cells that will form the spilces, 


Figure 21-36 Neuroblasts and 
asymmetric cell division in the central 
nervous System of a fly embryo. The 

neuroblast originates as a specialized 
ectodermal cell. It is singled out by latéral 
inhibition and emerges from the basal 
(internai) face of the ectoderm. It then 
goes through repeated division cycles, 
dividing asymmetrically to generate a sériés 
of ganglion mother cells. Each ganglion 
mother cell divides just once to give a 
pair of differentiated daughters (typically a 
neuron plus a glial cell). 
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Figure 21-37 Morphological diversity 
in stickleback fish is caused by 
changes in regulatory éléments. 

(A-D) Pelvic spines are présent in marine 
(A) but not in freshwater (C) populations. 
Correspondingly, Pitxl is expressed in 
the pelvic area in marine (B) but not in 
freshwater (D) fish. The lack of expression 
in the pelvic area of freshwater populations 
is caused by mutations in an enhancer 
element. Other enhancers and sites of 
expression for Pitxl are the same in marine 
and freshwater sticklebacks. (Courtesy of 
Michael D. Shapiro.) 


freshwater sticklebacks hâve lost this expression as a resuit of a change at the Pitxl 
locus. These changes do not lie in the coding sequence. Instead, each is a small 
délétion of a bloclc of adjacent regulatory DNA that Controls Pitxl expression spe- 
cifically in the pelvic cells (Figure 21-37). 

The Pitxl protein has important functions elsewhere in the body, so that the 
DNA sequences that encode this protein must be retained. The regulatory DNA 
responsible for Pitxl expression at these other sites is also unchanged in the two 
populations of sticklebacks. The évolution of pelvis development in sticklebacks 
shows how the modular nature of regulatory DNA éléments that we encountered 
in Chapter 7 (see Figure 7-29) allows independent modification of the different 
parts of the body, even when formation of those body parts dépends on the same 
proteins. 

In the recent évolution of plants, changes of body structure can be traced in 
a similar way to changes in regulatory DNA. For example, these account for a 
large part of the dramatic différence between the wild teosinte plant and its mod¬ 
em descendant, maize, through some 10,000 years of mutation and sélection by 
Native Americans. 

Summary 

Drosophila has been the foremost model organismfor the study of the genetics of 
animal development. Its embryonic pattern is initiated by the products ofmater- 
nal-effect genes called egg-polarity genes, which operate by setting up graded dis¬ 
tributions of transcription regulators in the egg and early embryo. The gradient of 
Bicoid protein along the A-P axis, for example, helps initiate the orderly expression 
ofgap genes, pair-rule genes, and segment-polarity genes. These three classes of seg¬ 
mentation genes, through a hierarchy of interactions, become expressed in some 
régions ofthe embryo and not other s, progressively subdividing the embryo along 
the A-P axis into a regular sériés ofrepeating modular units called segments. 

Superimposed on the pattern ofgene expression that repeats itself in every seg¬ 
ment, there is a serial pattern of expression of Hox genes that confer on each seg¬ 
menta different identity. These genes aregrouped in complexes and are arranged in 
a sequence that matches their sequence of expression along the A-P axis ofthe body. 

Although Ho xgene expression is initiated in the embryo, it is subsequently main- 
tained by the action of chromatin-binding proteins ofthe Polycomb and Trithorax 
group, which stamp the chromatin ofthe Hox complex with a heritable record of 
its embryonic State of repression or activation, respectively. Hox complexes homol- 
ogous to that o/Drosophila arefound in virtually every type of animal, where they 
help pattern the A-P axis ofthe body. 

Signaling gradients are also set up along the dorsoventral (D-V) axis. Initially, 
Toll signaling generates a nuclear gradient of Dorsal protein, which induces an 
extracellular signaling gradient ofthe TGFfi-family protein Dpp and its antagonist, 
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Sog. This créâtes a gradient ofDpp activity that helps refine the assignment of differ¬ 
ent characters to cells at different positions along the D-V axis. 

In Xenopus, the polarity ofthe egg and the site ofsperm entry set up the embry- 
onic axes. A gradientgenerated by the TGFfi-family protein Nodal induces different 
fates along the animal-vegetal axis, whereas BMP and Chordin—proteins homol- 
ogous to Drosophila Dpp and Sog, respectively—control the patterning ofthe D-V 
axis. This axis is inverted, so that dorsal in thefly corresponds to ventral in thefrog. 

Transcription regulators control the formation of spécifie cell types. Members 
ofthe MyoD/myogeninfamily drive the process of muscle cell détermination, coor- 
dinating the many components required, whereas Achaete/Scute transcription 
regulators control neural fate. Other genes encoding such master transcriptional 
regulators can regulate the formation ofentire organs. Eyeless ,for example, is both 
necessary and sufficient to generate eye structures in Drosophila. 

To refine the anatomical pattern within such an organ, the cells interact locally, 
both by diffusible inductive signais and by short-range mechanisms. Often, the cells 
compete with one another by latéral inhibition. This process results in activation of 
the Notch signaling pathway in one cell and inhibition in its neighbors, generating 
two different cell types. Asymmetric cell divisions, in which daughter cells inherit 
different molecular déterminants from the mother cell, provide an additional way 
to organize a fine-grained diversity ofcell types. 

Evidence from recent evolutionary events indicates that anatomical changes are 
mostly driven by changes in regulatory DNA sequences that détermine when and 
where developmental genes are expressed. How the strilcing diversity in body struc¬ 
tures has evolved over longer times remains largely unlcnown, although it seems 
lilcely that similar principles apply. 

DEVELOPMENTAL TIMING 

Developmental events unfold over minutes, hours, days, weelcs, months, or even 
years, with each organism following its own strict timetable. The cascades of 
inductive interactions and transcriptional regulatory events described earlier talce 
time, as signais are transmitted and transcription regulators are synthesized and 
then bind to DNA to activate or repress their target genes. At the beginning of this 
chapter, we compared development with an orchestral performance. There are 
many players, and each must do the right thing at the right time; yet there is no 
leader or conductor to set the tempo and coordinate the timing of ail the differ¬ 
ent events. Each developmental process must thus occur at an appropriate rate, 
tuned by évolution to fit with the timing of other processes in the embryo or in 
the environment. The control of timing is one of the most important problems in 
developmental biology, but also one of the least understood. 

Molecular Lifetimes Play a Critical Part in Developmental Timing 

Developmental processes are complex, but they are built up from simple steps. A 
first challenge is to understand the timing of these steps. How long does it talce, 
for example, to switch the expression of a gene on or off? This is not lilce throwing 
a light switch: it involves delays. First, it talces time to malce an mRNA molécule: 
the RNA polymerase must travel the length of the gene, the primary RNA tran- 
script must be spliced and otherwise processed, and the resulting mRNA must 
be exported from the nucléus and delivered to the site where it will be translated. 
This adds up to what one might call the gestation time of the individual molécule. 
Second, it talces time for the individual mRNA molécules to accumulate to their 
fully effective concentration; as explainedin Chapter 15, this accumulation time is 
dictated by the average lifetime of the molécules—the longer they last, the higher 
their ultimate concentration, and the longer the time talcen to attain it. Similar 
delays occur at the next step, where the mRNA is translated into protein: synthesis 
of each individual protein molécule involves a gestation delay, and attainment of 
an effective concentration of protein molécules involves an accumulation delay 
that dépends on the protein's lifetime. The time for the whole gene switching pro¬ 
cess is just the sum of the gestation delays and the accumulation delays (basically, 
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the molecular lifetimes) for both the mRNA and the protein molécules. Somewhat 
counterintuitively, it is the combined length of these delays, rather than the rate 
of molecular synthesis (the number of molécules synthesized per second), that 
chiefly détermines the switching time. 

The same additive principle applies to long cascades of gene switching, where 
gene A activâtes gene B, and gene B activâtes gene C, and so on. It also applies 
in other circumstances, such as in signaling pathways where one protein directly 
régulâtes the activation of the next. In ail these cases, molecular lifetimes, along 
with gestation delays, play a key part in determining the pace of development. 
The lifetimes of mRNA and protein molécules are enormously variable, from a few 
minutes or hours to days or more, explaining much of the variation we see in the 
tempo of developmental events. 

Gene switching delays, however, are not the be-all and end-all of develop¬ 
mental timing. Development involves many other kinds of delay that contribute 
to timing. Chromatin structure talces time to remodel. Inductive signais talce time 
to diffuse across a field of cells (see Figure 21-9). Cells talce time to move and rear¬ 
range themselves in space. Nevertheless, the timing of gene switching plays a fon¬ 
damental part in developmental timing, as illustrated in an especially clear and 
striking way by a gene-expression oscillator that Controls the segmentation of the 
vertebrate body axis, as we now explain. 

A Gene-Expression Oscillator Acts as a Clock to Control 
Vertebrate Segmentation 

The main body axis of ail vertebrates has a répétitive, periodic structure, seen in 
the sériés of vertebrae, ribs, and segmentai muscles of the neclc, trunlc, and tail. 
These segmentai structures originate from the mesoderm that lies as a long slab 
on either side of the embryonic midline. This slab becomes brolcen up into a reg- 
ular répétitive sériés of separate bloclcs, or somites—cohesive groups of cells, sep- 
arated by clefts (Figure 21-38A). The somites form (as bilateral pairs) one after 
another, in a regular rhythm, starting in the région of the head and ending in the 
tail. Depending on the species, the final number of somites ranges from less than 
40 (in a frog or a zebrafish) to more than 300 (in a snalce). 

The posterior, most immature part of the mesodermal slab, called the preso- 
mitic mesoderm , supplies the required cells: as the cells proliferate, this mesoderm 
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Figure 21-38 Somite formation in the 
chick embryo. (A) A chick embryo at 40 
hours of incubation. (B) How the temporal 
oscillation of gene expression in the 
presomitic mesoderm becomes converted 
into a spatial alternating pattern of gene 
expression in the formed somites. In the 
posterior part of the presomitic mesoderm, 
each cell oscillâtes with a cycle time of 90 
minutes. As cells mature and emerge from 
the presomitic région, their oscillation is 
gradually slowed down and finally brought 
to a hait, leaving them in a State that 
dépends on the phase of the cycle they 
happen to be in at the critical moment. 

In this way, a temporal oscillation of gene 
expression traces out an alternating spatial 
pattern. (A, from Y.J. Jiang, L. Smithers and 
J. Lewis, Curr. Biol. 8:R868-R871, 1998. 
With permission from Elsevier.) 
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retreats tailward, extending the embryo (Figure 21-38B). In the process, it deposits 
a trail of somites formed from cells that group together into bloclcs as they emerge 
from the anterior end of the presomitic région. The spécial character of the pre- 
somitic mesoderm is maintained by a combination of fibroblast growth fac¬ 
tor (FGF) and Wnt signais, produced by a signaling center at the tail end of the 
embryo, and the range of these signais seems to define the length of the preso¬ 
mitic mesoderm. The somites emerge with clocldike timing, but what détermines 
the rhythm of the process? 

In the posterior part of the presomitic mesoderm, the expression of certain 
genes oscillâtes in time. Snapshots of gene expression talcen by fixing embryos 
for analysis at different times in the oscillation cycle reveal what is happening, 
and the oscillations can now also be observed in time-lapse movies of embryos 
containing fluorescent reporters of individual oscillating genes. One new somite 
pair is formed in each oscillation cycle, and, in mutants where the oscillations fail 
to occur, somite segmentation is disrupted: the cells may still break up, belatedly, 
into separate clusters, but they do so in a haphazard, irregular way. The gene-ex¬ 
pression oscillator controlling regular segmentation is called the segmentation 
clock. The length of one complété oscillation cycle dépends on the species: it is 30 
minutes in a zebrafish, 90 minutes in a chiclc, 120 minutes in a mouse. 

As cells emerge from the presomitic mesoderm to form somites—in other 
words, as they escape from the influence of the FGF and Wnt signais—their oscil¬ 
lation stops. Some become arrested in one State, some in another, according to 
the phase of the oscillation cycle at the time they leave the presomitic région. In 
this way, the temporal oscillation of gene expression in the presomitic mesoderm 
leaves its trace in a spatially periodic pattern of gene expression in the matur- 
ing mesoderm; this in turn dictâtes how the tissue will break up into physically 
separate bloclcs, through effects on the pattern of cell-cell adhesion (see Figure 
21-38B). 

How does the segmentation clock worlc? The first somite oscillator genes to 
be discovered were Hes genes, which are lcey components of the Notch signaling 
pathway. They are directly regulated by the activated form of Notch, and they code 
for inhibitory transcription regulators that inhibit the expression of other genes, 
including Delta. As well as regulating other genes, the products of Hes genes can 
directly regulate their own expression, creating a remarkably simple négative 
feedback loop. Autoregulation of certain spécifie Hes genes (depending on spe¬ 
cies) is thought to be the basic generator of the oscillations of the somite clock. 
Although the machinery has been modified in various ways in different species, 
the underlying principle seems to be conserved. When the lcey Hes gene is tran- 
scribed, the amount of Hes protein product builds up until it is sufficient to bloclc 
Hes gene transcription; synthesis of the protein ceases; the protein then decays, 
permitting transcription to begin again; and so on, cyclically (Figure 21-39). The 
period of oscillation, which détermines the size of each somite, dépends on the 
delay in the feedback loop. This equals the sum of the gestation delays and accu¬ 
mulation delays (that is, the molecular lifetimes) of the Hes mRNA and protein 
molécules, according to the additive principle discussed earlier. Mathematical 
modeling (see Chapter 8) allows us to relate these basic molecular parameters to 
the cycle time of the segmentation clock: to a first approximation, the cycle period 
is simply equal to twice the total delay in the négative feedback loop, and thus 
twice the sum of the delays occurring at each step of the loop. 

The feedback loop just described is intracellular, and each cell in the preso¬ 
mitic mesoderm can generate oscillations on its own. But these oscillations at the 
single-cell level are somewhat erratic and imprécise, refiecting the fundamentally 
noisy, stochastic nature of the control of gene expression, as discussed in Chapter 
7. A mechanism is needed to lceep ail the cells in the presomitic mesoderm that 
will form a particular somite oscillating in synchrony. This is achieved through 
cell-cell communication via the Notch signaling pathway, to which the Hes genes 
are coupled. The gene regulatory circuitry is such that in this context Notch sig¬ 
naling does not drive neighboring cells to be different, as in latéral inhibition, but 
does just the opposite: it keeps them in unison. In mutants where Notch signaling 
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Figure 21-39 Delayed négative feedback giving rise to oscillating gene expression. 

(A) A single gene, coding for a transcription regulator that inhibits its own expression, can behave 
as an oscillator. For oscillation to occur, there must be a delay (or several delays) in the feedback 
circuit, and the lifetimes of the mRNA and protein (which contribute to the delay) must be short 
compared with the total delay. The total delay détermines the period of oscillation. It is thought that 
a feedback circuit like this, based on a pair of redundantly acting genes called Herl and Her7 in 
the zebrafish—or their counterpart, Hes7, in the mouse—is the pacemaker of the segmentation 
clock governing somite formation. (B) The predicted oscillation of Herl and Her7 mRNA and 
protein, computed using rough estimâtes of the feedback circuit parameters appropriate to 
this gene in the zebrafish. Concentrations are measured as numbers of molécules per cell. The 
predicted period is close to the observed period, which is 30 minutes per somite in the zebrafish 
(depending on température). 


fails, including mutants defective in Delta or Notch itself, the cells drift out of syn- 
chrony and somite segmentation is again disrupted. This leads to gross deformity 
of the vertébral column—an extraordinary display of the conséquences of the 
noisy temporal control of gene expression at the single-cell level, writ large in the 
structure of the vertebrate body as a whole. 

Intracellular Developmental Programs Can Help Détermine the 
Time-Course of a Cell’s Development 

Although signaling between cells plays an essential part in driving the progress of 
development, this does not mean that cells always need signais from other cells to 
prod them into changing their character as development proceeds. Some of these 
changes are intrinsic to the cell (like the ticking of the segmentation clock) and 
dépend on intracellular developmental programs that can operate even when the 
cell is removed from its normal environment. 

The best-understood example is in the development of neural precursor cells, 
or neuroblasts, in the embryonic Drosophila central nervous System. These cells, 
as we saw, are initially singled out from the neurogenic ectoderm of the embryo 
by a typical lateral-inhibition mechanism that dépends on Notch, and they then 
proceed through an entirely predictable sériés of asymmetric cell divisions to 
generate ganglion mother cells that divide to form neurons and glial cells (see 
Figure 21-36). The neuroblast changes its internai State as it goes through its set 
program of divisions, generating different cell types with a reproducible sequence 
and timing. These successive changes in neuroblast spécification occur through 
the sequential expression of spécifie transcription regulators. For example, most 
embryonic neuroblasts sequentially express the transcription regulators Hunch- 
baclc, Krüppel, Pdm, and Cas in a fixed order (Figure 21-40). When a neuroblast 
divides, the set of transcription regulators expressed at that time is inherited by 
the ganglion mother cell and its neural progeny; thus, the differentiated neural 
cells are endowed with different characters according to their time of birth. 

Remarlcably, when neuroblasts are talcen from an embryo and maintained in 
culture, isolated from their normal surroundings, they step through much the 
same stereotyped developmental program as if they had been left in the embryo. 
Moreover, many of the neuroblast transitions occur even when cell division is 
blocked. The neuroblasts seem to hâve a built-in timer that détermines when each 
of the transcription regulators is expressed, and this timer can continue to run in 
the absence of cell division. The molecular basis of the timing is largely unknown; 
in part, at least, it must dépend on the time talcen for gene switching, as described 
above; but it may well also dépend on slow progressive changes in chromatin 
structure. These too can serve to measure the passage of time in the embryo. 
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Figure 21-40 Temporal patterning 
of neuroblast fate in Drosophila. 

Hunchback, Krüppel, Pdm, and Cas 
are transcription regulators that are 
expressed consecutively in the cell lineage 
of neuroblasts during development of the 
Drosophila nervous System. At successive 
time steps, correlated with cell division, 
the neuroblast switches its pattern of 
gene expression. Each neuroblast division 
produces one daughter that remains a 
neuroblast and expresses the updated set 
of genes, and one ganglion mother cell that 
maintains the expression of this gene set 
and differentiates into spécifie cell types 
accordingly. (After B.J. Pearson and 
C.Q. Doe, Nature 425:624-628, 2003.With 
permission from Macmillan Publishers.) 
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Cells Rarely Count Cell Divisions to Time Their Development 

Many specialized cells in animais develop from proliferating progenitor cells 
that stop dividing and terminally differentiate after a limited number of cell divi¬ 
sions. In these cases, différentiation is coordinated with withdrawal from the cell 
cycle, but it is usually not lcnown how the coordination is achieved. It has often 
been suggested that the cell-division cycle might serve as an intracellular timer 
to control the timing of cell différentiation. The cell cycle would be the ticking 
dock that sets the tempo of other developmental processes, with maturational 
changes in gene expression being dépendent on cell-cycle progression. Most of 
the evidence, however, indicates that this tempting idea is wrong. Although there 
are examples where cells change their maturation State with each division and 
the change dépends on cell division, this is not the general rule. As we just saw for 
neuroblasts in the Drosophila embryo, cells in developing animais often carry on 
with their normal timetable of maturation and différentiation even when cell divi¬ 
sion is artificially bloclced; necessarily, some abnormalities occur, if only because 
a single undivided cell cannot differentiate in two ways at once. But it seems that 
most developing cells can change their State without a requirement for cell divi¬ 
sion. Developmental control genes can switch the cell-division-cycle machinery 
on or off, and it is the dynamics of these genes, rather than the cell cycle, that sets 
the tempo of development. 

MicroRNAs Often Regulate Developmental Transitions 

Genetic screens are useful for tracking down the genes involved in almost any bio- 
logical process, and they hâve been used to search for mutations that alter devel¬ 
opmental timing. Such screens were performed in the nematode Caenorhabdi- 
tis elegans (Figure 21-41). This worm is small, relatively simple, and precisely 
structured. The anatomy of its development is highly predictable and has been 
described in extraordinary detail, so that one can map out the exact lineage of 
every cell in the body and see exactly how the developmental program is altered 
in a mutant. Genetic screens in C. elegans revealed mutations that disrupt devel¬ 
opmental timing in a particularly striking way: in these so-called heterochronic 
mutants, certain cells in a larva at one stage of development behave as though 
they were in a larva at a different stage of development, or cells in the adult carry 
on dividing as though they belonged to a larva (Figure 21-42). 

Genetic analyses showed that the products of the heterochronic genes act 
in sériés, forming regulatory cascades. Unexpectedly, two genes at the top of 
their respective cascades, called Lin4 and Let7 , were found to code not for pro- 
tein but instead for microRNAs (miRNAs)—short, untranslated, regulatory RNA 
molécules, 21 or 22 nucléotides long. These act by binding to complementary 
sequences in the noncoding régions of mRNA molécules transcribed from other 
heterochronic genes, thereby repressing their translation and promoting their 
dégradation, as discussed in Chapter 7. Increasinglevels of Lin4 miRNA govern the 
progression from first-stage larva cell behaviors to third-stage larva cell behaviors. 
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Figure 21-41 Caenorhabditis elegans. 

A side view of an adult hermaphrodite 
is shown. (From J.E. Sulston and H.R. 
Horvitz, Dev. Biol. 56:110-156, 1977. 
With permission from Academie Press.) 
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Figure 21-42 Heterochronic mutations 
in the Uni4 gene of C. elegans. Only the 
effects on one of the many altered lineages 
are shown. A loss-of-function (récessive) 
mutation in Uni4 causes prématuré 
occurrence of the pattern of cell division 
and différentiation characteristic of a late 
larva, so that the animal reaches its final 
State prematurely and with an abnormally 
small number of cells. The gain-of-function 
(dominant) mutation has the opposite 
effect, causing cells to reiterate patterns 
of cell divisions characteristic of the first 
larval stage, continuing through as many as 
five or six molt cycles. The cross dénotés 
a programmed cell death. Green Unes 
represent cells that contain Uni 4 protein 
(which binds to DNA), red Unes those that 
do not. (Adapted from V. Ambros and 
H.R. Horvitz, Science 226:409-416, 1984. 
With permission from the authors; and 
P. Arasu, B. Wightman and G. Ruvkun, 
Genes Dev. 5:1825-1833, 1991. With 
permission from the authors.) 


Increasing levels of Let7 miRNA govern the progression from late larva to adult. 
In fact, Lin4 and Let7 were the first miRNAs to be described in any animal: it was 
through developmental genetic studies in C. elegans that the importance of this 
whole class of molécules for gene régulation in animais was discovered. 

More generally, in many animais, miRNAs help regulate the transitions 
between different stages of development. For example, in Ries, fish, and frogs, the 
maternai mRNAs that are loaded into the egg in the mother are removed during 
early development when the genome of the embryo begins to be transcribed; at 
this stage, the embryo begins to express spécifie miRNAs that target many mater¬ 
nai mRNAs for translational repression and dégradation. 

Thus, miRNAs can sharpen developmental transitions by blocking and remov- 
ing mRNAs that define an earlier developmental stage. But how is the timing of 
miRNA expression itself controlled? In the case of the miRNAs that disable mater¬ 
nai mRNAs in frogs and fish, expression is activated at the end of the sériés of 
rapid, synchronous divisions that cleave the fertilized egg into many smaller cells. 
As the division rate of these blastomeres slows, widespread transcription of the 
embryo's genome begins (Figure 21-43). This event, where the embryo's own 
genome largely talces over control of development from maternai macromole- 
cules, is called the maternal-zygotic transition (MZT), and it occurs with roughly 
similar timing in most animal species, with the exception of mammals. 

One trigger for the MZT appears to be the nuclear-to-cytoplasmic ratio. 
During cleavage, the total amount of cytoplasm in the embryo remains constant, 
but the number of cell nuclei increases exponentially. As a critical threshold is 
reached in the ratio of cytoplasm to DNA, the cell cycles lengthen and transcrip¬ 
tion is initiated. Thus, haploid embryos undergo the MZT one cell cycle later than 
diploid embryos, which contain twice as much DNA per cell. According to one 
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Figure 21-43 The maternal-zygotic 
transition in a zebrafish embryo. 

Maternai mRNAs are deposited by the 
mother into the egg and drive early 
development. These mRNAs are degraded 
during different stages of embryogenesis, 
including blastula and gastrula stages, but 
a relatively abrupt change occurs at the 
maternal-zygotic transition (MZT). Before 
this, the embryonic (zygotic) genome is 
transcriptionally inactive; afterward, zygotic 
genes start to be transcribed. In zebrafish 
embryos, the zygotic genome begins to be 
activated at the 512-cell stage. 
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model, the nuclear-to-cytoplasmic ratio might be measured through the titration 
of a transcription repressor against the increasing amount of nuclear DNA. The 
total amount of repressor would stay constant during cleavage divisions, but the 
amount of repressor per genome would decrease, falling by a half with each round 
of DNA synthesis, until loss of repression allowed the zygotic genome to become 
transcriptionally active. The newly synthesized transcripts include the miRNAs 
that recognize many of the transcripts deposited in the egg by the mother, direct- 
ing their translational repression and rapid dégradation. 

Hormonal Signais Coordinate the Timing of Developmental 
Transitions 

We hâve so far emphasized timing mechanisms that operate locally and sepa- 
rately in the different parts of the embryo, or in spécifie subsystems of the molec- 
ular control machinery. Evolution has tuned each of these largely independent 
processes to run at an appropriate rate, matched to the needs of the organism as 
a whole. For some purposes, however, this is not enough: a global coordinating 
signal is required. This is especially true where changes hâve to occur throughout 
the body in response to a eue that dépends on the environment. For example, 
when an insect or amphibian undergoes metamorphosis —the transition from 
larva to adult—almost every part of the body is transformed. The timing of meta¬ 
morphosis dépends on external factors such as the supply of food, which déter¬ 
mines when the animal reaches an appropriate size. Ail the bodily changes hâve 
to be triggered together at the right time, even though they are occurring in widely 
separated sites. The coordination in such cases is provided by hormones—signal 
molécules that spread throughout the body. 

The metamorphosis of amphibians provides a spectacular example. During 
this developmental transition, amphibians switch from an aquatic to a terres- 
trial life. Larva-specific organs such as gills and tail disappear, and adult-specific 
organs such as legs form. This dramatic transformation is triggered by thyroid 
hormone, produced in the thyroid gland. If the gland is removed or if thyroid hor¬ 
mone action is bloclced, metamorphosis does not occur, although growth contin¬ 
ues, producing a giant tadpole. Conversely, a dose of thyroid hormone given to a 
tadpole by an expérimenter can trigger metamorphosis prematurely. 

The thyroid hormone is distributed through the vascular System and induces 
changes throughout the animal by binding to intracellular nuclear hormone 
receptors, which regulate hundreds of genes. This does not mean, however, that 
target tissues ail respond in the same way to the hormone: organs differ not only 
in their levels of thyroid hormone receptors and levels of extracellular proteins 
that locally regulate the amount of active hormone, but also in the sets of genes 
that respond. Thyroid hormone induces muscle in the limbs to grow and muscle 
in the tail to die. The timing of the responses also differs: for example, the legs 
form early in response to a very low concentration of circulating hormone, but it 
requires a high level of the hormone to induce résorption of the tail. 

A surge of thyroid hormone triggers metamorphosis, but how is the timing of 
the surge controlled? One mechanism dépends on coupling hormone synthesis 
to the size of the thyroid gland, which reflects the size of the tadpole. Only when 
the gland attains a certain size does it produce enough thyroid hormone to initi- 
ate metamorphosis. However, environmental eues other than nutrition also play a 
part: conditions such as température and light are sensed by the nervous System, 
which régulâtes the sécrétion of another tier of hormones (neurohormones) that 
stimulate the sécrétion of thyroid hormone. Thus, tadpole-intrinsic factors such 
as size combine with environmental factors to détermine when metamorphosis 
begins. 

Environmental Cues Détermine the Time of Flowering 

Another striking example of environmentally controlled developmental timing is 
the flowering of plants. Flowering involves a transformation of the behavior of the 
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cells at the growing apex of the plant shoot—the apical meristem. During ordinary 
végétative growth, these cells behave as stem cells, generating a steady succession 
of new leaves and new segments of stalle. In flowering, the meristem cells switch 
to making the components of a flower, with its sepals and petals, its stamens car- 
rying pollen, and its ovary containing the female gametes. 

To time the switch correctly, the plant has to talce account of both past and 
présent conditions. One important eue, for many plants, is day length. To sense 
this, the plant uses its circadian dock— an endogenous 24-hour rhythm of gene 
expression—to generate a signal for flowering only when there is light for the 
appropriate part of the day. The dock itself is influenced by light, and the plant 
in effect uses the cloclc to compare past to présent lighting conditions. Important 
parts of the genetic circuitry underlying these phenomena hâve been identified, 
including the phytochromes and cryptochromes that act as light receptors (dis- 
cussed in Chapter 15). The flowering signal that is carried from the leaves to the 
stem cells via the vasculature dépends on the product of Flowering locus T (Ft). 

But this signal will trigger flowering only if the plant is in a réceptive condi¬ 
tion from prior long-term cold exposure. Many plants need winter before they 
will flower—a process called vernalization. Cold over a period of weelcs or months 
progressively reduces the level of expression of a remarkable gene called Flower¬ 
ing locus C (Fie). Fie encodes a transcriptional repressor that suppresses expres¬ 
sion of the Ft flowering promoter. 

How does vernalization shut down Fie so as to lift the bloclc to flowering? The 
effect involves a noncoding RNA called Coolair that overlaps with the Fie gene and 
is produced when the température is low (Figure 21-44). Together with cold-in- 
duced chromatin modifiers, including Polycomb-group proteins, Coolair coordi- 
nates the switching of Fie chromatin to a silent State (discussed in Chapters 4 and 
7). The degree of silencing dépends on the length of cold exposure enabling the 
plants to distinguish the odd chilly night from the whole of winter. 

The effect on the chromatin is long lasting, persisting through many rounds 
of cell division even as the weather grows warmer. Thus vernalization créâtes a 
persistent bloclc in production of Fie, enabling the Ft signal to be generated when 
day length is sufficiently long. 
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Figure 21-44 Temporal control of 
flowering in Arabidopsis. The Fie gene 
is active and blocks flowering when plants 
hâve been grown without exposure to 
winterlike températures. Exposure to a 
prolonged period of cold leads to the 
production of the noncoding RNA Coolair, 
which overlaps with the Fie gene. Coolair 
induces long-term chromatin changes that 
turn off Fie. These changes persist after 
the end of the cold period and allow the 
plant to flower when other environmental 
conditions are favorable for flowering. 
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Mutations affecting the régulation of Fie expression alter the time of flowering 
and thus the ability of a plant to flourish in a given climate. The whole control Sys¬ 
tem governing the switch to flowering is thus of vital importance for agriculture, 
especially in an era of rapid climate change. 

The example of vernalization suggests a general point about the rôle of chro- 
matin modification in developmental timing. The plant uses changes in chromatin 
to record its expérience of prolonged cold. It may be that in other organisms—ani¬ 
mais as well as plants—slow, progressive changes in chromatin structure pro¬ 
vide long-term timers for those mysterious developmental processes that unfold 
slowly, over a period of days, weelcs, months, or years. Such chromatin timers may 
be among the most important cloclcs in the embryo, but as yet we understand very 
little about them. 

Summary 

Developmental timing is controlled at many levels. It takes time to switch a gene on 
or ojf, and this time delay dépends on the lifetimes ofthe molécules involved, which 
can vary widely. Cascades ofgene régulation involve cascades of delay s. Feedback 
loops can give rise to temporal oscillations in gene expression, and these may serve 
to generate spatially periodic structures. During vertebrate segmentation, for exam¬ 
ple, expression ofthe Hes genes oscillâtes, and one new pair of somites isformed 
during each oscillation cycle. Hes genes encode transcription repressor proteins that 
can act back on expression ofthe Hes genes themselves. This négative feedback gén¬ 
érâtes oscillations with a period that reflects the delay in the autoregulatory gene 
switching loop. The period of oscillation ofthis "segmentation clock" Controls the 
sizes of the somites. Notch signaling between neighboring cells synchronizes their 
oscillations: when Notch signaling fails, the cells drift out of synchrony because of 
genetic noise in their individual clocks, and the segmentai organization ofthe ver¬ 
tébral column is disrupted. 

Timing does not always dépend on cell-cell interactions; many developing ani¬ 
mal cells hâve intrinsic developmental programs that play out even in isolated cells 
in culture. Neuroblasts in Drosophila embryos, for example, go through set pro¬ 
grams ofasymmetric divisions, generating different neural cell types at each divi¬ 
sion with a predictable sequence and timing, through a cascade ofgene switching 
events. Studies in both vertebrates and invertebrates show that such programs are 
rarely governed by the timing ofcell division and can unfold even when cell division 
is blocked. MicroRNAs produced at critical moments sharpen developmental tran¬ 
sitions by blocking the translation and promoting the dégradation of spécifie sets 
ofmRNAs. Global coordination of developmental timing is achieved by hormones: 
as a tadpole grows, for example, thyroid hormone levels surge and trigger its meta- 
morphosis into a frog. Environmental control of developmental timing is especially 
striking in plants and reveals the presence ofmolecular timers that act over the long 
term. In vernalization, for example, prolonged cold induces changes in chromatin 
that chart the passage through winter so as to allow flowering only in the spring. 
Slow, progressive changes in chromatin structure are lilcely to be important timers 
in the long-term programming of development in animais too. 

MORPHOGENESIS 

The specialization of cells into distinct types at spécifie times is important, but it 
is only one aspect of animal development. Equally important are the movements 
and deformations that cells go through to assemble into tissues and organs with 
spécifie shapes and sizes. Lilce developmental timing, this process of morphogen- 
esis (“form génération") is less well understood than the processes of differential 
gene expression and inductive signaling that lead to cell-type specialization. The 
cell movements can be readily described, but the underlying molecular mecha- 
nisms that coordinate the movements are much harder to decipher. 

In Chapter 19, we saw how cells cohere to form épithélial sheets or surround 
themselves with extracellular matrix to create connective tissues. We also dis- 
cussed how the basic features of tissues, such as the polarity of epithelia, arise 
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from the properties of individual cells. In this section, we consider how the re¬ 
arrangements of cells during animal development give shape to the embryo and 
to ail the individual organs and appendages of the body. 

A small number of cell processes are basic to morphogenesis. Individual cells 
can migrate through the embryo along defined traclcs. They can crawl over one 
another in a coordinated way to elongate, constrict, or thiclcen a tissue. They can 
segregate from their neighbors and form physically separate groups. They can 
change their shape so as to deform an épithélial sheet into a tube or a vesicle. 
By stretching out while holding on to their companions, specialized sets of cells 
can form growing tubular networlcs such as the System of blood or lymph vessels. 
Mass migrations, as occur in gastrulation, can transform the entire topology of 
the embryo. Underlying ail these processes are changes in cell shape and changes 
in cell contacts—either with other cells or with extracellular matrix. We begin by 
considering the migration of individual cells. 

Cell Migration Is Guided by Cues in the Cell’s Environment 

The birthplace of cells is often far from their ultimate location in the body. Our 
slceletal muscles, for example, dérivé from muscle cell precursors, or myoblasts, in 
somites, from which they migrate into the limbs and other régions. The routes that 
the migrant cells follow and the sélection of sites that they colonize détermine the 
eventual pattern of muscles in the body. The embryonic connective tissues form 
the frameworlc through which the myoblasts travel, and these tissues provide the 
cues that guide myoblast distribution. No matter which somite they corne from, 
the myoblasts that migrate into a forelimb bud will form the pattern of muscles 
appropriate to a forelimb, and those that migrate into a hindlimb bud will form 
the pattern appropriate to a hindlimb. It is the connective tissue that provides the 
patterning information. 

As a migrant cell travels through the embryonic tissues, it repeatedly extends 
surface projections that probe its immédiate surroundings, testing for cues to 
which it is particularly sensitive by virtue of its spécifie assortment of cell-surface 
receptor proteins. Inside the cell, these receptors are connected to the cortical 
actin and myosin cytoskeleton, which moves the cell along. Some extracellular 
matrix molécules, such as the protein fibronectin, provide adhesive sites that help 
the cell advance; others, such as chondroitin sulfate proteoglycan, inhibit loco¬ 
motion and repel immigration. The nonmigrant cells along the migration pathway 
may likewise hâve inviting or repellent macromolecules on their surface; some 
may even extend filopodia to malce their presence lcnown. 

Among the many guiding influences, a few stand out as especially import¬ 
ant. In particular, many types of migrating cells are guided by chemotaxis that 
dépends on a G-protein-coupled receptor (called CXCR4), which is activated by 
an extracellular ligand called CXCL12. Cells expressing this receptor can snuffle 
their way along traclcs marlced out by CXCL12 (Figure 21-45). Chemotaxis toward 
sources of CXCL12 plays a major part in guiding the migrations of lymphocytes 



Figure 21-45 CXCL12 guides migrating 
germ cells. Zebrafish germ cells migrate to 
domains that express CXCL12. As the sites 
of CXCL12 expression change, cells follow 
the CXCL12 track and are guided to the 
région where the gonad develops at a later 
developmental stage. (A) At the 4-somite 
stage, germ cells move from a position 
that is close to the midline to more latéral 
régions where CXCL12 is expressed. (B) As 
the CXCL12 expression retracts, germ cells 
are guided to more posterior positions. 
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Figure 21-46 Directional migration by 
local blebbing. Germ cells migrate via 
protrusions that define the leading edge 
of the cell. The persistence and site of the 
protrusions are biased toward higher levels 
of CXCL12. Thus, germ cells migrate up 
the CXCL12 gradient. 


and various other white blood cells; of neurons in the developing brain; of myo- 
blasts entering limb buds; of primordial germ cells as they travel toward the devel¬ 
oping gonads; and of cancer cells when they metastasize. 

Detailed studies of primordial-germ-cell migration hâve shown that CXCL12 
signaling does not induce cell migration per se but rather serves to control its 
direction. In the absence of CXCL12 signaling, germ cells still display the mem¬ 
brane blebbing associated with cell migration, but the position of the cell front 
where blebs form is randomly chosen (Figure 21-46); if CXCL12 signaling is 
intact, blebbing is more frequent on the side of the cell that faces the source of 
CXCL12, resulting in directional migration. 

The Distribution of Migrant Cells Dépends on Survival Factors 

The final distribution of migrant cells dépends not only on the routes they take, 
but also on whether they survive the journey and thrive in the environment they 
find at the journey's end. Spécifie sites provide survival factors needed for spécifie 
types of migrant cells to survive. 

Among the most important sets of migrant cells in the vertebrate embryo are 
those of the neural crest. They arise from the border région between the part of the 
ectoderm that will form epidermis and the part that will form the central nervous 
System. As the neural ectoderm rolls up to form the neural tube, the neural crest 
cells break loose from the épithélial sheet along this border région and set out on 
their long migrations (see Figure 19-8 and Movie 21.5). They settle ultimately in 
many sites and give rise to a surprising diversity of cell types. Some lodge in the 
slcin and specialize as pigment cells; still others form slceletal tissue in the face. 
Still others will differentiate into the neurons and glial cells of the peripheral ner¬ 
vous System—not only in the sensory ganglia that lie close to the spinal cord, but 
also, following a much longer migration, in the wall of the gut. 

The neural crest cells that give rise to the pigment cells of the skin and those 
that develop into the nerve cells of the gut dépend on a secreted peptide called 
endothelin-3, which is produced by tissues along the migration pathways and acts 
as a survival factor for the migrating crest cells. In mutants with a defect in the 
gene for endothelin-3 or its receptor, many of these migrating crest cells die. As 
a resuit, the mutant individuals hâve nonpigmented (albino) patches of skin and 
a déficit of nerve cells in the intestine, especially its lower end, the large bowel, 
which becomes abnormally distended for laclc of proper neural control—a poten- 
tially léthal condition called megacolon. 
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Figure 21-47 Effect of mutations in the 
Kit gene. Both the baby and the mouse 
are heterozygous for a loss-of-function 
mutation that leaves them with only hait the 
normal quantity of Kit gene product. In 
both cases, pigmentation is defective 
because pigment cells dépend on the gene 
product as a receptor for a survival factor. 
(Courtesy of R.A. Fleischman, from 
R.A. Fleischman et al., Proc. NatlAcad. 

Soi. USA 88:10885-10889, 1991.) 


Another important survival signal for many types of migratory cells, including 
primordial germ cells, blood cell precursors, and neural-crest-derived pigment 
cells, dépends on a receptor tyrosine kinase called Kit. This is expressed on the 
surface of the migrant cells, and a protein ligand, called Steel factor, is produced 
by the cells of the tissue through which the cells migrate and/or in which they 
corne to settle. Individuals with mutations in the genes for either of these proteins 
hâve déficits in pigmentation, blood cells, and germ cells (Figure 21-47). 


Changing Patterns of Cell Adhesion Molécules Force Cells Into 
New Arrangements 

Patterns of gene expression govern embryonic cell movements in many ways. 
They regulate cell motility, cell shape, and the production of proteins that guide 
migration. Importantly, they also détermine the sets of adhesion molécules that 
the cells display on their surface. Through changes in its surface molécules, a cell 
can break old attachments and malce new ones. Cells in one région may develop 
surface properties that malce them cohere with one another and become segre- 
gated from a neighboring group of cells with different surface chemistry. 

Experiments done half a century ago on early amphibian embryos showed 
that the effects of sélective cell-cell adhesion can be so powerful that they can 
bring about an approximate reconstruction of the normal structure of an early 
postgastrulation embryo after the cells hâve been artificially dissociated and 
mixed up. When these cells are reaggregated into a random mixture, the cells 
spontaneously sort themselves out according to their original germ-layer origins 
(Figure 21-48). As discussed in Chapter 19, cadherin proteins hâve a central rôle 
in the sorting process (see Figure 19-9). Cadherins belong to a large and varied 
family of Ca 2+ -dependent cell-cell adhesion proteins, and they and other cell-cell 
adhesion proteins are differentially expressed in the various tissues of the early 
embryo. Antibodies against these proteins interfère with the normal sélective 
adhesion between cells of a similar type. 

Changes in the patterns of expression of the various cadherins correlate closely 
with the changing patterns of association among cells during various develop- 
mental processes, including gastrulation, neural tube formation, and somite for¬ 
mation. These cell rearrangements are lilcely to be regulated and driven in part by 


Figure 21-48 Sorting out by adhesion. Cells from different parts of an early 
amphibian embryo will sort out according to their origins. In the classical 
experiment shown here, mesoderm cells (green), neural plate cells (blue), and 
epidermal cells (red) hâve been disaggregated and then reaggregated in a 
random mixture. They sort out into an arrangement reminiscent of a normal 
embryo, with a “neural tube” internally, epidermis externally, and mesoderm in 
between. (Modified from P.L. Townes and J. Holtfreter, J. Exp. Zool. 128:53- 
120, 1955. With permission from Wiley-Liss.) 
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the cadherin pattern. In particular, cadherins appear to hâve a major rôle in con- 
trolling the formation and dissolution of épithélial sheets and clusters of cells (see 
Movie 19.1). They not only glue one cell to another but also provide anchorage 
for intracellular actin filaments at the sites of cell-cell adhesion. In this way, the 
pattern of stresses and movements in the developing tissue is regulated according 
to the pattern of cell adhesions. 

Répulsive Interactions Help Maintain Tissue Boundaries 

The different types of cadherins enable different types of cells to cohere selec- 
tively: cells expressing one type of cadherin will maximize their contact with cells 
expressing the same cadherin and thereby segregate from other cells, creating 
spécifie tissue boundaries. Cell mixing can be inhibited and boundaries created 
and maintained in another way as well: cells of different types can sometimes 
actively repel one another. The bidirectional activation of Eph receptors and eph- 
rins discussed in Chapter 15 often médiates such repulsion, acting at interfaces 
between different groups of cells to lceep the groups from mixing, and repelling 
invasion by inappropriate visitors. Ephrin-Eph signaling opérâtes, for example, 
at the boundaries of the rhombomeres discussed earlier. Neighboring rhombo- 
meres express complementary combinations of ephrins and Eph receptors, and 
this keeps the cells in adjacent rhombomeres strictly segregated, with a boundary 
between them that is sharply defined (Figure 21-49). 

Groups of Similar Cells Can Perforai Dramatic Collective 
Rearrangements 

Cadherin-mediated cell sorting and ephrin-Eph-mediated repulsion exemplify 
how différences in cell-surface properties can drive tissue arrangements, causing 
cells that express different sets of genes to separate from one another. However, 
groups of cells that are ail similar can also undergo dramatic rearrangements. 
During frog gastrulation, for example, cells in one région of the surface epithe¬ 
lium invaginate and migrate as a sheet into the interior of the embryo and con¬ 
verge toward the embryonic midline. The movement is driven mainly by an active 
rearrangement of the migrating cells, called convergent extension. Here the cells 
crawl over one another in a coordinated way, displacing their neighbors as they 
migrate, causing the cell sheet to narrow along one axis (converge) and elongate 
along another (extend). Strikingly, small, square fragments of tissue from the 
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Figure 21-49 Sorting out by repulsion. Ephrin-Eph signaling in hindbrain segmentation in a chick 
embryo. Each pair of rhombomeres (segments in the hindbrain) is associated with a branchial 
arch (a modified gill rudiment) to which it sends innervation. Rhombomeres are distinguished from 
one another by expression of different Hox genes (see Figure 21-32). Mutual repulsion (red bars) 
between cells that express EphrinB2 in rhombomere 4 and EphA4 in rhombomere 5 créâtes a 
sharp boundary. 
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Figure 21-50 Convergent extension 
and its cellular basis. (A) Schematic 
diagram of cell behaviors that underlie 
convergent extension. The cells form 
lamellipodia, with which they attempt to 
crawl over one another. Alignment of the 
lamellipodial movements along a common 
axis leads to convergent extension. The 
process dépends on the Wnt-Frizzled/ 
planar-cell-polarity signaling pathway and is 
cooperative, presumably because cells that 
are already aligned exert forces that tend to 
align their neighbors in the same way. 

(B-G) The pattern of convergent 
extension of dorsal mesoderm during 
zebrafish gastrulation at 8.8 (B, E), 9.3 (C, 
F), and 11.3 (D, G) hours after fertilization. 
Cells that will give rise to the notochord 
are labeled in green, and cells that will give 
rise to somites and muscle are labeled in 
blue. The notochord and somite domains 
are spatially separate from the start of the 
recording (B, E), but their boundaries are 
at first barely visible and only a little later 
become obvious. Convergence narrows 
the notochord domain to a width of about 
two cells at the last time point (D, G). 

(A, after J. Shih and R. Keller, Development 
116:901-914, 1992; B-G, after 
N.S. Glickman et al., Development 
130:873-887, 2003. With permission from 
The Company of Biologists.) 


appropriate région of the embryo, isolated in culture, will spontaneously narrow 
and elongate, just as they would in the embryo (Figure 21-50). The alignment 
of the cell movements dépends on the same signaling pathway that is involved 
in generating planar cell polarity within developing epithelia, as we discuss next. 

Planar Cell Polarity Helps Orient Cell Structure and Movement in 
Developing Epithelia 

Cells within an epithelium always hâve an apical-basal polarity (discussed in 
Chapter 19), but the cells of many epithelia show an additional polarity at right 
angles to this axis: the cells are ail arranged as if they had an arrow written on 
them, pointing in a spécifie direction in the plane of the epithelium. This type of 
polarity is called planar cell polarity. In the wing of a fly, for example, each épi¬ 
thélial cell has a tiny asymmetrical projection, called a wing hair, on its surface, 
and the hairs ail point toward the tip of the wing. Similarly, in the inner ear of a 
vertebrate, each mechanosensory hair cell has a precisely oriented asymmetric 
bundle of actin-filled, rodlilce protrusions called stereocilia sticking up from its 
apical plasma membrane as a detector of Sound and of forces such as gravity. Tilt- 
ing the bundle in one direction causes ion channels in the membrane to open, 
electrically activating the cell; tilting in the opposite direction has the opposite 
effect. For the ear to function properly, the hair cells must be oriented correctly. 
Planar cell polarity is also important in the respiratory tract, where every ciliated 
cell must orient the beating of its cilia so as to sweep mucus upward, away from 
the lungs. 

Screens for mutants with misoriented wing hairs in Drosophila hâve identified 
a set of genes that is critical for planar cell polarity. Some of these genes code for 
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Figure 21-51 Planar cell polarity. (A) Wing hairs on the wing of a fly. Each cell in the wing epithelium forms a small, spiky 
protrusion or “hair” at ifs apex, and ail the hairs point the same way, toward the tip of the wing. This reflects a planar polarity in 
the structure of each cell. (B) Sensory hair cells in the inner ear of a mouse similarly hâve a well-defined planar polarity, manifest 
in the oriented pattern of stereocilia (actin-filled protrusions) on their surface. The détection of Sound dépends on the correct, 
coordinated orientation of the hair cells. (C) A mutation in the gene Flamingo in the fly, coding for a nonclassical cadherin, 
disrupts the pattern of planar cell polarity in the wing. (D) A mutation in a homologous Flamingo gene in the mouse randomizes 
the orientation of the planar cell polarity vector of the hair cells in the ear. The mutant mice are deaf. (A and C, from J. Chae et 
al., Development 126:5421-5429, 1999. With permission from The Company of Biologists; B and D, from J.A. Curtin et al., 
Curr. Biol. 13:1129-1133, 2003. With permission from Elsevier.) 


components of the Wnt signaling pathway, others code for specialized members 
of the cadherin superfamily, while the functions of others are uncertain. These 
components of planar-cell-polarity signaling are assembled at cell-cell junctions 
in the epithelium in such a way as to exert a polarizing influence that can propa- 
gate from cell to cell. Essentially the same System of proteins Controls planar cell 
polarity in vertebrates; mice déficient in homologs of the Drosophila planar polar¬ 
ity genes hâve a variety of defects, including incorrectly oriented hair cells in the 
inner ear, making them deaf (Figure 21-51). 


Interactions Between an Epithelium and Mesenchyme Generate 
Branching Tubular Structures 

Animais require specialized types of épithélial surfaces for many functions, 
including excrétion, absorption of nutrients, and gas exchange. Where large sur¬ 
faces are required, they are often organized as branching tubular structures. The 
lung is an example. It originates from épithélial buds that grow out from the floor 
of the foregut and invade neighboring mesenchyme to form the bronchial tree, a 
System of tubes that branch repeatedly as they extend. Endothélial cells that form 
the lining of blood vessels invade the same mesenchyme, thereby creating a Sys¬ 
tem of closely apposed airways and blood vessels, as required for gas exchange in 
the lung (Figure 21-52). This whole process of branching morphogenesis dépends 
on signais that pass in both directions between the growing épithélial buds and 
the mesenchyme. Genetic studies in mice indicate that FGF proteins and their 
receptor tyrosine kinases play a central part in these signaling processes. FGF sig¬ 
naling has various rôles in development, but it is especially important in the many 
interactions that occur between a developing epithelium and mesenchyme. 

In the case of lung development, FGF 10 is expressed in clusters of mesen¬ 
chyme cells that lie near the tips of the growing épithélial tubes, and its receptor is 
expressed in the invading épithélial cells. In FGFlO-deficient mutant mice, a pri- 
mary bud of lung epithelium is formed but fails to grow out into the mesenchyme 
to create a branching bronchial tree. Conversely, a microscopie bead soalced in 



Figure 21-52 The airways of the lung, 
shown in a cast of the adult human 
bronchial tree. Resins of different colors 
hâve been injected into different branches 
of the tree of airways. (From R. Warwick 
and P.L. Williams, Gray’s Anatomy, 35th ed. 
Edinburgh: Longman, 1973.) 
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FGF 10 and placed near embryonic lung epithelium in culture will induce a bud 
to form and grow out from the epithelium toward the bead. Evidently, the epithe¬ 
lium invades the mesenchyme only by invitation, in response to FGF10. 

But what malces the growing épithélial tubes of the lung branch repeatedly 
as they invade the mesenchyme? This dépends on a Sonic hedgehog signal that 
is sent in the opposite direction, from the épithélial cells at the tips of the buds 
baclc to the mesenchyme, as shown in Figure 21-53. In mice lacking Sonic hedge¬ 
hog, the lung epithelium grows and differentiates, but it forms a sac instead of a 
branching tree of tubules. 

FGF signaling acts in a remarkably similar way in the formation of the air-ex¬ 
change System of insects, which consists of a pattern of fine, air-filled channels 
called tracheae and tracheoles. These originate from the epidermis covering the 
surface of the body and extend inward to invade the underlying tissues, branch¬ 
ing and narrowing as they go (Figure 21-54). The FGF acts on cells at the tips of 
the advancing tracheae, causing them to extend filopodia and migrate toward the 
source of the FGF signal. Because the tip cells remain connected to the remain- 
der of the trachéal epithelium, the pulling force that they generate elongates the 
trachéal tube. 

Initially, the pattern of FGF production in fly embryos is defined by the D-V 
and A-P patterning Systems discussed earlier. In later stages of development, how- 
ever, FGF expression is induced by transcription regulators called hypoxia-induc- 
ible factors ( HIFs ) that are activated by hypoxia (low oxygen levels). In this way, 
hypoxia stimulâtes the formation of finer and finer and more extensively branched 
trachea, until the oxygen supply is sufficient to stop the process. Hypoxia and HIFs 
hâve similar rôles in vertebrates, especially in the development of blood vessels, 
as we shall see in the next chapter. 


Figure 21-53 Branching morphogenesis 

of the lung. How FGF10 and Sonic 
hedgehog are thought to induce the growth 
and branching of the buds of the bronchial 
tree. Many other signal molécules, such as 
BMP4, are also expressed in this System, 
and the suggested branching mechanism 
is only one of several possibilities. 

As indicated, FGF10 protein is 
expressed in clusters of mesenchyme 
cells near the tips of the growing épithélial 
tubes, and its receptor is expressed in 
the épithélial cells themselves. The Sonic 
hedgehog signal is sent in the opposite 
direction, from the épithélial cells at the 
tips of the buds back to the mesenchyme. 
The patterns of gene expression and their 
timing suggest that the Sonic hedgehog 
signal may serve to shut off FGF10 
expression in the mesenchyme cells closest 
to the growing tip of a bud, splitting the 
FGFlO-secreting cluster into two separate 
clusters, which in turn cause the bud to 
branch into two. 


Drosophila embryo 


(A) 



trachéal System 


epithelium filopodia 



Figure 21-54 Branching morphogenesis of airways in a fly. (A) Drosophila embryonic trachéal 
System. (B) FGF (produced in Drosophila by the Branchless gene) signais from surrounding cells 
to the trachéal epithelium and activâtes its FGF receptors, leading to filopodia formation and tube 
élongation. [A, from G. Manning and M.A. Krasnow, in The Development of Drosophila 
(A. Martinez-Arias and M. Bâte, eds), Vol. 1, pp. 609-685. New York: Cold Spring Flarbor 
Laboratory Press, 1993.] 
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An Epithelium Can Bend During Development to Form a Tube or 
Vesicle 

The création of Systems of tubes such as blood vessels and airways is a complex 
process, and it can involve various additional forms of cell behavior, as sketched 
in Figure 21-55. 

As explained in Chapter 19, the process that converts an épithélial sheet into a 
tube dépends on contraction of spécifie bundles of actin filaments. With the help 
of myosin motor proteins, actin filament bundles can shorten, causing the épithé¬ 
lial cells to narrow at their apex. These actin bundles are connected from cell to 
cell by adherens junctions, and if their contraction is coordinated along a spécifie 
axis, the resuit will be that the sheet bends and rolls up into a tube (Figure 21-56). 
The vertebrate neural tube, which we discuss in the last section of this chapter, 
originates in this way. 


Figure 21-55 The forms of cell behavior 
involved in tube formation. Folding 
generates the neural tube, budding 
underlies the formation of lungs and 
trachea, cord hollowing occurs during 
the formation of mammalian salivary 
glands, cell hollowing is involved in the 
formation of trachéal terminal cell tubes, 
and cell assembly generates the heart tube 
that forms at the earliest stage of heart 
development. 


sheet of épithélial cells 
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adhesion belt with 
associated actin filaments 


INVAGINATION OF EPITHELIAL SHEET CAUSED 
BY AN ORGANIZED TIGHTENING ALONG 
ADHESION BELTS IN SELECTED REGIONS OF 
CELL SHEET 


EPITHELIAL TUBE PINCHES OFF 
FROM OVERLYING SHEET OF CELLS 


(A) 




épithélial tube 
or vesicle 



50 (iim 


forming neural tube 


Figure 21-56 Bending of an épithélial sheet to form a tube. Contraction of apical bundles of actin filaments linked from cell to cell via adherens 
junctions causes the épithélial cells to narrow at their apex. Depending on whether the contraction is oriented along one axis of the sheet or is equal 
in ail directions, the epithelium will either roll up into a tube or invaginate to form a vesicle. (A) Diagram showing how an apical contraction along one 
axis of an épithélial sheet can cause the sheet to form a tube. (B) Scanning électron micrograph of a cross section through the trunk of a two-day 
chick embryo, showing the formation of the neural tube by the process diagrammed in (A). (B, courtesy of Jean-Paul Revel.) 
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Summary 

Animal development involves dramatic cell movements , including the guided 
migration ofindividual cells, the adhesion and repulsion ofgroups ofcells , and the 
complex extension, branching, or rolling up of épithélial tissues. Migrant cells, such 
as those of the neural crest, break loose from their original neighbors and travel 
through the embryo to colonize new sites. Many migrant cells, including primor¬ 
dial germ cells, are guided by chemotaxis dépendent on the receptor CXCR4 and 
its ligand CXCL12. In general, cells that hâve similar adhesion molécules on their 
surfaces cohere and tend to segregate from other cell groups with different surface 
properties. Sélective cell-cell adhesion is often mediated by cadherins; repulsion is 
often driven by ephrin-Eph signaling. Within an épithélial sheet, cells can rearrange 
themselves to drive épithélial convergence and extension, as in gastrulation. Many 
movements are coordinated through a Wnt-dependent planar-polarity signaling 
pathway that is also responsiblefor orienting cells correctly in various types of epi¬ 
thelium. Elaborate branched tubular structures, such as the airways of the lung, 
are generated through bidirectional signaling between an épithélial bud and the 
mesenchyme that it invades, in a process called branching morphogenesis. Epithe¬ 
lial tubes and vesicles can originate in various ways, most simply by the rolling up 
and pinching ojf of a segment of epithelium, as in the formation ofthe neural tube. 


GROWTH 


One of the most fundamental aspects of animal development is one we know 
surprisingly little about—how the size of an animal or an organ is determined. 
Why, for example, do we grow to be so much larger than a mouse? Even within a 
species, size can vary greatly; a Great Dane, for instance, can weigh over 40 times 
more than a Chihuahua (Figure 21-57). 

Three variables define the size of an organ or organism: the number of cells, 
the size of the cells, and the quantity of extracellular material per cell. Size différ¬ 
ences can arise from changes in any of these factors (Figure 21-58). If we compare 
a mouse with a human, for example, we find that the différence lies chiefly in the 
number of cells, there being roughly 3000 times more cells in a human, corre- 
sponding to a body that is roughly 3000 times more massive. Wild and cultivated 
species of food plants, on the other hand, often differ in body size chiefly because 
of différences of cell size. 

The challenge, therefore, is to understand how cell numbers, cell size, and 
extracellular matrix production are regulated. First of ail, we need to identify the 
signais that drive or inhibit growth. Then we need to discover how the signais 
themselves are regulated. In many cases, the size of an organ or of the body as a 
whole seems to be controlled homeostatically, so that the correct size is reached 
and maintained even in the face of drastic disturbances. This suggests that the 
developing structure somehow senses its own size and uses this information to 
regulate the signais for its own growth or shrinkage. In most cases, the nature of 
this feedback control remains a profound mystery. 

In other cases, the duration of growth and the final size seem to be dictated 
by intracellular programs that talce no cognizance of the size the structure has 
attained. These intracellular programs, too, présent many mysteries, as we saw in 
our discussion of developmental timing. Very often, it seems, the sizes and pro¬ 
portions of body parts must dépend on combinations of size-measuring feedback 
Controls and intracellular programs, as well as on environmental influences such 
as nutrition. 

The variation in control strategies is nicely illustrated by some classic trans¬ 
plantation experiments. If several fêtai thymus glands are transplanted into a 
developing mouse, each grows to its characteristic adult size. In contrast, if multi¬ 
ple fêtai spleens are transplanted, each ends up smaller than normal, but collec- 
tively they grow to the size of one adult spleen. Thus, thymus growth is regulated 
by local mechanisms intrinsic to the individual organ, whereas spleen growth is 
controlled by a feedback mechanism that senses the quantity of spleen tissue in 
the body as a whole. In neither case is the mechanism lcnown. 



Figure 21-57 Members of the same 
species can hâve dramatically different 
sizes. The Chihuahua weighs 
2-5 kilograms, whereas a Great Dane 
weighs 45-90 kilograms. (Courtesy of 
Deanne Fitzmaurice.) 
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Figure 21-58 Déterminants of organ 
size. 


The Prolifération, Death, and Size of Cells Détermine 
Organisai Size 

The nematode worm C. elegans illustrâtes the different ways in which size différ¬ 
ences can arise. This créature follows an astonishingly précisé and predictable 
developmental program. Each individual of a given sex is generated by almost 
exactly the same sequences of cell divisions and cell deaths, and consequently 
has precisely the same number of somatic cells—959 in the adult hermaphrodite 
(the sex of the majority of these animais)—although the number of germ cells is 
more variable from worm to worm. The stereotyped development malces it pos¬ 
sible to trace somatic cell lineages in exhaustive detail. More than 1000 cell divi¬ 
sions generate 1090 somatic cells during hermaphrodite development, but 131 of 
these cells undergo apoptotic cell death. Thus, précisé régulation of both cell divi¬ 
sion and cell death détermines the final numbers of somatic cells in the worm. In 
fact, genetic screens in C. elegans identified the first genes responsible for apop- 
tosis and its régulation—thereby revolutionizing our molecular understanding of 
this form of programmed cell death (discussed in Chapter 18). 

The final number of somatic cells in the adult worm is already présent at sex- 
ual maturity (around three days after fertilization), after which no more somatic 
cells are generated. Yet the worm continues to grow, doubling in size between 
sexual maturity and death 2-3 weeks later. This doubling results from somatic cell 
growth: although the cells no longer divide, they continue to go through rounds of 
DNA synthesis; this endoreplication of the genome malces the cells polyploid. As in 
ail organisms, the size of a cell is proportional to its ploidy—that is, the number of 
genome copies that it contains: a doubling of ploidy roughly doubles cell volume. 
By artificial manipulation of somatic cell ploidy, and thereby somatic cell size, the 
size of the worm as a whole can be increased or decreased. Thus the worm's final 
size is set by a combination of programmed cell divisions and cell deaths, along 
with régulation of the sizes of individual cells through changes in ploidy. 

In plants, as in animais, cell size increases as ploidy increases (Figure 21-59). 
This effect has been exploited in the agricultural breeding of plants for large size: 
most of the major fruits and vegetables that we consume are polyploid. 

Animais and Organs Can Assess and Regulate Total Cell Mass 

The size of an animal or organ dépends on both cell number and cell size—that 
is, on total cell mass. Remarkably, many animais and organs can somehow assess 
their total cell mass and regulate it, providing evidence for feedback Controls of 
the sort highlighted earlier in our introductory account of general principles of 
growth control. In contrast with C. elegans , if cell size is artificially increased or 
decreased in these cases, cell numbers adjust to maintain a normal total cell mass. 
This has been beautifully illustrated by experiments done long ago in salamanders, 
where cell size can be manipulated by altering the animaEs ploidy. As shown in 
Figure 21-59E, salamanders of different ploidies end up being the same size with 
very different numbers of cells. The individual cells in a pentaploid salamander, 
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Figure 21-59 Effects of ploidy on cell size and organ size. In ail organisms, from bacteria to humans, cell size is proportional 
to ploidy—the number of copies of the genome per cell. This is illustrated for (A-D ) Arabidopsis flowers and (E) for salamanders. 
In each case, the upper panels show cells in a spécifie tissue [a petal for Arabidopsis, a pronephric (kidney) tubule for the 
salamander]; the lower panels show the gross anatomy—flowers fo r Arabidopsis, the whole body for the salamander. In the 
case of Arabidopsis flowers, increase in cell size increases organ size. By contrast, the salamander and its individual organs 
attain their normal standard size regardless of ploidy, because large cell size is compensated for by fewer cells. This indicates 
that the size of an organism or organ in this species is not controlled simply by counting cell divisions or cell numbers; size 
must somehow be regulated at the level of total cell mass. [A-D, from C. Breuer et al., Plant Cell 19:3655-3668, 2007. With 
permission from the American Society of Plant Biologists; E, adapted from G. Fankhauser, in Analysis of Development 
(B.H. Willier, P.A. Weiss and V. Hamburger, eds), pp. 126-150. Philadelphia: Saunders, 1955.] 


for example, are about five times the size of those in a haploid salamander, but 
there are only one-fifth as many cells. This scaling opérâtes not only in the body as 
a whole, but in its individual organs. 

The imaginai dises of Drosophila provide another striking example of homeo- 
static size control. These are épithélial pouches that grow by cell prolifération 
during the larval period and, during the pupal stage, form the organs and extrem- 
ities of the adult fly (Figure 21-60). Experiments hâve been chiefly done on the 
wing imaginai dise. Mutations in components of the cell-cycle control machinery 
can be used to speed up or slow down the rate of cell division in the dise. Remarlc- 
ably, such mutations can resuit in an excessive number of abnormally small cells 



Figure 21-60 The imaginai dises in 
the Drosophila larva (below) and the 
structures in the adult (above) that they 
give rise to. [After J.W. Fristrom et al., in 
Problems in Biology: RNA in Development 
(E.W. Hanley, ed.), p. 382. Sait Lake City: 
University of Utah Press, 1969.] 
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Figure 21-61 Pituitary dwarf and pituitary giant. The “giant” on the 
right is Robert Ladlow (1914-1940), the tallest recorded man at 8 feet 
11 inches (2.72 m), together with his father, who was almost 6 feet tall 
(1.82 m). The dwarf on the left is General Tom Thumb, which was the stage 
name of Charles Sherwood Stratton (1838-1883). On his 18th birthday, he 
was measured at 2 feet 8.5 inches (82.6 cm) tall, and at his death, he was 
3 feet 4 inches (102 cm).(Images from http://en.wikipedia.org/wiki/ 

Fi le: Robert_Wadlow.jpg. © Bettmann/CORBIS.) 


or a reduced number of abnormally large cells, respectively, leaving the size (area) 
and patterning of the adult wing practically unchanged. Thus, the size of the dise 
is not regulated so as to contain a set number of cells. Instead, there must be a 
regulatory mechanism that halts growth when the disc's total cell mass reaches 
the appropriate value, so that the size and pattern of the adult wing that develops 
from the dise are normal. Remarkably, developing dises—or even dise fragments, 
talcen out of their normal context and transplanted into the abdomen of an adult 
female—will grow until they reach their normal size. Clearly, the mechanisms 
that regulate dise size are intrinsic to the dise. 

We still hâve very little idea how organisms or organs assess their total cell 
mass or monitor their own growth. Nevertheless, we are beginning to understand 
some of the signal molécules that drive or hait growth in response to the mysteri- 
ous eues that convey information about the size attained. 



Extracellular Signais Stimulate or Inhibit Growth 

We hâve already seen how some signais act systemically as hormones to regulate 
the development of the animal as a whole. Some of these serve to regulate growth. 
In mammals, for instance, growth hormone (GH) is secreted by the pituitary 
gland into the bloodstream and stimulâtes growth throughout the body: excessive 
production of growth hormone leads to gigantism, and too little leads to dwarfism 
(Figure 21-61). Pituitary dwarfs hâve bodies and organs that are proportionately 
small, unlilce achondroplastic dwarfs, for example, whose limbs are dispropor- 
tionately short, usually because of a mutation in a gene encoding an FGF receptor 
that disrupts normal cartilage development (Figure 21-62). 

Growth hormone stimulâtes growth largely by inducing the liver and other 
organs to produce insulin-like growth factor 1 (IGF1), which acts mainly as a local 
signal within many tissues to increase cell survival, cell growth, cell prolifération, 
or some combination of these, depending on the cell type. Large breeds of dogs 
such as Great Danes owe their great size to high levels of IGF1, while miniature 
breeds such as Chihuahuas hâve low levels (see Figure 21-57). 

Not ail growth-regulating extracellular signais stimulate growth; some inhibit 
it, by promoting cell death or inhibiting cell growth, cell division, or both. Myosta¬ 
tin is a TGFp family member that specifically inhibits the growth and prolifération 
of myoblasts—the precursor cells that fuse to form the huge, multinucleated cells 
of slceletal muscle. When the Myostatin gene is deleted in mice, muscles grow to 
be several times larger than normal. Remarkably, two breeds of cattle that were 
bred for large muscles hâve both turned out to hâve mutations in the Myostatin 
gene; whippet dogs mutant for Myostatin develop similarly (Figure 21-63). 


Figure 21-62 Achondroplasia. This type of dwarfism occurs in one of 
10,000-100,000 births; in more than 99% of cases it results from a mutation 
at an identical site in the genome, corresponding to amino acid 380 in the 
FGF receptor FGFR3 (a glycine in the transmembrane domain). The mutation 
is dominant, and almost ail cases are due to new, independently occurring 
mutations, implying an extraordinarily high mutation rate at this particular site 
in the genome. The defect in FGF signaling causes dwarfism by interfering 
with the growth of cartilage in developing long bones. (From Velasquez’s 
painting of Sébastian de Morra. © Museo del Prado, Madrid.) 
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Figure 21-63 Myostatin limits muscle growth. A wild-type whippet dog and a bully whippet 
that lacks myostatin. (A, from http://www.merlinanimalrescue.co.uk/dogs/?m=201211 ; B, from 
http://animalslook.com/schwarzenegger-dog/.) 


Lilce TGFp itself, myostatin acts through the Smad intracellular signaling path- 
way (see Figure 15-57) to inhibit muscle growth specifically. Another intracellular 
signaling pathway, called the Hippo pathway, inhibits organ and organism growth 
more generally. It was discovered in Drosophila, but it opérâtes in vertebrates as 
well. It inhibits growth both by promoting cell death (by blocking an apoptosis 
inhibitor) and by inhibiting cell-cycle progression (by inhibiting the expression 
of the cell-cycle gene Cyclin É). Some components of the pathway in Drosophila 
are shown in Figure 21-64. The organs of animais that are abnormally résistant to 
Hippo repression can grow to a monstrous size (Figure 21-65). 

It is important to note that in ail species nutritional conditions also play a fun- 
damental part in regulating the pace and extent of growth, and in animais they do 
so through hormonal signal networlcs that are highly conserved between verte¬ 
brates and invertebrates. Although we do not hâve space for details here, genetic 
experiments, especially in Drosophila , hâve begun to unravel the logic of these 
Controls, and to indicate how they may operate alongside other machinery, such 
as the Hippo pathway, to détermine final size. 


Figure 21-64 Hippo pathway. Hippo, 
a protein kinase, limits growth by 
phosphorylation and activation of the kinase 
Warts, which in turn phosphorylâtes and 
inactivâtes the transcriptional coactivator 
Yorkie (called Yap in vertebrates). When 
unphosphorylated, Yorkie/Yap drives tissue 
growth: it activâtes the transcription of 
the growth-promoting gene Myc, the cell- 
cycle progression gene Cyclin E, the anti- 
apoptotic gene Diap, and the microRNA 
Bantam. Hippo-induced phosphorylation of 
Yorkie/Yap blocks this effect. 


Summary 

The sizes of animais and their organs vary widely and largely dépend on total cell 
mass. This in turn dépends on the size and number of cells, which are increased 
through cell growth and cell division, respectively. Cell numbers are reduced by 
programmed cell death. Each of these processes dépends on both intracellular and 


wild type Yap overactivity wild type Yap overactivity 

4 


(A) mouse liver (B) fly head 

Figure 21-65 Overcoming Hippo repression increases organ size. (A) Livers from control and 
Yap -overexpressing mice. In these mice, Hippo signaling is insufficient to block Yap. (B) Adult heads 
from control and Yap-overexpressing flies. In the mutant flies, Hippo signaling is unable to block 
Yap. (From J. Dong et al., Cell 130:1120-1133, 2007.With permission from Elsevier.) 
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extracellular signais. The mystery is how these processes are regulated and coordi- 
nated to produce and maintain the characteristic final size of the adult organ or 
animal. 

Some signais such as surviv al factors, growth factors, and mitogens stimulate 
growth by promoting cell survival, cell growth, and cell division, respectively, while 
other signal molécules do the opposite. Although most of these signais operate 
locally to help sculpt the size and shape ofthe animal, its organs, and appendages, 
other s act as hormones to regulate the growth ofthe animal as a whole. Nutrients 
can regulate growth through hormonal signais in the entire body. 

Many animais and organs can, by unlcnown mechanisms, assess their total cell 
mass and regulate it. If for example, cell size is artificially increased or decreased in 
these cases, cell numbers adjust to maintain a normal total cell mass. Conversely, if 
cell numbers are artificially increased or decreased, cell size adjusts to compensate. 

NEURAL DEVELOPMENT 

The development of the nervous System poses problems that hâve little parallel 
in other tissues. A typical nerve cell, or neuron, has a structure unlilce that of any 
other class of cells, with a long axon and branching dendrites, both of which malce 
many synaptic connections to other cells (Figure 21-66). The central challenge of 
neural development is to explain how the axons and dendrites grow out, find their 
right partners, and synapse with them selectively to create a neural networlc—an 
electrical signaling System—that functions correctly to guide behavior (Figure 
21-67). The problem is formidable: the human brain contains more than 10 11 
neurons, each of which, on average, has to malce connections with a thousand 
others, according to a regular and predictable wiring plan. The précision required 
is not so great as in a man-made computer, because the brain performs its com¬ 
putations in a different way and is more tolérant of vagaries in individual compo- 
nents. But the human brain nevertheless outstrips ail other biological structures 
in its organized complexity. 

The components of a typical nervous System—the various classes of neurons, 
glial cells, sensory cells, and muscles—originate in a number of widely separate 
locations in the embryo. Thus, in the first phase of neural development, the dif¬ 
ferent parts of the nervous System develop according to their own local programs: 
neurons are born and assigned spécifie characters according to the place and 
time of their birth, under the control of inductive signais and transcription regu- 
lators, by mechanisms of the types we hâve already discussed. In the next phase, 
newborn neurons extend axons and dendrites along spécifie routes toward their 
target cells, guided by extracellular signais that attract or repel them. In the third 
phase, neurons form synapses with other neurons or muscle cells, setting up a 
provisional but orderly networlc of connections. In the final phase, which con¬ 
tinues into adult life, the synaptic connections are adjusted and refined through 
mechanisms that usually dépend on synaptic signaling between the cells involved 



Figure 21-66 A typical neuron of a 
vertebrate. The arrows indicate the 
direction in which signais are conveyed. 
The neuron shown is a basket cell, a type 
of neuron in the cerebellum. (Adapted from 
S. Ramôn y Cajal, Histologie du Système 
Nerveux de l’Homme et des Vertébrés, 
1909-1911. Paris: Maloine; reprinted, 
Madrid: C.S.I.C., 1972.) 
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Figure 21-67 The complex organization of nerve cell connections. This 
drawing depicts a section through a small part of a mammalian brain—the 
olfactory bulb of a dog—stained by the Golgi technique. The black objects 
are neurons; the thin lines are axons and dendrites, through which the various 
sets of neurons are interconnected according to précisé rules. (From 
C. Golgi, Riv. sper. freniat. Reggio-Emilia 1:405-425, 1875.) 

(Figure 21-68). At ail stages, neurons are in intimate contact with various types of 
non-neuronal supporting cells—the glial cells. 

Neurons Are Assignée! Different Characters According to the Time 
and Place of Their Birth 

We start our account here with the first phase of neural development: the gén¬ 
ération of neural progenitors and their différentiation into hundreds of different 
neuronal subtypes, along with a much smaller number of glial types. Although the 
nervous System is exceptional in the extent of cell diversity, the process dépends 
on the same principles that generate different cell types in other organs. We hâve 
already discussed some of the underlying machinery in the developing Drosoph- 
ila nervous System. We turn now to vertebrates. 

The vertebrate spinal cord, the brain, and the retina of the eye together con- 
stitute the central nervous System (CNS). They ail originate as parts of the neural 
tube, whose formation was described earlier (see Figure 21-56). The brain and 
eyes develop from the anterior neural tube and the spinal cord from the posterior. 

The developmental anatomy is seen at its simplest in the spinal cord. As it 
develops, the epithelium forming the walls of the posterior neural tube becomes 
enormously thiclcened as the cells proliferate and differentiate, creating a highly 
organized structure of neurons and glial cells, surrounding a small central chan- 
nel. Bands of neurons with different future functions—and expressing different 
genes—are laid out along the dorsoventral axis of the tube. Motor neurons (those 
that control the muscles) are located ventrally, whereas neurons that process sen- 
sory information are found dorsally. This pattern is established by opposing gradi¬ 
ents of morphogens. These are secreted by specialized groups of cells that run the 
length of the ventral and dorsal midlines of the neural tube (Figure 21-69). The 
two morphogen gradients—consisting of Sonic hedgehog protein from the ventral 
source and BMP and Wnt from the dorsal source—help induce different groups 
of proliferating neural progenitor cells and differentiating neurons to express dif¬ 
ferent combinations of transcription regulators. These regulators in turn drive the 
production of different combinations of neurotransmitters, receptors, cell-cell 
adhesion proteins, and other molécules, creating terminally differentiated neu¬ 
rons that will form synaptic connections selectively with the right partners and 
exchange appropriate signais with them. 



Figure 21-68 The four phases of neural 
development. 
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Figure 21-69 A schematic cross section 
of the spinal cord of a chick embryo, 
showing how cells at different levels 
along the dorsoventral axis acquire 
different characters. (A) Signais that direct 
the dorsoventral pattern. Sonic hedgehog 
protein from the notochord and the floor 
plate (the ventral midline of the neural 
tube) and BMP and Wnt proteins from 
the roof plate (the dorsal midline) act as 
morphogens to control gene expression. 

(B) The resulting patterns of cell fates 
in the developing spinal cord. Different 
groups of proliferating neural progenitor 
cells (in the ventricular zone, close to 
the lumen of the neural tube) and of 
differentiating neurons (in the mantle zone, 
further out) express different combinations 
of transcription regulators. Neurons 
expressing different transcription regulators 
will form connections with different partners 
and may make different combinations 
of neurotransmitters and receptors. 

Colors represent different cell types and 
combinations of regulatory proteins. 


Extracellular morphogen gradients, however, are not the only way to generate 
cell diversity. As we saw earlier in our discussion of Drosophila neuroblasts (see 
Figure 21-36), different cell types can also be generated by temporal patterning, 
in which an intracellular program changes the character of a progenitor cell over 
time, giving rise to different cell types as development progresses. This mecha- 
nism also seems to operate in vertebrate neurogenesis. The most striking illus¬ 
tration cornes from study of another part of the CNS—the mammalian cérébral 
cortex. 

Although the cérébral cortex is the most complex structure in the human 
body, it has a simple beginning—from the anterior neural tube. As in the spinal 
cord, the cells that form the walls of the tube proliferate, and the neuroepithelium 
thiclcens and expands as they divide. On a predictable schedule, the divisions 
of the neuroepithelial cells begin to produce a succession of cells committed to 
terminal différentiation as neurons. These future neurons are born close to the 
lumen (the central cavity) of the tube. From here, they migrate outward, losing 
attachment to the lumenal surface and crawling outward along neighboring cells 
that continue to span the full thiclcness of the neuro epithelium. These latter neu¬ 
roepithelial cells do double duty, functioning as progenitors of neurons and glia, 
and as supporters of the épithélial architecture. They become stretched out as 
radial glial cells , forming a scaffold that continues to span the neuroepithelium 
even as this grows to an enormous thiclcness (Figure 21-70). At the same time, 
the radial glial cells continue to divide as neural precursors, giving rise to both 
neurons and glial cells—new radial glial cells as well as glial cells of other types. 
The newborn neurons, migrating along the radial glial cells, find their appropriate 
resting places in the developing cortex, where they mature, and from these sites 
they send out their axons and dendrites. The first-born neurons settle closest to 
their birthplace near the lumen, while neurons born later crawl past them to set¬ 
tle farther out (Figure 21-71). The successive générations of neurons thus build 
up as a sériés of cortical layers, ordered by birthdate and endowed with different 
intrinsic characters. 

Strikingly, single cortical progenitor cells isolated in culture generate distinct 
types of cortical neurons and glial cells, with the timing and characteristics appro¬ 
priate to spécifie cortical layers. These observations suggest that the neural pro¬ 
genitors in the developing mammalian cortex, much lilce the Drosophila neuro¬ 
blasts, step through an intracellular developmental program that generates the 
ordered succession of different nerve cell types. 
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The Growth Cône Pilots Axons Along Spécifie Routes Toward 
Their Targets 

According to the character assigned to it during its early development, a neuron 
will proceed to malce connections with spécifie partners. This phase of neural 
development involves a type of morphogenesis unique to the nervous System, in 
which axons and dendrites extend along spécifie routes toward their target cells. 
A typical neuron sends out one long axon and many dendrites, which are usually 
shorter. The axon projects to distant target cells to which the neuron will eventu- 
ally send signais. The dendrites will receive incoming signais from axon terminais 
of other neurons. Axons and dendrites extend by growth at their tip, where one 
sees an irregular, spiky enlargement called a growth cône (Figure 21-72 and 
Movie 21.6). The growth cône is both the engine that produces the crawling move- 
ment and the steering apparatus that directs the tip along the proper path. Cyto- 
slceletal machinery in the growth cône créâtes active protrusions, in the form of 
filopodia and lamellipodia (see Chapter 16 for details): when such a protrusion 








/ mm\ 


■ à- ■ LNbW* 
i # 4 V|4 


M 

38» 


layers of cortical 
neurons 


■►■►etc 


last-born 

neurons 


first-born 

neurons 


dividing progenitor cell 


radial glial cell 


Figure 21-70 Migration of immature 
neurons. Before sending out axons 
and dendrites, newborn neurons often 
migrate from their birthplace and settle in 
another location. The diagrams are based 
on reconstructions from sections of the 
cérébral cortex (part of the neural tube) of 
a monkey and rely on a staining technique 
that picks out at random a small subset of 
the whole dense mass of neuroepithelial 
cells. The neurons go through their final 
cell division close to the inner, lumenal 
face of the neural tube (in the ventricular 
proliférative zone) and then migrate 
outward by crawling along radial glial cells 
that form a scaffold. Each of these latter 
cells extends from the inner to the outer 
surface of the tube, a distance that may be 
as long as 2 cm in the cérébral cortex of 
the developing brain of a primate. 

The radial glial cells can be considered 
as persisting cells of the original columnar 
epithelium of the neural tube that become 
extraordinarily stretched as the wall of 
the tube thickens. They also serve as 
neural stem cells: depending on stage 
and région, the newborn neurons can 
be generated from radial glial cells that 
undergo mitosis while their nuclei are close 
to the inner surface of the tube, or they 
can be generated from a nearby class of 
specialized progenitors in the ventricular 
proliférative zone. (After P. Rakic, J. Comp. 
Neurol. 145:61-84, 1972. With permission 
from John Wiley & Sons, Inc.) 


Figure 21-71 Programmed production 
of different types of neurons at different 
times from dividing progenitors in 
the cérébral cortex of the brain of a 
mammal. Close to one face of the cortical 
neuroepithelium, progenitor cells divide, in 
stem-cell fashion, to produce successive 
générations of neurons (colored here 
blue, green, red, orange, and black). The 
neurons migrate out toward the opposite 
face of the epithelium by crawling along the 
surfaces of radial glial cells, as shown in 
Figure 21-70. The first-born neurons settle 
closest to their birthplace, while neurons 
born later crawl past them to settle farther 
out. Successive générations of neurons 
thus occupy different layers in the cortex 
and hâve different intrinsic characters 
according to their birth dates. 
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contacts an unfavorable surface, it withdraws; when it contacts a more favorable 
surface, it persists longer, steering the growth cône in that direction. In this way, 
the growth cône is guided by subtle variations in the properties of the surfaces 
over which it moves. At the same time, it is sensitive to spécifie signaling molé¬ 
cules, which—as we discuss next—can either encourage or hinder its advance. 

A Variety of Extracellular Cues Guide Axons to their Targets 

Growth cônes generally travel toward their targets along predictable routes, 
according to programs stored in the memory of the particular neuron to which 
they belong (Movie 21.7). In the simplest case, a growth cône can talce a route that 
has been pioneered by other neurites, which they follow by contact guidance. As a 
resuit, nerve fibers in a mature animal are usually found grouped together in tight 
parallel bundles (called fascicles or fiber tracts). Such crawling of growth cônes 
along axons is partly mediated by homophilic cell-cell adhesion molécules— 
membrane glycoproteins that help a cell displaying them to stick to any other cell 
that displays the same molécules. As discussed in Chapter 19, many homophilic 
adhesion molécules fall into one of two main classes: they are members of either 
the immunoglobulin superfamily, such as N-CAM, or the Ca 2+ -dependent cad- 
herin family, such as N-cadherin. Members of both families are generally prés¬ 
ent on the surfaces of growth cônes, of axons, and of various other cell types that 
growth cônes crawl over, including glial cells in the central nervous System and 
muscle cells in the periphery of the body. Growth cônes also migrate over compo- 
nents of the extracellular matrix. When tested with neurons growing in a culture 
dish, some of the matrix molécules, such as laminin, favor axon outgrowth, while 
others, such as chondroitin sulfate proteoglycans, discourage it. But exactly how 
the matrix functions to guide axons in intact animais remains to be discovered. 

Growth cônes are generally guided by a succession of different cues at differ¬ 
ent stages of their journey, as summarized in Figure 21-73. Many of these cues 
involve spécifie signaling molécules. Some of these are encountered in the extra¬ 
cellular matrix, while others are attached to the plasma membrane of cells that 
the growth cônes touch. Another important part is played by chemotactic factors; 
these are proteins secreted from cells that act as beacons at strategie points along 
the path—some attracting, others repelling. The trajectory of commissural axons— 
axons that cross from one side of the body to the other—provides a well-studied 
example. 

Commissural axons are a general feature of bilaterally symmetrical animais, 
such as us, because they are required to coordinate behavior of the two sides of 
the body. In the developing spinal cord of a vertebrate, for example, a large num- 
ber of neurons send their axonal growth cônes ventrally toward the floor plate 
(the same structure that we encountered earlier as a source of the morphogen 
Sonic hedgehog—see Figure 21-69). The growth cônes cross the floor plate and 
then turn abruptly through a right angle to follow a longitudinal path up toward 
the brain, parallel to the floor plate but never again Crossing it (Figure 21-74). The 
first stage of the journey dépends on a concentration gradient of the signal protein 
Netrin, secreted by the cells of the floor plate: the commissural growth cônes sniff 
their way toward its source. 


Figure 21-72 Internai architecture 
of a neuronal growth cône, as seen 
in culture on a fiat substratum. The 

growth cône forms as an expansion of 
the tip of the growing axon. (A) Image by 
interference-contrast microscopy. 

(B) Immunostaining to show microtubules 
(green). (C) Immunostaining to show 
actin filaments (red). (D) Diagram of the 
cytoskeletal machinery. Filopodia form 
and push forward by assembly of actin 
filaments at the leading edge of the 
growth cône. Microtubules stabilize the 
directional decisions made by the actin-rich 
protrusions. Filopodia adhering to the fiat 
substratum contract and pull the growth 
cône forward. (Images by Chi-Hung Lin, 
Paul Forscher Laboratory, Yale University, 
New Haven, CT.) 
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Figure 21-73 Mechanisms of growth- 
cone guidance. Growth cônes use a 
variety of extracellular eues to navigate 
to distant targets. They can adhéré to 
the extracellular matrix or to the surfaces 
of other cells, or they can be repelled by 
them; they can crawl, for example, by 
homophilic adhesion along the axons of 
pioneer neurons; and they can be attracted 
or repelled by soluble guidance signais. 
(After E. Kandel et al., Principles of Neural 
Science, 5th ed., New York: McGraw Hill 
Medical, 2012.) 


If commissural growth cônes are attracted to the floor plate, why do they cross 
it and emerge on the other side, instead of staying in the attractive territory? And 
having crossed it, why do they never cross baclc again? The answers lie in a change 
in the responsiveness of the growth cônes during their journey. As the growth 
cônes cross the midline, they lose sensitivity to Netrin and become sensitive 
instead to a signal protein called Sût (see Figure 21-74). Slit is also produced by 
the fioor plate, but it has the opposite effect to that of Netrin: it repels the growth 
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Figure 21-74 The guidance of commissural axons. (A) The pathway taken by commissural axons in the embryonic spinal cord of a vertebrate. 

(B) Attraction to the midline. The growth cône is first attracted to the floor plate by Netrin, which is secreted by the floor-plate cells and acts on the 
receptor DCC in the axonal membrane. (C) Repulsion from the midline after Crossing it. As the growth cône crosses the floor plate, Slit cornes into 
play: it binds to its receptors Robol and Robo2 and acts as a repellent to keep the growth cône from re-entering the floor plate. In addition, it blocks 
responsiveness to the attractant Netrin. Before Crossing the midline, the commissural neurons express Robo3.1, an alternative splice form of Robo3 
that is related to Robo proteins but blocks Slit signaling. As neurites cross the midline, Robo3.1 is lost and growth cônes become responsive to Slit 
and are repelled from the midline. 
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cônes, preventing them from re-entering the midline territory. The responses of 
the growth cône dépend on the receptors that it expresses: as commissural neu¬ 
rons approach the floor plate, the Slit receptors are lcept inactive by an inhibitory 
protein (Robo3.1) in the same membrane, allowing the commissural axons to 
grow to the midline without being repelled. Robo3.1 is lost as the growth cônes 
cross the midline; now the growth cônes become sensitive to repulsion by Slit and 
are thereby prevented from Crossing baclc to the other side. At the same time, sig¬ 
nais from the Slit receptors interfère with those from the Netrin receptors, making 
the growth cônes deaf to the signal that attracted them to the floor plate initially. 
A similar mechanism, using similar proteins, seems to govern midline Crossing of 
commissural axons in other animais, including Aies and worms. 

The guidance of commissural axons illustrâtes how axons rarely navigate 
directly to their targets. Instead, they use intermediate targets, or guideposts, and 
switch their sensitivities as they move from one local guidepost to the next, steer- 
ing their way through a complex environment to a far-away destination. 

The Formation of Orderly Neural Maps Dépends on Neuronal 
Specificity 

In many cases, neurons of a similar type are laid out in a broad array of different 
positions, but send out axons that corne together for their journey and arrive at 
the target région in a tight bundle. There the axons disperse again, to terminate 
at different sites in the target territory. This they do in an orderly way, creating a 
regular mapping from one territory to another—a neural map. 

The axon projection from the eye to the brain provides an important exam¬ 
ple. The neurons in the retina that convey visual information baclc to the brain 
are called retinal ganglion cells (RGCs). There are more than a million of them in 
humans, each one reporting on a different part of the visual field. Their axons con¬ 
verge on the optic nerve head at the baclc of the eye and travel together along the 
developing optic nerve toward the brain. Their main site of termination, in most 
vertebrates other than mammals, is the optic tectum—a broad expanse of cells in 
the midbrain. In connecting with tectal neurons, the RGC axons distribute them- 
selves in a predictable pattern according to the arrangement of their cell bodies in 
the retina: RGCs that are neighbors in the retina connect with target cells that are 
neighbors in the tectum. The orderly projection créâtes a retinotopic map of visual 
space on the tectum (Figure 21-75). 

Orderly maps of this sort are found in many brain régions. In the auditory Sys¬ 
tem, for example, the neurons that project from the ear to the brain form a tono- 
topic map in which brain cells receiving information about sounds of different 
pitch are ordered along a line, lilce the lceys of a piano. And in the somatosensory 


Figure 21-75 The neural map from 
eye to brain in a young zebrafish. 

(A) Diagrammatic view, looking down on 
the top of the head. (B) Fluorescence 
micrograph. Fluorescent tracer dyes hâve 
been injected into each ey e—red into the 
anterior part, green into the posterior part. 
The tracer molécules hâve been taken up 
by the neurons in the retina and carried 
along their axons, revealing the paths they 
take to the optic tectum in the brain and 
the map that they form there. (Courtesy 
of Chi-Bin Chien, from D.H. Sanes, 

T.A. Reh and W.A. Harris, Development 
of the Nervous System. San Diego, CA: 
Academie Press, 2000.) 
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Figure 21-76 A map of the body surface 
in the human brain. The surface of the 
body is mapped onto the somatosensory 
région of the cérébral cortex by using an 
orderly System of nerve cell connections 
to pair body sites with the brain sites that 
receive their sensory information. This 
means that the map in the brain is largely 
faithful to the topology of the body surface, 
even though different body régions are 
represented at different magnifications 
according to their density of innervation. 
The homunculus (the “little man” in the 
brain) has big lips, for example, because 
the lips are a particularly large and 
important source of sensory information. 
The map was determined by stimulating 
different points in the cortex of conscious 
patients during brain surgery and recording 
what they said they felt. (After W. Penfield 
and T. Rasmussen, The Cérébral Cortex of 
Man. New York: Macmillan, 1950.) 


System, neurons conveying information about touch map onto the cérébral cortex 
so as to mark out a "homunculus"—a small, distorted, two-dimensional image of 
the body surface (Figure 21-76). 

The retinotopic map of Visual space in the optic tectum is the best characterized 
of ail these maps. How does it arise? A famous experiment in the 1940s on frogs 
provided an important due. If the optic nerve of a frog is eut, it will regenerate. The 
retinal axons grow back to the optic tectum, restoring normal vision. If, however, 
the eye is in addition rotated in its soclcet at the time of cutting of the nerve, so as 
to put originally ventral retinal cells in the position of dorsal retinal cells, vision is 
still restored, but with an awkward flaw: the animal behaves as though it sees the 
world upside down and left-right inverted (Figure 21-77). If food is dangled in 
front of it, for example, it will lunge perversely backward. This is because the mis- 
placed retinal cells malce the connections appropriate to their original, not their 
actual, positions. It seems that the retinal ganglion cells (RGCs) hâve positional 
values—position-specific biochemical properties representing records of their 
original location in the retina, assigned perhaps by earlier morphogen gradients, 
and making RGCs on opposite sides of the retina intrinsically different. 

Such nonequivalence among neurons is referred to as neuronal specificity. 
It is this intrinsic characteristic that guides the retinal axons to their appropriate 
target sites in the tectum. Those target sites themselves are distinguishable by the 



Figure 21-77 Neurons in different 
régions of the retina project axons 
to different régions in the tectum. 

(A) Neurons (RGCs) in the anterior retina 
project axons to the posterior tectum (as 
shown in Figure 21-75 for zebrafish). 

(B) Régénération experiments show 
that retinal neurons hâve an intrinsic 
preference for the part of the tectum they 
normally connect to. If the eye is surgically 
rotated when the optic nerve is eut, the 
regenerating retinal axons connect to their 
original targets, creating an inverted map. 
(After E. Kandel et al., Principles of Neural 
Science, 5th ed., New York: McGraw Hill 
Medical, 2012.) 
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Figure 21-78 Selectivity of retinal axons growing over tectal membranes. (A) Diagram of an experiment performed with 
cells from a chick embryo. The culture substratum is coated with alternating stripes of membrane prepared either from posterior 
tectum or from anterior tectum. Axons from posterior retina grow on anterior tectal membrane but are repelled by posterior 
tectal membrane. Axons from anterior retina show different (less sélective) behavior. (B) Photograph of results. The retinal axons, 
growing out from the left, are made visible by staining them with a fluorescent marker. The sélective pattern of outgrowth shows 
that anterior tectum differs from posterior tectum, and anterior retina correspondingly differs from posterior retina. In the intact 
organism, this serves to orient a retinotopic map; the map is refined by subséquent compétitive interactions among the anterior 
and posterior retinal axons, which push the anterior retinal cells off anterior tectal territory. (From J. Walter et al., Development 
101:685-696, 1987.With permission from the Company of Biologists.) 


retinal axons because the tectal cells also carry positional labels. Thus, the neural 
map dépends on a correspondence between two Systems of positional marlcers, 
one in the retina and the other in the tectum. 

How are these marlcers used to malce the map? When posterior axons are 
allowed to grow out over a carpet of anterior or posterior tectal membranes in 
a culture dish, they show selectivity. Posterior axons strongly prefer the anterior 
tectal membranes, as in vivo, whereas anterior axons show no preference or pre¬ 
fer posterior tectal membranes (Figure 21-78). The key différence between ante¬ 
rior and posterior tectum is not an attractive factor on the anterior tectum but a 
répulsive factor on the posterior tectum, to which posterior retinal axons are sen¬ 
sitive but anterior retinal axons are not. If a posterior retinal growth cône touches 
posterior tectal membrane, it collapses its filopodia and withdraws. 

In this System, as in others that we hâve mentioned, the répulsive interactions 
are mediated by ephrin-Eph signaling—specifically, EphrinA-EphA signaling 
for the anteroposterior axis (Figure 21-79). An analogous mechanism based on 
EphB-EphrinB signaling orients the dorsoventral axis of the retinotopic map. 

These mechanisms serve to orient the map along both axes, but they are not 
enough by themselves to ensure accurate point-to-point detail. This is brought 
about through a long process of adjustment that fills in and refines the map 
through interactions among the RGC axon terminais as they compete for territory 
on the tectum. This refinement of the pattern of connections involves electrical 
signaling in the System of developing synapses—a topic that we return to shortly. 


Both Dendrites and Axonal Branches From the Same Neuron 
Avoid One Another 


Axons and dendrites from different neurons can repel one another, or they can 
cohere; they can collaborate to form synapses, or they can compete. Remarlcably, 
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Figure 21-79 Ephrin signaling orients 
the retinotopic map. (A) Neurons in the 
posterior retina express EphA. As their 
axons reach the tectum, they are repelled 
by high levels of EphrinA protein in the 
posterior tectum and project preferentially 
to the anterior tectum. (B) In EphA-mutant 
mice, posterior retinal axons feel no 
such repulsion and project more widely 
within the tectum. (After E. Kandel et al., 
Principles of Neural Science, 5th ed., New 
York: McGraw Hill Medical, 2012.) 


axons or dendrites can also repel each other when they arise from a single neu- 
ron. Such self-avoidance prevents the neuron from making purposeless synapses 
with itself; it also helps the cell spread out its processes widely so as to innervate 
a broad territory. 

Self-avoidance poses a problem. If the same self-recognition molécule were 
used in every neuron, ail neurons in the brain would repel each other. Some 
classes of neurons do show this sort of mutual repulsion, creating solitary terri- 
tories—a phenomenon called tiling; but in most cases, axons and dendrites from 
different neurons can overlap with one another. How then can the processes put 
out by a single neuron distinguish between self and non-self? This conundrum has 
been partially resolved by the discovery of a remarlcable set of proteins that endow 
each neuron with a label unlilce that of its neighbors. These are the DSCAM pro¬ 
teins in Drosophila and the protocadherins in vertebrates. As described in Chap- 
ter 7, DSCAM proteins are extraordinary for the number of isoforms that can be 
generated by alternative RNA splicing—more than 30,000 variants for DSCAM1 
(see Figure 7-57). Diversity arises from alternative exons that code for three highly 
variable extracellular immunoglobulin domains. Each DSCAM 1 isoform engages 
in homophilic binding (see Figure 19-5), but remarlcably, ail the variable domains 
need to be identical for this to occur. Thus, one cell surface will bind to another via 
DSCAM only when the two cell surfaces express identical isoforms. The resuit of 
binding is repulsion, although the detailed mechanisms are poorly understood. 

If alternative splicing occurs in a random fashion in each cell, neighboring pro¬ 
cesses from different neurons are unlikely to express the same DSCAM 1 variant, 
so only the processes of the same cell will repel one another. Neurons that laclc 
ail DSCAM 1 variants hâve severe defects in neuronal self-avoidance. Engineering 
Drosophila so that ail of its neurons produce a single isoform restores self-avoid¬ 
ance; but now the processes of neighboring neurons express the same isoform 
and repel each other, resulting in the phenomenon of tiling (Figure 21-80). 

Vertebrate neurons use a similar self-avoidance strategy to pattern their axons 
and dendrites, but instead of DSCAMs, they use protocadherins for self/non-self 
discrimination. The Protocadherin locus encodes 58 related cadherin-like trans¬ 
membrane proteins that are expressed in different combinations in single neu¬ 
rons. Homophilic récognition results in self-avoidance of dendrites emanating 
from the same neuron; neighboring dendrites of different neurons express differ¬ 
ent protocadherins and thus évadé repulsion. Thus, although insect DSCAM and 
vertebrate protocadherin proteins share no sequence homology, they médiate 
similar self-avoidance strategies. 
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Figure 21-80 DSCAM médiates self- 
avoidance of dendrites. (A) Sensory 
neurons in the Drosophila peripheral 
nervous System extend dendrites along 
the larval body wall. The image shows the 
dendrites of a regular array of photosensing 
neurons (red), which allow the larva 
to detect and avoid harmful light. The 
posterior epidermal cells of each segment 
are labeled in blue. There are many 
neurons, and those shown here spread 
out their dendrites into overlapping fields. 
(B) Mutations at the Dscam locus upset 
the way the various dendrites interact, 
changing the rules of self-avoidance and 
the distribution of innervation. (A, courtesy 
of Chun Han; B, after D. Hattori et al., 

Annu. Rev. Cell Dev. Biol. 24:597-620, 
2008. With permission from Annual 
Reviews.) 


red dendrites repel each 
other and blue dendrites 
repel each other; but 
blue and red do not 
repel each other 

(B) 


red dendrites repel 
each other; but orange 
dendrites repel neither 
themselves nor red 
dendrites 


green dendrites repel 
each other 


Target Tissues Release Neurotrophic Factors That Control Nerve 
Cell Growth and Survival 

Eventually, axonal growth cônes reach the target région where they must hait and 
malce synapses. These synapses, as a rule, are destined to transmit neural signais 
in one direction, from axon to target cell. The development of synapses, however, 
dépends on signaling in both directions: signais from the target tissue not only 
help control which growth cônes synapse where (as we discuss shortly), but can 
also regulate how many of the innervating neurons survive. 

Many types of vertebrate neurons are produced in excess; up to 50% or more of 
some of them die soon after they reach their target, even though they appear per- 
fectly normal and healthy up to the time of their death. About half of ail the motor 
neurons that send axons to slceletal muscle, for example, die within a few days 
after malcing contact with their target muscle cells. A similar proportion of the 
sensory neurons that innervate the skin die after their growth cônes hâve arrived 
there. 

This large-scale normal neuronal death often seems to reflect the outcome of 
a compétition, in which the target tissue releases a limited amount of a spécifie 
neurotrophic factor that the neurons innervating the tissue require to survive; 
those that do not get enough die by programmed cell death. If the amount of tar¬ 
get tissue is increased—for example, by grafting an extra limb bud onto the side 
of the embryo—more limb-innervating neurons survive; conversely, if the limb 
bud is eut off, the same neurons ail die (Figure 21-81). In this way, although indi- 
viduals may vary in their bodily proportions, they always retain the right number 
of motor neurons to innervate ail their muscles and the right number of sensory 
neurons to innervate their body surface. The strategy of overproduction followed 
by death of surplus cells may seem wasteful, but it provides a simple and effective 
means to adjust the number of innervating neurons according to the amount of 
tissue requiring innervation. 

The first neurotrophic factor to be identified, and still the best characterized, 
is called nerve growth factor ( NGF )—the founding member of the neurotrophin 
family of signal proteins. It promûtes the survival and growth of spécifie classes of 
sensory neurons and of sympathetic neurons (a subclass of peripheral neurons 
that control contractions of smooth muscle and sécrétion from exocrine glands). 
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Figure 21-81 The survival of motor 
neurons dépends on signais provided 
by the target muscles. (A) Removal of 
the limb bud shortly after arrivai of motor 
axons results in the death of motor neurons 
in the spinal cord on the amputated side. 

(B) Transplantation of an extra limb bud 
increases the survival of motor neurons. 
(After E. Kandel et al., Principles of Neural 
Science, 5th ed., New York: McGraw Hill 
Medical, 2012.) 
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NGF is produced by the tissues that these neurons innervate. When extra NGF is 
provided, extra sensory and sympathetic neurons survive, just as if extra target 
tissue were présent. Conversely, in a mouse with a mutation that inactivâtes the 
gene for NGF or for its receptor (a receptor tyrosine kinase called TrkA), almost 
ail sympathetic neurons and the NGF-dependent sensory neurons are lost. There 
are many neurotrophic factors, only a few of which belong to the neurotrophin 
family, and they act in different combinations to promote the survival and growth 
of different classes of neurons. 

Formation of Synapses Dépends on Two-Way Communication 
Between Neurons and Their Target Cells 

At journey's end, the taslc of a growth cône is to hait its travels and malce synapses 
with spécifie target cells. Synapses were introduced in Chapter 11, where we dis- 
cussed channels and the electrical properties of membranes. Two main classes 
of synapses are found in vertebrates; those made with muscle cells and those 
made with other neurons. Synapse formation is best understood in the case of 
the highly specialized connections between motor neurons and skeletal muscle 
cells—so-called neuromuscular junctions (see Figure 11-38). During synapse 
formation, the axonal growth cône differentiates into a nerve terminal that con- 
tains synaptic vesicles filled with the neurotransmitter acétylcholine, while acé¬ 
tylcholine receptors become clustered in the muscle cell plasma membrane at 
the site of synapse formation. A synaptic cleft séparâtes the pre- and postsynaptic 
plasma membranes, and a thin sheet of basal lamina lies in this space between 
them (Figure 21-82). 
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Formation of the synapse involves two-way communication between the 
muscle cell and axonal growth cône: each of them, under the influence of the 
other, must reorganize the molécules on its side of the junction. The growth cône 
releases the signal protein Agrin, while the muscle expresses the Agrin receptor 
LRP4. Agrin binding to LRP4 stimulâtes association of LRP4 with MuSK, a recep¬ 
tor tyrosine kinase. LRP4 also serves as a signal in the reverse direction, from the 
muscle to the axon (Figure 21-83). During synapse formation, MuSK and LRP4 
cluster in the muscle cell plasma membrane in the general neighborhood of the 
future synapse. As the growth cône approaches, it recognizes LRP4, which stim¬ 
ulâtes the différentiation of presynaptic structures in the nerve cell. At the same 
time, Agrin released from the growth cône binds to LRP4 in the muscle cell; this 
activâtes MuSK, and promûtes a more focused clustering of acétylcholine recep- 
tors in the muscle cell membrane. Through these mechanisms, the reciprocal sig- 
naling of LRP4 from muscle to growth cône—and of Agrin from growth cône to 
muscle—induces the coordinated, localized différentiation of pre- and postsyn- 
aptic structures. 

Synapse formation between neurons in the CNS is far more challenging, both 
for the neurons and for the scientists trying to understand the molecular basis of 
its specificity, and it remains poorly understood. 


Figure 21-82 Formation of the 
neuromuscular junction. (A) The growth 
cône of a motor axon approaches the 
muscle fiber. (B) Initial synapse formation 
is characterized by the accumulation of 
synaptic vesicles at the axon terminal 
and the formation of a specialized basal 
lamina in the synaptic cleft. (C) As the 
neuromuscular junction matures, the 
synaptic cleft accumulâtes basal lamina 
and extracellular matrix proteins, synaptic 
vesicles cluster at presynaptic release sites, 
and neurotransmitter receptors cluster at 
postsynaptic sites. Schwann (glial) cells 
accompany the motor axon and wrap 
around its terminus outside the région of 
synaptic contact. (D) Transmission électron 
micrograph of the région of synaptic contact. 
[D, courtesy of John Heuser, from J. Electron 
Microsc. 60 (Suppl 1), 2011. With permission 
from Oxford University Press.] 
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Figure 21-83 Reciprocal signaling during neuromuscular synapse différentiation. (A) The Agrin receptor LRP4 and its co-receptor MuSK 
cluster in the muscle cell membrane in the general neighborhood of the future synapse. (B) As the growth cône approaches, it recognizes LRP4, 
which stimulâtes différentiation of presynaptic structures. Reciprocally, Agrin is released from the nerve terminal, binds to a complex of LRP4 and 
MuSK in the muscle, and (C) promûtes the further and more focused clustering of the LRP4 and acétylcholine receptors in the muscle cell. Although 
the Agrin/MuSK/LRP4 machinery organizes the synapse, the process also dépends on electrical signaling via the acétylcholine receptors. It is not 
yet known how LRP4 signais to the motor axon. 
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Synaptic Pruning Dépends on Electrical Activity and Synaptic 
Signaling 

The two-way exchange of signais between axon growth cônes and muscle cells 
Controls the initial formation of neuromuscular junctions, but it is only the first 
step in the establishment of the final pattern of the synaptic connections. Each 
muscle cell at first receives synapses from several motor neurons, but in the end 
it is left innervated by only one. This process of synapse élimination dépends 
on active synaptic communication and electrical activity. If synaptic transmis¬ 
sion is bloclced by a toxin that binds to the acétylcholine receptors in the muscle 
cell membrane, or if axonal electrical activity is bloclced by a toxin that binds to 
sodium channels in the axon plasma membrane, the muscle cell retains its multi¬ 
ple innervation beyond the normal time of élimination. 

The phenomenon of activity-dependent synapse élimination is encountered in 
almost every part of the developing vertebrate nervous System (Figure 21-84). It 
has a key rôle, for example, in the refinement of the retinotopic map discussed 
earlier. Synapses are first formed in abundance and distributed over a broad target 
field; then the System of connections is pruned baclc and remodeled by compéti¬ 
tive processes that dépend on electrical activity and synaptic signaling. The élim¬ 
ination of synapses in this way is distinct from the élimination of surplus neurons 
by cell death, and it occurs after the period of normal neuronal death is over. Syn¬ 
apse remodeling during neural development, however, involves more than just 
synapse élimination; it also involves synapse reinforcement, as we discuss next. 

Neurons That Fire Together Wire Together 

Throughout the nervous System, and throughout life, activity-dependent élimi¬ 
nation and reinforcement of synapses plays a fundamental part in adjusting the 
detailed anatomy of the neural network according to functional requirements. The 
importance of these processes, and their underlying rules, emerged half a century 
ago from a groundbreaking sériés of experiments on the developing visual System 
of young mammals. 

In the brain of most mammals, axons relaying visual inputs from the two eyes 
are brought together in a spécifie neuronal layer in the visual région of the cérébral 
cortex. Here, they form two overlapping map s of the external visual field, one as 
perceived through the right eye, the other as perceived through the left. Although 
there may be a tendency for right- and left-eye inputs to be segregated even 
before synaptic communication begins, a large proportion of the axons carrying 
information from the two eyes at early stages form synapses together on shared 
target neurons in the visual cortex. A period of early electrical signaling activity, 

stimulate cell A while cell B stimulate cells A and B 

is quiet: cell C gets excited simultaneously: cell C gets excited 


Figure 21-84 Synapse modification 
and its dependence on electrical 
activity. Experiments in several Systems 
indicate that synapses are strengthened or 
weakened by electrical activity according 
to the rule shown in the diagram. The 
underlying principle appears to be that 
each excitation of a target cell tends 
to weaken any synapse where the 
presynaptic axon terminal has been quiet, 
but to strengthen any synapse where 
the presynaptic axon terminal has just 
been active. As a resuit, any synapse 
that is repeatedly weakened and rarely 
strengthened is eventually eliminated 
altogether. 
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Figure 21-85 Ocular dominance columns in the Visual cortex of a monkey’s brain, and their 
sensitivity to visual expérience. (A) Normally, stripes of cortical cells driven by the right eye 
alternate with stripes, of equal width, driven by the left eye. The stripes, set up before birth, are 
revealed here by injecting a radioactive tracer molécule into one eye, allowing time for this tracer to 
be transported to the visual cortex, and detecting radioactivity there by autoradiography, in sections 
eut parallel to the cortical surface. (B) If one eye is kept covered after birth, during the sensitive 
period of development, and thus deprived of visual expérience, its stripes shrink and those of the 
active eye expand. In this way, the deprived eye may lose the power of vision almost entirely. (From 
D.H. Hubel, T.N. Wiesel and S. LeVay, Philos. Trans. R. Soc. Lond. B Biol. Soi. 278:377-409, 1977. 
With permission from The Royal Society.) 


however, occurring spontaneously and independently in each retina before birth, 
leads to a remarkable pattern of ocular dominance columns in the visual cortex: 
stripes of cells driven by inputs from the right eye alternating with stripes driven 
by inputs from the left eye (Figure 21-85). 

The basis for these phenomena became clear from ingenious experiments 
interfering artificially with visual expérience and altering the coordination of 
electrical signaling in the two eyes. These studies, and many others subsequently, 
hâve highlighted a simple but profoundly important principle that seems to 
govern synapse reinforcement and élimination throughout the nervous System. 
When two (or more) neurons synapsing on the same target cell fire at the same 
time, they reinforce their connections to that cell; when they fire at different times, 
they compete, so that ail but one of them tend to be eliminated. This firing rule is 
expressed in the catchphrase "neurons that fire together wire together." 

The firing rule provides a simple interprétation of the developmental phe- 
nomenon we hâve just described in the mammalian visual System. A pair of axons 
bringing information from neighboring sites in the left eye will frequently fire 
together, and therefore wire together, as will a pair of axons from neighboring sites 
in the right eye; but a right-eye axon and a left-eye axon will rarely fire together, 
and will instead compete. Indeed, if activity from both eyes is silenced using tox- 
ins that bloclc axonal electrical activity or synaptic signaling, as described above, 
the inputs fail to segregate correctly. 

The ségrégation of inputs from the two eyes is only the first of a sériés of activ- 
ity-dependent adjustments of visual connections, whose maintenance is extraor- 
dinarily sensitive to expérience early in life. If, during a certain sensitive period 
(ending at about 5 years of âge in humans), one eye is kept covered for a time so 
as to deprive it of visual stimulation, while the other eye is allowed normal stimu¬ 
lation, the deprived eye loses its synaptic connections to the cortex and becomes 
almost entirely, and irreversibly, blind. In accordance with what the firing rule 
would predict, a compétition has occurred in which synapses in the visual cortex 
made by inactive axons are eliminated while synapses made by active axons are 
Consolidated. In this way, cortical territory is allocated to axons that carry infor¬ 
mation and is not wasted on those that are silent. 

Activity-dépendent synaptic changes are not confined to early life. They 
also occur in the adult brain, where many synapses show both functional and 
morphological alterations with use. This synaptic plasticity is thought to hâve a 
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fundamental rôle in learning and memory. Clearly, for the nervous System as for 
other parts of the body, developmental processes do not end at birth, as we dis- 
cuss in the next chapter. 


Summary 


WHATWE DONT KNOW 


• What régulâtes the pace of 
development? Why does a mouse 
embryo develop faster than a human 
embryo, for example? 


The development ofthe nervous System proceeds in four phases. First , neurons and 
glial cells are generated from dividing neural progenitor cells. Then, the newborn 
neurons send out axons and dendrites toward their targets. Next , they malce syn- 
aptic connections with appropriate target cells so that communication can hegin. 
Finally, excessive neurons are eliminated by normal neuronal cell death, after 
which the System ofsynaptic connections is refined and remodeled according to the 
pattern ofelectrical and synaptic activity in the neural networlc. 

Neurons born at different times and places are specialized to express different 
sets of genes, and they hâve a cell memory that play s a major rôle in determin- 
ing the connections they willform. Their specialization dépends not only on spa¬ 
tial patterning by morphogens but also on intrinsic developmental programs that 
unfold as the neural progenitors proliferate. Axons and dendrites grow out from 
the neurons by means ofgrowth cônes , which follow spécifie pathways delineated 
by attractive and repellant signais along the way , including cell-surface and extra¬ 
cellular matrix molécules and soluble signal proteins to which growth cônes from 
different classes of neurons respond differently. In many parts ofthe nervous System, 
neural maps are set up—orderly projections ofone array of neurons onto another. 
In the retinotopic System, the map is based on the matching of complementary Sys¬ 
tems of position-specific cell-surface markers—ephrins and Eph receptors-pos- 
sessed by the two sets of cells. Other cell-surface molécules such as DSCAM proteins 
in Drosophila and protocadherins in vertebrates médiate self-avoidance between 
the branches arisingfrom a single neuron, helping the cell spread out its processes. 

The formation of synapses involves back-and-forth signaling between target 
cells and the growth cône. After the growth cônes hâve reached their targets and 
initial connections haveformed, individual synapses are eliminated in some places 
and reinforced in others by mechanisms that dépend on synaptic and electrical 
activity. These mechanisms adjust the architecture ofthe neural network according 
to the way in which it is used. 


• What are the mechanisms that 
allow cell memory to be stored during 
development, explaining how each 
cell’s history détermines its future 
behavior? 

• How do signais move through 
tissues? What are the rôles of the 
extracellular matrix and of elongated 
cell projections? 

• How does a cell know exactly where 
it is in a multicellular organism? How 
does it know that its neighbors are the 
correct ones and that, if not, it should 
move or kill itself? 

• How do cells respond to tiny 
gradients of molécules in their 
environment, as required for knowing 
their positions? How are morphogen 
gradients reliably interpreted? 

• What are the genetic changes that 
allow the repurposing of existing body 
parts during évolution? For example, 
how did bat wings evolve from arms? 

• How do cells use genetic 
instructions to form the shape of 
something as complex as the human 
nose? 


PROBLEMS 

Which statements are true? Explain why or why not. 

21-1 In the early cleavage stages, when the embryo 
cannot yet feed, the developmental program is driven and 
controlled entirely by the material deposited in the egg by 
the mother. 

21-2 Because of the many later developmental trans¬ 
formations that produce the elaborately structured organs, 
the body plan set up during gastrulation bears little resem- 
blance to the body plan in the adult. 

21 -3 As development progresses, individual cells 
become more and more restricted in the range of cell types 
they can give rise to. 

21-4 At different stages of embryonic development, 
the same signais are used over and over again by different 
cells, but with different biological outcomes. 


21 -5 Changes in the coding régions of genes involved in 
development are primarily responsible for the différences 
between species. 

21-6 The cell cycle is the ticking dock that sets the 
tempo of developmental processes, with maturational 
changes in gene expression being dépendent on cell-cycle 
progression. 

Discuss the following problems. 

21 -7 Name the four processes that are fundamental to 
animal development, and describe each of them in a single 
sentence. 

21 -8 What are the three germ layers formed during gas¬ 
trulation, and what are the principal structures each gives 
rise to in the adult? 
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21 -9 In the early Drosophila embryo, there seems to be 
no requirement for the usual forms of cell-cell signaling; 
instead, transcriptional regulators and mRNA molécules 
move freely between nuclei. How can that be? 

21 -10 Morphogens play a lcey rôle in development, cre- 
ating concentration gradients that inform cells of where 
they are and how to behave. Examine the simple patterns 
represented by the fiags in Figure Q21-1. Which do you 
suppose could be created by a gradient of a single mor- 
phogen? Which would require gradients of two morpho¬ 
gens? Assuming that such patterns were présent in a sheet 
of cells, explain how they could be created by morphogens. 


CH? 

Norway 

Figure Q21-1 National fiags from three countries (Problem 21-10). 


21-11 Two adjacent cells in the nematode worm nor- 
mally differentiate into an anchor cell (AC) and a ventral 
uterine precursor (VU) cell, but which of the two becomes 
the AC and which becomes the VU cell is completely ran- 
dom: the cells hâve an equal chance of adopting either 
fate, but they always adopt different fates. Mutations of 
Linl2 alter these fates. In hyperactive Linl2 mutants, both 
cells become VU cells, while in inactive Linl2 mutants, 
both cells become ACs. Thus, Linl2 is central to the deci- 
sion-making process. In genetic mosaics in which one 
precursor cell has the hyperactive Linl2 and the other pre¬ 
cursor has the inactive Lin 12, the cell with the hyperactive 
Lin 12 always becomes the VU cell and the cell with inac¬ 
tive Lin 12 always becomes the AC. Assuming that one cell 
sends a signal and the other cell receives it, explain how 
these results suggest that Linl2 encodes a protein required 
to receive the signal. Offer a suggestion for how the fates 
of these two precursor cells are normally decided in wild- 
type worms. 

21-12 It was clear from the early days of studying devel¬ 
opment that certain "morphogenetic” substances were 
présent in the egg and segregated asymmetrically into 
cells of the developing embryo. One such investigation 
in ascidian (sea squirt) embryos examined endodermal 
allcaline phosphatase, which could be visualized by a his- 
tochemical stain. Treatment of embryos with cytochalasin 
B stopped cell division, but did not bloclc expression of 
allcaline phosphatase at the appropriate time. Treatment 
with actinomycin D, which bloclcs transcription, did not 
interfère with expression of allcaline phosphatase. Treat¬ 
ment with puromycin, which bloclcs translation, elim- 
inated expression of allcaline phosphatase. What is the 
lilcely nature of the morphogenetic substance that gives 
rise to allcaline phosphatase? 


21 -1 3 The mouse HoxA3 and HoxD3 genes are paralogs 
that occupy équivalent positions in their respective Hox 
gene clusters and share roughly 50% identity in their pro¬ 
tein-coding sequences. Mice with defects in HoxA3 hâve 
deficiencies in pharyngeal tissues, whereas mice with 
defects in HoxD3 hâve deficiencies in the axial slceleton, 
suggesting quite different functions for the paralogs. Thus, 
it came as a surprise when it was found that replacing a 
defective HoxD3 gene with the normal HoxA3 gene cor- 
rected the deficiency, as did the reciprocal experiment 
of replacing a mutant HoxA3 gene with a normal HoxD3 
gene. Neither transplaced gene, however, could supply 
its normal function; that is, a normal HoxA3 gene at the 
HoxD3 locus could not correct the deficiency caused by 
a mutant HoxA3 gene at the HoxA3 locus. The same was 
true for the HoxD3 gene. If the HoxA3 and HoxD3 genes 
are équivalent, how do you suppose they can play such 
distinct rôles in development? Why do you suppose they 
cannot perform their normal function in a new location? 

21 -14 The segmentation of somites in vertebrate embryos 
is thought to dépend on oscillations in the expression of the 
Hes7 gene. Mathematical modeling explains these oscilla¬ 
tions in terms of the delays in production of the unstable 
Hes7 protein, which acts as a transcription regulator to 
shut off its own expression. Once Hes7 decays, with a half- 
life of about 20 minutes, its transcription résumés. To test 
this model, you décidé to reduce the total delay by remov- 
ing one, two, or ail three of the introns from the Hes7 gene 
in mice. Why do you expect that intron removal would 
reduce the delay? What would you predict would happen 
to the oscillation time, and somite formation, if the model 
were correct? 

21-15 The oscillatory cloclc that drives somite forma¬ 
tion in vertebrates involves three essential components 
Her7 (an unstable repressor of its own synthesis), Delta (a 
transmembrane signaling molécule), and Notch (a trans¬ 
membrane receptor for Delta). Notch is bound by Delta on 
neighboring cells, activating the Notch signaling pathway, 
which then activâtes Her7 transcription. Normally, this 
System worlcs flawlessly to create sharply defined somites 
(Figure Q21-2A). In the absence of Delta, however, only 
the first five somites form normally, and the rest are poorly 
defined (Figure Q21-2B). If a puise of Delta is supplied 
later, somite formation returns to normal in the régions 
where Delta was présent (Figure Q21-2C). A diagram of 
the connections between the components of the cloclc 
and how they interact in adjacent cells is shown in Figure 
Q21-2D. In the absence of Delta, why do the cells become 
unsynchronized? What is it about the presence of Delta 
that lceeps adjacent cells oscillating in synchrony? 

21-16 The extracellular protein factor Decapentaplegic 
(Dpp) is critical for proper wing development in Drosoph¬ 
ila (Figure Q21-3A). It is normally expressed in a narrow 
stripe in the middle of the wing, along the anterior-pos- 
terior boundary. Flies that are defective for Dpp form 
stunted "wings" (Figure Q21-3B). If an additional copy 
of the gene is placed under control of a promoter that is 
active in the anterior part of the wing, or in the posterior 
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Figure Q21-2 Somite formation in zebrafish embryos (Problem 21-15). 
(A) Wild-type embryos with normal somites. (B) Somite formation in 
embryos lacking Delta. The bracket indicates normal-looking somites 
where they initially form. (C) Somite formation in embryos lacking Delta, 
but receiving a puise of Delta expression at the time indicated by the 
right-hand bracket. (D) Interactions among components of the oscillatory 
clock in adjacent cells. (Adapted from C. Soza-Ried et al., Development 
141:1780-1788, 2014. With permission from The Company of 
Biologists.) 


Figure Q21-3 Effects of Dpp expression on wing development in 
Drosophila (Problem 21-16). (A) Normal Dpp expression. (B) Absence 
of Dpp expression. (C) Additional anterior Dpp expression. 

(D) Additional posterior Dpp expression. (From M. Zecca, K. Basler 
and G. Struhl, Development 121:2265-2278, 1995. With permission 
from The Company of Biologists.) 

part of the wing, a large mass of wing tissue composed of 
normal-looking cells is produced at the site of Dpp expres¬ 
sion (Figure Q21-3C and D). Does Dpp stimulate cell divi¬ 
sion, cell growth, or both? How can you tell? 

21 -1 7 The highly branched structures of neurons would 
seem to malce it almost inévitable that they should malce 
unproductive synapses with themselves, yet they manage 
to avoid this outcome very effectively. How is this accom- 
plished in vertebrates? 
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22 


Cells evolved originally as free-living individuals, and such cells still dominate the 
Earth and its océans. But the cells that matter most to us, as human beings, are 
specialized members of a multicellular community. These cells hâve lost features 
needed for independent survival and acquired peculiarities that serve the needs 
of the body as a whole. Although they share the same genome, they are spectacu- 
larly diverse in structure, chemistry, and behavior. There are more than 200 differ¬ 
ent named cell types in the human body that collaborate with one another to form 
many different tissues, arranged into organs performing widely varied functions. 
To understand them, it is not enough to analyze cells in a culture dish: we need 
also to know how they live, worlc, and die in their natural habitat, the intact body. 

In Chapters 7 and 21, we saw how the various cell types become different in 
the embryo and how cell memory and signais from their neighbors enable them 
to remain different thereafter. In Chapter 19, we discussed the technology used 
to build multicellular tissues—the devices that bind cells together and the extra¬ 
cellular materials that give them support. But the adult body is not static: it is a 
structure in dynamic equilibrium, where new cells are continually being born, 
differentiating, and dying. Homeostatic mechanisms maintain a proper balance, 
so that the tissue architecture is preserved despite the constant replacement of 
old cells by new. In this chapter, we focus on these developmental processes that 
continue throughout life. In doing so, we shall illustrate some of the diversity of 
specialized cell types and see how they worlc together to perform their taslcs. 

We shall examine in particular the rôle played in many tissues by stem cells — 
cells that are specialized to provide a fresh supply of differentiated cells where 
these need to be continually replaced, or when they are required in great num- 
ber for purposes of repair and régénération. We shall see that while many tissues 
renew and repair themselves, some others do not; there, lost cells are lost forever, 
causing deafness, blindness, dementia, and other ills. 

In the final section of the chapter, we discuss how stem cells can be gener- 
ated and manipulated artificially, and we confront the practical question that 
underlies the current storm of interest in stem-cell technology: How can we use 
our understanding of the processes of cell différentiation and tissue renewal to 
improve upon nature, and malce good those injuries and failings of the human 
body that hâve hitherto seemed to be beyond repair? 

STEM CELLS AND RENEWAL IN EPITHELIAL TISSUES 

Among ail the self-renewing tissues in a mammal, the champion—for speed at 
least—is the lining of the small intestine: the long, convoluted portion of the gut 
tube that is chiefly responsible for absorption of nutrients from the gut lumen. To 
introduce stem cells, we take the small intestine as our starting point—not only 
because it renews itself at a greater rate than any other tissue in the body, but also 
because the molecular mechanisms that control its organization are particularly 
well understood. It thereby provides a beautiful illustration of the principles of 
stem-cell Systems that hâve broad applicability. 
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The Lining of the Small Intestine Is Continually Renewed Through 
Cell Prolifération in the Crypts 

The lining of the small intestine (and of most other régions of the gut) is a sin- 
gle-layered epithelium, only one cell thiclc. This epithelium covers the surfaces 
of the villi that project into the lumen, and it Unes the crypts that descend into 
the underlying connective tissue (Figure 22-1). Dividing cells are restricted to 
the crypts, and differentiated cells, no longer dividing, pour out of the crypts in a 
steady stream onto the villi. There are four main types of nondividing differenti¬ 
ated cells—one absorptive and three secretory (Figure 22-2): 

1. Absorptive cells (also called brush-border cells or enterocytes) hâve densely 
paclced microvilli on their exposed surfaces. Their job is to talce up nutri- 
ents from the gut lumen. To this end, they also produce hydrolytic enzymes 
that perform some of the final steps of extracellular digestion. They are the 
majority cell type in the epithelium. 

2. Goblet cells secrete mucus into the gut lumen that covers the epithelium 
with a protective coat. 

3. Paneth cells form part of the innate immune defense System (discussed in 
Chapter 24) and secrete proteins that kill bacteria. 

4. Enteroendocrine cells , of more than 15 different subtypes, secrete serotonin 
and peptide hormones that act on neurons and other cell types in the gut 
wall and regulate the growth, prolifération, and digestive activities of cells 
of the gut and other tissues. 

As if on a conveyor belt, the absorptive, goblet, and enteroendocrine cells 
travel mainly upward from their site of birth in the crypt, by a sliding movement in 
the plane of the épithélial sheet, to cover the surfaces of the villi. Within 3-5 days 
(in the mouse) after emerging from the crypts, the cells reach the tips of the villi, 
where they undergo apoptosis and are finally discarded into the gut lumen (see 


Figure 22-1 Renewal of the gut lining. 

(A) The pattern of cell turnover and 
prolifération in the epithelium that forms the 
lining of the small intestine. Stem cells (red) 
lie at the crypt base, interspersed among 
nondividing differentiated cells (Paneth 
cells). Progeny of the stem cells move 
mainly upward from the crypts onto the 
villi; after a few quick divisions, they cease 
dividing and differentiate—some of them 
while still in the crypt, most of them as they 
emerge from the crypt. The Paneth cells, 
like the other nondividing differentiated 
cells, are continually replaced by progeny 
of the stem cells, but they migrate 
downward to the crypt base and survive 
there for many weeks. (B) Photograph 
of a section of part of the lining of the 
small intestine, showing the crypts and 
villi. Note the mixture of differentiated cell 
types, ail generated from the stem cells; 
these are primarily absorptive cells, with 
mucus-secreting goblet cells (stained red) 
interspersed among them. Enteroendocrine 
cells (not labeled) are less numerous and 
less easy to identify without spécial stains. 
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Movie 20.6). The Paneth cells in the crypts are produced in much smaller num- 
bers and hâve a different migration pattern. They live at the bottom of the crypts, 
where they too are continually replaced, although not so rapidly, persisting for 
several weelcs (in the mouse) before undergoing apoptosis and being phagocyto- 
sed by their neighbors. 

The central problem is to understand the processes in the crypt that generate 
a continuai supply of ail these nondividing, terminally differentiated cell types. 

Stem Cells of the Small Intestine Lie at or Near the Base of Each 
Crypt 

The general pattern of cell prolifération and migration in the gut lining is revealed 
by a simple labeling method that uses injected puises of tritiated (radioactive) 
thymidine or of a thymidine analog that can be detected in tissue sections. Cells 
that are in S phase of the division cycle incorporate the marlcer molécule into their 
DNA, and their fate can then be followed over subséquent hours and days. If a 
cell divides after incorporation of the label, the label becomes diluted, halving 
with each cell cycle. This can be quantified. Experiments based on this labeling 
method confirm, first of ail, that dividing cells are confined to the crypts and that 
the differentiated cell types listed above do not divide. Second, the most rapidly 
dividing cells, with a cycle time of about 12 hours in the mouse, are shown to lie in 
the middle and upper parts of the crypt, and these cells are ail fated to differentiate 
and stop dividing (see Figure 22-1A). Just above the base of the crypt, interspersed 
among the Paneth cells, lie cells that divide more slowly. These are the stem cells, 
which feed some of their progeny into the higher levels of the crypt destined for 
différentiation, while other progeny remain at the crypt base to continue the 
whole process. The rapidly dividing cells above these stem cells are derived from 
them, but already committed to différentiation. These cells are called committed 
precursors or transit amplifying cells, since their divisions serve to amplify the 
number of differentiated cells that ultimately resuit from each stem-cell division. 

The Two Daughters of a Stem Cell Face a Choice 

Stem cells hâve a critical rôle in a variety of tissues, and it is useful to list their 
defining properties: 

1 . A stem cell is not itself terminally differentiated: that is, it is not at the end 
of a pathway of différentiation. 

2. It can divide without limit (or at least for the lifetime of the animal). 

3. When it divides, each daughter has a choice: it can either remain a stem 
cell, or it can embarlc on a course that commits it to terminal différentiation 
(Figure 22-3). 


Figure 22-2 The four main differentiated 
cell types found in the épithélial lining of 
the small intestine. Ail cells are oriented 
with the gut lumen at top. Broad orange 
arrows indicate direction of sécrétion 
or uptake of materials for each type of 
cell. Ail of these cells are generated from 
undifferentiated multipotent stem cells 
living near the bottoms of the crypts (see 
Figure 22-1). Absorptive (brush-border) 
cells outnumber the other cell types in the 
epithelium by about 10:1 or more. The 
microvilli on their apical surface provide 
a 30-fold increase of surface area, not 
only for the import of nutrients but also for 
the anchorage of enzymes that perform 
the final stages of extracellular digestion, 
breaking down small peptides and 
disaccharides into monomers that can be 
transported across the cell membrane. 
Goblet cells secrete mucus; these are 
the commonest of the secretory cell 
types. Paneth cells secrete (along with 
some growth factors) cryptdins— proteins 
of the defensin family that kill bacteria. 
Different subtypes of enteroendocrine cells 
secrete serotonin and peptide hormones 
into the gut wall (and thence the blood). 
Cholecystokinin is a hormone released 
from enteroendocrine cells in response 
to the presence of nutrients in the gut. It 
binds to receptors on nearby sensory nerve 
endings, which relay a signal to the brain 
to stop the feeling of hunger once one has 
eaten enough. (After T.L. Lentz, Cell Fine 
Structure. Philadelphia: Saunders, 1971; 

R. Krstic, lllustrated Encyclopedia 
of Human Histology. Berlin: 

Springer-Verlag, 1984.) 
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Stem cells are required wherever there is a recurring need to replace differ- 
entiated cells that cannot themselves divide. Although a stem cell must be able 
to divide, it does not necessarily hâve to divide rapidly; in fact, many stem cells 
divide at a relatively slow rate. 

Stem cells are of many types, specialized for the genesis of different classes 
of terminally differentiated cells—intestinal stem cells for intestinal epithelium, 
epidermal stem cells for epidermis, hematopoietic stem cells for blood, and so on. 
Each stem-cell System nevertheless raises similar fundamental questions. What 
are the distinguishing features of the stem cell in molecular terms? What condi¬ 
tions serve to lceep the stem cell in its proper place and to maintain its stem-cell 
character? What décidés whether a given daughter cell commits to différentiation 
or remains a stem cell? In a tissue where several distinct types of differentiated 
cells must be produced, are they ail derived from a single type of stem cell, or is 
there a distinct type of stem cell for each one? 

Wnt Signaling Maintains the Gut Stem-Cell Compartiment 

For the gut, the beginnings of an answer to these questions came from studies of 
cancer of the colon and rectum (the lower end of the gut, also known as the large 
intestine). Some people hâve a hereditary prédisposition to colorectal cancer and, 
in advance of the invasive disease, develop large numbers of small precancerous 
tumors (adenomas) in the lining of this part of the gut (Figure 22-4). The appear- 
ance of these tumors suggests that they hâve arisen from intestinal crypt cells that 
hâve failed to hait their prolifération in the normal way. As discussed in Chapter 
20, the cause has been traced to mutations in the Apc (adenomatous polyposis 
colï) gene: the tumors arise from cells that hâve lost both gene copies. Because 
Apc codes for a protein that prevents inappropriate activation of the Wnt signaling 
pathway (see Figure 15-60), this loss of Apc is presumed to mimic the effect of 
continuai exposure to a Wnt signal. The suggestion, therefore, is that Wnt signal¬ 
ing normally keeps crypt cells in a proliférative State, and that a cessation of expo¬ 
sure to Wnt signaling normally malces them stop dividing as they leave the crypt. 

Stem Cells at the Crypt Base Are Multipotent, Giving Rise to the 
Full Range of Differentiated Intestinal Cell Types 

It has long been suspected that ail the differentiated cell types in the lining of the 
intestine dérivé from a single type of stem cell. But firm proof was lacking, and the 
précisé nature and location of the stem cells were disputed. 

To solve the problem, and indeed to understand the organization of any stem- 
cell System, we need to discover how its cells are related to one another—who 
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Figure 22-3 The définition of a stem cell. 

Each daughter produced when a stem cell 
divides can either remain a stem cell or 
go on to become terminally differentiated. 

In many cases, the daughter that opts for 
terminal différentiation undergoes additional 
cell divisions before terminal différentiation 
is completed; such cells are called transit 
amplifying cells. 


Figure 22-4 An adenoma in the human 
colon, compared with normal tissue 
from an adjacent région of the same 
person’s colon. The specimen is from a 
patient with an inherited mutation in one of 
his two copies of the Apc gene. A mutation 
in the other Apc gene copy, occurring in 
a colon épithélial cell during adult life, has 
given rise to a clone of cells that behave 
as though the Wnt signaling pathway is 
permanently activated. As a resuit, the 
cells of this clone form an adenoma—an 
enormous, steadily expanding mass of 
giant cryptlike structures. 
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Figure 22-5 Clonal analysis using a genetic marker. A modem method for tracking cell lineage 
uses transgenic animais containing two transgenes, which together drive expression of a readily 
detected and heritable marker protein in a small subset of stem cells. The first transgene (top) 
carries two adjacent protein-coding sequences, GFP and CreERT2, both expressed under the 
control of the Lgr promoter that is active only in stem cells and not in their differentiated progeny. 
GFP encodes green fluorescent protein (see Chapter 9), which is used here simply to confirm 
expression in the entire stem-cell population. The CreERT2 gene encodes a chimeric form of the 
Cre recombinase called CreERT, which consists of Cre recombinase linked to the estrogen receptor 
protein; this enzyme becomes active as a recombinase only when it binds the artificial estrogen 
analog tamoxifen. 

The second transgene (bottom) carries a marker gene, LacZ, under the control of a promoter 
that is active in ail cells. The LacZ gene encodes p-galactosidase, an enzyme that can be detected 
histochemically in tissues (see Figure 7-28). However, LacZ expression in the transgene shown 
here is prevented by a blocking sequence (red) that is flanked by LoxP sites (pink ; see Figure 
5-66). When tamoxifen is provided, CreERT becomes active—leading to a recombination event 
that removes the blocking DNA sequence (and leaves one LoxP site behind). As a resuit, the LacZ 
marker is expressed. Because this change is heritable, the marker continues to be expressed in 
ail cells descended from those in which a recombination event has occurred. With a low dose of 
the inducer molécule tamoxifen, it is possible to activate the marker at random in just a few widely 
spaced cells, which, in the course of time, give rise to widely separated and easily distinguished 
clones of progeny (see Figure 22-6). 


is descended from whom, or, equivalently, what progeny will be produced from 
any given cell. This can best be done using a heritable marker that can be acti¬ 
vated in an individual cell, thus allowing the identification of the clone of progeny 
descended from that cell. A modem method uses transgenic animais to create 
a visible genetic mark in just a few widely spaced cells, which, in the course of 
time, give rise to widely separated and easily distinguished clones of progeny, as 
explained in Figure 22-5. 

A search among genes that are strongly upregulated in response to Wnt sig- 
naling revealed one, called Lgr5, that is expressed in gut stem cells specifically. 
The technique described in Figure 22-5 can be used to create a genetic mark in a 
random subset of Lgr5 -expressing cells—a mark that is inherited by the progeny 
of each cell. These Lgr5 cells divide with a cycle time of about 24 hours, and within 
a few days marlced clones are seen extending from the crypt bases up along the 
sides of the villi. After as long as 60 days or more, many of these clones still persist, 
retaining one or more members at the crypt base and extending ail the way up to 
the tips of the villi (Figure 22-6). Moreover, each single clone typically contains 
ail the major differentiated gut cell types—absorptive, goblet, Paneth, and entero- 
endocrine—in their normal proportions. The Lgr5 -expressing cells, therefore, are 
true stem cells that are multipotent— that is, able to generate a diverse set of dif¬ 
ferentiated cell types. 
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The Two Daughters of a Stem Cell Do Not Always Hâve to 
Become Different 

If the number of stem cells in a crypt is to remain stable, each stem-cell division 
must on average generate one daughter that remains a stem cell and one that 
becomes committed to différentiation. In principle, this could be achieved in at 
least two ways (Figure 22-7). 

One mechanism—the simplest at first sight—would be through asymmetric 
division: processes internai to the dividing stem cell could distribute regulatory 
factors asymmetrically to its two daughters, as occurs in Drosophila neuroblast 
divisions (see Figure 21-36). The factors inherited by one daughter would cause it 
to remain a stem cell, while those inherited by the other would drive it toward dif¬ 
férentiation. This strategy would guarantee that the original stem cell would give 
rise to precisely one stem cell in every subséquent cell génération. 

An alternative strategy would be based on a choice that each daughter malces 
independently of its sister : in normal circumstances, each would hâve a 50% prob- 
ability of remaining as a stem cell and a 50% probability of commitment to différ¬ 
entiation. Sometimes the two daughters of a stem cell would thus hâve opposite 
fates, sometimes the same. The choice that each cell malces might either be sto- 
chastic, lilce the fiip of a coin, or governed by the environment in which the cell 
finds itself. A strategy of independent choices is more flexible than that of strict 
asymmetric division. In particular, environmental factors can control the balance 
of probabilities, adjusting them in favor of the stem-cell option where more stem 
cells are needed, as they often are, either for growth or for damage repair. 

Clonal analysis gives a way to distinguish between the two strategies, since 
they give quite different prédictions as to the expected number of clones of differ¬ 
ent sizes produced from individual stem cells (see Figure 22-7). For the gut, the 
findings seem clear: the independent-choice theory fits the observations, and the 
asymmetric-division theory does not. 


Figure 22-6 Lgr5-ex pressing stem cells 
and their progeny in the small intestine. 

The method shown in Figure 22-5 was 
used here to mark single intestinal stem 
cells and trace the fates of their progeny. 
The Lgr5 gene encodes a member of the 
family of G-protein-linked transmembrane 
receptors, and it is expressed specifically 
in stem cells near the crypt base. Because 
the Lgr5 promoter was used to drive 
expression of CreERT2, treatment with 
a low dose of tamoxifen resulted in 
occasional stem cells expressing LacZ. 
These cells and ail of their progeny could 
subsequently be detected with a blue 
histochemical stain. Ail of the blue cells in 
these images dérivé from a single Lgr5- 
expressing stem cell. After 60 days, the 
blue progeny of this cell are seen to extend 
ail the way up a villus. These progeny 
can be shown to include ail types of 
differentiated cells, as well as persistent 
Lgr5-ex pressing cells at the crypt base. 
This proves that /_gr5-expressing cells are 
multipotent stem cells. (From N. Barker et 
al., Nature 449:1003-1007, 2007. With 
permission from Macmillan Publishers Ltd.) 


Paneth Cells Create the Stem-Cell Niche 

There are about 15 Lgr5-ex pressing stem cells in each crypt. They are slim and 
columnar, and they sit at the crypt base interspersed among the Paneth cells (see 
Figure 22-6). This is the intestinal stem-cell niche: the Paneth cells generate sig¬ 
nais, including a strong Wnt signal, that act over a short range to maintain the 
stem-cell State. Signal proteins from the connective tissue surrounding the crypt 
base help to reinforce the localizing signal from the Paneth cells; Lgr5 itself is a 
receptor for one of these proteins, called R-spondin. 

In the intestine, it seems that the niche created by the Paneth cells has space 
for only a limited number of stem cells, and when these divide, it is a random 
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Figure 22-7 Two ways for a stem cell to produce daughters with different fates: asymmetric division and independent 
choice. (A) The asymmetric-division strategy gives a clone consisting of precisely one stem cell plus a steadily increasing 
number of differentiating cells, in proportion to the number of cell divisions. (B) The independent-choice strategy is more variable 
in its outcome. With a choice made at random by each daughter and with a 50% probability for each one to remain a stem cell 
or differentiate, there is, for example, a 25% chance at the first division that both daughters will differentiate, so that the clone 
eventually goes extinct. Or, at this division or later, a prépondérance of daughters may chance to retain stem-cell character, 
creating a clone that persists and increases in size. With the help of some mathematics, the probability distribution of clone 
sizes generated from a single stem cell at any given time can be predicted on this stochastic assumption. The observations in 
the gut and elsewhere fit the stochastic independent-choice strategy, but not the asymmetric-division strategy. 


matter which of them are pushed out of the nest and condemned to différentia¬ 
tion and which stay in place as stem cells for the future. In most other stem-cell 
Systems where the question has been examined, it appears that the fates of the 
daughters of a stem cell are assigned in a similar way, independently and subject 
to influence from the cells' environment. 

A Single Lgr5-expressing Cell in Culture Can Generate an Entire 
Organized Crypt-Villus System 

The Paneth cells themselves are progeny of the stem cells, suggesting that the 
intestinal stem-cell System is in some way self-maintaining and self-organizing. 
This is demonstrated in a striking way by taking single dissociated Lgr5 -expressing 
cells and allowing them to proliferate in culture, embedded in a cell-free matrix 
rich in the basal-lamina component laminin (mimicking basal lamina). The cells 
proliferate, forming at first small, round épithélial vesicles. Within a few days, 
however, one or another of the cells in the vesicle, at random, begins to differen¬ 
tiate as a Paneth cell. This induces its neighbors to behave as stem cells and initi¬ 
âtes transformation of the simple vesicle into an organized structure, or organoid 
(Figure 22-8A,B). Protrusions resembling crypts grow out into the surrounding 
matrix and contain Paneth cells, Lgr5 -expressing stem cells, and the transit ampli- 
fying cells derived from them; these cell types are confined to the cryptlilce struc¬ 
tures. Terminally differentiated, nondividing absorptive cells line the other parts 
of the organoid epithelium, with their microvilli facing the lumen. Goblet and 
enteroendocrine cells are also présent, scattered through the epithelium, and the 
whole “minigut” structure, with ail its cell types, grows and renews itself in much 
the same way as the lining of the normal intestine. 
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Figure 22-8 Genesis of a minigut from a single Lgr5-ex pressing cell cultured in a cell-free 
matrix. (A,B) The founder cell first divides to form a small vesicle. At random, one or more of the 
cells in this vesicle differentiates as a Paneth cell (blue). This cell maintains Lgr5 expression (yellow) 
in its immédiate neighbors, which persist as stem cells that generate the full range of intestinal cell 
types. (C ) Schematic diagram of the key organizing signais. The Paneth cells organize crypts by 
producing a Wnt signal that acts on neighboring cells and keeps them proliferating in the stem- 
cell State. A répulsive interaction based on ephrin-Eph binding causes the crypt cell types (which 
express EphB, induced by Wnt) to segregate from the nondividing differentiated villus cell types 
(which express EphrinB). Both ephrin and Eph are cell-surface proteins attached to the plasma 
membrane; in many tissues, two cells that contain a different member of this pair repel each other 
when they touch (see Figure 21-49). (Adapted from T. Sato and H. Clevers, Science 340:1190- 
1194, 2013. With permission from AAAS.) 


Ephrin-Eph Signaling Drives Ségrégation of the Different Gut Cell 
Types 

The remarkable self-organizing behavior of the cultured organoids suggests that 
some interaction among the different épithélial cells drives them to segregate 
from one another. The ephrin-Eph signaling pathway (discussed in Chapter 15) 
appears to be responsible. The cells that live in the crypts express EphB receptor 
proteins, while absorptive, goblet, and enteroendocrine cells, as they begin to dif- 
ferentiate, switch off expression of this receptor and instead switch on expression 
of its ligands, cell-surface proteins of the EphrinB family (Figure 22-8C). In vari- 
ous other tissues, cells expressing Eph proteins are repelled by contacts with cells 
expressing ephrins on their surface (see Figures 21-49 and 21-79). It seems that 
the same is true in the gut lining, and that this mechanism serves to lceep the cells 
segregated and in their proper places. In EphB knoclcout mutants, the populations 
become mixed, so that, for example, Paneth cells wander out onto the villi. 

Notch Signaling Controls Gut Cell Diversification and Helps 
Maintain the Stem-Cell State 

If a single type of stem cell generates ail the differentiated cell types in the gut 
lining, what causes the progeny of this stem cell to diversify? Notch signaling has 
this rôle in many other Systems, where it médiates latéral inhibition—a compét¬ 
itive interaction that drives neighboring cells toward different fates (see Figure 
15-58 and Figure 21-35). Ail the essential components of the Notch pathway 
are expressed in the crypts; it seems that Wnt signaling maintains them there. If 
Notch signaling is abruptly bloclced, within a few days ail the cells in the crypts 
differentiate as goblet cells, and absorptive cells cease to be produced; conversely, 
if Notch signaling is artificially activated in ail the cells, absorptive cells continue 
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Figure 22-9 How Notch signaling, in 
combination with Wnt, maintains stem 
cells and drives cell diversification 
in the intestine. Wnt signaling leads to 
expression of Notch and Delta in the cells 
of the crypt, and Delta-Notch signaling 
in the crypt médiates latéral inhibition 
between adjacent cells. Cells expressing 
higher levels of Delta eventually activate 
Notch in their neighbors, adopt a secretory 
fate, and stop dividing; their neighbors, 
with activated Notch, are prevented 
from differentiating and keep on dividing. 
Essentially the same process opérâtes 
at the crypt base, where the Paneth cells 
express higher levels of Delta to prevent 
stem cells from differentiating, and in 
the transit amplifying population, where 
nascent secretory cells express higher 
levels of Delta. Division continues in the 
Notch-activated cells as they move up the 
crypt, until they escape from the influence 
of Wnt and emerge onto the villi to become 
absorptive cells. 


to be generated but no goblet cells are produced. This reflects the latéral inhi¬ 
bition mechanism operating in normal animais: the nascent goblet (and other 
secretory) cells express the Notch ligand Delta and thereby activate Notch in their 
neighbors, inhibiting them from differentiating as secretory (Figure 22-9). 

Delta-Notch signaling is crucial not only in the transit amplifying population, 
but also at the crypt base: the Paneth cells express Delta and this activâtes Notch 
in the stem cells, inhibiting différentiation. Without this influence, the stem cells 
lose their spécial character and differentiate as secretory cells. Thus maintenance 
of the intestinal stem-cell State requires a combination of signais, with both Wnt 
and Notch acting as central players. 

The Epidermal Stem-Cell System Maintains a Self-Renewing 
Waterproof Barrier 

Stem-cell Systems are organized in many different ways, but they share some 
underlying principles. Consider the epidermis, for example—the outer, épithélial 
covering of the body. The epidermis undergoes continuai renewal, but, unlilce the 
lining of the gut, it is multilayered or stratified. Stem cells are located in the basal 
layer, and their progeny move outward toward the exposed surface, differentiating 
as they go. They end up as lifeless scales or squames , which are eventually shed 
from the surface of the skin (Figure 22-10). Even though the architecture of this 
tissue is very different from that of the intestine, many of the same basic principles 
apply. The stem cells dépend for their existence on signais from a spécifie niche, 
in this case the basal lamina and underlying connective tissue. The daughters of 
stem cells that are committed to différentiation undergo several divisions as tran¬ 
sit amplifying cells (while still in the basal layer) before differentiating. Finally, a 
stochastic independent-choice mechanism dictâtes the fates of the daughters of a 
stem-cell division, allowing for increase in the number of stem cells when needed 
for growth or wound healing. Most of the same signaling pathways that organize 
the intestinal stem-cell System are also involved in regulating the epidermal stem- 
cell System, although with different individual rôles. 
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Figure 22-10 The multilayered structure of the epidermis, as seen in thin skin of a mouse. (A) The epidermis forms 
the outer covering of the skin, creating a waterproof barrier that is self-repairing and continually renewed. Beneath this lies 
a relatively thick layer of connective tissue, which includes the tough, collagen-rich dermis (from which leather is made) and 
the underlying fatty subcutaneous layer or hypodermis. The cells of the epidermis are called kératinocytes, because their 
characteristic differentiated activity is the synthesis of keratin intermediate filament proteins, which give the epidermis its 
toughness. These cells change their appearance and properties from one layer to the next, progressing through a regular 
program of différentiation. Those in the innermost layer, attached to an underlying basal lamina, are termed basal cells, and it is 
usually only these that divide: the basal cell population includes relatively small numbers of stem cells along with larger numbers 
of transit amplifying cells derived from them. Above the basal cells are several layers of larger prickle cells, shown in top view 
in (B), whose numerous desmosomes—each a site of anchorage for thick tufts of keratin filaments—are just visible in the light 
microscope as tiny prickles around the cell surface. Beyond the prickle cells lies the thin, darkly staining granular cell layer, 
where the cells are sealed together to form a waterproof barrier; this marks the boundary between the inner, metabolically active 
strata and the outermost layer of the epidermis, consisting of dead cells whose intracellular organelles hâve disappeared. These 
outermost cells are reduced to flattened scales, or squames, filled with densely packed keratin, which are eventually shed from 
the surface of the skin. The time from exit of a cell from the basal layer to its loss by shedding at the surface is a week or two, 
depending on body région and species. 

In addition to the cells destined for keratinization, the deep layers of the epidermis include small numbers of cells (not shown) 
that invade this tissue and hâve quite different origins and functions. These immigrants include dendritic cells, called Langerhans 
cells, derived from bone marrow and belonging to the immune System; mélanocytes (pigment cells) derived from the neural 
crest; and Merkel cells, which are associated with nerve endings in the epidermis. (B, from R.V. Krstic, Ultrastructure of the 
Mammalian Cell: an Atlas. Berlin: Springer-Verlag, 1979.) 


Tissue Renewal That Does Not Dépend on Stem Cells: Insulin- 
Secreting Cells in the Pancréas and Hépatocytes in the Liver 

Some types of cells can divide even though fully differentiated, allowing for 
renewal and régénération without the use of stem cells. The insulin-secreting cells 
(fi cells ) of the pancréas are one example. Their mode of renewal has a spécial 
importance, because it is the loss of these cells (through autoimmune attack) that 
is responsible for type 1 (juvenile-onset) diabètes; they are also a significant factor 
in the type 2 (adult-onset) form of the disease. The P cells are normally seques- 
tered in cell clusters called islets of Langerhans. These islets contain no obvious 
subset of cells specialized to act as stem cells, yet fresh P cells are continually gen- 
erated within them. Lineage tracing studies, similar to those described above for 
the gut, show that the renewal of this population normally occurs by simple dupli¬ 
cation of the existing insulin-expressing cells, and not by means of stem cells. 

Another tissue that can renewby simple duplication of fully differentiated cells 
is the liver. The main cell type in the liver is the hépatocyte, a large cell that per¬ 
forais the liver's metabolic functions. Hépatocytes normally live for a year or more 
and renew themselves through cell division at a very slow rate. Powerful homeo- 
static mechanisms operate to adjust the rate of cell prolifération or the rate of cell 
death, or both, so as to lceep the organ at its normal size or restore it to that size 
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in case of damage. A dramatic effect is seen if large numbers of hépatocytes are 
removed surgically or are lcilled by poisoning with carbon tetrachloride. Within 
a day or so after either sort of damage, a surge of cell division occurs among the 
surviving hépatocytes, quickly replacing the lost tissue. If two-thirds of a rat's liver 
is removed, for example, a liver of nearly normal size can regenerate from the 
remainder by hépatocyte prolifération within about two weelcs. 

Both the pancréas and the liver contain small populations of stem cells that 
can be called into play as a backup mechanism for production of the differen- 
tiated cell types in more extreme circumstances. This imparts resilience to the 
mechanisms of renewal and repair. 

Some Tissues Lack Stem Cells and Are Not Renewable 

The variety among tissues in the capacity for self-renewal is illustrated in a striking 
way by comparing the olfactory epithelium in the nose, the auditory epithelium 
of the inner ear, and the photoreceptive epithelium of the retina. These three sen- 
sory structures, which lilce the epidermis develop from the ectodermal layer of 
the early embryo, differ radically in their self-renewal capabilities. The olfactory 
epithelium contains a population of stem cells that give rise to differentiated cells 
that hâve a limited life-span and are continually replaced. But unlilce the epider¬ 
mis, these differentiated cells (the olfactory receptor cells) are neurons, with cell 
bodies lying in the olfactory epithelium and axons that extend baclc to the olfac¬ 
tory lobes in the brain. The continuai renewal of this epithelium therefore involves 
continuai production of fresh axons, which hâve to navigate baclc to the appropri- 
ate sites in the brain. 

In contrast, in mammals at least, the auditory epithelium and the retinal epi¬ 
thelium lack stem cells, and their sensory receptor cells—the sensory hair cells in 
the ear, the photoreceptors in the retina—are irreplaceable. If they are destroyed— 
whether by too much exposure to loud noise, by looking into the beam of a laser, 
or through degenerative processes in old âge—the loss is permanent. 

Summary 

Many tissues in the adult mammalian body are continually renewed by stem cells. 
Stem cells, by définition, are not terminally differentiated and hâve the ability to 
divide throughout the organism's lifetime, yielding some progeny that differentiate 
and others that remain stem cells. The lining ofthe gut renews itself more rapidly 
than any other tissue in the mammalian body and provides a paradigm for the 
workings of stem-cell Systems. In the small intestine, there is a continuai upward 
flow from crypts, where new cells are generated by cell division, onto villi that are 
composed of nondividing differentiated cells. Wnt signaling maintains cell prolif¬ 
ération in the crypts, and overactivation ofthe Wnt pathway gives rise to tumors. 
Stem cells lie at each crypt base and are distinguished by expression of Lgr5 and 
certain other genes. The Lgr5 + stem cells are multipotent, each capable ofgenerating 
several different types of differentiated cells as well as new stem cells. The balance 
offate choices is adjusted according to need, allowing increase in the number of 
stem cells where more are needed for growth or repair. In a suitable cell-free culture 
medium, a single Lgr5 + stem cell can generate a self-organizing “minigut,” contain- 
ing ail the standard intestinal épithélial cell types. 

Other self-renewing epithelia, such as the epidermis with its multilayered (strat- 
ified) architecture, hâve stem cells and their differentiating progeny arranged in 
different ways but are governed by similar basic principles. However, tissue renewal 
and repair does not always hâve to dépend on stem cells. Thus, the population of 
insulin-producing cells in the pancréas is enlarged and renewed by simple duplica¬ 
tion ofexisting insulin-producing cells. Similarly, in the liver, differentiated hépato¬ 
cytes remain able to divide throughout life and can dramatically increase their 
division rate when the need arises. Atan opposite extreme, some tissues, such as the 
sensory epithelia ofthe ear and the eye, do not undergo any turnover and are not 
renewable: their cells, once lost, are lostforever. 
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FIBROBLASTS AND THEIR TRANSFORMATIONS: 

THE CONNECTIVE-TISSUE CELL FAMILY 

From epithelia, with their varied patterns of renewal and their enormous variety 
of protective, absorptive, secretory, sensory, and biosynthetic functions, we turn 
now to connective tissues. Connective tissues typically consist of cells dispersed 
in extracellular matrix that they themselves secrete, as discussed in Chapter 19. 
They originate from the mesodermal (middle) layer of the early embryo, sand- 
wiched between ectoderm and endoderm (see Chapter 21, Figure 21-3). 

In the adult body, virtually ail epithelia are supported by a connective-tissue 
bed, or stroma; and specialized types of connective tissue, such as bone, carti¬ 
lage, and tendon, form the supporting frameworlc of the body as a whole. No less 
important than its mechanical rôle, connective tissue also contains the blood ves- 
sels that bring the oxygen and nourishment on which ail cells dépend. Cells of 
the immune System roam through connective tissue, passing in and out of blood 
vessels and lymphatics, and providing defence against infection; and through the 
meshes of connective tissue run peripheral nerves. Also embedded in connec¬ 
tive tissue are the muscles that enable us to move. In these many ways, the cells 
that form connective tissue and synthesize its various types of extracellular matrix 
contribute to the support and repair of almost every tissue and organ. 

Connective-tissue cells belong to a family of cell types that are related by ori- 
gin, and they are often remarkably interconvertible. The family includes fibro¬ 
blasts, cartilage cells, and bone cells, ail of which are specialized for the sécrétion 
of collagenous extracellular matrix and are jointly responsible for the architec¬ 
tural frameworlc of the body. The connective-tissue family also includes fat cells 
(i adipocytes ) and smooth muscle cells. Figure 22-11 illustrâtes these cell types and 
the interconversions that are thought to occur between them. The adaptability of 
the differentiated character of connective-tissue cells is an important feature of 
responses to many types of damage. 

Fibroblasts Change Their Character in Response to Chemical and 
Physical Signais 

Fibroblasts seem to be the least specialized cells in the connective-tissue family. 
They are dispersed in connective tissue throughout the body, where they secrete 
a nonrigid extracellular matrix that is rich in type I or type III collagen, or both, as 
discussed in Chapter 19. When a tissue is injured, the fibroblasts nearby prolifer- 
ate, migrate into the wound (Movie 22.1), and produce large amounts of collag¬ 
enous matrix that helps to isolate and repair the damaged tissue. Their ability to 
thrive in the face of injury, together with their solitary lifestyle, may explain why 
fibroblasts are the easiest of cells to grow in culture—a feature that has made them 
a favorite subject for cell biological studies. 
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Figure 22-11 The family of connective- 
tissue cells. Arrows show the 
interconversions that are thought to 
occur within the family. For simplicity, 
the fibroblast is shown as a single cell 
type, but it is uncertain how many types 
of fibroblasts exist and whether the 
différentiation potential of different types is 
restricted in different ways. 
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A class of connective-tissue cells in the bone marrow, called bone marrow stro¬ 
mal cells, provides an example of radical connective-tissue versatility. These cells, 
which can be regarded as a kind of fibroblast, can be isolated from the bone mar¬ 
row and propagated in culture. Large clones of progeny can be generated in this 
way from single ancestral stromal cells. Depending on the culture conditions, the 
members of such a clone either can continue proliferating to produce more cells 
of the same type, or can differentiate as fat cells, cartilage cells, or bone cells. The 
fate of the cells dépends on physical as well as Chemical signais: embedded in a 
stiff, unyielding matrix, they tend to turn into bone cells, whereas in a softer, more 
elastic matrix, they tend to turn into fat cells. This effect is mediated by an intra- 
cellular pathway that responds to tension in actin-myosin bundles and relays a 
signal to spécifie transcription regulators in the nucléus (Figure 22-12). Because 
of their self-renewing, multipotent character, the bone marrow stromal cells, and 
other cells with similar properties, are referred to as mesenchymal stem cells. 


Figure 22-12 Control of fibroblast 
différentiation by the physical properties 
of the extracellular matrix. On a stiff 
matrix, the cells form strong adhesions, 
spread out, and tend to turn into bone 
cells. On a soft matrix, where the cells are 
unable to form strong anchorages, they fail 
to spread and tend to differentiate as fat 
cells. These effects dépend on transcription 
regulators (YAP and TAZ proteins) that 
move into the cell nucléus in response to 
tension developed in actin-myosin bundles 
in the cytoplasm. (Based on S. Dupont et 
al., Nature 474:179-183, 2011.) 


Osteoblasts Make Bone Matrix 

Cartilage and bone are tissues of very different character; but they are closely 
related in origin, and the formation of the skeleton dépends on an intimate part- 
nership between them. 

Cartilage tissue is structurally simple, consisting of cells of a single type— 
chondrocytes—embedded in a more or less uniform, highly hydrated matrix con¬ 
sisting of proteoglycans and type II collagen (discussed in Chapter 19). The carti¬ 
lage matrix is déformable, and the tissue grows by expanding as the chondrocytes 
divide and secrete more matrix (Figure 22-13). Bone, by contrast, is dense and 
rigid; it grows by apposition—that is, by déposition of additional matrix on free 
surfaces. Lilce reinforced concrète, the bone matrix is predominantly a mixture of 
tough fibers (type I collagen fibrils), which resist pulling forces, and solid particles 
(calcium phosphate as hydroxylapatite crystals), which resist compression. The 
bone matrix is secreted by osteoblasts that lie at the surface of the existing matrix 
and deposit fresh layers of bone onto it. Some of the osteoblasts remain free at the 
surface, while others gradually become embedded in their own sécrétion. This 
freshly formed material (consisting chiefly of type I collagen) is rapidly converted 
into hard bone matrix by the déposition of calcium phosphate crystals in it. 

Once imprisoned in hard matrix, the original bone-forming cell, now called 
an osteocyte, has no opportunity to divide, although it continues to secrete addi¬ 
tional matrix in small quantities around itself. The osteocyte, lilce the chondro¬ 
cyte, occupies a small cavity, or lacuna, in the matrix, but unlilce the chondrocyte 
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Figure 22-13 The growth of cartilage. 

The tissue expands as the chondrocytes 
divide and make more matrix. The freshly 
synthesized matrix with which each cell 
surrounds itself is shaded dark green. 
Cartilage may also grow by recruiting 
fibroblasts from the surrounding tissue and 
converting them into chondrocytes. 
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Figure 22-14 Déposition of bone matrix 
by osteoblasts. Osteoblasts lining the 
surface of bone secrete the organic matrix 
of bone (osteoid) and are converted into 
ostéocytes as they become embedded in 
this matrix. The matrix calcifies soon after 
it has been deposited. The osteoblasts 
themselves are thought to dérivé from 
ostéogénie stem cells that are closely 
related to fibroblasts. 


it is not isolated from its fellows. Tiny channels, or canaliculi, radiate from each 
lacuna and contain cell processes from the résident osteocyte, enabling it to form 
gap junctions with adjacent ostéocytes (Figure 22-14). Blood vessels and nerves 
run through the tissue, keeping the bone cells alive and reacting when the bone 
is damaged. 

A mature bone has a complex and beautiful architecture, in which dense plates 
of compact bone tissue enclose spaces spanned by light frameworks of trabecular 
bone—a filigree of délicate shafts and flying buttresses of bone tissue, with soft 
marrow in the interstices (Figure 22-15). The création, maintenance, and repair 
of this structure dépend not only on the cells of the connective-tissue family 
that synthesize matrix, but also on a separate class of cells called osteoclasts that 
dégradé it, as we explain below. 

Bone Is Continually Remodeled by the Cells Within It 

For ail its rigidity, bone is by no means a permanent and immutable tissue. 
Running through the hard extracellular matrix are channels and cavities occu- 
pied by living cells, which account for about 15% of the weight of compact bone. 
These cells are engaged in an unceasing process of remodeling: while osteoblasts 
deposit new bone matrix, osteoclasts demolish old bone matrix. This mechanism 
provides for continuous turnover and replacement of the matrix in the interior of 
the bone. 

Osteoclasts (Figure 22-16) are large, multinucleated cells that originate, like 
macrophages, from hematopoietic stem cells in the bone marrow (discussed later 
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Figure 22-15 Trabecular and compact 
bone. (A) Low-magnification scanning 
électron micrograph of trabecular bone in a 
vertebra of an adult man. The soft marrow 
tissue has been dissolved away. (B) A 
slice through the head of the fémur, with 
bone marrow and other soft tissue likewise 
dissolved away, reveals the compact bone 
of the shaft and the trabecular bone in 
the interior. Because of the way in which 
bone tissue remodels itself in response to 
mechanical load, the trabeculae become 
oriented along the principle axes of stress 
within the bone. (A, courtesy of Alan Boyde; 
B, from J.B. Kerr, Atlas of Functional 
Histology. Mosby, 1999.) 
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Figure 22-16 Osteoclasts. (A) Drawing of an osteoclast in cross section. This giant, multinucleated cell erodes bone matrix. The 
“ruffled border” is a site of sécrétion of acids (to dissolve the bone minerais) and hydrolases (to digest the organic components of 
the matrix). Osteoclasts vary in shape, are motile, and often send out processes to resorb bone at multiple sites. They develop 
from monocytes and can be viewed as specialized macrophages. (B) An osteoclast on bone matrix, seen by scanning électron 
microscopy. The osteoclast has been crawling over the matrix, eating it away, and leaving a trail of pits where it has done so. 

(A, from R.V. Krstic, Ultrastructure of the Mammalian Cell: An Atlas. Berlin: Springer-Verlag, 1979; B, courtesy of Alan Boyde.) 


in this chapter). The precursor cells are released into the bloodstream and col- 
lect at sites of bone résorption, where they fuse to form the multinucleated osteo¬ 
clasts, which cling to surfaces of the bone matrix and eat it away. Osteoclasts are 
capable of tunneling deep into the substance of compact bone, forming cavities 
that are then invaded by other cells. A blood capillary grows down the center of 
such a tunnel, and the walls of the tunnel become lined with a layer of osteo- 
blasts (Figure 22-17). These osteoblasts lay down concentric layers of new matrix, 
which gradually fill the cavity, leaving only a narrow canal surrounding the new 
blood vessel. At the same time as some tunnels are filling up with bone, others are 
being bored by osteoclasts, cutting through older concentric Systems. 
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Figure 22-17 The remodeling of 
compact bone. Osteoclasts acting 
together in a small group excavate a tunnel 
through the old bone, advancing at a rate 
of about 50 gm per day. Osteoblasts enter 
the tunnel behind them, line its walls, and 
begin to form new bone, depositing layers 
of matrix at a rate of 1 -2 pm per day. At 
the same time, a capillary sprouts down the 
center of the tunnel. The tunnel eventually 
becomes filled with concentric layers of 
new bone, with only a narrow central 
canal remaining. Each such canal, besides 
providing a route of access for osteoclasts 
and osteoblasts, contains one or more 
blood vessels that transport the nutrients 
the bone cells require for survival. Typically, 
about 5-10% of the bone in a healthy adult 
mammal is replaced in this way each year. 
(AfterZ.F.G. Jaworski, B. Duckand 
G. Sekaly, J. Anat. 133:397-405, 1981. 
With permission from Blackwell Publishing.) 
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Osteoclasts Are Controlled by Signais From Osteoblasts 

The osteoblasts that malce the matrix also produce the signais that recruit and 
activate the osteoclasts to dégradé it. Disturbance of the balance can lead to osteo- 
porosis, where there is excessive érosion of the bone matrix and wealcening of the 
bone, or to the opposite condition, osteopetrosis, where the bone becomes exces- 
sively thiclc and dense. Hormonal signais hâve powerful effects on this balance. 
Chronic use of corticosteroid drugs, for example, can cause osteoporosis as a side 
effect; but this can be treated by other drugs that redress the balance, including 
agents that bloclc the factors that osteoblasts secrete to recruit osteoclasts. 

Local Controls allow bone to be deposited in one place while it is resorbed 
in another. Through such Controls over the process of remodeling, bones are 
endowed with a remarkable ability to adjust their structure in response to long- 
term variations in the load imposed on them. It is this that malces orthodontics 
possible, for example: a steady force applied to a tooth with a brace will cause it to 
move gradually, over many months, through the bone of the jaw, by remodeling of 
the bone tissue ahead of it and behind it. 

Bone can also undergo much more rapid and dramatic reconstruction when 
the need arises. Some cells capable of forming new cartilage persist in the connec¬ 
tive tissue that surrounds a bone. If the bone is brolcen, the cells in the neighbor- 
hood of the fracture repair it by a process that resembles the way bones develop 
in the embryo: cartilage is first laid down to bridge the gap and is then replaced 
by bone. The capacity for self-repair, so strikingly illustrated by the tissues of the 
slceleton, is a property of living structures that has no parallel among present-day 
man-made objects. 

Summary 

The family of connective-tissue cells includes fibroblasts, cartilage cells; bone cells, 
fat cells, and smooth muscle cells. Some classes offibroblasts, such as the mesenchy- 
mal stem cells ofbone marrow, seem to be able to transform into any ofthe other 
members ofthe family. These transformations of connective-tissue cell type are reg- 
ulated by the composition of the surrounding extracellular matrix, by cell shape, 
and by hormones and growth factors. Cartilage and bone both consist of cells and 
solid matrix that the cells secrete around themselves—chondrocytes in cartilage, 
osteoblasts in bone (ostéocytes being osteoblasts that hâve become trapped within 
the bone matrix). The matrix of cartilage is déformable so that the tissue can grow 
by swelling, whereas bone is rigid and can grow only by apposition. While osteo¬ 
blasts secrete bone matrix, they also produce signais that recruit monocytes from 
the circulation to become osteoclasts, which dégradé bone matrix. Through the 
activities of these antagonistic classes of cells, bone undergoes a perpétuai remodel¬ 
ing through which it can adapt to the load it bears and alter its density in response 
to hormonal signais. Moreover, adult bone retains an ability to repair itself if frac- 
tured, by réactivation ofthe mechanisms thatgoverned its embryonic development: 
cells in the neighborhood ofthe break convert into cartilage, which is later replaced 
by bone. 

GENESIS AND REGENERATION OF SKELETAL 
MUSCLE 

The term "muscle" includes many cell types, ail specialized for contraction but 
in other respects dissimilar. As noted in Chapter 16, ail eukaryotic cells possess a 
contractile System involving actin and myosin, but muscle cells hâve developed 
this apparatus to a high degree. Mammals possess four main categories of cells 
specialized for contraction: skeletal muscle cells, heart (cardiac) muscle cells, 
smooth muscle cells, and myoepithelial cells (Figure 22-18). These differ in func- 
tion, structure, and development. Although ail of them generate contractile forces 
by using organized filament Systems based on actin and myosin II, the actin and 
myosin molécules employed hâve somewhat different amino acid sequences, are 
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differently arranged in the cell ; and are associated with different sets of proteins 
that control contraction. 

We focus in this section on skeletal muscle cells, which are responsible for 
practically ail movements that are under voluntary control. These cells can be 
very large (2-3 cm long and 100 pm in diameter in an adult human) and are often 
called muscle fibers because of their highly elongated shape. Each one is a syn¬ 
cytium, containing many nuclei within a common cytoplasm. In an intact mus¬ 
cle, they are bundled tightly together, with fibroblasts (and some fat cells) in the 
interstices between them and blood vessels and nerve fibers running through the 
tissue. The mechanisms of muscle contraction were discussed in Chapter 16. Here 
we consider the unusual strategyby which the multinucleate skeletal muscle cells 
are generated and maintained. 

Myoblasts Fuse to Form New Skeletal Muscle Fibers 

During development, certain cells, originating from the somites of a vertebrate 
embryo at a very early stage, become determined as myoblasts, the precursors 
of skeletal muscle fibers. After a period of prolifération, the myoblasts undergo a 
dramatic change of State: they stop dividing, switch on the expression of a whole 
battery of muscle-specific genes required for terminal différentiation, and fuse 
with one another to form multinucleate skeletal muscle fibers (Figure 22-19). 
Once différentiation and cell fusion hâve occurred, the cells do not divide and the 
nuclei never again replicate their DNA. 


Figure 22-18 The four classes of 
muscle cells of a mammal. (A) Schematic 
drawings (to scale). (B-E) Scanning 
électron micrographs. Skeletal muscle 
fibers (B, from a hamster) are giant cells 
with many nuclei and are formed by cell 
fusion. The other types of muscle cells 
are more conventional, generally having 
only a single nucléus. Heart muscle cells 
(C, from a rat) resemble skeletal muscle 
fibers in that their actin and myosin 
filaments are aligned in very orderly arrays 
to form a sériés of contractile units called 
sarcomeres, so that the cells hâve a 
striated (striped) appearance. The arrows 
in (C) point to intercalated dises—end-to- 
end junctions between the heart muscle 
cells; skeletal muscle cells in long muscles 
are joined end-to-end in a similar way. 
Smooth muscle cells (D, from the urinary 
bladder of a guinea-pig) are so named 
because they do not appear striated; they 
belong to the connective-tissue family 
and are closely related to fibroblasts. Note 
that the smooth muscle is shown here 
at a lower magnification than the other 
muscle types. The functions of smooth 
muscle vary greatly, from propelling food 
along the digestive tract to erecting hairs 
in response to cold or fear. Myoepithelial 
cells (E, from a secretory alveolus of a 
lactating rat mammary gland) also hâve no 
striations, but unlike ail other muscle cells 
they lie in epithelia and are derived from 
the ectoderm. They form the dilator muscle 
of the eye’s iris and serve to expel saliva, 
sweat, and milk from the corresponding 
glands. (B, courtesy of Junzo Desaki; 

C, from T. Fujiwara, in Cardiac Muscle in 
Handbook of Microscopie Anatomy 
[E.D. Canal, ed.]. Berlin: Springer-Verlag, 
1986; D, courtesy of Satoshi Nakasiro; 

E, from T. Nagato et al., Cell Tissue Res. 
209:1-10, 1980. With permission from 
Springer-Verlag.) 
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Figure 22-19 Myoblast fusion in culture. 

The culture is stained with a fluorescent 
antibody (green) against skeletal muscle 
myosin, which marks differentiated 
muscle cells, and with a DNA-specific 
dye (blue) to show cell nuclei. (A) A short 
time after a change to a culture medium 
that favors différentiation, just two of the 
many myoblasts in the field of view hâve 
switched on myosin production and hâve 
fused to form a muscle cell with two nuclei 
(upper right). (B) Somewhat later, almost 
ail the cells hâve differentiated and fused. 
(C) High-magnification view, showing 
characteristic striations (fine transverse 
stripes) in two of the multinucleate muscle 
cells. (Courtesy of Jacqueline Gross and 
Terence Partridge.) 


Some Myoblasts Persist as Quiescent Stem Cells in the Adult 

Even though humans do not normally generate new skeletal muscle fibers in adult 
life, they still hâve the capacity to do so, and existing muscle fibers can résumé 
growth when the need arises. Cells capable of serving as myoblasts are retained as 
small, flattened, and inactive cells lying in close contact with the mature muscle 
cell and contained within its sheath of basal lamina (Figure 22-20). If the mus¬ 
cle is damaged or stimulated to grow, these satellite cells are activated to prolifer- 
ate, and their progeny can fuse to repair the damaged muscle or to allow muscle 
growth. Satellite cells, or some subset of the satellite cells, are thus the stem cells 
of adult skeletal muscle, normally held in reserve in a quiescent State but available 
when needed as a self-renewing source of terminally differentiated cells. 

The process of muscle repair by means of satellite cells is, however, limited 
in what it can achieve. In one form of muscular dystrophy, for example, a genetic 
defect in the cytoslceletal protein dystrophin damages differentiated skeletal mus¬ 
cle cells. As a resuit, satellite cells proliferate to repair the damaged muscle fibers. 
This regenerative response is, however, unable to lceep pace with the damage, and 
connective tissue eventually replaces the muscle cells, blocking any further pos- 
sibility of régénération. A décliné of capacity for repair likewise contributes to the 
wealcening of muscle in the elderly. 
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Figure 22-20 Satellite cells repair skeletal muscle fibers. (A) The specimen is stained with an antibody (red) against a muscle 
cadherin, M-cadherin, which is présent on both the satellite cell and the muscle fiber and is concentrated at the site where their 
membranes are in contact. The nuclei of the muscle fiber are stained green, and the nucléus of the satellite cell is stained blue. 

(B) Schematic of the repair of a damaged muscle fiber by prolifération and fusion of satellite cells. (A, courtesy of Terence Partridge.) 
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Summary 

Skeletal muscle fibers are one offour main categories ofvertebrate cells specialized 
for contraction, and they are responsiblefor ail voluntary movement. Each skeletal 
musclefiber is a syncytium and develops by thefusion ofmany myoblasts. Myoblasts 
proliferate extensively, but once they bave fused, they can no longer divide. Fusion 
generallyfollows the onset ofmyoblast différentiation, in which manygenes encod- 
ing muscle-specific proteins are switched on coordinately. Some myoblasts persist 
in a quiescent State as satellite cells in adult muscle; when a muscle is damaged, 
these cells are reactivated to proliferate and to fuse in order to replace the muscle 
cells that hâve been lost. They are the stem cells of skeletal muscle, and exhaustion of 
their regenerative capacity is responsiblefor someforms ofmuscular dystrophy as 
well as for the décliné of muscle mass in old âge. 

BLOOD VESSELS, LYMPHATICS, AND ENDOTHELIAL 
CELLS 

Almost ail tissues dépend on a blood supply, and the blood supply dépends on 
endothélial cells, which form the linings of the blood vessels. Endothélial cells 
hâve a remarkable capacity to adjust their number and arrangement to suit local 
requirements. They create an adaptable life-support System, extending by cell 
migration into almost every région of the body. If it were not for endothélial cells 
extending and remodeling the network of blood vessels, tissue growth and repair 
would be impossible. Cancerous tissue is as dépendent on a blood supply as is 
normal tissue, and this has led to a surge of interest in endothélial cell biology, 
in the hope that it may be possible to bloclc the growth of tumors by attacking the 
endothélial cells that bring them nourishment. 

Endothélial Cells Line Ail Blood Vessels and Lymphatics 

The largest blood vessels are arteries and veins, which hâve a thiclc, tough wall 
of connective tissue and many layers of smooth muscle cells (Figure 22-21). The 
inner wall is lined by an exceedingly thin single sheet of endothélial cells, the 
endothélium, separated from the surrounding outer layers by a basal lamina. The 
amounts of connective tissue and smooth muscle in the vessel wall vary accord- 
ing to the vessel's diameter and function, but the endothélial lining is always prés¬ 
ent. In the finest branches of the vascular tree—the capillaries and sinusoids—the 
walls consist of nothing but endothélial cells and a basal lamina (Figure 22-22), 
together with a few scattered pericytes. Related to vascular smooth muscle cells, 
pericytes wrap themselves around the small vessels and strengthen them (Figure 
22-23). 
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Figure 22-21 Diagram of a small artery 
in cross section. The endothélial cells 
form the endothélial lining, which although 
inconspicuous, is the fundamental 
component. Compare with the capillary in 
Figure 22-22. 


Figure 22-22 Capillaries. Electron 
micrograph (left) of a cross section of a 
small capillary in the pancréas. The wall 
is formed by a single endothélial cell 
surrounded by a basal lamina, as seen 
most clearly in the drawing to the right. 
(From R.P. Bolender, J. Cell Biol. 61:269- 
287, 1974. With permission from The 
Rockefeller University Press.) 
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Less obvious than the blood vessels are the lymphatic vessels. These carry no 
blood and hâve much thinner and more permeable walls than the blood vessels. 
They provide a drainage System for the fluid (lymph) that seeps out of the blood 
vessels, as well as an exit route for white blood cells that hâve migrated from blood 
vessels into the tissues. Less happily, they can also provide the path by which 
cancer cells escape from a primary tumor to invade other tissues. The lymphat- 
ics form a branching System of tributaries, ail ultimately discharging into a single 
large lymphatic vessel, the thoracic duct, which opens into a large vein close to the 
heart. Lilce blood vessels, lymphatics are lined with endothélial cells. 

Thus, endothélial cells line the entire blood and lymphatic vascular System, 
from the heart to the smallest capillary, and they control the passage of materi- 
als—and the transit of white blood cells—into and out of the bloodstream. Arter- 
ies, veins, capillaries, and lymphatics ail develop from small vessels constructed 
primarily of endothélial cells and a basal lamina: connective tissue and smooth 
muscle are added later where required, under the influence of signais from the 
endothélial cells. 

Endothélial Tip Cells Pioneer Angiogenesis 

To understand how the vascular System cornes into being and how it adapts to the 
changing needs of tissues, we hâve to understand endothélial cells. How do they 
become so widely distributed, and how do they form channels that connect in just 
the right way for blood to circulate through the tissues and for lymph to drain baclc 
to the bloodstream? 

Endothélial cells originate at spécifie sites in the early embryo from precursors 
that also give rise to blood cells. From these sites, the early embryonic endothélial 
cells migrate, proliferate, and differentiate to form the first rudiments of blood 
vessels—a process called vasculogenesis. Subséquent growth and branching of 
the vessels throughout the body occurs mainly by prolifération and movement of 
the endothélial cells of these first vessels, in a process called angiogenesis. 

Angiogenesis occurs in a broadly similar way in the young organism as it grows 
and in the adult during tissue repair and remodeling. We can watch the behavior 
of the cells in naturally transparent structures, such as the cornea of the eye or the 
fin of a tadpole, or in tissue culture, or in the embryo. The embryonic retina, which 
blood vessels invade according to a predictable timetable, provides a convenient 
example for experimental study. Each new vessel originates as a capillary sprout 
from the side of an existing capillary or small venule (Figure 22-24). At the tip 
of the sprout, leading the way, is an endothélial cell with a distinctive character. 
This tip cell has a pattern of gene expression somewhat different from that of the 
endothélial stalle cells following behind it, and while they divide, it does not. The 
tip cell's most striking feature is that it puts out many long filopodia, resembling 
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Figure 22-23 Pericytes. The scanning 
électron micrograph shows pericytes 
wrapping their processes around a small 
blood vessel (a post-capillary venule) in the 
mammary gland of a cat. Pericytes are also 
présent around capillaries, but are much 
more sparsely distributed there. (From 
T. Fujiwara and Y. Uehara, Am. J. Anat. 
170:39-54, 1984. With permission from 
Wiley-Liss.) 
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Figure 22-24 Angiogenesis. (A) A new blood capillary forms by the sprouting of an endothélial 
cell from the wall of an existing small vessel. An endothélial tip cell, with many filopodia, leads the 
advance of each capillary sprout. The endothélial stalk cells trailing behind the tip cell become 
hollowed out to form a lumen. (B) Blood capillaries sprouting in the retina of an embryonic mouse 
that had a red dye injected into the bloodstream, revealing the capillary lumen opening up behind 
the tip cell (Movie 22.2). (B, from H. Gerhardt et al., J. Cell Biol. 161:1163-1177, 2003. With 
permission from the author.) 
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those of a neuronal growth cône. The column of stalle cells behind it, meanwhile, 
becomes hollowed out to form a lumen. 

The endothélial tip cells that pioneer the growth of normal capillaries not only 
look like neuronal growth cônes, but also respond similarly to signais in the envi¬ 
ronment. In fact, many of the same guidance molécules are involved, including 
the netrins, slits, and ephrins mentioned in our account of neural development 
in the previous chapter. The corresponding receptors are expressed in the tip cells 
and guide the vascular sprouts along spécifie pathways in the embryo, often in 
parallel with nerves. Perhaps the most important of the guidance molécules for 
endothélial cells, however, is one that is chiefly dedicated to the control of vascu¬ 
lar development: vascular endothélial growth factor, or VEGF. 

Tissues Requiring a Blood Supply Release VEGF 

Almost every cell, in almost every tissue of a vertebrate, is located within 
50-100 pm of a blood capillary. What mechanism ensures that the System of blood 
vessels branches into every noolc and cranny? How is it adjusted so perfectly to 
the local needs of the tissues, not only during normal development but also in 
pathological circumstances? Wounding, for example, induces a burst of cap¬ 
illary growth in the neighborhood of the damage, to satisfy the high metabolic 
requirements of the repair process (Figure 22-25). Local irritants and infections 
also cause a prolifération of new capillaries, most of which regress and disappear 
when the inflammation subsides. Less benignly, a small sample of tumor tissue 
implanted in the cornea, which normally laclcs blood vessels, causes blood vessels 
to grow quicldy toward the implant from the vascular margin of the cornea; the 
growth rate of the tumor increases abruptly as soon as the vessels reach it. 

In ail these cases, the invading endothélial cells respond to signais produced 
by the tissue that they invade. The signais are complex, but a lcey part is played 
by vascular endothélial growth factor (VEGF). The régulation of blood vessel 
growth to match the needs of the tissue dépends on the control of VEGF produc¬ 
tion, through changes in the stability of its mRNA and in its rate of transcription. 
The latter control is relatively well understood. A shortage of oxygen, in practically 
any type of cell, causes an increase in the intracellular level of a transcription fac¬ 
tor called hypoxia-inducible factor la (HIFIa). HIFI a stimulâtes transcription 
of Vegf (and of other genes whose products are needed when oxygen is in short 
supply). The VEGF protein is secreted, diffuses through the tissue, and acts on 
nearby endothélial cells, stimulating them to proliferate, to produce proteases 
to help them digest their way through the basal lamina of the parent capillary or 
venule, and to form sprouts. The tip cells of the sprouts detect the VEGF gradient 
and move toward its source. As the new vessels form, bringing blood to the tissue, 
the oxygen concentration rises. The HIFla activity then déclinés, VEGF produc¬ 
tion is shut off, and angiogenesis cornes to a hait (Figure 22-26). 
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Figure 22-25 New capillary formation in 
response to wounding. Scanning électron 
micrographs of casts of the System of 
blood vessels surrounding the margin of 
the cornea show the reaction to wounding. 
The casts are made by injecting a resin 
into the vessels and letting the resin set; 
this reveals the shape of the lumen, as 
opposed to the shape of the cells. Sixty 
hours after wounding, many new capillaries 
hâve begun to sprout toward the site of 
injury, which is just above the top of the 
picture. Their oriented outgrowth reflects 
a chemotactic response of the endothélial 
cells to an angiogenic factor released at the 
wound. (Courtesy of Peter C. Burger.) 
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Figure 22-26 The regulatory mechanism 
controlling blood vessel growth 
according to a tissue’s need for oxygen. 

Lack of oxygen triggers the sécrétion of 
VEGF, which stimulâtes angiogenesis. 


Signais from Endothélial Cells Control Recruitment of Pericytes 
and Smooth Muscle Cells to Form the Vessel Wall 

The vascular network is continually remodeled as it grows and adapts. A newly 
formed vessel may enlarge; or it may sprout side branches; or it may regress. 
Smooth muscle and other connective-tissue cells that pack themselves around 
the endothélium (see Figure 22-23) help to stabilize vessels as they enlarge. This 
process of vessel wall formation begins with recruitment of pericytes. Small num- 
bers of these cells travel outward in company with the stalle cells of each endo¬ 
thélial sprout. The recruitment and prolifération of pericytes and smooth muscle 
cells to form a vessel wall dépend on platelet-derived growth factor-B (PDGF-B) 
secreted by the endothélial cells and on PDGF receptors in the pericytes and 
smooth muscle cells. In mutants lacking this signal protein or its receptor, these 
vessel wall cells are missing in many régions. As a resuit, the embryonic blood 
vessels develop microaneurysms—microscopie pathological dilatations—that 
eventually rupture, as well as other abnormalities, reflecting the importance of 
signais exchanged in both directions between the exterior cells of the wall and the 
endothélial cells. 

Summary 

Endothélial cells are the fundamental éléments ofthe vascular System. They form a 
single cell loyer that Unes ail blood vessels and lymphatics and régulâtes exchanges 
between the bloodstream and the surrounding tissues. New vessels originate as 
endothélial sprouts from the walls of existing small vessels. A specialized motile 
endothélial tip cell at the leading edge ofeach sproutputs outfïlopodia that respond 
to gradients of guidance molécules in the environment, leading the growth ofthe 
sprout in much the same way as the growth cône ofa neuron is led. The endothélial 
stalk cells following behind become hollowed out to form a capillary tube. Signais 
from endothélial cells organize the growth and development of the connective- 
tissue cells that form the surrounding loyers ofthe vessel wall. 

A homeostatic mechanism ensures that blood vessels permeate every région of 
the body. Cells that are short of oxygen increase their concentration of hypoxia- 
inducible factor la (HIFI a), which stimulâtes the production of vascular endothé¬ 
lial growth factor (VEGF). VEGF acts on endothélial cells, causing them to prolifer- 
ate and invade the hypoxie tissue to supply it with new blood vessels. As new vessels 
enlarge, they recruit increasing numbers of pericytes—cells that cling to the outside 
ofthe endothélial tube and mature into the smooth muscle coat that is needed to 
give the vessel strength. 
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A HIERARCHICAL STEM-CELL SYSTEM: BLOOD CELL 
FORMATION 

The function of blood vessels is to carry blood, and it is to blood itself that we now 
turn. Blood contains many types of cells, with functions that range from the trans¬ 
port of oxygen to the production of antibodies. Some of these cells stay within the 
vascular System, while others use the vascular System only as a means of trans¬ 
port and perform their function elsewhere. Ail blood cells, however, hâve certain 
similarities in their life history. They ail hâve limited life-spans and are produced 
throughout the life of the animal. Most remarkably, they are ail generated ulti- 
mately from a common stem cell, located (in adult humans) in the bone marrow. 
This hematopoietic (blood-making) stem cell is thus multipotent, giving rise to ail 
the types of terminally differentiated blood cells as well as some other types of 
cells, such as the osteoclasts in bone, as mentioned earlier. The hematopoietic 
System is the most complex of the stem-cell Systems in the mammalian body, and 
it is exceptionally important in medical practice. 

Red Blood Cells Are Ail Alike; White Blood Cells Can Be Grouped 
in Three Main Classes 

Blood cells can be classified as red or white. The red blood cells, or érythrocytes, 
remain within the blood vessels and transport O 2 and CO 2 bound to hemoglobin. 
The white blood cells, or leukocytes, combat infection and in some cases phago¬ 
cytose and digest débris. Leukocytes, unlilce érythrocytes, must malce their way 
across the walls of small blood vessels and migrate into tissues to perform their 
tasks. In addition, the blood contains large numbers of platelets, which are not 
entire cells but small, detached cell fragments or “minicells" derived from the cor¬ 
tical cytoplasm of large cells called megakaryocytes. Platelets adhéré specifically 
to the endothélial cell lining of damaged blood vessels, where they help to repair 
breaches and aid in blood clotting. 

Ail red blood cells belong in a single class, following the same developmental 
trajectory as they mature, and the same is true of platelets; but there are many 
distinct types of white blood cells. White blood cells are traditionally grouped into 
three major categories—granulocytes, monocytes, and lymphocytes—based on 
their appearance in the light microscope. 

Granulocytes contain numerous lysosomes and secretory vesicles (or gran¬ 
ules) and are subdivided into three classes according to the morphology and 
staining properties of these organelles (Figure 22-27). The différences in stain- 
ing reflect major différences of chemistry and function. Neutrophils (also called 
polymorphonuclear leukocytes because of their multilobed nucléus) are the most 
common type of granulocyte; they phagocytose and destroy microorganisms, 
especially bacteria, and thus hâve a lcey rôle in innate immunity to bacterial infec¬ 
tion, as discussed in Chapter 24 (see Movie 16.1). Basophils secrete histamine 
(and, in some species, serotonin) to help médiate inflammatory reactions; they 
are closely related to mast cells , which résidé in connective tissues but are also 
generated from the hematopoietic stem cells. Eosinophils help to destroy para¬ 
sites and modulate allergie inflammatory responses. 

Once they leave the bloodstream, monocytes (see Figure 22-27D) mature 
into macrophages, which, together with neutrophils, are the main “professional 
phagocytes" in the body. As discussed in Chapter 13, both types of phagocytic cells 
contain specialized lysosomes that fuse with newly formed phagocytic vesicles 
(phagosomes), exposing phagocytosed microorganisms to a barrage of enzymat- 
ically produced, highly reactive molécules of superoxide (O 2 ) and hypochlorite 
(CIO - , the active ingrédient in bleach), as well as to attaclc by a concentrated mix¬ 
ture of lysosomal hydrolase enzymes that become activated in the phagosome. 
Macrophages, however, are much larger and longer-lived than neutrophils. They 
recognize and remove senescent, dead, and damaged cells in many tissues, and 
they are unique in being able to ingest large microorganisms such as protozoa. 
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Figure 22-27 White blood cells. (A-D) These électron micrographs show 
(A) a neutrophil, (B) a basophil, (C) an eosinophil, and (D) a monocyte. 
Electron micrographs of lymphocytes are shown in Figure 24-14. Each 
of the cell types shown here has a different function, which is reflected 
in the distinctive types of secretory granules and lysosomes it contains. 
There is only one nucléus per cell, but it has an irregular lobed shape, 
and in (A), (B), and (C) the connections between the lobes are out of the 
plane of section. (E) A light micrograph of a blood smear stained with the 
Romanowsky stain, which colors the white blood cells strongly. (Courtesy 
of Dorothy Bainton.) 


Monocytes also give rise to dendritic cells. Like macrophages, dendritic cells 
are migratory cells that can ingest foreign substances and organisms, but they do 
not hâve as active an appetite for phagocytosis and instead hâve a crucial rôle 
as presenters of foreign antigens to lymphocytes to trigger an immune response. 
Dendritic cells in the epidermis (called Langerhans cells), for example, ingest for¬ 
eign antigens and carry these trophies baclc from the skin to présent to lympho¬ 
cytes in lymph nodes. 

There are two main classes of lymphocytes, both involved in immune 
responses: B lymphocytes malce antibodies, while T lymphocytes kill virus- 
infected cells and regulate the activities of other white blood cells. In addition, 
there are lymphocyte-like cells called natural kïller (NK) cells, which kill some 
types of tumor cells and virus-infected cells. The production of lymphocytes is a 
specialized topic discussed in detail in Chapter 24. Here we concentrate mainly 
on the development of the other blood cells, often referred to collectively as 
myeloid cells. 

Table 22-1 summarizes the various types of blood cells and their functions. 

The Production of Each Type of Blood Cell in the Bone Marrow Is 
Individually Controlled 

Most white blood cells function in tissues other than the blood; blood simply 
transports them to where they are needed. A local infection or injury in any tissue 
rapidly attracts white blood cells into the affected région as part of the inflamma- 
tory response, which helps fight the infection or heal the wound (Movie 22.3). 

The inflammatory response is complex and is governed by many different sig¬ 
nal molécules produced locally by mast cells, nerve endings, platelets, and white 
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Blood Cells 

Type of cell 

Main functions 

Typical concentration in 
human blood (cells/liter) 

Red blood cells (érythrocytes) 

Transport O 2 and CO 2 

5 X 10 12 

White blood cells (leukocytes) 

Granulocytes 

Neutrophils (polymorphonuclear 
leukocytes) 

Phagocytose and destroy invading bacteria 

5 x 10 9 

Eosinophils 

Destroy larger parasites and modulate allergie 
inflammatory responses 

2 x 10 8 

Basophils 

Release histamine (and in some species serotonin) in 
certain immune reactions 

4 x 10 7 

Monocytes 

Become tissue macrophages, which phagocytose and 
digest invading microorganisms and foreign bodies as 
well as damaged senescent cells 

4 x 10 8 

Lymphocytes 

B cells 

Make antibodies 

2 x 10 9 

T cells 

Kill virus-infected cells and regulate activities of other 
leukocytes 

1 x 10 9 

Natural killer (NK) cells 

Kill virus-infected cells and some tumor cells 

1 x 10 8 

Platelets (cell fragments arising from 
megakaryocytes in bone marrow) 

Initiate blood clotting 

3 x 10 11 

Humans contain about 5 liters of blood, accounting for 7% of body weight. Red blood cells constitute about 45% of this volume and white blood 
cells about 1%, the rest being the liquid blood plasma. 


blood cells, as well as by the activation of complément (discussed in Chapter 24). 
Some of these signal molécules act on the endothélial lining of nearby capillaries, 
helping white blood cells to first stick and then malce an exit from the bloodstream 
into the tissue where they are needed, as described in Chapter 19 (see Figure 
19-28 and Movie 19.2). Damaged or inflamed tissues and local endothélial cells 
secrete other molécules called chemokines, which act as chemoattractants for 
spécifie types of white blood cells, causing them to become polarized and crawl 
toward the source of the attractant. As a resuit, large numbers of white blood cells 
enter the affected tissue (Figure 22-28). 

Other signal molécules produced during an inflammatory response escape 
into the blood and stimulate the bone marrow to produce more leukocytes and 
release them into the bloodstream. The régulation tends to be cell-type spécifie: 
some bacterial infections, for example, cause a sélective increase in neutrophils, 
while infections with some protozoa and other parasites cause a sélective increase 
in eosinophils. (For this reason, physicians routinely use differential white blood 
cell counts to aid in the diagnosis of infectious and other inflammatory diseases.) 

In other circumstances, érythrocyte production is selectively increased—for 
example, in response to anémia (laclc of hemoglobin) due to blood loss, and in the 
process of acclimatization when one goes to live at high altitude, where oxygen is 
scarce. Thus, blood cell formation, or hematopoiesis , necessarily involves complex 
Controls, which regulate the production of each type of blood cell individually to 
meet changing needs. 

Figure 22-28 Chemotaxis of white blood cells to damaged tissue. 

A chemoattractive signal released from a site of damage, which is toward 
the bottom of the page, causes white blood cells to exit from the capillary by 
crawling between adjacent endothélial cells, as shown. 
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Bone Marrow Contains Multipotent Hematopoietic Stem Cells, 

Able to Give Rise to Ail Classes of Blood Cells 

In the bone marrow, the developing blood cells and their precursors, including 
the stem cells, are intermingled with one another, as well as with fat cells and 
other stromal cells (connective-tissue cells), which produce a délicate support- 
ing meshworlc of collagen fibers and other extracellular matrix components. In 
addition, the whole tissue is richly supplied with thin-walled blood vessels, called 
blood sinuses, into which the new blood cells are discharged. Megakaryocytes are 
also présent; these, unlilce other blood cells, remain in the bone marrow when 
mature and are one of its most striking features, being extraordinarily large (diam- 
eter up to 60 pm) with a highly polyploid nucléus. They normally lie close beside 
blood sinuses, and they extend processes through holes in the endothélial lining 
of these vessels; platelets pinch off from the processes and are swept away into the 
blood (Figure 22-29 and Movie 22.4). 

Because of the complex arrangement of the cells in bone marrow, it is difficult 
to identify in ordinary tissue sections any but the immédiate precursors of the 
mature blood cells. There is no obvious visible characteristic by which we can rec- 
ognize the ultimate stem cells. In the case of hematopoiesis, the stem cells were 
first identified by a functional assay that exploited the wandering lifestyle of blood 
cells and their precursors. 

When an animal is exposed to a large dose of x-rays, most of the hematopoietic 
cells are destroyed and the animal dies within a few days as a resuit of its inability 
to manufacture new blood cells. The animal can be saved, however, by a trans¬ 
fusion of cells talcen from the bone marrow of a healthy, immunologically com¬ 
patible donor. Among these cells there are some that can colonize the irradiated 
host and permanently reequip it with hematopoietic tissue (Figure 22-30). Such 
experiments prove that the marrow contains hematopoietic stem cells. They also 
show how we can assay for the presence of hematopoietic stem cells and hence 
discover the molecular features that distinguish them from other cells. 

For this purpose, cells talcen from bone marrow are sorted (using a fluores¬ 
cence-activated cell sorter) according to the surface antigens that they display, 
and the different fractions are transfused baclc into irradiated mice. If a fraction 
rescues an irradiated host mouse, it must contain hematopoietic stem cells. In this 
way, it has been possible to show that the hematopoietic stem cells are character- 
ized by a spécifie combination of cell-surface proteins, and by appropriate sorting 
we can obtain virtually pure stem-cell préparations. The stem cells turn out to be a 
tiny fraction of the bone marrow population—about 1 cell in 50,000-100,000; but 
this is enough. A single such cell injected into a host mouse with defective hema¬ 
topoiesis is sufficient to reconstitute its entire hematopoietic System, generating a 
complété set of blood cell types, as well as fresh stem cells. This and other exper¬ 
iments (using artificial lineage marlcers) show that the individual hematopoietic 
stem cell is multipotent and can give rise to the complété range of blood cell types, 
both myeloid and lymphoid, as well as to new stem cells lilce itself (Figure 22-31). 


Figure 22-29 A megakaryocyte among 
other developing blood cells in the bone 
marrow. The megakaryocyte’s enormous 
size results from its having a highly 
polyploid nucléus. One megakaryocyte 
produces about 10,000 platelets, which 
split off from long processes that extend 
through holes in the walls of an adjacent 
blood sinus. 


x-irradiation halts blood cell 
production; mouse would die 
if no further treatment were given 



INJECT BONE MARROW CELLS 
FROM HEALTHY DONOR 



mouse survives; the injected stem 
cells colonize its hematopoietic tissues 
and generate a steady supply of 
new blood cells 


_ 



Figure 22-30 Rescue of an irradiated 
mouse by a transfusion of bone marrow 
cells. An essentially similar procedure is 
used in the treatment of leukemia in human 
patients by bone marrow transplantation. 








A HIERARCHICAL STEM-CELL SYSTEM: BLOOD CELL FORMATION 


1243 



STEM CELL COMMITTED PROGENITORS DIFFERENTIATED CELLS 


Figure 22-31 A tentative scheme 
of hematopoiesis. The multipotent 
stem cell normally divides infrequently 
to generate either more multipotent 
stem cells, which are self-renewing, or 
committed progenitor cells, which are 
limited in the number of times that they 
can divide before differentiating to form 
mature blood cells. As they go through 
their divisions, the progenitors become 
progressively more specialized in the 
range of cell types that they can give rise 
to, as indicated by the branching of this 
cell-lineage diagram. In adult mammals, 
ail of the cells shown develop mainly in the 
bone marrow—except for T lymphocytes, 
which as indicated develop in the thymus, 
and macrophages and osteoclasts, 
which develop from blood monocytes. 
Some dendritic cells may also dérivé from 
monocytes. 


Commitment Is a Stepwise Process 

Hematopoietic stem cells do not jump directly from a multipotent State into a 
commitment to just one pathway of différentiation; instead, they go through a 
sériés of progressive restrictions. The first step, usually, is commitment to either a 
myeloid or a lymphoid fate. This is thought to give rise to two lcinds of progenitor 
cells, one capable of generating large numbers of ail the different types of myeloid 
cells, and the other giving rise to large numbers of ail the different types of lym¬ 
phoid cells. Further steps give rise to progenitors committed to the production of 
just one cell type. The steps of commitment correlate with changes in the expres¬ 
sion of spécifie transcription regulators, needed for the production of different 
subsets of blood cells. 

Divisions of Committed Progenitor Cells Amplify the Number of 
Specialized Blood Cells 

Hematopoietic progenitor cells generally become committed to a particular path¬ 
way of différentiation long before they cease proliferating and terminally differ- 
entiate. The committed progenitors go through many rounds of cell division to 
amplify the ultimate number of cells of the given specialized type. In this way, a 
single stem-cell division can lead to the production of thousands of differentiated 
progeny, which explains why the number of stem cells is such a small fraction 
of the total population of hematopoietic cells. For the same reason, a high rate 
of blood cell production can be maintained even though the stem-cell division 
rate is low. The smaller the number of division cycles that the stem cells them- 
selves hâve to undergo in the course of a lifetime, the lower the rislc of generating 
stem-cell mutations, which would give rise to persistent mutant clones of cells in 
the body—a particular danger in the hematopoietic System where, as discussed 
in Chapter 20, a relatively small accumulation of mutations can be sufficient to 
cause cancer. A low rate of stem-cell division also slows the process of réplicative 
cell senescence (discussed in Chapter 17). 
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The stepwise nature of commitment means that the hematopoietic System can 
be viewed as a hierarchical family tree of cells. Multipotent stem cells give rise 
to committed progenitor cells, which are specified to give rise to only one or a 
fewblood cell types. The committed progenitors divide rapidly, but only a limited 
number of times, before they terminally differentiate into cells that divide no fur- 
ther and die after several days or weelcs. Figure 22-31 depicts the hematopoietic 
family tree. It should be noted, however, that variations are thought to occur: not 
ail stem cells generate the identical patterns of progeny via precisely the same 
sequence of steps. 

Stem Cells Dépend on Contact Signais From Stromal Cells 

Like the stem cells of other tissues, hematopoietic stem cells dépend on signais 
from their niche, in this case created by the specialized connective tissue of the 
bone marrow. (This is the site in adult humans; during development, and in non- 
human mammals such as the mouse, hematopoietic stem cells can also malce 
their home in other tissues—notably liver and spleen.) When they lose contact 
with their niche, the hematopoietic stem cells tend to lose their stem-cell poten- 
tial (Figure 22-32). Evidently the loss of potency is not absolute or instantaneous, 
however, since the stem cells can still survive journeys via the bloodstream to col- 
onize other sites in the body. 

Factors That Regulate Hematopoiesis Can Be Analyzed in Culture 

While the stem cells dépend on contact with bone marrow stromal cells for long- 
term maintenance, their committed progeny do not, or at least not to the same 
degree. These cells can thus be dispersed and cultured in a semisolid matrix of 
dilute agar or methylcellulose, and factors derived from other cells can be added 
artificially to the medium. The semisolid matrix inhibits migration, so that the 
progeny of each isolated precursor cell remain together as an easily distinguish- 
able colony. A single committed neutrophil progenitor, for example, may give rise 
to a clone of thousands of neutrophils. Such culture Systems hâve provided a way 
to assay for the factors that support hematopoiesis and hence to purify them and 
explore their actions. These substances are glycoproteins and are usually called 
colony-stimulating factors (CSFs). Some of these factors circulate in the blood 
and act as hormones, while others act in the bone marrow as secreted local medi- 
ators; still others talce the form of membrane-bound signais that act through cell- 
cell contact. 

An important example of the latter is a protein called Steel or Stem Cell Fac¬ 
tor (SCF). This is expressed both in the bone marrow stroma (where it helps to 
define the stem-cell niche) and along pathways of migration, and it occurs both 
in a membrane-bound and a soluble form. It binds to a receptor tyrosine kinase 
called Kit, and it is required during development for guidance and survival not 
only of hematopoietic cells but also of other migratory cell types—specifically, 
germ cells and pigment cells. 

Erythropoiesis Dépends on the Hormone Erythropoietin 

The best understood of the CSFs that act as hormones is the glycoprotein erythro¬ 
poietin, which is produced in the lcidneys and régulâtes erythropoiesis, the forma¬ 
tion of red blood cells, to which we now turn. 

The érythrocyte is by far the most common type of cell in the blood (see Table 
22-1). When mature, it is paclced full of hemoglobin and contains hardly any of 
the usual cell organelles. In an érythrocyte of an adult mammal, even the nucléus, 
endoplasmic réticulum, mitochondria, and ribosomes are absent, having been 
extruded from the cell in the course of its development (Figure 22-33). The éryth¬ 
rocyte therefore cannot grow or divide, and it has a limited life-span—about 120 
days in humans or 55 days in mice. Worn-out érythrocytes are phagocytosed and 
digested by macrophages in the liver and spleen, which remove more than 10 11 
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Figure 22-32 Dependence of 
hematopoietic stem cells on contact 
with stromal cells. The contact-dependent 
interaction between the Kit receptor 
and its ligand is one of several signaling 
mechanisms thought to be involved in 
hematopoietic stem-cell maintenance. 

The real System is certainly more 
complex. Moreover, the dependence of 
hematopoietic cells on contact with stromal 
cells cannot be absolute, since small 
numbers of the functional stem cells can be 
found tree in the circulation. SCF, stem-cell 
factor. 
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Figure 22-33 A developing red blood cell (erythroblast). The cell is shown 
extruding its nucléus to become an immature érythrocyte (a réticulocyte), 
which then leaves the bone marrow and passes into the bloodstream. The 
réticulocyte will lose its mitochondria and ribosomes within a day or two 
to become a mature érythrocyte. Erythrocyte clones develop in the bone 
marrow on the surface of a macrophage, which phagocytoses and digests 
the nuclei discarded by the erythroblasts. 


senescent érythrocytes in each of us each day. Young érythrocytes actively protect 
themselves from this fate: they hâve a protein on their surface that binds to an 
inhibitory receptor on macrophages and thereby prevents their phagocytosis. 

A laclc of oxygen or a shortage of érythrocytes stimulâtes specialized cells in the 
kidney to synthesize and secrete increased amounts of erythropoietin into the 
bloodstream. The erythropoietin, in turn, boosts the production of érythrocytes. 
The effect is rapid: the rate of release of new érythrocytes into the bloodstream 
rises steeply 1-2 days after an increase in erythropoietin levels in the bloodstream. 
Clearly, the hormone must act on cells that are close precursors of the mature 
érythrocytes. 

The cells that respond to erythropoietin can be identified by culturing bone 
marrow cells in a semisolid matrix in the presence of erythropoietin. In a few 
days, colonies of about 60 érythrocytes appear, each founded by a single com- 
mitted erythroid progenitor cell. This progenitor dépends on erythropoietin for 
its survival as well as its prolifération. It does not yet contain hemoglobin, and it 
is derived from an earlier type of committed erythroid progenitor whose survival 
and prolifération are governed by other factors. 

Multiple CSFs Influence Neutrophil and Macrophage Production 

The two classes of cells dedicated to phagocytosis, neutrophils and macrophages, 
develop from a common progenitor cell called a granulocyte/macrophage (GM) 
progenitor cell. Lilce the other granulocytes (eosinophils and basophils), neutro¬ 
phils circulate in the blood for only a few hours before migrating out of capillaries 
into the connective tissues or other spécifie sites, where they survive for only a few 
days. They then die by apoptosis and are phagocytosed by macrophages. Mac¬ 
rophages, in contrast, can persist for months or perhaps even years outside the 
bloodstream, where they can be activated by local signais to résumé prolifération. 

At least seven distinct CSFs that stimulate neutrophil and macrophage colony 
formation in culture hâve been defined, and some or ail of these are thought to 
act in different combinations to regulate the sélective production of these cells 
in vivo. These CSFs are synthesized by various cell types—including endothélial 
cells, fibroblasts, macrophages, and lymphocytes—and their concentration in 
the blood typically increases rapidly in response to bacterial infection in a tissue, 
thereby increasing the number of phagocytic cells released from the bone marrow 
into the bloodstream. 

The CSFs not only operate on the precursor cells to promote the production 
of differentiated progeny, they also activate the specialized functions (such as 
phagocytosis and target-cell killing) of the terminally differentiated cells. CSFs can 
be synthesized artificially and are now widely used in human patients to stimulate 
the régénération of hematopoietic tissue and to boost résistance to infection. 




The Behavior of a Hematopoietic Cell Dépends Partly on Chance 

CSFs are defined as factors that promote the production of colonies of differenti¬ 
ated blood cells. But precisely what effect does a CSF hâve on an individual hema¬ 
topoietic cell? The factor might control the rate of cell division or the number of 
division cycles that the progenitor cell undergoes before differentiating; it might 
act late in the hematopoietic lineage to facilitate différentiation; it might act early 
to influence commitment; or it might simply increase the probability of cell sur¬ 
vival (Figure 22-34). By monitoring the fate of isolated individual hematopoietic 
cells in culture, it has been possible to show that a single CSF, such as granulocyte/ 
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1. Frequency of stem-cell division 

2. Probability of stem-cell death 

3. Probability that stem-cell daughter 
will become a committed progenitor 
cell of the given type 


4. Division cycle time of committed 
progenitor cell 


5. Probability of progenitor-cell death 


6. Number of committed progenitor- 
cell divisions before terminal 
différentiation 


7. Lifetime of differentiated cells 


Figure 22-34 Some of the parameters 
through which the production of 
blood cells of a spécifie type might 
be regulated. Studies in culture suggest 
that various colony-stimulating factors 
(CSFs) can affect ail of these aspects of 
hematopoiesis. 


macrophage CSF, can exert ail these effects, although it is still not clear which are 
most important in vivo. 

Studies in vitro indicate, moreover, that there is a large element of chance in 
the way a hematopoietic cell behaves—a reflection, presumably, of "noise" in the 
genetic control System, as discussed in Chapters 7 and 8. If two sister cells are 
talcen immediately after a cell division and cultured apart under identical con¬ 
ditions, they frequently give rise to colonies that contain different types of blood 
cells or the same types of blood cells in different numbers. Thus, both the pro- 
gramming of cell division and the process of commitment to a particular path of 
différentiation seem to involve random events at the level of the individual cell, 
even though the behavior of the multicellular System as a whole is regulated in a 
reliable way. The sequence of cell fate restrictions shown earlier, in Figure 22-31, 
conveys the impression of a program executed with computer-like logic and pré¬ 
cision. Individual cells may be more varied, quirlcy, and erratic, and may some- 
times progress by other decision pathways from the stem-cell State toward termi¬ 
nal différentiation. 

Régulation of Cell Survival Is as Important as Régulation of Cell 
Prolifération 

The default behavior of hematopoietic cells in the absence of CSFs is death by 
apoptosis (discussed in Chapter 18), and the control of cell survival plays a cen¬ 
tral part in regulating the numbers of blood cells. The amount of apoptosis in the 
vertebrate hematopoietic System is enormous: billions of neutrophils die in this 
way each day in an adult human, for example. In fact, most neutrophils produced 
in the bone marrow die there without ever functioning. This futile cycle of pro¬ 
duction and destruction presumably serves to maintain a reserve supply of cells 
that can be promptly mobilized to fight infection whenever it flares up, or phago- 
cytosed and digested for recycling when ail is quiet. Compared with the life of the 
organism, the lives of cells are cheap. 

Too little cell death can be as dangerous to the health of a multicellular organ¬ 
ism as too much prolifération. As noted in Chapter 18, mutations that inhibit cell 
death by causing excessive production of the intracellular apoptosis inhibitor 
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Bcl2 promote the development of cancer in B lymphocytes. Indeed, the capacity 
for unlimited self-renewal is a dangerous property for any cell to possess. Many 
cases of leukemia arise through mutations that confer this capacity on committed 
hematopoietic precursor cells that would normally be fated to differentiate and 
die after a limited number of division cycles. 

Summary 

The many types ofblood cells, including érythrocytes, lymphocytes, granulocytes, 
and macrophages, ail dérivé from a common multipotent stem cell In the adult, 
hematopoietic stem cells arefound mainly in bone marrow, and they dépend on sig¬ 
nais from the marrow stromal (connective-tissue) cells to maintain their stem-cell 
character. The stem cells arefew and far between, and they normally divide infre- 
quently to produce more stem cells (self-renewal) and various committed progen- 
itor cells (transit amplifying cells), each able to give rise to only one or a few types 
ofblood cells. The committed progenitor cells divide extensively under the influence 
of various protein signal molécules (colony-stimulating factors, or CSFs) and then 
terminally differentiate into mature blood cells, which usually die after several days 
or weeks. 

Studies of hematopoiesis hâve been greatly aided by in vitro assays in which 
stem cells or committed progenitor cells form clonal colonies when cultured in a 
semisolid matrix. The progeny of stem cells seem to malce their choices between 
alternative developmental pathways in a partly random manner. Cell death by 
apoptosis, controlled by the availability ofCSFs, also play s a central part in regulat- 
ing the numbers of mature dijferentiated blood cells. 

REGENERATION AND REPAIR 

As we hâve seen, many of the tissues of the body are not only self-renewing but 
also self-repairing, and this is largely thanlcs to stem cells and the feedback Con¬ 
trols that regulate their behavior and maintain homeostasis. There are, however, 
limits to what these natural repair mechanisms can achieve. In most parts of the 
human brain, for example, nerve cells that die, as in Alzheimer's disease, are not 
replaced. Likewise, when heart muscle dies for laclc of oxygen, as in a heart attaclc, 
it is replaced by scar tissue rather than new heart muscle. 

Some animais do far better than humans and can regenerate entire organs, 
such as whole limbs, after amputation. Among the invertebrates, there are some 
species that can even regenerate ail the tissues of the body from a single somatic 
cell. These phenomena encourage the hope that human cells might be coaxed by 
artificial measures into similar feats of repair and régénération, so as to replace 
the slceletal muscle fibers that degenerate in victims of muscular dystrophy, the 
nerve cells that die in patients with Parkinson's disease, the insulin-secreting cells 
that are lacking in type 1 diabetics, the heart muscle cells that die in a heart attaclc, 
and so on. As we learn more about the basic cell biology, these goals, once only a 
dream, are beginning to seem attainable. 

In this section, we start with some examples of the remarkable regenerative 
abilities of some animal species, as an indication of what is possible in principle. 
We shall then discuss how we can improve upon the natural repair processes of 
the human body and treat disease by exploiting the properties of the various types 
of stem cells found in human tissues. In the final section of the chapter, we shall 
see how a deeper understanding of the molecular biology of cell différentiation 
and of stem cells has revealed ways to convert one type of cell into another, open- 
ing up radically new possibilities. 

Planarian Worms Contain Stem Cells That Can Regenerate a 
Whole New Body 

Schmidtea mediterranea is a small freshwater flatworm, or planarian, just under a 
centimeter long when grown to full size (Figure 22-35). It has an epidermis, a gut, 
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Figure 22-35 The planarian worm, 
Schmidtea mediterranea. (A) External 
view. (B) Immunostaining with three 
different antibodies, revealing the internai 
anatomy. (A, courtesy of A. Sanchez 
Alvarado; B, from A. Sanchez Alvarado, 
BMC Biol. 10:88, 2012.) 


a brain, a pair of primitive eyes, a peripheral nervous System, musculature, and 
excretory and reproductive organs—most of the basic body parts familiar in other 
animais, although ail relatively simple by vertebrate standards and built from 
about 20-25 distinct differentiated cell types. For more than a century, planarians 
such as Schmidtea hâve intrigued biologists because of their extraordinary capac- 
ity for régénération: a small tissue fragment talcen from almost any part of the 
body will reorganize itself and grow to form a complété new animal. This prop- 
erty goes with another: when the animal is starved, it gets smaller and smaller, by 
reducing its cell numbers while maintaining essentially normal body proportions. 
This behavior is called degrowth, and it can continue until the animal is as little 
as one-twentieth or even a smaller fraction of its full size. Supplied with food, it 
will grow baclc to full size again. Cycles of degrowth and growth can be repeated 
indefinitely, without impairing survival or fertility. 

Underlying this behavior is a process of continuai cell turnover. Along with the 
differentiated cells, which do not divide, there is a population of small, apparently 
undifferentiated dividing cells called neoblasts. The neoblasts constitute about 
20% of the cells in the body and are widely distributed within it; by cell division, 
they serve as stem cells for the production of new differentiated cells. Differenti¬ 
ated cells, meanwhile, are continually dying by apoptosis, allowing their corpses 
to be phagocytosed and digested by neighboring cells. Through this cell canni- 
balism, the constituents of the dying cells can be efficiently recycled. Cell birth 
continues in a dynamic balance with cell death and cell cannibalism, no matter 
whether the animal is fed or starved. In conditions of starvation, the balance is 
evidently tilted toward cell cannibalism, and in conditions of plenty, toward cell 
birth. 

A high dose of x-rays halts ail cell division, puts a stop to cell turnover, and 
destroys the capacity for régénération. The resuit is death after a delay of several 
weelcs. The animal can be rescued, however, by injecting into it a single neoblast 
isolated from an unirradiated donor (Figure 22-36). In a certain proportion of 
cases, the injected cell divides to form a clone of progeny that eventually repopu- 
late the entire body, creating a healthy regenerative individual with an apparently 
complété set of differentiated cell types as well as dividing neoblasts. Genetic 
marlcers prove that these are ail derived from the single neoblast that was injected. 
It follows that at least some neoblasts are totipotent (or at least highly pluripotent) 
stem cells; that is, cells able to give rise to ail (or at least almost ail) of the cell types 
that malce up the body of a flatworm, including more neoblasts lilce themselves. 

Some Vertebrates Can Regenerate Entire Organs 

One might thinlc that such powers of régénération would be a prérogative of small, 
simple, primitive animais. But some vertebrates, too, especially fish and amphib- 
ians, show remarkable regenerative abilities. A newt, for example, can regenerate 
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a whole amputated limb. In this process, differentiated cells seem to revert to an 
embryonic character by first forming on the amputation stump a blastema —a 
small bud resembling an embryonic limb bud. The blastema then grows and its 
cells differentiate to form a correctly patterned replacement for the limb that has 
been lost, in what looks lilce a récapitulation of embryonic limb development (Fig¬ 
ure 22-37). A large contribution to the blastema cornes from the slceletal muscle 
cells in the limb stump. These multinucleate cells re-enter the cell cycle, dediffer- 
entiate, and break up into mononucleated cells, which then proliferate within the 
blastema, before eventually redifferentiating. But do they redifferentiate only into 
muscle, or do they behave lilce neoblasts in the planarian and give rise to the full 
range of cell types needed to reconstruct the missing part of the limb? Careful lin- 
eage tracing, using genetic marlcers, shows (contrary to previous belief) that the 
cells are restricted according to their origins: muscle-derived cells give rise only to 
muscle, connective-tissue cells only to connective tissues, epidermal cells only to 
epidermal cells. The cells in the adult vertebrate body are, after ail, less adaptable 
than the cells of the flatworm: by working in concert, they can replace the lost 
structure, but each cell type is far from totipotent. 

Why a newt can regenerate a whole limb—as well as many other body parts- 
but a mammal cannot remains a profound mystery. 

Stem Cells Can Be Used Artificially to Replace Cells That Are 
Diseased or Lost: Therapy for Blood and Epidermis 

Earlier in this chapter, we saw how mice can be irradiated to lcill off their hemato- 
poietic cells, and then rescued by a transfusion of new stem cells, which repopu- 
late the bone marrow and restore blood cell production (see Figure 22-30). In the 
same way, patients with some forms of leulcemia or lymphoma can be irradiated 
or chemically treated to destroy their cancerous cells along with the rest of their 
hematopoietic tissue, and then can be rescued by a transfusion of healthy, non- 
cancerous hematopoietic stem cells. In favorable cases, these can be sorted out 
from samples of the patienté own hematopoietic tissue before it is ablated. They 
are then transfused baclc afterward, avoiding problems of immune rejection. 


AMPUTATION 



Figure 22-36 Régénération of a 
planarian from a single somatic cell. 

(A) The distribution of dividing cells 
(neoblasts, blue) in the adult body. 
Irradiation blocks ail cell division and 
prevents régénération, but (B) a single 
unirradiated neoblast cell injected into the 
irradiated animal is able to reconstitute 
ail tissues. This eventually produces a 
complété animal that consists entirely 
of the progeny of this one cell and can 
regenerate. (Adapted from E.M. Tanaka 
and P.W. Reddien, Dev. Cell 21:172—185, 
2011 .) 


Figure 22-37 Newt limb régénération. 

The time-lapse sequence shows the stages 
of régénération after amputation at the level 
of the humérus. The sequence spans the 
events of wound healing, dedifferentiation 
of stump tissues, blastema formation, and 
redifferentiation. (Courtesy of Susan Bryant 
and David Gardiner.) 
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Figure 22-38 The continuing production of neurons in an adult mouse 
brain. The brain is viewed from above, in a cut-away section, to show the 
région lining the ventricles of the forebrain where neural stem cells are found. 
These cells continually produce progeny that migrate to the olfactory bulb, 
where they differentiate as neurons. The constant turnover of neurons in 
the olfactory bulb is presumably linked in some way to the turnover of the 
olfactory receptor neurons that project to it from the olfactory epithelium, as 
mentioned earlier. In adult humans, there is a continuing turnover of neurons 
in the hippocampus, a région specially concerned with learning and memory. 
(Adapted from B. Barres, Ce//97:667-670, 1999. With permission from 
El sévi er.) 


Another example of the use of stem cells is in the repair of the skin after exten¬ 
sive burns. By culturing cells from undamaged régions of the burned patient's 
skin, it is possible to obtain epidermal stem cells quite rapidly in large numbers. 
These can then be used (through rather long and complicated procedures) to 
repopulate the damaged body surface. 


immature neurons 



\ / 

cérébral hemispheres 


Neural Stem Cells Can Be Manipulated in Culture and Used to 
Repopulate the Central Nervous System 

The central nervous System (the CNS) is the most complex tissue in the body, at an 
opposite extreme from the epidermis. And yet fish and amphibians can regener- 
ate large parts of the brain, spinal cord, and eyes after they hâve been eut away. In 
adult mammals, however, these tissues hâve very little capacity for self-repair, and 
stem cells capable of generating new neurons are hard to find—so hard to find, 
indeed, that for many years they were thought to be absent. 

We now lcnow, however, that neural stem cells that generate both neurons and 
glial cells do persist in certain parts of the adult mammalian brain (Figure 22-38). 
Neuronal turnover occurs on a dramatic scale in certain songbirds' brains, where 
large numbers of neurons die each year and are replaced by newborn neurons 
as part of a process by which the birds refine their song for each new breeding 
season. In the adult human brain, there is a continuing turnover of neurons in 
the hippocampus, a région specially concerned with learning and memory. Here, 
plasticity of adult function is associated with turnover of a spécifie subset of neu¬ 
rons. About 1400 fresh neurons in this class are generated every day, giving a turn¬ 
over of 1.75% of the population per year. 

Fragments talcen from self-renewing régions of the adult brain, or from the 
brain of a fétus, can be dissociated and used to establish cell cultures, where they 
give rise to floating “neurospheres"—clusters consisting of a mixture of neural 
stem cells with neurons and glial cells derived from the stem cells. These neuro¬ 
spheres can be propagated through many cell générations, or their cells can be 
talcen at any time and implanted baclc into the brain of an intact animal. Here they 
will produce differentiated progeny, in the form of neurons and glial cells. 

Using slightly different culture conditions, with the right combination of 
growth factors in the medium, the neural stem cells can be grown as a monolayer 
and induced to proliferate as an almost pure stem-cell population without atten¬ 
dant differentiated progeny. By a further change in the culture conditions, these 
cells can be induced at any time to differentiate to give either a mixture of neurons 
and glial cells (Figure 22-39), or just one of these two cell types, according to the 
composition of the culture medium. 

Neural stem cells, whether derived as above or from pluripotent stem cells as 
described in the next section, can be grafted into an adult brain. Once there, they 
show a remarkable ability to adjust their behavior to match their new location. 
Stem cells from the mouse hippocampus, for example, when implanted in the 
mouse olfactory-bulb-precursor pathway (see Figure 22-38), give rise to neurons 
that become correctly incorporated into the olfactory bulb. This capacity of neu¬ 
ral stem cells and their progeny to adapt to a new environment in animais sug- 
gests applications in the treatment for diseases where neurons degenerate, and 
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fêtai brain or ES cells -► neurospheres (A) -► pure culture of neural stem cells (B) 

dissociate cells and dissociate and 

culture in suspension culture as monolayer 

in medium 1 in medium 2 


-► mixture (C) of differentiated neurons (red) 

and glial cells (green); cell nuclei 
switch to are blue 

medium 3 



for injuries of the central nervous System. For example, might it be possible to use 
injected neural stem cells to replace the neurons that die in Parkinson's disease or 
to repair accidents that sever the spinal cord? 

Summary 

Animais vary in their capacityfor régénération. At one extreme, planarian worms 
contain stem cells (neoblasts) that support continuai turnover ofall cell types, and 
an entire worm can be regenerated from practically any small body fragment or 
even from a single neoblast cell. Newts can regenerate limbs and other large body 
parts after amputation, but the cells remain restricted according to their origins: 
muscle cells in the regenerate dérivé from muscle, epidermis from epidermis, and so 
on. In mammals, régénération is more limited. Nevertheless, it is becoming possible 
to go beyond the natural limits ofwound healing by exploiting stem-cell biology. 
Thus, certain régions ofthe nervous System contain stem cells that support produc¬ 
tion of neurons in these sites throughout life. Neural stem cells can be obtained from 
these sites or from fêtai brains, grown in culture, and then grafted baclc into other 
sites in the brain, where they are able to generate neurons appropriate to the new 
location. 


Figure 22-39 Neural stem cells. Shown 
are the steps leading from fêtai brain tissue, 
via neurospheres (A), to a pure culture of 
neural stem cells (B). These stem cells can 
be kept proliferating as such indefinitely, 
or, through a change of medium, can be 
caused to differentiate (C) into neurons 
(red) and glial cells (green). Neural stem 
cells with the same properties can also be 
derived, via a similar sériés of steps, from 
embryonic stem (ES) or induced pluripotent 
stem (iPS) cells (discussed later in this 
chapter). (Micrographs from L. Conti et al., 
PLoS Biol. 3:1594-1606, 2005.) 


CELL REPROGRAMMING AND PLURIPOTENT 
STEM CELLS 

When cells are transplanted from one site in the mammalian body to another or 
are removed from the body and maintained in culture, they remain largely faithful 
to their origins. Each type of specialized cell has a memory of its developmental 
history and seems fixed in its specialized fate. Some limited transformations can 
certainly occur, as we saw in our account of the connective-tissue cell family, and 
some stem cells can generate a variety of differentiated cell types, but the possibil¬ 
ités are restricted. Each type of stem cell serves for the renewal of one particular 
type of tissue, and the whole pattern of self-renewing and differentiated cells in 
the adult body is amazingly stable. What, at a fundamental molecular level, is the 
nature of these stable différences between cell types? Is there any way to override 
the cell memory mechanisms and force a switch from one State to another that is 
radically different? 

We hâve already discussed these fundamental questions from a general stand- 
point in Chapter 7. Here we consider them more closely in the context of stem-cell 
biology, where there has been a recent révolution in our understanding and in 
our ability to manipulate States of cell différentiation. With further research, these 
advances would seem to hâve important practical conséquences. 
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Nuclei Can Be Reprogrammed by Transplantation into Foreign 
Cytoplasm 

If we cannot switch the basic character of a specialized cell by changing its envi¬ 
ronment, can we do so by interfering with its inner workings in a more direct and 
drastic way? An extreme treatment of this sort is to talce the nucléus of the cell and 
transplant it into the cytoplasm of a large cell of a different type. If the special¬ 
ized character is defined and maintained by cytoplasmic factors, the transplanted 
nucléus should switch its pattern of gene expression to conform with that of the 
host cell. In Chapter 7, we described a famous experiment of this sort, using the 
frog Xenopus. In this experiment, the nucléus of a differentiated cell (a cell from 
the lining of a tadpole's gut) was used to replace the nucléus of an oocyte (an egg- 
cell precursor arrested in prophase of the first meiotic division, in readiness for 
fertilization). The resulting hybrid cell went on, in a certain fraction of cases, to 
develop into a complété normal frog (see Figure 7-2A). This was crucial evidence 
for what is now a central principle of developmental biology: the cell nucléus, even 
that of a differentiated cell, contains a complété genome, capable of supporting 
development of ail normal cell types. At the same time, the experiment showed 
that cytoplasmic factors can indeed reprogram a nucléus: the oocyte cytoplasm 
can drive the gut cell nucléus baclc to an early embryonic State, from which it can 
then step through the changing patterns of gene expression that lead ail the way 
to a complété adult organism. 

The full story, however, is not quite so simple. First, the reprogramming in such 
experiments is not perfect. When the transplanted nucléus is talcen from a gut cell, 
for example, a gene that is normally spécifie to the gut is found to be expressed per- 
sistently, even in the muscle cells of the final animal. Second, the experiment suc- 
ceeds in only a limited proportion of cases, and this success rate becomes lower 
and lower, the more mature the animal from which the transplanted nucléus is 
talcen: very large numbers of transplantations must be done to score a single suc¬ 
cess if the nucléus cornes from a differentiated cell of an adult frog. 

Nuclear transplantation can be done in mammals too, with basically similar 
results. Thus, a nucléus talcen from a differentiated cell in the mammary gland 
of an adult sheep and transplanted into an enucleated sheep's egg was able to 
support development of an apparently normal sheep—the famous Dolly. Again, 
the success rate is low: many transplantations hâve to be done to obtain one such 
individual. 

Reprogramming of a Transplanted Nucléus Involves Drastic 
Epigenetic Changes 

In a typical fully differentiated cell, there seem to be mechanisms maintaining 
the pattern of gene expression that cytoplasmic factors cannot easily override. An 
obvious possibility is that the stability of the pattern of gene expression in an adult 
cell may dépend, in part at least, on self-perpetuating modifications of chromatin, 
as discussed in Chapter 4. As explained in Chapter 7, the phenomenon of X-inac- 
tivation in mammals provides a clear example of such epigenetic control. Two X 
chromosomes exist side by side in each female cell, exposed to the same Chem¬ 
ical environment, but while one remains active, the other persists from one cell 
génération to the next in a condensed inactive State; cytoplasmic factors cannot 
be responsible for the différence, which must instead refiect mechanisms intrin- 
sic to the individual chromosome. Elsewhere in the genome also, Controls at the 
level of chromatin act in combination with other forms of régulation to govern 
the expression of each gene. Genes can be shut down completely, or switched on 
constitutively, or maintained in a labile State where they can be readily switched 
on or off according to changing circumstances. 

The reprogramming of a nucléus transplanted into an oocyte involves dra- 
matic changes in chromatin. The nucléus swells, increasing its volume 50-fold 
as the chromosomes décondense; there is a Wholesale alteration in patterns of 
méthylation of DNA and histones; the standard histone H1 (the histone that links 
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adjacent nucléosomes) is replaced by a variant form that is peculiar to the oocyte 
and early embryo; and the preexisting type of histone H3 is also replaced at many 
sites by a distinct isoform. Evidently, the egg contains factors that reset the State of 
the chromatin in the nucléus, wiping out old histone modifications on chromatin 
and imposing new ones. Reprogrammed in this way, the genome becomes com¬ 
petent once again to initiate embryonic development and to give rise to the full 
range of differentiated cell types. 

Embryonic Stem (ES) Cells Can Generate Any Part of the Body 

A fertilized egg, or an équivalent cell produced by nuclear transplantation, is a 
remarkable thing: it can generate a whole new multicellular individual, and that 
means that it can give rise to every normal type of specialized cell, including even 
egg or sperm cells for production of the next génération. A cell in such a State is 
said to be totipotent, and it is said to be pluripotent if it can give rise to most cell 
types but not absolutely ail. Nevertheless, such a progenitor is not a stem cell: it 
is not self-renewing, but is instead dedicated to a program of progressive différ¬ 
entiation. If it were the only available starting point for study and exploitation of 
pluripotent cells, the enterprise would require a continuai supply of fresh fertil¬ 
ized eggs or fresh nuclear transplantation procedures—an awkward requirement 
for studies in experimental animais, and unacceptable for practical applications 
in humans. 

Here, however, nature has been unexpectedly lcind to scientists. It is possible 
to talce an early mouse embryo, at the blastocyst stage, and through cell culture to 
dérivé from it a class of stem cells called embryonic stem cells, or ES cells. These 
originate from the inner cell mass of the early embryo (the cluster of cells that give 
rise to the body of the embryo proper, as opposed to extraembryonic structures), 
and they hâve an extraordinary property: given suitable culture conditions, they 
will continue proliferating indefinitely and yet retain an unrestricted develop- 
mental potential. Their only limitation is that they do not give rise to extraembry¬ 
onic tissues such as those of the placenta. Thus they are classified as pluripotent, 
rather than totipotent. But this is a minor restriction. If ES cells are put baclc into 
a blastocyst, they become incorporated into the embryo and can give rise to ail 
the tissues and cell types in the body, integrating perfectly into whatever site they 
may corne to occupy, and adopting the character and behavior that normal cells 
would show at that site. They can even give rise to germ cells, from which a new 
génération of animais can be derived (Figure 22-40). 

ES cells let us move between cell culture, where we can use powerful tech¬ 
niques for genetic transformation and sélection, and the intact organism, where 
we can discover how such genetic manipulations affect development and phys- 
iology. Thus, ES cells hâve opened the way to efficient genetic engineering in 
mammals, leading to a révolution in our understanding of mammalian molecular 
biology. 

Cells with properties similar to those of mouse ES cells can now be derived 
from early human embryos and from human fêtai germ cells, and even, as we 
shall explain below, from differentiated cells talcen from adult mammalian tis¬ 
sues. In this way, one can obtain a potentially inexhaustible supply of pluripotent 


cells of inner cell mass 
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clump of ES cells 
injected into 
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Figure 22-40 Production and 
pluripotency of ES cells. ES cells are 
derived from the inner cell mass (ICM) 
of the early embryo. The ICM cells are 
transferred to a culture dish containing 
an appropriate medium, where they 
become converted to ES cells and can 
be kept proliferating indefinitely without 
differentiating. The ES cells can be taken 
at any time—after genetic manipulation, if 
desired—and injected back into the inner 
cell mass of another early embryo. There 
they take part in formation of a well-formed 
chimeric animal that is a mixture of ordinary 
and ES-derived cells. The ES-derived cells 
can differentiate into any of the cell types in 
the body, including germ cells from which a 
new génération of mice can be produced. 
These next-generation progeny are no 
longer chimeric, but consist of cells that 
ail inherit hait their genes from the cultured 
ES cell line. 



blastocyst develops in 
foster mother into a healthy 
chimeric mouse; the ES cells may 
contribute to any tissue 
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cells. Grown in culture, these can be manipulated, by suitable choice of culture 
conditions, to give rise to large quantities of almost any type of differentiated cell, 
opening the way to important practical applications. Before discussing them, 
however, we consider the underlying biology. 

A Core Set of Transcription Regulators Defines and Maintains the 
ES Cell State 

What is it that gives ES cells and related types of pluripotent stem cells their 
extraordinary capabilities? What can they tell us about the fundamental mecha- 
nisms underlying stemness, cell différentiation, and the stability of the differen¬ 
tiated State? 

For some attributes, the answer is simple. For example, an essential feature 
of ES cells is that they must avoid senescence. As discussed in Chapter 17, this is 
the fate of fibroblasts and many other types of somatic cells: they are limited in 
the number of times they will divide, in part at least because they laclc telomerase 
activity, with the resuit that their telomeres become progressively eroded in each 
division cycle, leading eventually to cell-cycle arrest. ES cells, by contrast, express 
high levels of active telomerase, allowing them to escape senescence and con¬ 
tinue dividing indefinitely. This is a property shared with other, more specialized 
types of stem cells, such as those of the adult intestine, which similarly can carry 
on dividing for hundreds or thousands of cycles. 

The deeper problem is to explain how the whole complex pattern of gene 
expression in an ES cell is organized and maintained. As a first step, one can look 
for genes expressed specifically in ES cells or in the corresponding pluripotent 
cells of the early embryo. This approach identifies a relatively small number of 
candidate ES-critical genes; that is, genes that seem to be essential in one way 
or another for the peculiar character of ES cells. A gene called Oct4, for exam¬ 
ple, is exclusively expressed in ES cells and in related classes of cells in the intact 
organism—specifically, in the germ-cell lineage and in the inner cell mass and its 
precursors. Oct4 codes for a transcription regulator. When it is lost from ES cells, 
they lose their ES cell character; and when it is missing in an embryo, the cells that 
should specialize as inner cell mass are diverted into an extraembryonic pathway 
of différentiation and their development is aborted. 

Fibroblasts Can Be Reprogrammée! to Create Induced Pluripotent 
Stem Cells (iPS Cells) 

In Chapter 7, we saw that fibroblasts and some other cell types can be driven to 
switch their character and differentiate as muscle cells if the master muscle-spe- 
cific transcription regulator MyoD is artificially expressed in them. Could the same 
technique be used to convert adult cell types into ES cells, through forced expres¬ 
sion of factors such as Oct4? This question was tackled by transfecting fibroblasts 
with retroviral vectors carrying genes that one might hope to hâve such an effect. 
A total of 24 candidate ES-critical genes were tested in this way. None of them was 
able by itself to cause the conversion; but in certain combinations they could do 
so. In 2006, the first breakthrough experiments whittled down the requirement to 
a core set of four factors, ail of them transcription regulators: Oct4, Sox2, Klf4, and 
Myc, known as the OSKM factors for short. When coexpressed, these could repro- 
gram mouse fibroblasts, permanently converting them into cells closely similar 
to ES cells (Figure 22-41). ES-like cells created in this way are called induced 
pluripotent stem cells, or iPS cells. Lilce ES cells, iPS cells can continue dividing 
indefinitely in culture, and when incorporated into a mouse blastocyst they can 
participate in création of a perfectly formed chimeric animal. In this animal, they 
can contribute to the development of any tissue and can turn into any differen¬ 
tiated cell type, including functional germ cells from which a new génération of 
mice can be raised (see Figure 22-40). 

iPS cells can nowbe derived from adult human cells and from various other dif¬ 
ferentiated cell types besides fibroblasts. Numerous methods can be used to drive 
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Figure 22-41 Reprogramming fibroblasts 
to IPS cells with the OSKM factors. As 

indicated, the master gene regulator proteins 
Oct4, Sox2, and Klf4 (OSK) induce both 
their own and each other 1 s synthesis (gray 
shading). This generates a self-sustaining 
feedback loop that helps to maintain 
cells in an embryonic stem cell-like State, 
even after ail of the experimentally added 
OSKM initiators hâve been removed. Myc 
overexpression speeds up early stages of 
the reprogramming process through the 
mechanisms shown (see Figure 17-61). 
Stable reprogramming also involves the 
permanently induced expression of the 
Nanog gene, which produces an additional 
master transcription regulator. (Adapted from 
J. Kim et al., Cell 132:1049-1061,2008.) 
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expression of the transforming OSKM factors, including methods that leave no 
trace of foreign DNA in the reprogrammed cell. Variations of the original cocktail 
of transcription regulators can drive the conversion, with different specialized cell 
types having somewhat different requirements. Myc overexpression, for example, 
turns out not to be absolutely necessary, although it enhances the efficiency of the 
process. And differentiated cell types may express some of the required factors as 
part of their normal phenotype. For example, cells of the dermal papilla of hair 
follicles already express Sox2, Klf4, and Myc; to convert them into iPS cells, it is 
enough to force them artificially to express Oct4. Oct4, indeed, seems to hâve a 
central rôle and to be generally indispensable for the création of iPS cells. 


Reprogramming Involves a Massive Upheaval of the Gene Control 
System 

Converting a differentiated cell into an iPS cell is not lilce flicking a switch on some 
predictable, precisely engineered piece of machinery. Only a few of the cells that 
receive the OSKM factors will actually become iPS cells—one in several thou- 
sand in the original experiments, and still only a small minority with more recent, 
improved techniques. In fact, the success of the original experiments depended 
on clever sélection to piclc out those few cells where the conversion had occurred 
(Figure 22-42). 

Conversion to an iPS character by the OSKM factors is not only inefficient but 
also slow: fibroblasts take ten days or more from introduction of the conversion 
factors before they begin to express marlcers of the iPS State. This suggests that 
the transformation involves a long cascade of changes. These changes are being 
extensively studied, and they affect both the expression of individual genes and 
the State of the chromatin. The results of one such study are outlined in Figure 
22-43. The process begins with a Myc-induced cell prolifération and loosening 
of chromatin structure that promûtes the binding of the other three master reg¬ 
ulators to many hundreds of different sites in the genome. At a large proportion 
of these sites, Oct4, Sox2, and Klf4 ail bind in concert. The binding sites include 
the endogenous Oct4, Sox2, and Klf4 genes themselves, which eventually cré¬ 
âtes the types of positive feedback loops just described that malces expression of 
these genes self-sustaining (see Figure 22-41). But self-induction of Oct4, Sox2, 
and Klf4 is only a small part of the transformation that occurs. The three core fac¬ 
tors activate some target genes and repress others, producing a cascade of effects 
that reorganize the gene control System globally and at every level, changing the 
patterns of histone modification, DNA méthylation, and chromatin compaction, 
as well as the expression of innumerable proteins and noncoding RNAs. By the 
end of this complex process, the resulting iPS cell is no longer dépendent on the 
artificially generated factors that triggered the change: it has settled into a stable, 
self-sustaining State of coordinated gene expression, making its own Oct4, Sox2, 
Klf4, and Myc (and ail the other essential ingrédients of a pluripotent stem cell) 
from its own endogenous copies of the genes. 



normal 
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Figure 22-42 A strategy used to select 
cells that hâve converted to an iPS 
character. The experiment makes use 
of a gene (Fbxl5) that is présent in ail 
cells but is normally expressed only in 
ES and early embryonic cells (although not 
required for their survival). A fibroblast cell 
line is genetically engineered to contain 
a gene that produces an enzyme that 
dégradés G418 under the control of the 
Fbxl5 regulatory sequence. G418 is an 
aminoglycoside antibiotic that blocks 
protein synthesis in both bacteria and 
eukaryotic cells. When the OSKM factors 
are artificially expressed in this cell line, 
a small proportion of the cells undergo a 
change of State and activate the Fbxl 5 
regulatory sequence, driving expression 
of the G418-resistance gene. When G418 
is added to the culture medium, these are 
the only cells that survive and proliferate. 
When tested, they turn out to hâve an 
iPS character. 
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An Experimental Manipulation of Factors that Modify Chromatin 
Can Increase Reprogramming Efficiencies 

The low efficiency and slow rate of conversion suggest that there is some barrier 
bloclcing the switch from the differentiated State to the iPS State in these experi- 
ments, and that overcoming this barrier is a difficult process that involves a large 
element of chance. Likewise, the outcome is variable, with significant différences 
between the individual lines of transformed cells that are generated, even when 
the initial differentiated cells are genetically and phenotypically identical. Only 
some of the candidate iPS lines pass ail the tests of pluripotency. At a molecular 
level, there are différences even among the fully validated iPS cells: although they 
share many features, they vary in details of their gene expression patterns and, for 
example, in their patterns of DNA méthylation. 

Overcoming these difficulties will be critical for improving our understanding 
of how cell specialization is controlled and organized in multicellular organisms; 
it should also facilitate many medical advances. Thus, intensive research is being 
carried out on the reprogramming process. One approach aims at obtaining a 
much clearer picture of the rôle that chromatin structures play in gene régulation 
in eukaryotes. 

From our discussion of nuclear transplantation, one might expect that any 
reprogramming of a differentiated cell would require a radical and widespread 
change in the chromatin structure of selected genes. Not only are such changes 
observed, but a large number of different experiments reveal that the efficiency of 
the reprogramming process can be substantially increased by altering the activ- 
ity of proteins that affect chromatin structure. Figure 22-44 categorizes some of 
the factors that hâve been shown to enhance the transformation of fibroblasts to 
iPS cells; those in the top three rows—chromatin remodelers, histone modify- 
ing enzymes, and histone variants—are especially well known to hâve profound 
effects on the organization of nucléosomes in chromatin (discussed in Chapter 4). 

We can only touch briefly here on the massive amounts of data that hâve been 
accumulating in this exciting research area. The major challenge that remains is to 
obtain a systems-level model for the complex set of biochemical changes that are 
involved in reprogramming. For example, which chromatin changes corne first, 
and which then follow? How can these be triggered by the master transcription 
regulators through their binding to spécifie DNA sequences, and why do many 
cells in a population appear résistant to these effects? 


Figure 22-43 A summary of some of 
the major events that accompany the 
reprogramming of mouse fibroblasts to 
iPS cells. By sorting cells at various times 
after the OSKM induction shown, one can 
carry out detailed biochemical analyses on 
the different cell populations shown. This 
led to the discovery that two major waves 
of new gene transcription are induced, but 
that the second wave occurs only in the 
subset of cells expressing an embryonic 
marker protein. Some 1500 genes are 
found to be differentially expressed in 
these cells, compared to the large majority 
of cells that fail to progress toward iPS 
cells. As indicated, major DNA méthylation 
changes are observed only after the 
alteration of chromatin structures. 

In the first transcription wave, among 
the genes prominently induced are those 
for cell prolifération, metabolism, and 
cytoskeletal organization; in contrast, genes 
associated with fibroblast development 
are repressed. In the second transcription 
wave, genes required for embryonic 
development and for stem cell maintenance 
are induced. (Adapted from J.M. Polo et 
al., Cell 151:1617-1632, 2012.) 


ES and iPS Cells Can Be Guided to Generate Spécifie Adult Cell 
Types and Even Whole Organs 

We can thinlc of embryonic development in terms of a sériés of choices pre- 
sented to cells as they follow a road that leads from the fertilized egg to terminal 
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Figure 22-44 Factors that hâve been 
observed to enhance reprogramming 
efficiency. Emphasized here are those 
factors that can alter chromatin States, 
with those in the top three rows having 
the most direct effects. An up arrow 
indicates that reprogramming is increased 
when the activity of the indicated factor 
is increased; a down arrow indicates that 
reprogramming is increased when the 
activity of the indicated factor is decreased. 
Thus, for example, increased activity of 
histone acetyl transferases and increased 
activity of histone deacetylases hâve 
opposite effects, as expected from their 
biochemical activities (see p. 196). 


différentiation. After their long sojourn in culture, the ES cells or iPS cells and 
their progeny can still read the signs at each branch in the highway and respond 
as normal embryonic cells would. If ES or iPS cells are implanted directly into an 
embryo at a later stage or into an adult tissue, however, they fail to receive the 
appropriate sequence of eues; their différentiation then is not properly controlled, 
and they will often give rise to a tumor of the type known as a teratoma, containing 
a mixture of cell types inappropriate to the site in the body. 

In culture, by exposing the ES or iPS cell to an appropriate sequence of signal 
proteins and growth factors, delivered with the right timing, it is possible to guide 
the cell along a pathway that approximates a normal developmental pathway, so 
as to convert it into one of the standard specialized adult cell types (Figure 22-45 
and Movie 22.5). Success requires trial and error, but has now been achieved for 
many different final specialized States, including neuronal, muscular, and intesti¬ 
nal cell types. In a few cases, it has even been possible, by careful manipulation of 
the culture conditions, to get ES or iPS cells to interact with one another so as to 
construct an entire organ, albeit on a small scale (Figure 22-46). 


Figure 22-45 Production of differentiated 
cells from ES or iPS cells in culture. 

These cells can be cultured indefinitely 
as pluripotent cells when attached as a 
monolayer to a dish. Alternative^ they 
can be detached and allowed to form 
aggregates called embryoid bodies, which 
causes the cells to begin to specialize. 

Cells from embryoid bodies, cultured in 
media with different factors added, can 
then be driven to differentiate in various 
ways. (Based on E. Fuchs and J.A. Segre, 
Cell 100:143-155, 2000. With permission 
from Elsevier.) 
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Figure 22-46 Cultured ES cells can give rise to a three-dimensional organ. (A) Remarkably, under appropriate conditions, 
mouse ES cells in culture can proliferate, differentiate, and interact to form a three-dimensional, eye-like structure, which 
includes a multilayered retina similar in organization to the one that forms in vivo. (B) Fluorescent micrograph of an optic cup 
formed by ES cells in culture. The structure includes a developing retina, containing multiple layers of neural cells, which 
produce a protein (pink) that serves as a marker for retinal tissue. (B, from M. Eiraku, N. Takata, H. Ishibashi et al., Nature 
472:51-56, 2011. With permission from Macmillan Publishers Ltd.) 


Cells of One Specialized Type Can Be Forced to Transdifferentiate 
Directly Into Another 

The route we hâve just described, from one mode of différentiation to another via 
conversion to an iPS cell, seems needlessly roundabout. Could we not convert 
cell type A into cell type B directly, without backtracking to the embryonic-like 
iPS State? For many years, it has been lcnown that such transdifferentiation can be 
achieved in a few spécial cases, such as the conversion of fibroblasts into slceletal 
muscle cells by forced expression of MyoD (see p. 396). But now, with the insights 
that hâve corne from the study of ES and iPS cells, ways are being found to bring 
about such interconversions in a much wider range of cases. 

An élégant example cornes from studies of the heart. By forcing expression of 
an appropriate combination of factors—not Oct4, Sox2, Klf4, and Myc, but Gata4, 
Mef2c, and Tbx5—it is possible to convert heart fibroblasts directly into heart 
muscle cells. This has been done in the living mouse, using retroviral vectors, 
and the transformation occurs with high efficiency when the vectors carrying the 
transgenes are injected directly into the heart muscle tissue itself. Although they 
occupy only a small fraction of the tissue volume, the fibroblasts in the heart out- 
number the heart muscle cells, and they survive in large numbers even where the 
heart muscle cells hâve died. Thus, in a typical nonfatal heart attaclc, where heart 
muscle cells hâve died for laclc of oxygen, the fibroblasts proliferate and make 
collagenous matrix so as to replace the lost muscle with a fibrous scar. This is a 
poor sort of repair. By forcing expression of the appropriate factors in the heart, 
as described above, it has proved possible, in the mouse at least, to do better than 
nature and regenerate lost heart muscle by transdifferentiation of heart fibro¬ 
blasts. 

We are still a long way from putting this technique into practice as a treatment 
for heart attacks in humans, but it shows what the future may hold—not only for 
this medical problem, but for many others. 

ES and iPS Cells Are Useful for Drug Discovery and Analysis of 
Disease 

A large part of the excitement surrounding ES and iPS cells and the technology 
of transdifferentiation cornes from the prospect of using the artificially generated 
cells for tissue repair. It begins to seem that virtually any type of tissue might be 
replaceable, allowing treatment of degenerative diseases that hâve previously had 
no cure. Research in this area is moving rapidly, but there are many difficulties to 
be overcome. 




CELL REPROGRAMMING AND PLURIPOTENT STEM CELLS 


1259 


studying / screening for * 

disease / theraputic 
mechanism I compounds 


disease-specific drugs 



treatment 
with drugs 






cMYC, OCT4, 
KLF4, SOX2 


transplantation of genetically 
matched healthy cells 


healthy cells 



\ 


in vitro 

différentiation 


Figure 22-47 Use of iPS cells for 
drug discovery and for analysis and 
treatment of genetic disease. The 

left side of the diagram shows how 
differentiated cells that are generated from 
iPS cells derived from a patient with a 
genetic disease can be used for analysis of 
the disease mechanism and for discovery 
of therapeutic drugs. The right side of the 
diagram shows how the genetic defect 
might be repaired in the iPS cells, which 
could then be induced to differentiate in an 
appropriate way and grafted back into the 
patient without danger of immune rejection. 
(Based on D.A. Robinton and G.Q. Daley, 
Nature 481:295-305, 2012). 
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With the advent of iPS cells and direct transdifférentiation, at least one major 
hurdle has been surmounted, in principle at least: the problem of immune rejec¬ 
tion. ES cells, because they are created from early embryos that generally corne 
from unrelated donors, will never be genetically identical to the cells of the patient 
receiving the transplant. The transplanted cells and their progeny are therefore 
liable to rejection by the immune System. Both iPS and transdifferentiated cells, in 
contrast, can be generated from a small sample of the patient's own tissue and so 
should escape immune attaclc when transplanted back into the same individual. 

Tissue repair by transplantation, however, is not the only application for which 
ES, iPS, and transdifferentiated cells can be used: there are other ways in which 
they promise to be more immediately valuable. In particular, they can be used to 
generate large, homogeneous populations of specialized cells of any chosen type 
in culture; and these can serve for investigation of disease mechanisms and in the 
search for new drugs acting on a spécifie cell type (Figure 22-47). 

Where a disease has a genetic cause, we can dérivé iPS cells from sufferers and 
use these cells to produce the spécifie cell types that malfunction, to investigate 
how the malfunction occurs, and to screen for drugs that might help to put it right. 
Timothy syndrome provides an example. In this rare genetic condition, there is a 
severe, life-threatening disorder in the rhythm of the heart beat (as well as several 
other abnormalities), as a resuit of a mutation in a spécifie type of Ca 2+ channel. 
To study the underlying pathology, researchers toolc skin fibroblasts from patients 
with the disorder, generated iPS cells from the fibroblasts, and drove the iPS cells 
to differentiate into heart muscle cells. These cells, when compared with heart 
muscle cells prepared similarly from normal control individuals, showed irregu- 
lar contractions and abnormal patterns of Ca 2+ influx and electrical activity that 
could be characterized in detail. From this finding, it is a small step to develop¬ 
ment of an in vitro assay for drugs that might correct the misbehavior of the heart 
muscle cells. 

This approach to drug discovery—where iPS cells are prepared from the indi¬ 
vidual patient, differentiated into the relevant cell type, and used to test candidate 
drugs in vitro —would seem to represent a huge advance on the slow, costly tradi- 
tional methods that involve administration of test compounds to large numbers 
of people. 
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Summary 

In the adult mammalian body, the various types ofstem cells are highly specialized, 
each giving rise to a limited range of differentiated cell types. Cells become restricted 
to spécifie pathways of différentiation during embryonic development. One way 
to force a return to a pluripotent or totipotent State is by nuclear transplantation: 
the nucléus ofa differentiated cell can be injected into an enucleated oocyte , whose 
cytoplasm reprograms the genome baclc to an approximation of an early embry¬ 
onic State. This allows production ofan entire new individual. The reversion ofthe 
genome to this State involves radical, genome-wide changes in chromatin structure 
and DNA méthylation. 

Remarkably , cells taken from the inner cell mass ofan early mammalian embryo 
can be propagated in culture indefinitely in a pluripotent State. When transplanted 
back into a host early embryo, these embryonic stem (ES) cells can contribute cells 
to any tissue, including thegerm line. ES cells hâve been invaluablefor genetic engi¬ 
neering in mice. Cells with similar properties; called induced pluripotent stem cells 
(iPS cells), can be generated from adult differentiated cells such as fibroblasts by 
forced expression of a cocktail of key transcription regulators. A similar method 
can be used to reprogram adult cells directly from one specialized State to another. 
In principle, iPS cells generated from cells biopsied from an adult human patient 
could be used for tissue repair in that same individual, avoiding the problem of 
immune rejection. More immediately, they provide a source of specialized cells that 
can be used to analyze in vitro the ejfects of mutations affecting human cells and for 
screening for drugsfor treatment of genetic diseases. 


WHATWE DONT KNOW 


• What détermines tissue and organ 
size? How do the cells in each tissue 
know when to terminate their growth 
and division, so as to limit the size of 
an organ or tissue appropriately? 

• What is the fundamental molecular 
différence that distinguishes a stem 
cell? 

• How is the correct balance between 
stem cells, progenitor cells, and 
differentiated cells maintained in a 
tissue or organ? 

• What rôle does chromatin structure 
play in cell memory and in cell 
reprogramming? 

• How are molécules inherited 
asymmetrically during cell division? 

• How do germ cells avoid aging? 


PROBLEMS 

Which statements are true? Explain why or why not. 

22-1 In the small intestine, stem cells in the crypts 
divide asymmetrically to maintain the population of cells 
that make up the villi; after each division, one daughter 
remains a stem cell and the other begins to divide rapidly 
to produce differentiated progeny. 

22-2 Stem cells, being stem cells, are by définition the 
same in ail tissues. 

22-3 Every tissue that can be renewed is renewed from 
a tissue-specific population of stem cells. 

22-4 Disturbance of the balance in the activities of 
osteoblasts and osteoclasts in favor of osteoclasts can 
give rise to the condition known as osteoporosis, the brit- 
tle-bone syndrome of the elderly. 

Discuss the following problems. 

22-5 In the 1950s, scientists fed 3 H-thymidine to rats to 
label cells that were synthesizing DNA, and then followed 
the fates of labeled cells for periods of up to a year. They 
found three patterns of cell labeling in different tissues. 
Cells in some tissues such as neurons in the central ner- 
vous System and the retina did not get labeled. Muscle, 
lcidney, and liver, by contrast, each showed a small number 
of labeled cells that retained their label, apparently with- 
out further division or loss. Finally, cells such as those in 
the squamous epithelia of the tongue and esophagus were 


labeled in fairly large numbers, with radioactive pairs of 
nuclei visible in 12 hours; however, the labeled cells disap- 
peared over time. Which of these three patterns of labeling 
would you expect to see if the labeled cells were generated 
by stem cells? Explain your answer. 

22-6 At any given time, intestinal crypts of mice com¬ 
prise about 15 stem cells and 10 Paneth cells. After cell 
division, which occurs about once a day, the daughter 
cells remain stem cells only if they maintain contact with a 
Paneth cell. This constant compétition for Paneth-cell con¬ 
tact raises the possibility that crypts might become mono¬ 
clonal over time; that is, the crypt cells at one point in time 
might dérivé from only 1 of the 15 stem cells that existed 
at some earlier time. To test this possibility, you use the 
so-called confetti marlcer that upon activation expresses 
any one of three fluorescent proteins in the stem cells of 
the crypt. You then examine crypts at various times to 
détermine whether they contain cells with multiple colors 
or only one color (Figure Q22-1). Do the crypts become 
mono clonal over time or not? How can you tell? 

22-7 The origin of new P cells of the pancréas—from 
stem cells or from preexisting p cells—was not resolved 
until a decade ago, when the technique of lineage tracing 
was used to décidé the issue. Using transgenic mice that 
expressed a tamoxifen-activated form of Cre recombinase 
under the control of the insulin promoter, which is active 
only in p cells, investigators could remove an inhibitory 
segment of DNA and thereby allow expression of human 
placental alkaline phosphatase (HPAP), which can be 
detected by histochemical staining. After a puise of tamox- 
ifen that converted about 30% of P cells in young mice to 
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microvilli 



Figure Q22-1 Fluorescent cells 
in crypts in mouse intestines at 
various times after activation of 
expression of fluorescent proteins 
(Problem 22-6). The images 
are taken in the xz plane, which 
cuts through multiple crypts, 
as indicated in the schematic 
drawing. Roughly 50 crypts are 
visible in each section. Dotted 
white circles identify some 
individual crypts. Scale bars are 
100 jim. (Adapted from 
H.J. Snippert et al., Cell 
143:134-144, 2010. With 
permission from Elsevier.) 



30 weeks 


cells that express HPAP, the investigators followed the per- 
centage of labeled P cells for a year, during which time the 
total number of P cells in the pancréas increased by 6.5- 
fold. How do you suppose the percentage of P cells would 
change over time if new p cells were derived from stem 
cells? What if new P cells were derived from preexisting 
P cells? Which hypothesis do the results in Figure Q22-2 
support? 


100 



months after tamoxifen injection 


Figure Q22-2 Percentage of labeled P cells in pancreatic islets of mice 
at different âges (Problem 22-7). Ail mice were injected with a puise of 
tamoxifen at 6 to 8 weeks of âge and then stained for human placental 
alkaline phosphatase (HPAP) at various times afterward. Error bars 
represent standard déviations. 


22-8 One of the earliest assays for hematopoietic stem 
cells made use of their ability to form colonies in the 
spleens of heavily irradiated mice. By varying the amounts 
of transplanted bone marrow cells, investigators showed 
that the number of spleen colonies varied linearly with 
dose and that the curve passed through the origin, sug- 
gesting that single cells were capable of forming individ¬ 
ual colonies. However, because colony formation was rare 
relative to the numbers of transplanted cells, it was possi¬ 
ble that undispersed clumps of two or more cells were the 
actual initiators. 

A classic paper resolved this issue by exploiting 
rare, cytologically visible genome rearrangements gen- 
erated by irradiation. Récipient mice were first irradiated 
to deplete bone marrow cells, and then they were irradi¬ 
ated a second time after transplantation to generate rare 
genome rearrangements in the transplanted cell popula¬ 
tion. Spleen colonies were then screened to find ones that 
carried genome rearrangements. How do you suppose this 
experiment distinguishes between colonization by single 
cells versus cellular aggregates? 

22-9 It is possible to purify hematopoietic stem cells 
using a combination of antibodies directed against cell- 
surface targets. By removing cells that expressed surface 
markers characteristic of spécifie lineages such as B cells, 
granulocytes, myelomonocytic cells, and T cells, investi¬ 
gators generated a population of cells enriched for stem 
cells. They further enriched this population for putative 
stem cells by positively selecting for cells that expressed 
suspected stem-cell surface markers. Spleen colony for¬ 
mation in irradiated mice by these putative stem cells and 
the unfractionated bone marrow cells is shown in Figure 
Q22-3. Given that only about 1 in 10 cells lodges in the 
spleen, do these results support the idea that the enriched 
population consists mostly of hematopoietic stem cells? 
What additional information would you need to hâve to 
feel confident that the enriched cells are true stem cells? 
What proportion of bone marrow cells are hematopoietic 
stem cells? 

22-10 Génération of induced pluripotent stem (iPS) 
cells was first accomplished using retroviral vectors to 
carry the OSKM (Oct4, Sox2, Klf4, and Myc) set of tran¬ 
scription regulators into cells. The efficiency of fibroblast 
reprogramming was typically low (0.01%), in part because 
large numbers of retroviruses must integrate to bring 
about reprogramming and each intégration event carries 
with it the rislc of inappropriately disrupting or activating 
a critical gene. In what other ways, or other forms, do you 
suppose you might deliver the OSKM transcription regula¬ 
tors so as to avoid these problems? 

Figure Q22-3 Spleen 
colony formation by cells 
enriched for stem cells and by 
unfractionated bone marrow 
cells (Problem 22-9). 
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Pathogens and Infection 


CHAPTER 

23 


Infectious diseases currently cause about one-quarter of ail human deaths world¬ 
wide, more than ail forms of cancer combined and second only to cardiovascular 
diseases. In addition to the continuing heavy burden of ancient diseases such as 
tuberculosis and malaria, newer infectious diseases continually emerge. The cur- 
rent pandémie (worldwide épidémie) of AIDS (acquired immune deficiency syn¬ 
drome ), was first clinically observed in 1981 and has since caused more than 35 
million deaths worldwide. Moreover, some diseases long thought to resuit from 
other causes are now recognized to be associated with infections. Most gastric 
ulcers, for example, are caused not by stress or spicy food, but by infection of the 
stomach lining by the bacterium Hélicobacter pylori. 

The burden of infectious diseases is not spread equally across the planet. 
Poorer countries and communities suffer disproportionately, often due to poor 
public sanitation and health Systems. Some infectious diseases, however, occur 
primarily or exclusively in industrialized communities: Legionnaire's disease, for 
example, a bacterial infection of the lungs, commonly spreads through air-condi- 
tioning Systems. 

Since the mid-1800s, physicians and scientists hâve struggled to identify the 
agents—collectively called pathogens—that are capable of causing infectious 
diseases. More recently, the advent of microbial genetics and molecular cell biol- 
ogy has greatly enhanced our understanding of the causes and mechanisms of 
infectious diseases. We now lcnow that pathogens frequently exploit the attributes 
of their host's cells in order to infect them. This understanding can give us new 
insights into normal cell biology, as well as strategies for treating and preventing 
infectious diseases. 

Although pathogens are understandably a focus of attention, only a relatively 
small fraction of the microbial species we encounter are pathogens. Much of the 
biomass of the Earth is made up of microbes, and they produce everything from 
the oxygen we breathe to the soil nutrients we use to grow food. Even those spe¬ 
cies of microbes that colonize the human body do not generally cause disease. 
The collective of microorganisms that résidé in or on an organism is called the 
microbiota. Many of these microbes hâve a bénéficiai effect on the health of the 
organism, assisting its normal development and physiology. 

In this chapter, we give an overview of the different lcinds of pathogens, as well 
as those micro organisms that colonize our body without causing trouble. We then 
discuss the cell biology of infection—the molecular interactions between patho¬ 
gens and their host. In Chapter 24, we consider how our innate and adaptive 
immune Systems collaborate to defend us against pathogens. 


IN THIS CHAPTER 

INTRODUCTION TO PATHOGENS 
AND THE HUMAN MICROBIOTA 

CELL BIOLOGY OF INFECTION 


INTRODUCTION TO PATHOGENS AND THE HUMAN 
MICROBIOTA 

We normally thinlc of pathogens as hostile invaders, but a pathogen, lilce any other 
organism, is simply exploiting an available niche in which to live and procreate. 
Living on or in a host organism is a very effective strategy, and it is possible that 
every organism on Earth is subject to some type of infection (Figure 23-1). A 
human host is a nutrient-rich, warm, and moist environment, which remains at a 
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uniform température and constantly renews itself. It is not surprising that many 
microorganisms hâve evolved the ability to survive and reproduce in this désir¬ 
able niche. In this section, we discuss some of the common features that microor¬ 
ganisms must hâve in order to colonize the human body or cause disease, and we 
explore the wide variety of organisms that are known to cause disease. 

The Human Microbiota Is a Complex Ecological System That Is 
Important for Our Development and Health 

The human body contains about 10 13 human cells, as well as a microbiota con- 
sisting of approximately 10 14 bacterial, fungal, and protozoan cells, which repre- 
sent thousands of microbial species—the so-called normal flora. The combined 
genomes of the various species of the human microbiota, called the microbiome, 
contain more than 5 x 10 6 genes—more than 100 times greater than the number 
of genes in the human genome itself. A conséquence of this genomic diversity 
is that the microbiota expands the range of biochemical and metabolic activities 
available to the humans. 

The microbiota is usually confined to the skin, mouth, digestive tract, and 
vagina. With the exception of microbes colonizing the skin, it consists primarily 
of anaérobie bacteria, with distinct communities of species inhabiting each body 
part. These communities vary considerably between individual humans, even 
between close relatives or identical twins. Although the microbiota of an individ¬ 
ual is generally consistent over time, it is influenced by a variety of factors, includ- 
ing âge, diet, health status, and antibiotic use. 

There are various ecological relationships that these microbes hâve with their 
host. In mutualism, both the microbe and host benefit. The anaérobie bacteria 
that inhabit our intestines, for example, gain shelter and a nutrient supply but also 
contribute to the digestion of our food, produce important nutrients for us, and 
are essential for the normal development of our gastrointestinal tract and innate 
and adaptive immune Systems. In commensalism, the microbe benefits but offers 
no benefit and causes no harm: for example, we are infected with many viruses 
that hâve no noticeable effect on our health. In parasitism, the microbe benefits 
to the détriment of the host, as is often the case for pathogens. 

Many infectious diseases are caused by a single pathogen. There is increasing 
evidence, however, that an imbalance in the community of microbes that consti- 
tute the microbiota can contribute to some diseases, including autoimmune and 
allergie diseases, obesity, inflammatory bowel disease, and diabètes. Remarlcably, 
in such cases of microbiota imbalance (referred to as dysbiosis ), the transfer of the 
microbiota from a healthy individual to someone suffering from the disease can 
be bénéficiai and sometimes curative, as in the case of Clostridium difficile colitis 
caused by overgrowth of the bacterium. 


Figure 23-1 Parasitism at many levels. 

(A) Most animais harbor parasites, an 
example being the blacklegged tick or deer 
tick (Ixodes scapularis), shown here on a 
human finger. Although ticks of this species 
thrive on white-tailed deer and other wild 
mammals, they can also live on humans. 

(B) Ticks themselves harbor their own 
parasites including the bacterium Borrelia 
burgdorferi, stained here with a vital dye 
that labels living bacteria green and dead 
bacteria red. These spiral-shaped bacteria 
live in deer ticks and can be transmitted to 
humans du ring a tick’s blood meal. Borrelia 
burgdorferi causes Lyme disease, which 

is characterized by a bull’s-eye-shaped 
skin rash and fever; if the infection is left 
untreated, various complications can 
resuit, including arthritis and neurological 
abnormalities. The idea that parasites hâve 
their own parasites was noted by Jonathan 
Swift in 1733: 

“So, naturalists observe, a flea 
Has smaller fleas that on him prey; 

And these hâve smaller still to bite ‘em; 

And so proceed ad infinitum.” 

(A, from Acorn, White-Footed Mice and 
Tick Cycle Augment Risks of Lyme Disease 
in 2012. March 14, 2012. Reprinted with 
permission of Anita Sil; B, courtesy of 
M. Embers.) 


Pathogens Interact with Their Hosts in Different Ways 

If it is normal for us to live with a community of microbes, why are some of 
them capable of causing us illness or death? Although the ability of a particular 
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microorganism to cause disease dépends on many factors, it requires that the 
pathogen possess specialized pathogenic characteristics that allow it to live in 
humans. 

Primary pathogens can cause overt disease in most healthy people. Some 
primary pathogens cause acute, life-threatening épidémie infections and spread 
rapidly from one siclc or dying host to another; historically important examples 
include the bacterium Vibrio cholerae, which causes choiera, and the variola 
and influenza viruses, which cause smallpox and flu, respectively. Others may 
persistently infect a single individual for years without causing overt disease; 
examples include the bacterium Mycobacterium tuberculosis (which can cause 
the life-threatening lung infection tuberculosis) and the intestinal worm Ascaris. 
Although these potential primary pathogens can malce some people critically ill, 
billions of people carry these foreign organisms in an asymptomatic way, often 
unaware that they are infected. It is sometimes difficult to draw a line between 
the asymptomatic presence of such pathogens and the normal microbiota. Some 
microbes of the normal flora can act as opportunistic pathogens, in that they 
cause disease only if our immune Systems are wealcened or if they gain access to a 
normally stérile part of the body. 

In order to survive and multiply, a successful pathogen must be able to: 
(1) enter the host (usually by breaking an épithélial barrier); (2) find a nutritionally 
compatible niche in the host's body; (3) avoid, subvert, or circumvent the host's 
innate and adaptive immune responses; (4) replicate, using host resources; and (5) 
exit one host and spread to another. Pathogens hâve evolved various mechanisms 
that maximally exploit the biology of their host organisms to help accomplish these 
taslcs. For some pathogens, these mechanisms are adapted to a unique host spe- 
cies, whereas for others the mechanisms are sufficiently general to permit inva¬ 
sion, survival, and réplication in a wide variety of hosts. Because pathogens hâve 
evolved the ability to interface directly with the molecular machinery of host cells, 
we hâve learned a great deal about cell biological principles by studying them. 

Our constant exposure to pathogens has strongly influenced human évolution. 
In modem times, humans hâve learned how to limit the ability of pathogens to 
infect us through improvements in public health measures and childhood nutri¬ 
tion, vaccines, antimicrobial drugs, and routine testing of blood used for transfu¬ 
sions. As we learn more about the mechanisms by which pathogens cause disease 
(called pathogenesis ), our creativity and resourcefullness will continue to serve 
as an important addition to our immune Systems in fighting infectious diseases. 

Pathogens Can Contribute to Cancer, Cardiovascular Disease, 
and Other Chronic lllnesses 

Some viral and bacterial pathogens can cause or contribute to chronic, life-threat¬ 
ening illnesses that are not normally classified as infectious diseases. An import¬ 
ant example is cancer. As discussed in Chapter 20, the oncogene concept—that 
certain altered genes can trigger cell transformation and tumor development— 
came initially from studies of the Rous sarcoma virus , which causes a form of can¬ 
cer (sarcomas) in chickens. One of the viral genes encodes an overactive homolog 
of the host tyrosine kinase Src (see Figure 3-63), which has been implicated in 
many lcinds of cancer. Several human cancers are also lcnown to hâve a viral ori- 
gin. Human papillomavirus , for example, which causes génital warts, is responsi- 
ble for more than 90% of cervical cancers (see Figure 20-40). The recent develop¬ 
ment of a vaccine against the most abundant cancer-associated strains of human 
papillomavirus promises to prevent many of these cancers in the future. In other 
cases, chronic tissue damage caused by infection can increase the likelihood of 
cancer. Inflammation caused by the stomach-dwelling bacterium H. pylori can be 
a major contributor to stomach cancer, as well as to gastric ulcers. 

The major causes of death in wealthy industrialized nations are cardiovascular 
diseases. They frequently resuit from atherosclerosis , the accumulation in blood 
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Figure 23-2 Pathogens in many forms. 

(A) The structure of the protein coat, or 
capsid, of poliovirus. This virus was once 
a common cause of paralysis, but the 
disease (poliomyelitis) has been greatly 
reduced by widespread vaccination. 

(B) The bacterium Vibrio cholerae, the 
causative agent of the épidémie, diarrheal 
disease choiera. (C) The protozoan parasite 
Trypanosoma brucei (purple) in a field 

of érythrocytes (red blood cells; pink). 

This parasite causes African sleeping 
sickness, a potentially fatal disease of the 
central nervous System. (D) This clump of 
Ascaris nematodes was removed from the 
obstructed intestine of a two-year-old boy. 
(A, courtesy of Robert Grant, 

Stephan Crainic, and James M. Hogle; 

B, photograph courtesy of John 
Mekalanos; C, CDC, Department of Health 
and Human Services; D, from J.K. Baird et 
al., Am. J. Trop. Med. Hyg. 35:314-318, 
1986. Photograph by Daniel H. Connor.) 


vessel walls of fatty deposits that can bloclc blood flow and cause heart attaclcs 
and strolces. A hallmarlc of early atherosclerosis is the appearance in blood vessel 
walls of clumps of macrophages called foam cells, which recruit other white blood 
cells into the forming atherosclerotic plaque. Foam cells in atherosclerotic plaques 
often contain the bacterial pathogen Chlamydia pneumoniae, which commonly 
causes pneumonia in humans and is a significant risk factor for atherosclerosis 
in humans and animal models. Other bacterial species are also implicated in 
atherosclerosis, including bacteria usually associated with teeth and gums, such 
as Porphyromonas gingivalis. As we learn more about the interactions between 
pathogens and the human body, it seems lilcely that more chronic conditions will 
be found to hâve a linlc to an infectious agent. 

Pathogens Can Be Viruses, Bacteria, or Eukaryotes 

Many types of pathogens cause disease in humans. The most familiar are viruses 
and bacteria. Viruses cause diseases ranging from AIDS and smallpox to the com¬ 
mon cold. Viruses are essentially fragments of nucleic acid (DNA or RNA) that 
generally encode a relatively small number of gene products, wrapped in a pro- 
tective shell of proteins (Figure 23-2A) and (in some cases) an outer membrane 
envelope (see Figure 5-62). Much larger and more complex than viruses, bacte¬ 
ria are prokaryotic cells, which perform most of their basic metabolic functions 
themselves, relying on the host primarily for nutrition (Figure 23-2B). 

Some other infectious agents are eukaryotic organisms. These range from 
single-celled fungi and protozoa (Figure 23-2C) to large, complex metazoa such 
as parasitic worms. One of the most common human parasites, shared by about 
a billion people at présent, is the nematode worm Ascaris lumbricoides , which 
infects the gut (Figure 23-2D). It closely resembles its harmless nematode cousin 
Caenorhabditis elegans , which is used as a model organism for genetic and devel- 
opmental biological research (see Figure 1-39). C. elegans , however, is only about 
1 mm in length, whereas Ascaris can reach 30 cm. 

We now introduce the basic features of each of the major types of pathogens, 
before we examine the mechanisms that pathogens use to infect their hosts. 
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Figure 23-3 Bacterial shapes and cell-surface structures. (A) Bacteria are traditionally classified by shape. (B and C) They 
are also classified as Gram positive or Gram négative. (B) Gram-positive bacteria such as Streptococcus and Staphylococcus 
hâve a single membrane and a thick cell wall made of cross-linked peptidoglycan. They are called Gram positive because 
they retain the violet dye used in the Gram-staining procedure. (C) Gram-negative bacteria such as Escherichia coli (E. coiî) 
and Salmonella hâve two membranes, separated by the periplasm (see Figure 11-17). The peptidoglycan cell wall of these 
organisms is located in the periplasm and is thinner than in Gram-positive bacteria; they therefore fail to retain the dye in the 
Gram-staining procedure. The inner membrane of both Gram-positive and Gram-negative bacteria is a phospholipid bilayer. The 
inner leaflet of the outer membrane of Gram-negative bacteria is also made primarily of phospholipids, whereas the outer leaflet 
of the outer membrane is composed of a unique glycosylated lipid called lipopolysaccharide (LPS). (D) Cell-surface appendages 
are important for bacterial behavior. Many bacteria swim using the rotation of helical flagella. The bacterium illustrated has only a 
single flagellum at one pôle; however, many hâve multiple flagella. Straight pili (also called fimbriae) are used to adhéré to various 
surfaces in the host, as well as to facilitate genetic exchange between bacteria. Some kinds of pili can retract to generate force 
and thereby help bacteria move along surfaces. 


Bacteria Are Diverse and Occupy a Remarkable Variety of 
Ecological Niches 

Although bacteria generally laclc internai membranes, they are highly sophisti- 
cated cells whose organization and behaviors hâve attracted the attention of many 
scientists. Bacteria are classified broadly by their shape—as rods, spheres ( cocci ), 
or spirals (Figure 23-3A) — as well as by their so-called Gram-staining properties, 
which reflect différences in the structure of the bacterial cell wall. Gram-positive 
bacteria hâve a thick layer of peptidoglycan cell wall outside their inner (plasma) 
membrane (Figure 23-3B), whereas Gram-negative bacteria hâve a thinner pep¬ 
tidoglycan cell wall. In both cases, the cell wall protects against lysis by osmotic 
swelling, and it is a target of host antibacterial proteins such as lysozyme and 
antibiotics such as penicillin. Gram-negative bacteria are also covered outside 
the cell wall by an outer membrane containing lipopolysaccharide {LPS) (Figure 
23-3C). Both peptidoglycan and LPS are unique to bacteria and are recognized 
as pathogen-associated molecular patterns {PAMPs) by the host innate immune 
System, as discussed in Chapter 24. The surface of bacterial cells can also display 
an array of appendages, including flagella and pili, which enable bacteria to swim 
or adhéré to désirable surfaces, respectively (Figure 23-3D). Apart from cell shape 
and structure, différences in ribosomal RNA and genomic DNA sequence are also 
used for phylogenetic classification. Because bacterial genomes are small—typi- 
cally between 1,000,000 and 5,000,000 nucléotide pairs (compared to more than 
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3,000,000,000 for humans)—they are now simple to sequence, malcing this an 
important new classification tool. 

Bacteria also exhibit extraordinary molecular, metabolic, and ecological diver- 
sity. At the molecular level, bacteria are far more diverse than eukaryotes, and 
they can occupy ecological niches having extremes of température, sait concen¬ 
trations, and nutrient limitation. Some bacteria replicate in an environmental rés¬ 
ervoir such as water or soil and only cause disease if they happen to encounter a 
susceptible host; these are called facultative pathogens. Others can only replicate 
inside the body of their host and are therefore called obligate pathogens. Bacte¬ 
ria also differ in the range of hosts they will infect. Shigella flexneri, for example, 
which causes épidémie dysentery (bloody diarrhea), will infect only humans and 
other primates. By contrast, the closely related bacterium Salmonella enterica, 
which is a common cause of food poisoning in humans, can also infect other ver- 
tebrates, including chiclcens and turtles. A champion generalist is the opportunis¬ 
te pathogen Pseudomonas aeruginosa , which can cause disease in a wide variety 
of plants and animais. 


Bacterial Pathogens Carry Specialized Virulence Genes 

Pathogenic bacteria and their closest nonpathogenic relatives often differ in a rel- 
atively small number of genes. Genes that contribute to the ability of an organ- 
ism to cause disease are called virulence genes, and the proteins they encode are 
called virulence factors. Such virulence genes are often clustered together on the 
bacterial chromosome; large clusters are called pathogenicity islands. Virulence 
genes can also be carried on bactériophages (bacterial viruses) or transposons 
(see Table 5-4), both of which integrate into the bacterial chromosome, or on ext¬ 
rachromosomal virulence plasmids (Figure 23-4A). 

Pathogenic bacteria are thought to emerge when groups of virulence genes 
are transferred together into a previously avirulent bacterium by a process called 
horizontal gene transfer (to distinguish it from vertical gene transfer from parent 
to offspring). Horizontal transfer can occur by one of three mechanisms: natural 
transformation by released nalced DNA, transduction by bactériophages, or sexual 
exchange by conjugation (Figure 23-4B and Movie 23.1). Sequencing the genomes 
of large numbers of pathogenic and nonpathogenic bacteria has indicated that 
horizontal gene transfer has made important contributions to bacterial évolution, 
enabling species to inhabit new ecological and nutritional niches, as well as to 
cause disease. Even within a single bacterial species, the amount of chromosomal 
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Figure 23-4 Genetic différences 
between pathogenic and nonpathogenic 
bacteria. (A) Genetic différences between 
nonpathogenic E coli and two closely 
related food-borne pathogens— Shigella 
flexneri, which causes dysentery, and 
Salmonella enterica, a common cause 
offood poisoning. Nonpathogenic E coli 
has a single circular chromosome. The 
chromosome of S. flexneri differs from 
that of E coli in a limited number of 
locations; most of the genes required for 
pathogenesis (virulence genes) are carried 
on an extrachromosomal virulence plasmid. 
The chromosome of S. enterica carries 
two large inserts (pathogenicity islands) not 
found in the E coli chromosome; these 
inserts each contain many virulence genes. 
(B) Bacterial pathogens evolve by horizontal 
gene transfer. This can occur by three 
mechanisms: natural transformation, in 
which naked DNA is taken in by competent 
bacteria; transduction, in which bacterial 
viruses (bactériophages) transfer DNA 
from one bacterium into another; and 
conjugation, during which plasmid DNA, 
and even chromosomal DNA, is transferred 
from a donor to a récipient bacterium. 
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Figure 23-5 Comparative-genomics- 
based model for the évolution of 
pathogenic Vibrio cholerae strains. 

Progenitor strains in the wild first 
acquired the biosynthetic pathway 
necessary to make the 01 antigen type of 
carbohydrate chain on the outer-membrane 
lipopolysaccharide (see Figure 23-3C). 
Incorporation of the CTX([) bactériophage 
created the Classical pathogenic strains 
responsible for the first six worldwide 
épidémies of choiera between 1817 and 
1923. Sometime in the twentieth century, 
an 01 strain in the environment picked 
up the CTX(|) bactériophage again, along 
with an associated bactériophage RS1 c|) 
and two pathogenicity islands (VSP1 
and VSP2), creating the El Tor strain 
that emerged as the seventh worldwide 
pandémie in 1961. In 1992, an El Tor strain 
was isolated that had picked up a new 
DNA cassette, enabling it to produce the 
0139 antigen type of carbohydrate chain 
rather than the 01 type. This altered the 
bacterium’s interaction with the human 
immune System, without diminishing its 
virulence; this bacterium also picked up a 
new pathogenicity island (SXT). An électron 
micrograph of Vibrio cholerae (V. cholerae) 
is shown in Figure 23-2B. 


variation is astonishing; the genomes of different strains of Escherichia coli can 
differ by as much as 25%. Such variation has led to the concept that a bacterial 
species has both a core genome common to ail isolâtes within the species and a 
larger pan-genome consisting of ail genes présent in the full spectrum of isolâtes. 

Acquisition of genes and gene clusters can drive the rapid évolution of patho¬ 
gens and turn nonpathogens into pathogens. Consider, for example, Vibrio chol¬ 
erae —the Gram-negative bacterium that causes the épidémie diarrheal disease 
choiera. Of the hundreds of strains of Vibrio cholerae , the only ones that cause 
pandémie human disease are those infected with a mobile bactériophage (CTX(|)) 
containing genes encoding the two subunits of the toxin that causes the diarrhea. 
As summarized in Figure 23-5, seven pandémies of V. cholerae hâve arisen since 
1817. The first six were caused by the periodic reemergence of so-called Classical 
strains. In addition to the toxin-encoding bactériophage, these Classical strains 
shared a similar 01 surface antigen, part of the LPS in the outer membrane (see 
Figure 23-3C). In 1961, the seventh pandémie began, caused by a new strain 
named “El Tor," which arose when an 01-expressing strain acquired two bacté¬ 
riophages and at least two new pathogenicity islands. El Tor eventually displaced 
the Classical strains. In 1992, a new strain emerged in which 01 was replaced with 
another O-antigen variant called 0139, which was not recognized by antibodies 
présent in the blood of survivors of previous choiera épidémies. The 0139 strain 
also contains a transposon-like element that encodes antibiotic résistance. As this 
example malces clear, the rapid évolution of bacterial pathogens can be likened 
to an arms race which pits the survival of a bacterium against our immune Sys¬ 
tems and the tools of modem medicine. Similar struggles for survival talce place 
between ail pathogens and humans, and understanding these conflicts provides 
lcey insights into the évolution of pathogens and greatly informs us how we treat 
new outbreaks of infectious diseases. 

Bacterial Virulence Genes Encode Effector Proteins and Sécrétion 
Systems to Deliver Effector Proteins to Host Cells 

What are the gene products that enable a bacterium to cause disease in a healthy 
host? For pathogenic bacteria that live outside of host cells, called extracellular 
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bacterial pathogens , virulence genes often encode secreted toxic proteins ( tox- 
ins ) that interact with host cell structural or signaling proteins to elicit a response 
that is bénéficiai to the pathogen. Several of these bacterial toxins are among the 
most potent of known human poisons. Bacterial toxins are often composed of two 
protein components—an A subunit with enzymatic activity, and a B subunit that 
binds to spécifie receptors on the host cell surface and directs the trafficking of 
the A subunit to the cytosol by various routes (Figure 23-6). The Vihrio cholerae 
phage, for example, encodes the two subunits of choiera toxin (Movie 23.2). The 
A subunit catalyzes the transfer of an ADP-ribose moiety from NAD + to the tri- 
meric G protein G s (see Figure 15-23), which activâtes adenylyl cyclase to malce 
cyclic AMP (see Figure 15-25). ADP-ribosylation prevents inactivation of the G 
protein and results in the overaccumulation of intracellular cyclic AMP and the 
release of ions and water into the intestinal lumen, leading to the watery diarrhea 
associated with choiera. The infection then spreads to new hosts via released bac- 
teria, which can contaminate food and water. 

Some pathogenic bacteria secrete multiple toxins, each of which targets a dif¬ 
ferent signaling pathway in host cells. Anthrax, for example, is an acute infectious 
disease of sheep, cattle, and occasionally humans. It is caused by contact with 
spores of the Gram-positive bacterium Bacillus anthracis. Dormant spores can 
survive in soil for long periods. If inhaled, ingested, or rubbed into breaks in the 
skin, spores can germinate and the bacteria replicate. The bacteria secrete two 
toxins with identical B subunits but different A subunits. The B subunits bind to 
a host cell-surface receptor protein to transfer the two different A subunits into 
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Figure 23-6 Bacterial toxin entry into 
host cells. Bacterial toxins are often 
composed of A and B protein subunits. 
The B (binding) subunit of the toxin 
interacts with host-cell toxin receptors, 
enabling endocytosis and intracellular 
trafficking of B subunit as well as its 
associated and enzymatically active 
A subunit(s). In the case of Bacillus 
anthracis, the B subunit changes 
conformation in the low pH environment 
of the endosome to form a pore through 
which two different A subunits, léthal 
factor and edema factor, are transported 
across the membrane of the endosome 
in an unfolded conformation. In the 
cases of Vibrio cholerae toxin and 
Bordetella pertussis toxin, the B and A 
subunits are transported to the Golgi 
apparatus and then to the endoplasmic 
réticulum (ER), where the A subunits are 
then translocated into the cytosol in an 
unfolded conformation through a protein- 
translocation channel. 
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Figure 23-7 Type III sécrétion Systems 
that can deliver effector proteins into 
the cytosol of a host cell. (A) Electron 
micrograph of purified type III sécrétion 
Systems, each of which consists of 
over two dozen proteins. (B) The large 
lower ring is embedded in the bacterial 
inner membrane, and the smaller upper 
ring is embedded in the bacterial outer 
membrane. During infection, docking of 
the tip of the hollow needle at a host-cell 
plasma membrane results in the sécrétion 
of bacterial translocator proteins (green), 
which form a pore in the host membrane, 
through which bacterial effector proteins 
are then secreted into the host cell. 

(A, from O. Schraidt et al., PLoS Pathog. 
6(4):e1000824, 2010.) 


host cells (see Figure 23-6). The A subunits are called léthal factor and edema 
factor. The A subunit of edema toxin is an adenylyl cyclase that catalyzes the pro¬ 
duction of cyclic AMP (see Figure 15-25), leading to an ion imbalance that can 
cause an accumulation of extracellular fluid {edema) in the slcin or lung. The A 
subunit of léthal toxin is a protease that cleaves several activated members of the 
mitogen-activated protein kinase kinase (MAP kinase kinase) family (see Figure 
15-49), disrupting intracellular signaling and leading to immune cell dysfunction 
and cell death. Injection of léthal toxin into the bloodstream of an animal causes 
shoclc (a large fall in blood pressure) and death. 

Apart from toxins, bacteria use specialized sécrétion Systems to secrete 
many other effector proteins that interact with host cells. Gram-negative bacteria 
hâve a general sécrétion System and several classes of accessory sécrétion Systems 
(types I-VT). A subset of these accessory sécrétion Systems, called contact-dépen¬ 
dent sécrétion Systems , is présent in many bacteria that contact or live inside host 
cells. The type III sécrétion System (Figure 23-7), for example, injects into the 
host-cell cytoplasm effector proteins that can elicit a variety of host cell responses 
that enable the bacterium to invade or survive. There is a remarlcable degree of 
structural similarity between the type III syringe and the base of a bacterial fia- 
gellum. Because flagella are found in a wider range of bacteria than are type III 
sécrétion Systems, and the sécrétion Systems appear to be adaptations spécifie for 
pathogenesis, it seems lilcely that the type III sécrétion Systems evolved from fla¬ 
gella. Other types of delivery Systems used by bacterial pathogens appear to hâve 
evolved independently. For example, type IV sécrétion Systems are closely related 
to the conjugation apparatus that many bacteria use to exchange genetic material. 

Fungal and Protozoan Parasites Hâve Complex Life Cycles 
Involving Multiple Forms 

Pathogenic fungi and protozoan parasites are eukaryotes, as are their hosts. Con- 
sequently, antifungal and antiparasitic drugs are often less effective and more 
toxic to the host than are antibiotics that target bacteria. A second characteristic 
of fungal and parasitic infections that malces them difficult to treat is the tendency 
of the pathogens to switch among several different forms during their life cycles. A 
drug that is effective at killing one form can be ineffective at killing another form; 
therefore the population can survive the treatment. 

Fungi include both unicellular yeasts (such as Saccharomyces cerevisiae and 
Schizosaccharomyces pombe , which are used to balce bread and brew beer, and as 
model organisms for cell biology research) and filamentous, multicellular molds 
(lilce those found on moldy fruit or bread). Most of the important pathogenic 
fungi exhibit dimorphism —the ability to grow in either yeast or mold form. The 
yeast-to-mold or mold-to-yeast transition is frequently associated with infection. 
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Histoplasma capsulatum, for example, grows as a mold at low température in the 
soil, but it switches to a yeast form when inhaled into the lung, where it can cause 
the disease histoplasmosis (Figure 23-8). 

Protozoan parasites are single-celled eukaryotes with more elaborate life 
cycles than fungi, and they frequently require more than one host. Malaria is the 
most devastating protozoal disease, infecting more than 200 million people every 
year and killing upward of 500,000. It is caused by four species of Plasmodium , 
which are transmitted to humans by the bite of the female Anopheles mosquito. 
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Figure 23-8 Dimorphism in the 
pathogenic fungus Histoplasma 
capsulatum. (A) At low température in the 
soil, H. capsulatum grows as a multicellular 
filamentous mold consisting of many 
individual cells connected together. (B) After 
it is inhaled into the lung of a mammal, the 
increase in température causes a switch 
to a yeast form consisting of small clumps 
of round cells. (C) A stained histologie 
section of a mouse lung infected with 
H. capsulatum, showing a macrophage 
containing yeast forms of the pathogen. 

(A and B, courtesy of Sinem Beyhan and 
Anita Sil; C, courtesy of Davina Hocking 
Murray and Anita Sil.) 


Figure 23-9 The complex life cycle of 
malaria parasites. (A) The sexual cycle of 
Plasmodium falciparum requires passage 
between a human host and an insect host 
(Movie 23.3). (B)-(D) Blood smears from 
people with malaria, showing three different 
forms of the parasite that appear in red 
blood cells: (B) ring stage; (C) schizont; and 
(D) gamétocyte. (B-D, courtesy of the 
Centers for Disease Control, Division of 
Parasitic Diseases, DPDx.) 
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Plasmodium falciparum causes the most serious form of malaria and is the most 
intensively studied of the malaria-causing parasites. It exists in many distinct 
forms, and it requires both the human and mosquito hosts to complété its sexual 
cycle (Figure 23-9). Several of these forms are highly specialized to invade and 
replicate in spécifie tissues—the lining of the insect gut, the human liver, and the 
human red blood cell. Even within a single host cell type, the red blood cell, the 
Plasmodium parasite undergoes a complex sequence of developmental events, 
reflected in striking morphological changes (Figure 23-9B-D). 

Ail Aspects of Viral Propagation Dépend on Host Cell Machinery 

Bacteria, fungal, and protozoan pathogens are living cells themselves. They use 
their own machinery for DNA réplication, transcription, and translation, and, for 
the most part, they provide their own sources of metabolic energy. Viruses, by 
contrast, are the ultimate hitchhikers, carrying little more than information in the 
form of nucleic acid. Most clinically important human viruses hâve small genomes 
consisting of double-stranded DNA or single-stranded RNA (Table 23-1), and we 
now hâve complété genome sequences of almost ail of them. 

Viral genomes typically encode three types of protein: proteins for replicat- 
ing the genome, proteins for packaging the genome and delivering it to more 
host cells, and proteins for modifying the structure or function of the host cell to 
enhance the réplication of the virus (see Figure 7-62). In general, viral réplication 
involves (1) entry into the host cell, (2) disassembly of the infectious virus particle, 
(3) réplication of the viral genome, (4) transcription of viral genes and synthesis of 
viral proteins, (5) assembly of these viral components into progeny virus particles, 


Viruses That Cause Human Disease 

Virus 

Genome type 

Disease 

Herpes simplex virus 1 

Double-stranded DNA 

Récurrent cold sores 

Epstein-Barr virus (EBV) 

Double-stranded DNA 

Infectious mononucleosis 

Varicella-zoster virus 

Double-stranded DNA 

Chickenpox and shingles 

Smallpox virus (Variola) 

Double-stranded DNA 

Smallpox 

Human papillomavirus 

Double-stranded DNA 

Warts, cancer 

Adenovirus 

Double-stranded DNA 

Respiratory disease 

Hepatitis-B virus 

Part single-, part double-stranded DNA 

Hepatitis B 

Human immunodeficiency virus (HIV-1) 

Single-stranded RNA [+] strand 

Acquired immune deficiency syndrome (AIDS) 

Poliovirus 

Single-stranded RNA [+] strand 

Poliomyelitis 

Rhinovirus 

Single-stranded RNA [+] strand 

Common cold 

Hepatitis-A virus 

Single-stranded RNA [+] strand 

Hepatitis A 

Hepatitis-C virus 

Single-stranded RNA [+] strand 

Hepatitis C 

Yellow fever virus 

Single-stranded RNA [+] strand 

Yellow fever 

Coronavirus 

Single-stranded RNA [+] strand 

Common cold, respiratory disease 

Rabies virus 

Single-stranded RNA [-] strand 

Rabies 

Mumps virus 

Single-stranded RNA [-] strand 

Mumps 

Measles virus 

Single-stranded RNA [-] strand 

Measles 

Influenza virus type A 

Single-stranded RNA [-] strand 

Respiratory disease (flu) 
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Figure 23-10 A simple viral life cycle. The hypothetical simple virus shown 
here consists of a small double-stranded DNA molécule that codes for 
only a single viral capsid protein. To reproduce, the viral genome must first 
enter a host cell, where it is replicated to produce multiple copies, which are 
transcribed and translated to produce the viral coat protein. The viral genomes 
can then assemble spontaneously with the coat protein to form a new virus 
particle, which escapes from the host cell. No known virus is this simple. 


and (6) release of progeny virions (Figure 23-10). A single virus particle (a virion ) 
that infects a single host cell can produce thousands of progeny. 

Virions corne in a wide variety of shapes and sizes (Figure 23-11), and 
although most hâve relatively small genomes, genome size can vary considerably. 
The recently discovered giant viruses of amoebae, called pandoraviruses, are the 
largest known viruses, with 700 nm particles and double-stranded DNA genomes 
of over 2,000,000 nucléotide pairs. The virions of poxvirus are also large: they are 
250-350 nm long and enclose a genome of double-stranded DNA of about 270,000 
nucléotide pairs. At the other end of the size scale are the virions of parvovirus, 
which are less than 30 nm in diameter and hâve a single-stranded DNA genome 
of fewer than 5000 nucléotides. 

Viral genomes are packaged in a protein coat, called a capsid, which in some 
viruses is further enclosed by a lipid bilayer membrane, or envelope. The capsid is 
made of one or several proteins, arranged in regular arrays that generally produce 
structures with either helical symmetry, which results in a cylindrical structure 
(for example, influenza, measles, and bunyavirus), or icosahedral symmetry (for 
example, poliovirus and herpesvirus; see Figure 23-11). Some viruses instead pro¬ 
duce capsids with more complicated structures (for example, poxviruses). When 
the capsid is packaged with the viral genome, the structure is called a nucleocap- 
sid. The nucleocapsids of nonenveloped viruses usually leave an infected cell by 
lysing it. For enveloped viruses , by contrast, the nucleocapsid is enclosed within a 
lipid bilayer membrane that the virus acquires in the process of budding from the 
host-cell plasma membrane, which it does without disrupting the membrane or 
killing the cell (Figure 23-12). Enveloped viruses can cause persistent infections 
that may last for years, often without noticeable deleterious effects on the host. 

Because the host cell performs most of the critical steps in viral réplication, the 
identification of effective antiviral drugs that do not harm the host can be difficult. 
Probably the most effective strategy for containing viral diseases is through vacci- 
nating of potential hosts. Highly successful vaccination programs hâve effectively 
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Figure 23-11 Examples of viral 
morphology. As shown, both DNA and 
RNA viruses vary greatly in both size and 
shape. 
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Figure 23-12 Acquisition of a viral 
envelope. (A) Electron micrograph of 
an animal cell from which six copies of 
an enveloped virus (Semliki forest virus) 
are budding. (B) Schematic drawing of 
the envelope assembly and budding 
processes. The lipid bilayer that surrounds 
the viral capsid is derived directly from 
the plasma membrane of the host cell. In 
contrast, the proteins in this lipid bilayer 
(shown in green) are encoded by the viral 
genome. (A, courtesy of M. Olsen and 
G. Griffith.) 


eliminated smallpox infection from the planet, and the éradication of poliomyeli- 
tis is approaching completion (Figure 23-13). 

Summary 

Infections diseases are caused by pathogens, which include viruses, bacteria, and 
fungi, as well as protozoan and metazoan parasites. AU pathogens must hâve 
mechanismsfor entering their host and for evading immédiate destruction by the 
host. The great majority of bacteria are not pathogenic to humans. Those that are 
pathogenic produce spécifie virulence factors that médiate the bacteria's interac¬ 
tions with the host; these proteins change the behavior of host cells in ways that 
promote the réplication and spread ofthe bacteria. Eulcaryotic pathogens such as 
fungi and protozoan parasites typically pass through several differentforms during 
the course of infection; the ability to switch among these forms is usually required 
for these pathogens to survive in a host and cause disease. In some cases, such as 
malaria, parasites must pass sequentially through several host species to complété 
their life cycles. Unlike bacteria and eulcaryotic parasites; viruses hâve no metabo- 
lism of their own and no intrinsic ability to produce the proteins encoded by their 
DNA or RNA genomes; they rely on subverting the machinery ofthe host cell. 



Figure 23-13 Effective control of a viral disease through vaccination. The graph shows the 
number of cases of poliomyelitis reported per year in the United States. The arrows indicate the 
timing of the introduction of the Salk vaccine (inactivated virus given by injection) and the Sabin 
vaccine (live attenuated virus given orally). 
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CELL BIOLOGY OF INFECTION 

The mechanisms through which pathogens cause disease are as diverse as the 
pathogens themselves. Nonetheless, ail pathogens must carry out certain com- 
mon taslcs: they must gain access to the host, reach an appropriate niche, avoid 
host defenses, replicate, and exit from the infected host to spread to an uninfected 
one. In this section, we examine the common strategies that many pathogens use 
to accomplish these taslcs. 

Pathogens Overcome Epithelial Barriers to Infect the Host 

The first step in infection is for the pathogen to gain access to the host. A thiclc cov- 
ering of slcin protects most parts of the human body from the environment. The 
protective boundaries of some other human tissues (eyes, nasal passages, respi- 
ratory tract, mouth, digestive tract, urinary tract, and female génital tract) are less 
robust. In the lungs and small intestine, for example, the barrier is just a single 
monolayer of épithélial cells. Nonetheless, ail these epithelia serve as barriers to 
infection. 

Wounds in barrier epithelia allow pathogens direct access to unoccupied 
niches within otherwise stérile host tissues. This avenue of entry requires little 
in the way of pathogen specialization, and many members of the normal flora 
can cause serious illness if they enter through such wounds. Staphylococci from 
the skin and nose, or Streptococci from the throat and mouth, are two examples 
of opportunistic bacterial pathogens that are responsible for many serious infec¬ 
tions resulting from breaches in épithélial barriers. The recent emergence of 
bacterial strains of Staphylococcus that are résistant to the antibiotics commonly 
used for treatment (for example, methicillin-resistant Staphylococcus aureus, or 
MRSA, which infects up to 50,000,000 people worldwide) is of particular concern. 
Papillomaviruses, which cause warts and cervical cancer, also take advantage of 
breaches in épithélial barriers. 

Primary pathogens, however, need not wait for a wound to gain access to their 
host. One efficient way for such a pathogen to cross the skin is to catch a ride in the 
saliva of a biting arthropod. A diverse group of bacteria, viruses, and protozoa has 
developed the ability to survive in insects and then use them as vectors to spread 
from one mammalian host to another. As discussed earlier, the Plasmodium pro- 
tozoan that causes malaria develops through several forms in its life cycle, includ- 
ing some that are specialized for survival in a human and others that are special- 
ized for survival in a mosquito (see Figure 23-9). Viruses that are spread by insect 
bites cause yellow fever and Dengue fever, as well as many kinds of viral enceph- 
alitis (inflammation of the brain). These viruses replicate in both insect cells and 
mammalian cells, as required for their transmission by an insect vector. 

The efficient spread of a pathogen via an insect vector requires that an individ- 
ual insect consumes a blood meal from an infected host and transfers the patho¬ 
gen to a naive host. In a few striking cases, the pathogen alters the behavior of 
the insect so that its transmission to a new host is more lilcely. An example is the 
bacterium Yersinia pestis , which causes bubonic plague. It multiplies in the flea's 
foregut to form aggregated masses that physically bloclc the digestive tract; during 
each repeated, but futile, attempt at feeding, some of the bacteria in the foregut are 
flushed into the bite site, thus transmitting plague to a new host (Figure 23-14). 

Pathogens That Colonize an Epithelium Must Overcome Its 
Protective Mechanisms 

Whereas many épithélial barriers such as the skin and the lining of the mouth 
and large intestine are densely populated by normal flora, others, including the 
lining of the lower lung and the bladder, are normally kept nearly stérile. How 
do these epithelia avoid bacterial colonization? A layer of protective mucus cov¬ 
ers the respiratory epithelium, and the coordinated beating of cilia sweeps the 
mucus and trapped bacteria up and out of the lung. The épithélial lining of the 
bladder and the upper gastrointestinal tract also has a thiclc layer of mucus, and 
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Figure 23-14 Plague bacteria within 
a flea. This light micrograph shows the 
digestive tract dissected from a flea that 
had dined about two weeks previously 
on the blood of an animal infected with 
the plague bacterium, Yersinia pestis. 
The bacteria multiplied in the flea gut to 
produce large cohesive aggregates (red 
arrows); the bacterial mass on the left 
is occluding the passage between the 
esophagus and the midgut. This type of 
blockage prevents a flea from digesting 
its blood meals, so that hunger causes 
it to bite repeatedly, disseminating the 
infection. (From B.J. Hinnebusch, 

E.R. Fischer and T.G. Schwan, 

J. Infect. Dis. 178:1406-1415, 1998.) 
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these organs are periodically flushed by urination and by peristalsis, respectively, 
which washes away most microbes. 

Pathogenic bacteria and eukaryotic parasites that infect these épithélial sur¬ 
faces hâve evolved spécifie mechanisms for overcoming these protective mech- 
anisms. Those that infect the urinary tract, for example, adhéré tightly to the 
épithélial lining via spécifie adhesins, which are proteins or protein complexes 
that recognize and bind to cell-surface molécules on the epithelium. An import¬ 
ant group of adhesins in E. coli strains that infect the kidney are components of 
the pili— surface projections that can be several micrometers long and thus able 
to span the thickness of the protective mucus layer; at the tip of each pilus is an 
adhesin protein that binds tightly to the D-galactose-D-galactose disaccharide 
on glycolipids on the surface of kidney cells (Figure 23-15). Strains of E. coli that 
infect the bladder rather than the kidney express a second kind of pilus with a dif¬ 
ferent adhesin protein that binds to bladder épithélial cells. It is the specificity of 
the adhesin proteins on the tips of the two types of pili that is responsible for the 
bacteria's colonizing of the different parts of the urinary tract. 

The stomach is an especially hostile environment for pathogens. Besides the 
thiclc layer of mucus and peristaltic washing, it is filled with acid (average pH ^2), 
which is léthal to almost ail bacteria ingested in food. Yet, it is home to a micro- 
biota of hundreds of résident species, including the bacterium H. pylori, which, 
as we discussed earlier, is the major cause of stomach ulcers and some stomach 
cancers. The hypothesis that a persistent bacterial infection could cause stomach 
ulcers was initially met with skepticism. The young Australian doctor who made 
the initial discovery finally proved the point: he dranlc a pure culture of H. pylori 
and developed inflammation of the stomach, which often précédés the develop¬ 
ment of ulcers. A short course of antibiotics can now effectively cure a patient 
of récurrent stomach ulcers. Remarkably, H. pylori is able to persist for life as a 
commensal in most humans. One way in which it survives in the stomach is by 
producing the enzyme urease , which converts urea to ammonia that neutralizes 
the acid in its immédiate vicinity. The bacterium also uses its flagellum for che- 
motactic motility, allowing it to seek out the more neutral pH near the surface 
of gastric épithélial cells. H. pylori virulence proteins that target both épithélial 
and immune cells help H. pylori persist in the stomach, but they can also induce 
chronic inflammation, alteration in host gene expression, changes in cell prolifér¬ 
ation and apoptosis, and disruption of cell-cell junctions, ail of which are predis- 
posing factors for stomach cancer. 

Extracellular Pathogens Disturb Host Cells Without Entering Them 

Extracellular pathogens can cause serious disease without entering host cells. 
Bordetella pertussis, the bacterium that causes whooping cough, for example, col- 
onizes the respiratory epithelium and circumvents the normal mechanism that 


Figure 23-15 Pathogenic E. coli in 
the infected bladder of a mouse. 

(A) Scanning électron micrograph of 
uropathogenic E. coli, a common cause of 
bladder and kidney infections. The bacteria 
are attached to the surface of épithélial 
cells lining the infected bladder. (B) A close- 
up view of one of the bacteria showing 
the pili on its surface. (C) An E. coli pilus 
has adaptor proteins on its tip that bind to 
glycolipids on the surface of kidney cells. 

(A, from G.E. Soto and S.J. Hultgren, 

J. Bacteriol. 181:1059-1071, 1999. With 
permission from the American Society for 
Microbiology; B, courtesy of D.G. Thanassi 
and S.J. Hultgren, Methods 20:111-126, 
2000. With permission from Academie 
Press.) 
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clears the respiratory tract by expressing adhesins that bind ciliated épithélial 
cells. The adhèrent bacteria produce toxins that eventually kill the ciliated cells, 
compromising the host's ability to clear the infection. The most familiar of these 
is pertussis toxin, which, lilce the choiera toxin discused above, has an A subunit 
that ADP-ribosylates the a subunit of the G protein Gi, inhibiting the G protein 
from suppressing the activity of the host cell's adenylyl cyclase, thereby increasing 
the production of cyclic AMP (see Figure 23-6). This toxin also interfères with the 
chemotactic pathway that neutrophils use to seek out and destroy invading bac¬ 
teria (see Figures 16-3 and 16-86). B. pertussis colonization of the respiratory tract 
causes severe coughing, which helps spread the infection. 

Not ail extracellular pathogens that colonize an epithelium exert their effect 
through toxins. Enteropathogenic E. coli (EPEC), which causes diarrhea in young 
children, uses a type III sécrétion System (see Figure 23-7) to deliver its own spé¬ 
cial receptor protein (called Tir ) into the plasma membrane of a host intestinal 
épithélial cell (Figure 23-16). The extracellular domain of Tir binds to the bacte- 
rial surface protein intimin, triggering actin polymerization in the host cell that 
results in the formation of a unique cell-surface protrusion called a pedestal; this 
pushes the tightly adhèrent bacteria up about 10 pm from the host-cell mem¬ 
brane, thereby promoting bacterial movement along the cell surface. A similar 
strategy is used by vaccinia virus (the virus that was used as a vaccine to eradicate 
smallpox) to form mobile pedestals, which promote spread of the virus from cell 
to cell. The study of how EPEC and vaccinia virus promote actin polymerization 
has been of major importance in understanding how intracellular signaling path- 
ways regulate the cytoslceleton in normal, uninfected cells (discussed in Chapter 
16). Although pedestal formation promûtes the spread of these pathogens, the 
sympoms of EPEC infection (severe diarrhea) are caused by the loss of absorptive 
microvilli and disruption of signaling pathways in épithélial cells, which are trig- 
gered by Tir and other effector proteins. 

Intracellular Pathogens Hâve Mechanisms for Both Entering and 
Leaving Host Cells 

Many pathogens hâve to enter host cells to cause disease. These intracellular 
pathogens include ail viruses and many bacteria and protozoa. Each of these has 
a preferred niche for réplication and survival within host cells. Bacteria and proto¬ 
zoa replicate either in the cytosol or within a membrane-enclosed compartment. 
While most RNA viruses replicate within the cytosol, most DNA viruses replicate 
in the nucléus. Tife inside a host cell has several advantages. The pathogens are not 
accessible to antibodies, nor are they easy targets for phagocytic cells (discussed 
in Chapter 24); furthermore, intracellular bacteria and protozoa are bathed in a 
rich source of nutrients, and viruses hâve access to the host cell's biosynthetic 


Figure 23-16 Interaction of 
enteropathogenic E. coli (EPEC) with 
host intestinal épithélial cells. (A) When 
EPEC contacts an épithélial cell in the 
lining of the human gut, it delivers a 
bacterial protein called Tir into the host 
cell through a type III sécrétion System. 

Tir then inserts into the plasma membrane 
of the host cell, where it functions as a 
receptor for the bacterial adhesin protein 
intimin. Next, a host-cell protein tyrosine 
kinase phosphorylâtes the intracellular 
domain of Tir on tyrosines. Phosphorylated 
Tir recruits host-cell proteins (including 
an adaptor protein, a WASp protein, 
and the Arp 2/3 complex) that trigger 
actin polymerization (see Figure 16-16). 
Consequently, a branched network of 
actin filaments assembles underneath 
the bacterium, forming an actin pedestal 
(Movie 23.4). (B) EPEC on a pedestal. In 
this fluorescence micrograph, the DNA of 
the EPEC and host cell is labeled in blue, 
Tir protein is labeled in green, and host-cell 
actin filaments are labeled in red. The 
inset shows a close-up view of the two 
upper bacteria on pedestals. (B, from 
D. Goosney et al., Annu. Rev. CellDev. 

Biol. 16:173-189, 2000. With permission 
from Annual Reviews.) 
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machinery for their reproduction. This lifestyle, however, requires that the patho- 
gen hâve mechanisms for entering host cells, for finding a suitable subcellular 
niche where it can replicate, and for exiting from the infected cell to spread the 
infection. Below we consider some of the myriad ways that individual intracellu- 
lar pathogens exploit and modify host cell biology to satisfy these requirements. 

Viruses Bind to Virus Receptors at the Host Cell Surface 

The first step for any intracellular pathogen is to bind to the surface of the host 
target cell. Viruses accomplish this by the binding of viral surface proteins to virus 
receptors displayed on the host cell. The first virus receptor identified was an 
E. coli surface protein that is recognized by the bactériophage lambda; the protein 
normally functions to transport the sugar maltose from outside the bacterium to 
the inside where it is used as an energy source. Receptors need not be proteins, 
however: an envelope protein of herpes simplex virus, for example, binds to hep- 
aran sulfate proteoglycans (discussed in Chapter 19) on the surface of certain ver- 
tebrate host cells, and simian virus 40 (SV40) binds to a glycolipid. The specificity 
of virus-receptor interactions often serves as a barrier preventing the spread of a 
virus from one species to another. Acquiring the ability to bind to a new receptor 
often requires multiple changes in a virus, but it can be crucial in allowing the 
cross-species transmission that can resuit in new disease outbreaks. 

Viruses that infect animal cells generally exploit cell-surface receptor molé¬ 
cules that are either ubiquitous (such as the sialic-acid-containing oligosaccha¬ 
rides used by the influenza virus) or found uniquely on those cell types in which 
the virus replicates (such as the neuron-specific proteins used by rabies virus). 
Although a virus usually uses a single type of host-cell receptor, some viruses use 
more than one type. An important example is HIV-1, which requires two types of 
receptors to enter a host cell. Its primary receptor is CD4, a cell-surface protein on 
helper T cells and macrophages that is involved in immune récognition (discussed 
in Chapter 24). It also requires a co-receptor, which is either CCR5 (a receptor 
for P-chemokines) or CXCR4 (a receptor for a-chemokines), depending on the 
particular variant of the virus; macrophages are susceptible only to HIV variants 
that use CCR5 for entry, whereas helper T cells are most efficiently infected by 
variants that use CXCR4 (Figure 23-17). The viruses that are found within the first 
few months after HIV infection almost invariably use CCR5, which explains why 
individuals that carry a defective CCR5 gene are less susceptible to HIV infection. 
In the later stages of infection, viruses often either switch to use CXCR4 or adapt 
to use both co-receptors through the accumulation of mutations; in this way, the 
virus can change the cell types it infects as the disease progresses. It may seem 
paradoxical that viruses would infect immune cells, as we might expect that virus 
binding would trigger an immune response; but invasion of an immune cell can 
be a useful way for a virus to wealcen the immune response and travel around the 
body to infect other immune cells. 



Figure 23-17 Receptor and co-receptors 

for HIV. Ail strains of HIV require the 
CD4 protein as a primary receptor. Early 
in an infection, most of the viruses use 
CCR5 as a co-receptor, allowing them to 
infect macrophages and their precursors, 
monocytes. As the infection progresses, 
mutant variants arise that now use CXCR4 
as a co-receptor, enabling them to infect 
helper T cells efficiently. The natural ligand 
for the chemokine receptors (Sdfl for 
CXCR4; Rantes, Mipla, or Mipl |3 for 
CCR5) blocks co-receptor function and 
prevents viral invasion. 
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Viruses Enter Host Cells by Membrane Fusion, Pore Formation, or 
Membrane Disruption 

After récognition and attachment to the host cell surface, the virus must enter the 
cell to replicate. Some enveloped viruses enter the host cell by fusing their enve- 
lope membrane with the plasma membrane. Most viruses, whether enveloped or 
nonenveloped, activate signaling pathways in the cell that induce endocytosis, 
commonly via clathrin-coated pits (see Figure 13-7), leading to internalization 
into endosomes. Large viruses that do not fit into clathrin-coated vesicles, such as 
poxviruses, often enter cells by macropinocytosis, a process by which membrane 
ruffles fold over and entrap fluid into macropinosomes (see Figure 13-50). Once 
inside endosomes, fusion of the viral envelope occurs from the lumenal side of 
the endosome membrane. The mechanism of membrane fusion mediated by 
viral spilce glycoproteins has similarities with SNARE-mediated membrane fusion 
during normal vesicular trafficlcing (discussed in Chapter 13). 

Enveloped viruses regulate fusion both to ensure that they fuse only with the 
appropriate host cell membrane and to prevent fusion with one another. For 
viruses such as HIV-1 that fuse at neutral pH with the plasma membrane (Figure 
23-18A), binding to receptors or co-receptors usually triggers a conformational 
change in a viral envelope protein that exposes a normally buried fusion peptide 
(see Figure 13-21). Other enveloped viruses, such as influenza A virus, only fuse 
with a host cell membrane after endocytosis (Figure 23-18B); in this case, it is 
frequently the acid environment in the late endosome that triggers the confor¬ 
mational change in a viral surface protein that exposes the fusion peptide. The H + 
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Figure 23-18 Four virus entry strategies. 

(A) Some enveloped viruses, such as HIV, 
fuse directly with the host-cell plasma 
membrane to release their RNA genome 
(blue) and capsid proteins (brown) into the 
cytosol. (B) Other enveloped viruses, such 
as influenza virus, first bind to cell-surface 
receptors, triggering receptor-mediated 
endocytosis; when the endosome acidifies, 
the virus envelope fuses with the endosomal 
membrane, releasing the viral RNA genome 
(blue) and capsid proteins (brown) into the 
cytosol. (C) Poliovirus, a nonenveloped virus, 
induces receptor-mediated endocytosis, 
and then forms a pore in the endosomal 
membrane to extrude its RNA genome 
(blue) into the cytosol. (D) Adenovirus, 
another nonenveloped virus, uses a more 
complicated strategy: it induces receptor- 
mediated endocytosis and then disrupts the 
endosomal membrane, releasing the capsid 
and its DNA genome into the cytosol; the 
trimmed-down virus eventually docks onto 
a nuclear pore and releases its DNA (red) 
directly into the nucléus (Movie 23.5). 
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pumped into the early endosome also has another effect; it enters the influenza 
virion through an ion channel in the viral envelope and triggers changes in the 
viral capsid. These priming steps allow the capsids to disassemble once released 
into the cytosol after virus fusion with the late endosomal membrane. 

Nonenveloped viruses use different strategies to enter host cells—strategies 
that do not rely on membrane fusion. Poliovirus, which causes poliomyelitis, binds 
to a cell-surface receptor, triggering both receptor-mediated endocytosis (see Fig¬ 
ure 13-52) and a conformational change in the viral particle. The conformational 
change exposes a hydrophobie projection on one of the capsid proteins, which 
inserts into the endosomal membrane to form a pore. The viral RNA genome then 
enters the cytosol through the pore, leaving the capsid in the endosome (Figure 
23-18C). Other nonenveloped viruses such as adenovirus disrupt the endosomal 
membrane after they are talcen up by receptor-mediated endocytosis. One of the 
proteins released from the capsid lyses the endosomal membrane, releasing the 
remainder of the virus into the cytosol. During endosomal trafficking and sub¬ 
séquent transport within the cytosol, adenoviruses undergo multiple uncoating 
steps, which sequentially remove structural proteins and ready the virus particles 
to release their DNA into the nucléus through nuclear pore complexes (Figure 
23-18D). 

Bacteria Enter Host Cells by Phagocytosis 

Bacteria are much larger than viruses—too large to be talcen up either through 
pores or by receptor-mediated endocytosis. Instead, they enter host cells by 
phagocytosis, which is a normal function of phagocytes such as neutrophils, mac¬ 
rophages, and dendritic cells (discussed in Chapter 24). These phagocytes patrol 
the tissues of the body and ingest and destroy microbes; however, some intracel- 
lular bacterial pathogens such as M. tuberculosis use this to their advantage and 
hâve evolved to survive and multiply inside macrophages. 

Some bacterial pathogens can invade host cells that are normally nonphago- 
cytic. One way they do so is by expressing an invasion protein that binds with high 
affinity to a host-cell receptor, which is often a cell-cell or cell-matrix adhesion 
protein (discussed in Chapter 19). For example, Yersinia pseudotuberculosis (a 
bacterium that causes diarrhea and is a close relative of the plague bacterium 
Y. pestis) expresses a protein called invasin that has an RGD motif that is simi- 
lar to fibronectin's and likewise is recognized by host-cell Pi integrins (see Fig¬ 
ure 19-55). Listeria monocytogenes, which causes a rare but serious form of food 
poisoning, invades host cells by expressing a protein that binds to the cell-cell 
adhesion protein E-cadherin (see Figure 19-6). For both these bacterial species, 
binding of the bacterial invasion proteins to the host cell adhesion proteins stim¬ 
ulâtes signaling through members of the Rho family of small GTPases (discussed 
in Chapter 16). This in turn activâtes proteins in the WASp family and the Arp 2/3 
complex, leading to actin polymerization at the site of bacterial attachment. Actin 
polymerization, together with the assembly of a clathrin coat (see Figure 13-6), 
drives the advancement of the host celFs plasma membrane over the adhesive 
surface of the microbe, resulting in the phagocytosis of the bacterium—a process 
known as the zipper mechanism of invasion (Figure 23-19A). 

A second pathway by which bacteria can invade nonphagocytic cells is known 
as the trigger mechanism (Figure 23-19B). It is used by various pathogens that 
cause food poisoning, including Salmonella enterica, and it is initiated when the 
bacterium injects a set of effector molécules into the host-cell cytosol through 
a type III sécrétion System (see Figure 23-7). Some of these effector molécules 
activate Rho family pro teins, which in turn stimulate actin polymerization, as just 
discussed. Other bacterial effector proteins interact with host-cell cytoskeletal 
éléments more directly, nucleating and stabilizing actin filaments and causing 
the rearrangement of actin cross-linking proteins. The overall effect is to cause the 
formation of localized ruffles on the surface of the host cell (Figure 23-19C and 
D), which fold over and engulf the bacteria by a process that resembles macropi- 
nocytosis. The appearance of cells being invaded by use of the trigger mechanism 
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(A) ZIPPER MECHANISM 


(B) TRIGGER MECHANISM 


invasin receptors 



type III sécrétion 




Figure 23-19 Mechanisms used by 
bacteria to induce phagocytosis by host 
cells that are normally nonphagocytic. 

(A) In the zipper mechanism, bacteria 
express an invasion protein that binds with 
high affinity to a host-cell receptor, which 
is often a cell-cell or cell-matrix adhesion 
protein. (B) In the trigger mechanism, 
bacteria inject a set of effector molécules 
into the host-cell cytosol through a type III 
sécrétion System called SPI1 (Salmonella 
pathogenicity island 1), inducing membrane 
ruffling. Both the zipper and trigger 
mechanisms cause the polymerization of 
actin at the site of bacterial attachment 
by activating Rho family small GTPases 
and the Arp 2/3 complex. (C) A scanning 
électron micrograph showing a very early 
stage of Salmonella enterica invasion 
by the trigger mechanism. Bacteria 
(pseudocolored yellow) are shown 
surrounded by a small membrane ruffle. 

(D) Fluorescence micrograph showing that 
the large ruffles that engulf the Salmonella 
bacteria are actin-rich. The bacteria are 
labeled in green and actin filaments in red\ 
because of the color overlap, the bacteria 
appear yellow. (C, from Rocky Mountain 
Laboratories, NIAID, NIH; D, from 
J.E. Galân, Annu. Rev. Cell Dev. Biol. 
17:53-86, 2001. With permission from 
Annual Reviews.) 


is similar to the ruffling induced by some extracellular growth factors, suggesting 
that the bacteria exploit normal intracellular signaling pathways. 

Intracellular Eukaryotic Parasites Actively Invade Host Cells 

The uptalce of viruses and bacteria into host cells is carried out largely by the host, 
with the pathogen being a relatively passive participant. In contrast, intracellular 
eukaryotic parasites, which are typically much larger than other types of intra¬ 
cellular pathogens, invade host cells through a variety of complex pathways that 
usually require energy expenditure by the parasite. 

Toxoplasma gondii ,a cat parasite that also causes occasional serious human 
infections, is an example. When this protozoan contacts a host cell, it protrudes 
an unusual microtubule-based structure called a conoid, which facilitâtes entry 
into the host cell (Figure 23-20). The energy for invasion seems to corne from 
actin polymerization in the parasite rather than host cytoskeleton, and invasion 
also requires at least one unusual parasite myosin motor protein (Class XIV; see 
Figure 16-40). At the point of contact, the parasite discharges effector proteins 
from secretory organelles into the host cell, and these proteins target various host 
pathways to enable invasion, to bloclc an innate immune response, and promote 
survival. As the parasite moves into the host cell, a membrane derived from the 
host-cell plasma membrane surrounds it. Remarlcably, the parasite removes host 
transmembrane proteins from the surrounding membrane as it forms, so that 
the parasite is protected in a membrane-enclosed compartment that does not 
fuse with lysosomes and does not participate in host-cell membrane trafficking 
processes (see Figure 23-20). The specialized membrane is selectively porous: it 
allows the parasite to talce up small metabolic intermediates and nutrients from 
the host celFs cytosol but excludes macromolecules. Malaria parasites invade 
human red blood cells using a similar mechanism. 
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The protozoan Trypanosoma cruzi, which causes Chagas disease, in Mexico 
and Central and South America, uses two alternative invasion strategies. In a 
lysosome-dependent pathway, the parasite attaches to host cell-surface receptors, 
inducing a local increase in Ca 2+ in the host cell's cytosol. The Ca 2+ signal recruits 
lysosomes to the site of parasite attachment, and the lysosomes fuse with the host 
cell's plasma membrane, allowing the parasites rapid access to the lysosomal 
compartment (Figure 23-21). In a lysosome-independent pathway, the parasite 
pénétrâtes the host-cell plasma membrane by inducing the membrane to invagi- 
nate, without the recruitment of lysosomes. 


parasite in vacuole host-cell nucléus 



(B) I_ 

14 (im 


Figure 23-20 The life cycle of the 
intracellular parasite Toxoplasma gondii. 

(A) After attachment to a host cell, T. gondii 
uses its conoid to inject effector proteins 
that facilitate invasion. As the host cell’s 
plasma membrane invaginâtes to surround 
the parasite, it somehow removes the 
normal host-cell membrane proteins, so that 
the compartment (shown in red) does not 
fuse with lysosomes. After several rounds 

of réplication, the parasite causes the 
compartment to break down and the host 
cell to lyse, releasing the progeny parasites 
to infect other host cells (Movie 23.6). 

(B) Light micrograph of T. gondii replicating 
within a membrane-enclosed compartment 
(a vacuole) in a cultured cell. (B, courtesy of 
Manuel Camps and John Boothroyd.) 
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Figure 23-21 The two alternative strategies that Trypanosoma cruzi uses to invade host cells. In the lysosome-dependent pathway (left), 

T. cruzi recruits host-cell lysosomes to its site of attachment to the host cell. The lysosomes fuse with the invaginating plasma membrane to create 
an intracellular compartment constructed almost entirely of lysosomal membrane. After a brief stay in the compartment, the parasite sécrétés a 
pore-forming protein that disrupts the surrounding membrane, thereby allowing the parasite to escape into the host-cell cytosol and proliferate. 

In the lysosome-independent pathway (right), the parasite induces the host plasma membrane to invaginate and pinch off without recruiting 
lysosomes; then lysosomes fuse with the endosome prior to the parasite’s escape into the cytosol. 
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Figure 23-22 Choices that an 
intracellular pathogen faces. After 
entry into a host cell, generally through 
phagocytosis into a membrane-enclosed 
compartiment, intracellular pathogens can 
use one of three strategies to survive and 
replicate. Pathogens that follow strategy 
(1) include ail viruses, Trypanosoma cruzi, 
Listeria monocytogenes , and Shigella 
flexneri. Those that follow strategy (2) 
include Mycobacterium tuberculosis and 
Légionella pneumophila. Those that follow 
strategy (3) include Salmonella enterica, 
Coxiella burnetii, and Leishmania. 


Some Intracellular Pathogens Escape from the Phagosome into 
the Cytosol 

The intracellular parasites just discussed raise a general problem that faces ail 
intracellular pathogens, including viruses, bacteria, and eulcaryotic parasites: 
they must find a cell compartment in which they can replicate. After their endocy- 
tosis by a host cell, they usually find themselves in an endosomal compartment, 
which normally would fuse with lysosomes to form a phagolysosome —a danger- 
ous place for pathogens. To survive, pathogens use a variety of strategies. Some 
escape from the endosomal compartment before such fusion. Others remain in 
the endosomal compartments but modify it so that it no longer fuses with lyso¬ 
somes. Still others hâve evolved to weather the harsh conditions in the phagoly¬ 
sosome (Figure 23-22). 

Trypanosoma cruzi uses the escape route by secreting a pore-forming toxin 
that lyses the lysosome membrane, releasing the parasite into the host cell's cyto¬ 
sol (see Figure 23-21). The bacterium Listeria monocytogenes uses a similar strat¬ 
egy. Following phagocytosis by the zipper mechanism, it sécrétés a protein called 
listeriolysin O, which disrupts the phagosomal membrane, releasing the bacteria 
into the cytosol (Figure 23-23). 

Many Pathogens Alter Membrane Traffic in the Host Cell to Survive 
and Replicate 

The survival and reproduction of many intracellular pathogens requires that they 
modify membrane (vesicular) traffic in the host cell. They may, for example, pre- 
vent the normal fusing of endosomes with lysosomes, or adapt themselves to 



Figure 23-23 Escape of Listeria monocytogenes by sélective destruction of the phagosomal membrane. The bacterium 
attaches to E-cadherin on the surface of host épithélial cells and induces its own uptake by the zipper mechanism (see 
Figure 23-19A). Within the phagosome, the bacterium sécrétés the protein listeriolysin O, which is activated at pH <6 and 
forms oligomers in the phagosome membrane, thereby creating large pores and eventually disrupting the membrane. Once 
in the host-cell cytosol, the bacteria begin to replicate and continue to secrete listeriolysin O; because the pH in the cytosol 
is >6, however, the listeriolysin O there is inactive and is also rapidly degraded by protéasomes. Thus, the host cell’s plasma 
membrane remains intact. 
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Figure 23-24 Modifications of 
membrane traffic in host cells by 
bacterial pathogens. Intracellular bacterial 
pathogens, including Mycobacterium 
tuberculosis, Salmonella enterica, and 
Légionella pneumophila, ail replicate in 
membrane-enclosed compartiments, but 
the compartiments differ. M. tuberculosis 
remains in a compartiment that has early 
endosomal markers and continues to 
communicate with the plasma membrane 
via transport vesicles. S. enterica replicates 
in a compartiment that has late endosomal 
markers and does not communicate with 
the plasma membrane. L pneumophila 
replicates in an unusual compartiment that 
is wrapped in rough endoplasmic réticulum 
(ER) membrane and communicates with 
the ER via transport vesicles. TGN, trans 
Golgi network. 


resist the lysosome's antimicrobial armaments. Intracellular pathogens must also 
provide a pathway for importing nutrients from the host cytosol into their com- 
partment of choice. 

Different pathogens hâve distinct strategies for altering membrane traffic in 
the host cell (Figure 23-24). M. tuberculosis prevents the early endosome that 
contains the bacteria from maturing, so the endosome never acidifies or acquires 
the other characteristics of a late endosome or lysosome. This strategy requires 
the activity of its type VII sécrétion System, as well as mycobacterial lipid products 
that mimic host lipids and influence vesicular traffic. Phagosomes containing Sal¬ 
monella enterica, in contrast, acidify and acquire markers of late endosomes and 
lysosomes, but the bacteria slow the process of phagosomal maturation. They do 
so by injecting effector proteins through a second type III sécrétion System. These 
effectors activate host kinesin motor proteins to pull membrane tubules outward 
from the phagosome along cytoplasmic microtubules, forming a specialized com- 
partment called the Salmonella-containing vacuole (Figure 23-25). 

Other bacteria seem to find shelter in intracellular compartments that are dis¬ 
tinct from those of the usual endocytic System. One example is Légionella pneu¬ 
mophila, which was first recognized as a human pathogen in 1976, when it was 
found to be the cause of a type of pneumonia known as Legionnaire's disease. 
L. pneumophila is normally a parasite of freshwater amoebae, but it is commonly 



Figure 23-25 Salmonella enterica 
residing in a modified phagosomal 
compartment called the Salmonella- 
containing vacuole. These bacteria invade 
the host cell using an SPI1 type III sécrétion 
System to inject effector proteins that 
induce the trigger mechanism of microbe 
entry illustrated in Figure 23-19B. 

(A) Following its engulfment into a 
phagosome, the bacterium inactivâtes its 
SPI1 type III sécrétion System and activâtes 
its SPI2 type III sécrétion System to inject 
different effector proteins, which remodel 
the phagosome into the specialized 
Salmonella-conta\n\ng vacuole. One of the 
injected effector proteins activâtes host 
kinesin motor proteins to pull membrane 
tubules outward toward the plus ends 

of the microtubules (see Figure 16-42). 

(B) Fluorescence micrograph showing 
S. enterica in a Salmonella-con\à\n\ng 
vacuole. The bacteria are stained green, 
the microtubules red, and the nucléus blue. 
(B, courtesy of Stéphane Meresse.) 
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Figure 23-26 Légionella pneumophila residing in a compartment with characteristics similar to those of the rough 
endoplasmic réticulum (ER). (A) Electron micrograph showing the unusual coiled structure that the Légionella pneumophila 
bacterium induces on the surface of a phagocyte during the invasion process. Some other pathogens, including the bacterium 
Borrelia burgdorferi, which causes Lyme disease, the eukaryotic pathogen Leishmania, and the yeast Candida albicans, can 
also invade cells using this type of coiling phagocytosis. (B) Following invasion, L. pneumophila uses its type IV sécrétion System 
to secrete effector proteins that block phagosome-endosome fusion and phagosome maturation. It also sécrétés effector 
proteins that promote the fusion of the phagosome with ER-derived vesicles, thereby creating a Légionella -containing vacuole 
with characteristics similar to the rough ER. (A, from M.A. Horwitz, Ce//36:27-33, 1984. With permission from Elsevier.) 


spread to humans by central air-conditioning Systems, which harbor infected 
amoebae and produce microdroplets of water that are easily inhaled. Once in 
the lung, the bacteria are engulfed by macrophages by an unusual process called 
coiling phagocytosis (Figure 23-26A). L. pneumophila uses a type IV sécrétion 
System to inject effector proteins into the phagocyte that modulate the activity 
of proteins that regulate vesicular traffic, including SNARE proteins and Rab and 
Arf family small GTPases (discussed in Chapter 13). The effector proteins thereby 
prevent the phagosome from fusing with endosomes and promote its fusion with 
vesicles derived from the endoplasmic réticulum, converting the phagosome into 
a compartment that resembles the rough endoplasmic réticulum (Figure 23-26B). 

Viruses can also alter membrane traffic in the host cell. Enveloped viruses 
malce use of host cell membranes to acquire their own envelope membrane. In 
the simplest cases, virally encoded glycoproteins are inserted into the endoplas¬ 
mic réticulum membrane and follow the secretory pathway through the Golgi 
apparatus to the plasma membrane; the viral capsid proteins and genome assem¬ 
ble into nucleocapsids, which acquire their envelope as they bud off from the 
plasma membrane (see Figure 23-12). This mechanism is used by many envel¬ 
oped viruses including HIV-1. Other enveloped viruses such as herpesviruses and 
vaccinia virus acquire their lipid envelopes in more complexways (Figure 23-27). 

Viruses and Bacteria Use the Host-Cell Cytoskeleton for 
Intracellular Movement 

As mentioned earlier, many pathogens escape into the cytosol rather than 
remaining in a membrane-enclosed compartment. The cytosol of mammalian 
cells is extremely viscous, as it is crowded with protein complexes, organelles, and 
cytoskeletal filaments, ail of which inhibit the diffusion of particles the size of a 
bacterium or a viral nucleocapsid. Thus, to reach a particular région of the host 
cell a pathogen must be actively moved there. As with transport of intracellular 
organelles, pathogens generally use the host cell's cytoskeleton for their active 
movement. 
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Figure 23-27 Complex strategies for 
viral envelope acquisition. (A) Herpesvirus 
nucleocapsids assemble in the nucléus 
and then bud through the inner nuclear 
membrane into the space between the 
inner and outer nuclear membranes, 
acquiring a lipid bilayer membrane coat. 

The virus particles then apparently lose this 
coat when they fuse with the endoplasmic 
réticulum membrane to escape into the 
cytosol. Subsequently, the nucleocapsids 
bud into the Golgi apparatus and bud 
out again on the other side, thereby 
acquiring two new membrane coats in 
the process. The virus then buds from 
the cell surface with a single membrane 
when its outer membrane fuses with the 
plasma membrane. (B) Vaccinia virus 
(which is closely related to the virus that 
causes smallpox and is used to vaccinate 
against smallpox) assembles in “réplication 
factories” in the cytosol, far away from the 
plasma membrane. The immature virion, 
with one membrane, is then surrounded 
by two additional membranes, both 
acquired from the Golgi apparatus by a 
poorly understood wrapping mechanism, 
to form the intracellular enveloped virion. 
After fusion of the outermost membrane 
with the host-cell plasma membrane, the 
extracellular enveloped virion is released 
from the host cell. 


Several bacteria that replicate in the host cell's cytosol hâve adopted a remark- 
able mechanism that dépends on actin polymerization for movement. These 
bacteria include the human pathogens Listeria monocytogenes, Shigella flexneri, 
Rickettsia rickettsii (which causes Rocky Mountain spotted fever), and Burkholde- 
ria pseudomallei (which causes melioidosis, a disease characterized by severe 
respiratory symptoms). Baculovirus, an insect virus, also uses this mechanism for 
intracellular movement. Ail of these pathogens induce the nucléation and assem- 
bly of host-cell actin filaments at one pôle of the bacterium or virus. The growing 
filaments generate force and push the pathogens through the cytosol at rates of 
up to 1 pm/sec (Figure 23-28). New filaments form at the rear of each patho- 
gen and are left behind lilce a rocket trail as the microbe advances; the filaments 
depolymerize within a minute or so as they encounter depolymerizing factors in 
the cytosol. For L. monocytogenes and S. flexneri, the moving bacteria collide with 
the plasma membrane and move outward, inducing the formation of long, thin, 
host-cell protrusions with the bacteria at their tip. As shown in Figure 23-28, a 
neighboring cell often engulfs these projections, allowing the bacteria to enter the 
neighbor's cytoplasm without exposure to the extracellular environment, thereby 
avoiding antibodies produced by the host's adaptive immune System. For B. pseu¬ 
domallei, movement and collision of the bacteria with the plasma membrane pro¬ 
mûtes cell-cell fusion, which serves a similar purpose of immune avoidance while 
allowing continued bacterial réplication. 
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Figure 23-28 The actin-based movement of bacterial pathogens 
within and between host cells. (A) Following invasion, bacterial pathogens 
such as Listeria monocytogenes, Shigella flexneri, Rickettsia rickettsii, and 
Burkholderia pseudomallei induce the assembly of actin-rich tails in the 
host-cell cytoplasm, which drives rapid bacterial movement. For most of 
these pathogens, the moving bacteria collide with the host-cell plasma 
membrane to form membrane-covered protrusions, which are engulfed by 
neighboring cells—spreading the infection from cell to cell. In contrast, for 
B. pseudomallei, collision with the plasma membrane promûtes cell—cell 
fusion, creating a conduit through which bacteria can invade neighboring 
cells (Movie 23.7). 


The molecular mechanisms of pathogen-induced actin assembly differ for the 
different pathogens, suggesting that they evolved independently (Figure 23-29). 
L. monocytogenes and baculovirus produce proteins that directly bind to and acti- 
vate the Arp 2/3 complex to initiate the formation of an actin tail and movement 
(see Figure 16-16). S. flexneri produces an unrelated surface protein that binds to 
and activâtes N-WASp, which then activâtes the Arp 2/3 complex. Rickettsia spe- 
cies produce a protein that directly polymerizes actin by mimicking the function 
of host formin proteins (see Figure 16-17). 

Many viral pathogens rely primarily on microtubule-dependent motor pro¬ 
teins rather than actin polymerization to move within the host-cell cytosol. 
Viruses that infect neurons, such as the neurotropic alpha herpesviruses , which 
include the virus that causes chickenpox, provide important examples. The virus 
enters sensory neurons at the tips of their axons, and microtubule-based rétro¬ 
gradé “backward” axonal transport carries the nucleocapsids down the axon to 
the nucléus. The transport is mediated by attachment of viral capsid proteins to 
the motor protein dynein (see Figure 16-58). After réplication and assembly in the 
nucléus, the enveloped virions are then carried by antegrade “forward" axonal 
transport along microtubules to the axon tips, with the transport being mediated 
by the attachment of a different viral capsid protein to a kinesin motor protein (see 
Figure 16-56). A large number of viruses associate with either dynein or kinesin 
motor proteins to move along microtubules at some stage in their réplication. As 
microtubules serve as oriented traclcs for vesicular transport in eukaryotic cells, 
it is not surprising that many viruses hâve independently evolved the ability to 
exploit them for their own transport. 


free bacterium 


/ 



by zipper 
mechanism 



engulfment by 
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Viruses Can Take Over the Metabolism of the Host Cell 

Viruses use basic host cell machinery for most aspects of their reproduction: they 
dépend on host-cell ribosomes to produce their proteins, and most use host-cell 
DNA and RNA polymerases for their own réplication and transcription. Many 
viruses encode proteins that modify the host transcription or translation appa- 
ratus to favor the synthesis of viral RNAs and proteins over those of the host cell, 
shifting the synthetic capacity of the cell toward the production of new virus par- 
ticles. Poliovirus, for example, encodes a protease that specifically cleaves the 
TATA-binding component of TFIID (see Figure 6-17), shutting off transcription 
of most of the host celFs protein-coding genes. Influenza virus produces a protein 
that bloclcs both the splicing and the polyadenylation of host-cell RNA transcripts, 
preventing their export into the cytosol (see Figure 6-38). 

Viruses also alter translation by the host. Translation initiation for most host- 
cell mRNAs dépends on récognition of their 5' cap by translation initiation factors 
(see Figure 6-70). This initiation process is often inhibited during viral infection, 
so that the host-cell ribosomes can be used more efficiently for the synthesis of 
viral proteins. Some viral genomes encode endonucleases that cleave off the 5' 
cap from host-cell mRNAs; some go even further by using the liberated 5' caps as 
primers to synthesize viral mRNAs, a process called cap snatching. Several other 
viral RNA genomes encode proteases that cleave certain translation initiation fac¬ 
tors; these viruses rely on 5' cap-independent translation of their own RNA, using 
internai ribosome entry sites (IRESs) (see Figure 7-68). 
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Figure 23-29 Molecular mechanisms 
for actin nucléation by various bacterial 
pathogens. Listeria monocytogenes and 
Shigella fiexneri induce actin nucléation 
by recruiting and activating the host Arp 
2/3 complex (see Figure 16-16), although 
each uses a different recruitment strategy: 

L. monocytogenes expresses a surface 
protein, ActA, that directly binds 
to and activâtes the Arp 2/3 complex; 

S. fiexneri expresses a surface protein, 

IcsA (unrelated to ActA), that recruits the 
host protein N-WASp, which in turn recruits 
the Arp 2/3 complex, along with other host 
proteins, including WIP (WASp-interacting 
protein). Rickettsia rickettsii uses an entirely 
different strategy; it expresses a surface 
protein, Sca2, that directly nucleates actin 
polymerization by mimicking the activity of 
host formin proteins. 


A few DNA viruses use host-cell DNA polymerase to replicate their genome. 
Unfortunately for these viruses, DNA polymerase is expressed at high levels only 
during S phase of the cell cycle, and most cells that these viruses infect spend 
most of their time in Gi phase. Adenovirus has evolved a mechanism to drive the 
host cell into S phase, so that the cell produces large amounts of active DNA poly¬ 
merase, which then replicates the viral genome; to accomplish this, the adenovi¬ 
rus genome also encodes proteins that inactivate both Rb (see Figure 17-61) and 
p53 (see Figure 17-62), two key suppressors of cell-cycle progression. As might be 
expected for any mechanism that encourages unregulated DNA réplication, these 
viruses can promote, under some circumstances, the development of cancer. 
Other DNA viruses, including poxviruses and mimivirus, encode their own DNA 
and RNA polymerases, as well as some transcription regulators, allowing them to 
bypass usual host pathways and replicate outside the nucléus. 

RNA viruses must always encode their own réplication proteins because host 
cells laclc polymerase enzymes that use RNA as a template. For RNA viruses with 
a single-stranded genome, the réplication strategy dépends on whether the RNA 
is a positive [+] strand, which contains translatable information lilce mRNA, or a 
complementary négative [-] strand. When the RNA is a positive [+] strand, the 
incoming viral genome is used to produce the viral RNA polymerase and viral pro¬ 
teins; the viral polymerase is then used to replicate the viral RNA and to generate 
mRNAs for the production of more viral proteins. For viruses with a négative [-] 
strand RNA genome (such as influenza and measles virus), an RNA polymerase 
enzyme is packaged as a structural protein of the incoming viral capsids. 

Retroviruses such as HIV-1, which hâve a positive [+] strand RNA genome, are 
a spécial class of RNA virus because they carry with them a viral reverse transcrip¬ 
tase enzyme. After entry to the host cell, the reverse transcriptase uses the viral 
RNA genome as a template to synthesize a double-stranded DNA copy of the viral 
genome, which enters into the nucléus and intégrâtes into the host celFs chromo¬ 
somes (see Figure 5-62). It is later transcribed by the celFs DNA-dependent RNA 
polymerase to produce viral genomes and proteins. 

Pathogens Can Evolve Rapidly by Antigenic Variation 

The complexity and specificity of the interplay between pathogens and their host 
cells might suggest that virulence would be difficult to acquire by random muta¬ 
tion. Yet, new pathogens are constantly emerging, and old pathogens are con- 
stantly changing in ways that malce familiar infections more difficult to prevent 
or treat. Pathogens hâve two advantages that enable them to evolve rapidly. First, 
they replicate very quicldy, providing a great deal of material for natural sélection 
to worlc with. Whereas humans and chimpanzees hâve acquired a 2% différence 
in genome sequences over about 8 million years of divergent évolution, poliovirus 
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manages a 2% change in its genome in 5 days—about the time it talces the virus 
to pass from the human mouth to the gut. Second, sélective pressures act rapidly 
on this genetic variation. The host's adaptive immune System and modem micro- 
bicidal drugs, both of which destroy pathogens that fail to change, are the main 
sources of these sélective pressures. 

An example of an adaptation to the sélective pressure imposed by the adap¬ 
tive immune System is the phenomenon of antigenic variation. An important 
adaptive immune response against many pathogens is the host's production of 
antibodies that recognize spécifie molécules ( antigens ) on the pathogen's surface 
(discussed in Chapter 24). Many pathogens hâve evolved mecanisms that delib- 
erately change these antigens during the course of an infection, enabling them to 
évadé antibodies. Some eukaryotic parasites, for example, undergo programmed 
rearrangements of the genes encoding their surface antigens. A striking example 
occurs in Trypanosoma brucei, a protozoan parasite that causes African sleeping 
sickness and is spread by tsetse Aies. (T brucei is a relative of T. cruzi—see Fig¬ 
ure 23-21—but it replicates extracellularly rather than intracellularly.) T. brucei 
is covered with a single type of glycoprotein, called variant-specific glycoprotein 
(VSG), which elicits in the host a protective antibody response that rapidly clears 
most of the parasites. The trypanosome genome, however, contains about 1000 
different Vsg genes or pseudogenes, each encoding a VSG with a distinct amino 
acid sequence. Only one of these genes is expressed at any one time, from one of 
approximately 20 possible expression sites in the genome. Gene rearrangements 
that copy different Vsg genes into expression sites repeatedly change the VSG pro- 
tein displayed on the surface of the pathogen. In this way, a few trypanosomes 
with an altered VSG escape the initial antibody-mediated clearance, replicate, 
and cause the disease to recur, leading to a chronic cyclic infection (Figure 23-30). 

Bacterial pathogens can also rapidly change their surface antigens. As dis¬ 
cussed in Chapter 5, Salmonella enterica bacteria switch between expressing 
either of two versions of the protein flagellin, the structural component of the bac¬ 
terial flagellum (see Figure 23-3D), in a process called phase variation (see Figure 
5-65). Species of the genus Neisseria are also champions at this. These Gram-neg- 
ative cocci can cause meningitis and sexually transmitted diseases. They undergo 
genetic recombination very similar to that just described for eukaryotic patho¬ 
gens, which enables them to vary the pilin protein they use to attach to host cells. 
By inserting one of the multiple silent copies of variant pilin genes into a single 
expression locus, they can express many slightly different versions of the protein 
and repeatedly change the amino acid sequence over time. Neisseria bacteria are 


Figure 23-30 Antigenic variation in 
trypanosomes. (A) There are about 1000 
distinct Vsg genes in Trypanosoma brucei, 
and they are expressed one at a time from 
approximately 20 expression sites in the 
genome. To be expressed, an inactive 
gene is copied and the copy is moved 
into an expression site through DNA 
recombination. Each Vsg gene encodes a 
different surface protein (antigen). These 
switching events allow the trypanosome 
to repeatedly change the surface antigen 
it expresses. (B) A person infected with 
trypanosomes expressing VSG a mounts 
a protective antibody response, which 
clears most of the parasites expressing 
this antigen. However, a few of the 
trypanosomes will hâve switched to 
expression of VSG b , which can now 
proliferate until anti-VSG b antibodies 
clear them. By that time, however, some 
parasites will hâve switched to VSG C , and 
so the cycle continues. 
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also extremely adept at talcing up DNA from their environment by natural trans¬ 
formation and incorporating it into their genomes, further contributing to their 
extraordinary variability. The end resuit of this considérable variation is a pleth- 
ora of different surface compositions with which to bewilder the host adaptive 
immune System. It is therefore not surprising that it has been difficult to develop 
an effective vaccine against Neisseria infections, although there are now several 
that protect against Neisseria meningitidis, a common cause of fatal meningitis. 


Error-Prone Réplication Dominâtes Viral Evolution 

In contrast to the DNA rearrangements in bacteria and parasites, viruses rely on an 
error-prone réplication mechanism for antigenic variation. Retroviral genomes, 
for example, acquire on average one point mutation every réplication cycle, 
because the viral reverse transcriptase (see Figure 5-62) needed to produce DNA 
from the viral RNA genome laclcs the proofreading activity of DNA polymerases. A 
typical, untreated HIV infection may eventually produce HIV genomes with every 
possible point mutation. By a process of mutation and sélection within each host, 
most viruses change over time—from a form that is most efficient at infecting mac¬ 
rophages to one more efficient at infecting T cells, as described earlier (see Figure 
23-17). Similarly, once a patient is treated with an antiviral drug, the viral genome 
can quicldy mutate and be selected for its résistance to the drug. Remarlcably, only 
about one-third of the nucléotide positions in the coding sequence of the viral 
genome are invariant, and nucléotide sequences in some parts of the genome, 
such as the Env gene (see Figure 7-62), can differ by as much as 30% from one 
HIV isolate to another. This extraordinary genomic plasticity greatly complicates 
attempts to develop vaccines against HIV. It has also led to the rapid emergence 
of new HIV strains. Nucléotide sequence comparisons between various strains 
of HIV and the very similar simian immunodeficiency virus (SIV) isolated from a 
variety of monlcey species suggest that the most virulent type of HIV, HIV-1, may 
hâve jumped from primates to humans multiple independent times, starting as 
long ago as 1908 (Figure 23-31). 

Influenza viruses are an important exception to the rule that error-prone rép¬ 
lication dominâtes viral évolution. They are unusual in that their genome consists 
of several (usually eight) strands of RNA. When two strains of influenza infect the 
same host, the RNA strands of the two strains can reassort to form a new type 
of influenza virus. In normal years, influenza is a mild disease in healthy adults, 
although it can be life-threatening in the very young and very old. Different influ¬ 
enza strains infect fowl such as duclcs and chiclcens, but only a subset of these 
strains can infect humans, and transmission from fowl to humans is rare. In 1918, 
however, a particularly virulent variant of avian influenza crossed the species 
barrier to infect humans, triggering the catastrophic pandémie of 1918 called 
the Spanish flu, which lcilled 20-50 million people worldwide. Subséquent influ¬ 
enza pandémies hâve been triggered by genome reassortment, in which a new 
RNA segment from an avian form of the virus replaced one or more of the viral 
RNA segments from the human form (Figure 23-32). In 2009, a new H INI swine 
virus emerged that derived genes from pig, avian, and human influenza viruses. 
Such recombination events allowed the new virus to replicate rapidly and spread 
through an immunologically naive human population. Generally, within two or 
three years, the human population develops immunity to a new recombinant 
strain of virus, and the infection rate drops to a steady-state level. Because the 
recombination events are unpredictable, it is not possible to lcnow when the next 
influenza pandémie will occur or how severe it might be. 

Drug-Resistant Pathogens Are a Growing Problem 

The development of drugs that cure rather than prevent infections has had a 
major impact on human health. Antibiotics, which are either bactericidal (they 
kill bacteria) or bacteriostatic (they inhibit bacterial growth without killing), are 
the most successful class of such drugs. Penicillin was one of the first antibiotics 
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Figure 23-31 Diversification of 
HIV-1, HIV-2, and related strains 
of SIV. HIV comprises different viral 
families, ail descended from SIV (simian 
immunodeficiency virus). On three separate 
occasions, SIV was passed from a 
chimpanzee to a human, resulting in three 
HIV-1 groups: major (M), outlier (O), and 
non-M non-0 (N). The HIV-1 M group is the 
most common and is primarily responsible 
for the global AIDS épidémie. On two 
separate occasions, SIV was passed from 
a sooty mangabey monkey to a human, 
resulting in the two HIV-2 groups. In 2009, 
a new strain of HIV was discovered that 
appears to hâve resulted from SIV passage 
from a gorilla to a human. 
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used to treat infections in humans, just in time to prevent tens of thousands of 
deaths from infected battlefield wounds in World War IL Because bacteria (see 
Figure 1-17) are not closely related evolutionarily to the eukaryotes they infect, 
much of their basic machinery for DNA réplication and transcription, RNA trans¬ 
lation, and metabolism differs from that of their host. These différences enable us 
to develop antibacterial drugs that exhibit sélective toxicity, in that they specifi- 
cally inhibit these processes in bacteria without disrupting them in the host. Most 
of the antibiotics that we use to treat bacterial infections are small molécules that 
inhibit macromolecular synthesis in bacteria by targeting bacterial enzymes that 
either are distinct from their eukaryotic counterparts or are involved in pathways 
such as cell wall biosynthesis that are absent in animais (Figure 23-33 and see 
Table 6-4). 

However, bacteria continuously evolve and strains résistant to antibiotics rap- 
idly develop, often within a few years of the introduction of a new drug. Similar 
drug résistance also arises rapidly when treating viral infections with antiviral 
drugs. The virus population in an HIV-infected person treated with the reverse 
transcriptase inhibitor AZT, for example, will acquire complété résistance to the 
drug within a few months. The current protocol for treatment of HIV infections 
involves the simultaneous use of three drugs, which helps to minimize the acqui¬ 
sition of résistance for any one of them. 

There are three general strategies by which a pathogen can develop drug résis¬ 
tance: (1) it can alter the molecular target of the drug so that it is no longer sensi¬ 
tive to the drug; (2) it can produce an enzyme that modifies or destroys the drug; 
or (3) it can prevent the drug's access to the drug target by, for example, actively 
pumping the drug out of the pathogen (Figure 23-34). 

Once a pathogen has chanced upon an effective drug-resistance strategy, the 
newly acquired or mutated genes that confer the résistance are frequently spread 
throughout the pathogen population by horizontal gene transfer. They may even 
spread between pathogens of different species. The highly effective but expensive 
antibiotic vancomycin, for example, is used as a treatment of last resort for many 
severe, hospital-acquired, Gram-positive bacterial infections that are résistant 
to most other lcnown antibiotics. Vancomycin prevents one step in bacterial cell 
wall synthesis—the cross-linking of peptidoglycan chains in the bacterial cell wall 
(see Figure 23-3B). Résistance can arise if the bacterium synthesizes a cell wall 
using different subunits that do not bind vancomycin. The most effective form of 
vancomycin résistance dépends on the acquisition of a transposon (see Figure 
5-60) containing seven genes, the products of which work together to sense the 
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Figure 23-32 Model for the évolution 
of pandémie strains of influenza virus 
by recombination. Influenza A virus is 
a natural pathogen of birds, particularly 
waterfowl, and it is always présent in wild 
bird populations. In 1918, a particularly 
virulent form of the virus crossed the 
species barrier from birds to humans and 
caused a devastating worldwide épidémie. 
This strain was designated H1 NI, referring 
to the spécifie forms of its main antigens, 
hemagglutinin (H) and neuraminidase (N). 
Changes in the virus, rendering it less 
virulent, and the rise of adaptive immunity 
in the human population, prevented the 
pandémie from continuing in subséquent 
seasons, although H1N1 influenza strains 
continued to cause serious disease every 
year in very young and very old people. In 
1957, a new pandémie arose when three 
genes were replaced by équivalent genes 
from an avian virus (green bars)] the new 
strain (designated H2N2) was not effectively 
cleared by antibodies in people who had 
previously contracted only H1 NI forms of 
influenza. In 1968, another pandémie was 
triggered when two genes were replaced 
from another avian virus; the new virus 
was designated H3N2. In 1977, there was 
a resurgence of H1 NI influenza, which 
had previously been almost completely 
replaced by the N2 strains. Molecular 
sequence information suggests that this 
minor pandémie may hâve been caused by 
an accidentai release of an influenza strain 
that had been held in a laboratory since 
about 1950. In 2009, a new H1N1 swine 
virus emerged that had derived five genes 
from pig influenza viruses, two from avian 
influenza viruses, and one from a human 
influenza virus. As indicated, most human 
influenza today is caused by H1 NI and 
H3N2 strains. 
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Figure 23-33 Antibiotic targets. Although 
there are many antibiotics in clinical use, 
they hâve a narrow range of targets, 
which are highlighted in yellow. A few 
représentative antibiotics in each class are 
listed. Nearly ail antibiotics used to treat 
human infections fall into one of these 
categories. The vast majority inhibit either 
bacterial protein synthesis or bacterial cell 
wall synthesis. 


presence of vancomycin, shut down the normal pathway for bacterial cell wall 
synthesis, and produce a different type of cell wall. 

Drug-resistance genes acquired by horizontal transfer frequently corne from 
environmental microbial réservoirs. Nearly ail antibiotics used to treat bacterial 
infections today are based on natural products produced by fungi or bacteria. 
Penicillin, for example, is made by the mold Pénicillium , and more than 50% of 
the antibiotics currently used in the clinic are made by Gram-positive bacteria of 
the genus Streptomyces, which résidé in the soil. It is believed that microorgan- 
isms produce antimicrobial compounds, many of which hâve probably existed 
on Earth for hundreds of millions of years, as weapons in their compétition with 
other microorganisms in the environment. Surveys of bacteria talcen from soil 
samples that hâve never been exposed to antibiotic drugs used in modem medi- 
cine reveal that the bacteria are typically already résistant to about seven or eight 
of the antibiotics widely used in clinical practice. When pathogenic microorgan¬ 
isms are faced with the sélective pressure provided by antibiotic treatments, they 
can apparently draw upon this immense source of genetic material to acquire 
résistance. 

Lilce most other aspects of infectious disease, human behavior has exacerbated 
the problem of drug résistance. Many patients talce antibiotics for symptoms that 
are typically caused by viruses (flu-like illnesses, colds, sore throats, and earaches) 
and these drugs hâve no effects. Persistent and chronic misuse of antibiotics can 
eventually resuit in antibiotic-resistant normal flora, which can then transfer the 
résistance to pathogens. Antibiotics are also misused in agriculture, where they 
are commonly employed as food additives to promote the growth and health of 
farm animais. An antibiotic closely related to vancomycin was commonly added 
to cattle feed in Europe; the resulting résistance in the normal flora of these ani¬ 
mais is widely believed to be one of the original sources for vancomycin-resistant 
bacteria that now threaten the lives of hospitalized patients. 
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Figure 23-34 Three general mechanisms 
of antibiotic résistance. (A) A nonresistant 
wild-type bacterial cell bathed in a drug 
(red triangles) that binds to and inhibits an 
essential enzyme (light green) will be killed 
due to enzyme inhibition. (B) A bacterium 
that has altered the drug’s target enzyme 
so that the drug no longer binds to the 
enzyme will survive and proliferate. In many 
cases, a single point mutation in the gene 
encoding the target protein can generate 
résistance. (C) A bacterium that expresses 
an enzyme (dark green) that either 
dégradés or covalently modifies the drug 
will survive and proliferate. Some résistant 
bacteria, for example, make p-lactamase 
enzymes, which cleave penicillin and similar 
molécules. (D) A bacterium that expresses 
or up-regulates an efflux pump that ejects 
the drug from the bacterial cytoplasm 
(using energy derived from either ATP 
hydrolysis or the electrochemical gradient 
across the bacterial plasma membrane) will 
survive and proliferate. Some efflux pumps, 
such as the TetR efflux pump, are spécifie 
for a single drug (in this case, tétracycline), 
whereas others, called multidrug résistance 
(MDR) efflux pumps, are capable of 
exporting a wide variety of structurally 
dissimilar drugs. Upregulation of an MDR 
pump can render a bacterium résistant to a 
very large number of different antibiotics in 
a single step. 


antibiotic-resistant bacteria 
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Because the acquisition of drug résistance is almost inévitable, it is crucial that 
we continue to develop innovative treatments for infectious diseases. We must 
also talce additional measures to delay the onset of drug résistance. 

Summary 

AU pathogens share the ability to interact with host cells in diverse ways that pro- 
mote the réplication and spread ofthe pathogen. Pathogens often colonize the host 
by adhering to or invading the épithélial surfaces that line the respiratory, gastroin¬ 
testinal , and urinary tracts, as well as the other body surfaces in direct contact with 
the environment. Intracellular pathogens, including ail viruses and many bacteria 
and protozoa, invade host cells by one ofseveral mechanisms. Viruses rely largely on 
receptor-mediated endocytosis, whereas bacteria exploit cell adhesion and phago- 
cytic pathways; in both cases, the host cell provides the machinery and energy for 
the invasion. Protozoa, by contrast, employ unique invasion strategies that usually 
require significant metabolic expense on the part ofthe invader. Once inside, intra¬ 
cellular pathogens seelc out a cell compartment that is favorable for their survival 
and réplication , frequently altering host membrane traffic and exploiting the host- 
cell cytoskeleton for intracellular movement. Pathogens evolve rapidly, so that new 
infectious diseases frequently emerge, and old pathogens acquire new ways to évadé 
our attempts at treatment, prévention , and éradication. 


WHATWE DONT KNOW 


• What are the genetic and molecular 
features that differ between pathogens 
and members of the normal human 
microbiota? How can our immune 
System distinguish between the two? 

• To what extent are common host- 
cell biological pathways and molécules 
hijacked by diverse microbes? 

• Can host-cell defense molécules be 
mobilized by drugs to fight infection? 


PROBLEMS 

Which statements are true? Explain why or why not. 

23-1 Our adult bodies harbor about 10 times more 
microbial cells than human cells. 

23-2 The microbiomes from healthy humans are ail 
very similar. 

23-3 Pathogens must enter host cells to cause disease. 

23-4 Viruses replicate their genomes in the nucléus of 
the host cell. 

23-5 You should not talce antibiotics for diseases caused 
by viruses. 

Discuss the following problems. 


23-9 The Gram-negative bacterium Yersinia pestis, the 
causative agent of the plague, is extremely virulent. Upon 
infection, Y. pestis injects a set of effector proteins into 
macrophages that suppresses their phagocytic behavior 
and also interfères with their innate immune responses. 
One of the effector proteins, YopJ, acetylates serines and 
thréonines on various MAP kinases, including the MAP 
kinase kinase kinase TAK1, which Controls a lcey signaling 
step in the innate immune response pathway. To déter¬ 
mine how YopJ interfères with TAK1, you transfect human 
cells with active YopJ (YopJ WT ) or inactive YopJ (YopJ CA ) 
and with FLAG-tagged active TAK1 (TAK1 wt ) or inactive 
TAK1 (TAK1 K63W ), and assay for total TAK1 and for phos- 
phorylated TAK1, using antibodies against the FLAG tag or 
against phosphorylated TAK1 (Figure Q23-1). How does 
YopJ bloclc the TAK1 signaling pathway? How do you sup¬ 
pose the serine/threonine acetylase activity of YopJ might 
interfère with TAK1 activation? 


23-6 In order to survive and multiply, a successful 
pathogen must accomplish five taslcs. Name them. 


TAK1 WT 

WT 

WT 

K63W 

YopJ 

CA 

WT 

- 


23-7 Clostridium difficile infection is the leading cause 
of hospital-associated gastrointestinal illness. It is typically 
treated with a course of antibiotics, but the infection recurs 
in about 20% of cases. C. difficile infections are difficult to 
eradicate because the bacteria exist in two forms: a repli - 
cating, toxin-producing form and a spore form that is résis¬ 
tant to antibiotics. Fecal microbiota transplantation—the 
transfer of normal gut microbiota from a healthy individ- 
ual—can résolve >90% of récurrent infections, a much bet- 
ter cure rate than further antibiotic treatment alone. Why 
do you suppose microbiota transplantation is so effective? 

23-8 What are the three general mechanisms for hori¬ 
zontal gene transfer? 


IP: a-FLAG-TAKI 
IB: a-pTAKI 

IP: a-FLAG-TAKI 
IB: a-FLAG-TAKI 
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Figure Q23-1 Effects of YopJ on TAK1 phosphorylation (Problem 
23-9). TAK1 was immunoprecipitated (IP) using antibodies against 
the FLAG tag (a-FI_AG-TAK1). Total TAK1 in the immunoprécipitation 
was assayed by immunoblot (IB) using the same antibody. 
Phosphorylated TAK1 was assayed by IB using antibodies spécifie 
for phospho-TAKI (a-pTAKI). A scale of protein molecular mass is 
shown at right in kilodaltons. (From N. Paquette et al., Proc. Nat! 
Acad. Sci. USA 109:12710-12715, 2012. With permission from 
National Academy of Sciences.) 
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23-10 The intracellular bacterial pathogen Salmonella 
typhimurium, which causes gastroenteritis, injects effector 
proteins to promote its invasion into nonphagocytic host 
cells by the trigger mechanism. S. typhimurium first stim¬ 
ulâtes membrane ruffling to promote invasion, and then 
suppresses membrane ruffling once invasion is complété. 
This behavior is mediated in part by injection of two effec¬ 
tor proteins: SopE, which promûtes membrane ruffling 
and invasion, and SptP, which bloclcs the effects of SopE. 
Both effector proteins target the monomeric GTPase, Rac, 
which in its active form promûtes membrane ruffling. How 
do you suppose SopE and SptP affect Rac activity? How do 
you suppose the effects of SopE and SptP are staggered in 
time if they are injected simultaneously? 

23-11 John Snow is widely regarded as the father of mod¬ 
em epidemiology. Most famously, he investigated an out- 
brealc of choiera in London in 1854 that lcilled more than 
600 victims before it was finished. Snow recorded where 
the victims lived, and plotted the data on a map, along with 
the locations of the water pumps that served as the source 
of water for the public (Figure Q23-2). He concluded that 
the disease was most lilcely spread in the water, although 
he could find nothing suspicious-looking in it. His conclu¬ 
sion ran counter to the then-current belief that choiera was 
from “miasmas" in bad air. Very few believed his theory 
during the next 50 years, with the "bad air" theory persist- 
ing until at least 1901. What do you suppose Snow saw in 
the data that led him to his conclusion? Why do you thinlc 
most scientists remained slceptical for so long? 





I 
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Figure Q23-2 A map of where the victims of the 1854 choiera 
outbreak lived, superimposed on a modem Google map of the area 
(Problem 23-11). The locations of the victims’ houses are indicated 
in a gradient of colors from blue (indicating few cases) to orange 
(indicating many cases). Public water pumps are shown as red 
squares. 

23-12 Influenza épidémies account for 250,000 to 
500,000 deaths globally each year. These épidémies are 
marlcedly seasonal, occurring in temperate climates in the 
northern and Southern hemispheres during their respec- 


northern Southern 



month 

Figure Q23-3 Seasonal patterns of influenza épidémies (Problem 
23-12). Cases of influenza at different times of the year are shown for 
the northern hemisphere (blue), the Southern hemisphere (orange), 
and the tropics (red). 

tive winters. By contrast, in the tropics, there is significant 
influenza activity year round, with a pealc in the rainy sea- 
son (Figure Q23-3). Canyou suggest some possible expla- 
nations for the patterns of influenza épidémies in temper¬ 
ate zones and the tropics? 

23-13 Several négative-strand viruses carry their genome 
as a set of discrète RNA segments. Examples include influ¬ 
enza virus (eight segments), Rift Valley fever virus (three 
segments), Hantavirus (three segments), and Lassa virus 
(two segments), to name a few. Why does segmentation of 
the genome provide a strong evolutionary advantage for 
these viruses? 

23-14 Avian influenza viruses readily infect birds, but 
are transmitted to humans very rarely. Similarly, human 
influenza viruses spread readily to other humans, but 
hâve never been detected in birds. The key to this speci- 
ficity lies in the viral capsid protein, hemagglutinin, which 
binds to sialic acid residues on cell-surface glycoproteins, 
triggering virus entry into the cell (Movie 23.8). Hemag¬ 
glutinin on human viruses recognizes sialic acid in a 2-6 
linkage with galactose, whereas avian hemagglutinin rec¬ 
ognizes sialic acid in a 2-3 linkage with galactose. Humans 
malce carbohydrate chains that hâve only the 2-6 linkage 
between sialic acid and galactose; birds malce only the 
2-3 linkage; but pigs malce carbohydrate chains with both 
linlcages. How does this situation malce pigs idéal hosts for 
generating new strains of human influenza viruses? 

23-15 The majority of antibiotics used in the clinic are 
made as natural products by bacteria. Why do you suppose 
bacteria malce the very agents we use to kill them? 

23-16 In the early days of penicillin research, it was dis- 
covered that bacteria in the air could destroy the penicillin, 
a big problem for large-scale production of the drug. How 
do you suppose this occurs? 

23-17 When the Oxford team of Ernst Chain and Nor¬ 
man Heatley had laboriously collected their first two grams 
of penicillin (probably no more than 2% pure!), Chain 
injected two normal mice with 1 g each of this préparation, 
and waited to see what would happen. The mice survived 
with no apparent ill effects. Their boss, Howard Florey, was 
furious at what he saw as a waste of good antibiotic. Why 
was this experiment important? 
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As we discussed in Chapter 23, ail living organisms serve as hosts for other spe- 
cies, usually in relationships that are benign or even mutually helpful. But ail 
organisms, and ail cells in a multicellular organism, need to defend themselves 
against infection by harmful invaders, collectively called pathogens, which can 
be microbes (bacteria, viruses, or fungi), or larger parasites. Even bacteria defend 
themselves against viruses, using intracellular proteins called restriction factors, 
which bloclc viral propagation. Invertebrates use a variety of defense strategies, 
including protective barriers, toxic molécules, restriction factors, and phagocytic 
cells that ingest and destroy invading pathogens. Vertebrates, too, dépend on such 
innate immune responses, but they can also harness more sophisticated and spé¬ 
cifie mechanisms, called adaptive immune responses. The innate responses occur 
first, calling the adaptive immune responses into play if required, in which case, 
both types of responses worlc together to eliminate the pathogen (Figure 24-1). 

Whereas innate immune responses are general defense reactions that can 
involve almost any cell type in an organism, the adaptive immune responses are 
highly spécifie to the particular pathogen that induced them and dépend on a class 
of white blood cells (leukocytes) called lymphocytes. There are two major classes 
of lymphocytes that mount adaptive immune responses—B lymphocytes (B cells), 
which secrete antibodies that bind specifically to the pathogen, and T lymphocytes 
[Tcells), which can either directly kill cells infected with the pathogen or produce 
secreted or cell-surface signal proteins that stimulate other host cells to help elim¬ 
inate the pathogen (Figure 24-2). Unlilce innate immune responses, which are 
generally short-lasting, the adaptive responses provide long-lasting protection: 
a person who recovers from measles or is vaccinated against it, for example, is 
protected for life against measles by the adaptive immune System, although not 
against other common viruses, such as those that cause mumps or chiclcenpox. 

Both the innate and adaptive immune Systems hâve evolved sensing mech¬ 
anisms that enable them to recognize harmful invaders (pathogens) and distin- 
guish them from both the host's own cells and molécules and harmless or béné¬ 
ficiai foreign organisms and their molécules. The innate System relies on sensor 
proteins that recognize particular types or patterns of molécules that are common 
to pathogens but are absent or sequestered in the host. The adaptive System, by 
contrast, uses unique genetic mechanisms to produce a virtually limitless diver- 
sity of related proteins—receptors on T and B cells and secreted antibodies—that, 
between them, can bind almost any foreign molécule. This remarkable strategy 
enables the adaptive immune System to react specifically against any pathogen, 
even if the animal never encountered it before. But, it also requires that the System 
learn not to react against self molécules or harmless foreign ones; if these learning 
mechanisms fail, harmful autoimmune or allergie responses resuit. 

In this chapter, we focus on vertebrate immune responses and the features 
that distinguish them from other kinds of cell responses. We begin with innate 
immune defenses and then discuss the highly specialized properties of the adap¬ 
tive immune System. 
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Figure 24-1 Innate and adaptive immune 
responses. Innate immune responses are 
activated directly by pathogens and defend 
ail multicellular organisms against infection. 
In vertebrates, pathogens, together 
with the innate immune responses they 
activate, also stimulate adaptive immune 
responses, which then work together with 
innate immune responses to help fight the 
infection. 
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THE INNATE IMMUNE SYSTEM 

Adaptive immune responses are slow to develop when a vertebrate first encoun- 
ters a newpathogen. This is because the spécifie B cells and T cells that can respond 
to a particular pathogen are initially few in number and must be stimulated to 
proliferate and differentiate before they can mount effective adaptive immune 
responses, which can talce days. By contrast, a single bacterium that divides every 
hour can generate almost twenty million progeny in a single day, producing a full- 
blown infection. Vertebrates, therefore, rely on their innate immune System to 
defend them against infection during the first critical hours and days of exposure 
to a new pathogen. Plants and invertebrates laclc adaptive immune Systems and 
therefore rely entirely on innate immunity for protection against pathogens. 

In this section, we consider some of the strategies the innate immune System 
uses to recognize pathogens and to provide a first line of defense against them. 

Epithelial Surfaces Serve as Barriers to Infection 

In vertebrates, the first encounters with infectious organisms are typically at the 
épithélial surfaces that form the skin and line the respiratory, digestive, urinary, 
and reproductive tracts. These epithelia provide both physical and Chemical bar¬ 
riers to invasion by pathogens: tight junctions between épithélial cells bar entry 
between the cells, and a variety of substances secreted by the cells discourage the 
attachment and entry of pathogens. The lceratinized épithélial cells of the skin, 
for example, form a thiclc physical barrier, and the sebaceous glands in the skin 
secrete fatty acids and lactic acid, which inhibit bacterial growth. In addition, épi¬ 
thélial cells in ail tissues, including those in plants and invertebrates, secrete anti- 
microbial molécules called defensins. Defensins are positively charged, amphi- 
pathic peptides that bind to and disrupt the membranes of many pathogens, 
including enveloped viruses, bacteria, fungi, and parasites. 

The épithélial cells that line internai organs such as the respiratory and diges¬ 
tive tracts also secrete slimy mucus, which sticks to the épithélial surface and 
malces it difficult for pathogens to adhéré. The beating of cilia on the surface of the 
épithélial cells lining the respiratory tract and the peristaltic action of the intestine 
also discourage the adhérence of pathogens. Moreover, as we discuss in Chapter 
23, healthy skin and gut are populated by enormous numbers of harmless (and 
often helpful) commensal microbes, collectively called the normal flora , which 
compete for nutrients with pathogens; some also produce antimicrobial peptides 
that actively inhibit pathogen prolifération. 

Pattern Récognition Receptors (PRRs) Recognize Conserved 
Features of Pathogens 

Pathogens do occasionally breach the épithélial barricades, in which case under- 
lying, nonepithelial cells of the innate immune System provide the next line of 
defense. These cells sense the presence of pathogens largely through the use of 
receptor proteins that recognize microbe-associated molécules that either are not 
présent or are sequestered in the host organism. Because these microbial mol¬ 
écules often occur in repeating patterns, they are called pathogen-associated 
molecular patterns (PAMPs), even though they are not unique to microbes that 
can cause disease. PAMPs are présent in various microbial molécules, including 
nucleic acids, lipids, polysaccharides, and proteins. 

The spécial receptor proteins that recognize PAMPs are called pattern récog¬ 
nition receptors (PRRs). Some PRRs are transmembrane proteins on the surface 
of many types of host cells, where they recognize extracellular pathogens; on 
professional phagocytic cells (phagocytes) such as macrophages and neutrophils 
(discussed in Chapter 22), they can médiate the uptalce of the pathogens into pha¬ 
gosomes, which then fuse with lysosomes, where the pathogens are destroyed. 
Other PRRs are located intracellularly, where they can detect intracellular patho¬ 
gens such as viruses; these PRRs are either free in the cytosol or associated with 
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Figure 24-2 The two main classes 
of adaptive immune responses. 

Lymphocytes carry out both classes of 
adaptive responses—shown here as 
responses to a viral infection. In one class, 
B cells secrete antibodies that specifically 
bind to and neutralize extracellular viruses, 
by preventing the viruses from infecting 
host cells. In the other, T cells médiate 
the response; in this example, they kill the 
virus-infected host cells. In both cases, 
innate immune responses help activate 
the adaptive immune responses through 
pathways that are not shown. 
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Figure 24-3 A scanning électron 
micrograph of a mutant fruit fly that died 
from a fungal infection. The fly is covered 
with fungal hyphae, as if lacked a Toll 
receptor, which helps protect Drosophila 
from fungal infections. (From B. Lemaitre 
et al., Ce//86:973-983, 1996.) 


0.2 mm 


the membranes of the endolysosomal System (discussed in Chapter 13). Still other 
PRRs are secreted and bind to the surface of extracellular pathogens, marking 
them for destruction by either phagocytes or blood proteins that are part of the 
complément System (discussed later). 

There Are Multiple Classes of PRRs 

The first PRR identified was the Toll receptor in Drosophila, which was well-known 
for its rôle in fly development (see Figure 21-17). It was later discovered to be also 
required for the production of antimicrobial peptides that protect the fly against 
fungal infections (Figure 24-3). Toll is a transmembrane glycoprotein with a 
large extracellular domain that contains a sériés of leucine-rich repeats. Soon it 
was discovered that both plants and animais hâve a variety of Toll-like recep- 
tors (TLRs) that function as PRRs in innate immune responses against various 
pathogens. Mammals malce at least 10 different TLRs, each recognizing distinct 
ligands: TLR3, for example, recognizes double-stranded viral RNA in the endoso- 
mal lumen (Figure 24-4); TLR4 recognizes lipopolysaccharide (LPS) on the outer 




Figure 24-4 A Toll-like receptor. The structure 
of human TLR3 is shown (green), with a double- 
stranded RNA molécule (dsRNA, bluë) bound to 
it. The receptor is a homodimer in the membrane 
of endosomes. The binding of dsRNA to the 
two horseshoe-shaped domains on the lumenal 
side of the endosome brings the two cytosolic 
domains together, allowing adaptor proteins in the 
cytosol to assemble into a large signaling complex 
(not shown). (B) The crystal structure of a lumenal 
domain of the receptor, which contains 23 
conventional leucine-rich repeats, each of which 
contributes a p strand to the continuous 
P sheet (red) that lines the concave surface of the 
structure. (A, adapted from L. Liu et al., Science 
320:379-381,2008. With permission from AAAS; 
B, adapted from J. Choe, M.S. Kelker and I.A. 
Wilson, Science 309:581-585, 2006; PDB: 1ZIW.) 
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membrane of Gram-negative bacteria; TLR5 recognizes the protein that forms 
the bacterial flagellum; and TLR9 recognizes short, unmethylated sequences of 
bacterial, viral, or protozoan DNA, called CpG motifs, which are uncommon in 
vertebrate DNA. 

In addition to TLRs, vertebrates use several other families of PRRs to detect 
pathogens. One is the large family of NOD-like receptors (NLRs). Lilce TLRs, 
NLRs hâve leucine-rich repeat motifs, but they are exclusively cytoplasmic and 
recognize a distinct set of bacterial molécules. Individuals who are homozygous 
for a particular mutant allele of the NLR gene NOD2 hâve a greatly increased 
rislc of developing Crohn's disease, a chronic inflammatory disease of the small 
intestine, possibly triggered by a bacterial infection. Another class of PRRs con- 
sists of RIG-lilce receptors (RLRs), which are members of the RNA helicase family 
of proteins. They are also exclusively cytoplasmic and detect viral pathogens. A 
fourth class of PRRs consists of C-type lectin receptors ( CLRs ), which are trans¬ 
membrane cell-surface proteins that recognize carbohydrates (which is why they 
are called lectins) on various microbes. Table 24-1 summarizes some PRRs and 
their ligands and locations in cells. Collectively, these and other PRRs act as an 
alarm System to alert the innate and adaptive immune Systems that an infection 
is brewing (Movie 24.1). 

When a cell-surface or intracellular PRR binds a PAMP, it stimulâtes the cell 
to secrete a variety of cytokines and other extracellular signal molécules. Some of 
these inhibit viral réplication, but most induce a local inflammatory response that 
helps eliminate the pathogen, as we now discuss. 

Activated PRRs Trigger an Inflammatory Response at Sites of 
Infection 

When a pathogen invades a tissue, it activâtes PRRs on or in various cells of the 
innate immune System, resulting in an inflammatory response at the site of infec¬ 
tion. The inflammatory response dépends on changes in local blood vessels and is 
characterized clinically by local pain, redness, heat, and swelling. The blood ves¬ 
sels dilate and become permeable to fluid and proteins, leading to local swelling 
and an accumulation of blood proteins that aid in defense. At the same time, the 
endothélial cells lining the local blood vessels are stimulated to express cell adhe¬ 
sion proteins, which promote the attachment and escape of white blood cells or 
leukocytes (see Figure 19-29B), adding to the local swelling; initially neutrophils 
escape, followed later by lymphocytes and monocytes (the blood-borne precur- 
sors of macrophages). 


TABLE 24-1 


Some Pattern Récognition Receptors (PRRs) 


Receptor Location 

Ligand 

Origin of ligand 

Toll-like receptors (TLRs) 

TLR3 

Endolysosomal System 

Double-stranded RNA 

Viruses 

TLR4 

Plasma membrane 

Bacterial lipopolysaccharide (LPS); viral coat proteins 

Bacteria; viruses 

TLR5 

Plasma membrane 

Flagellin 

Bacteria 

TLR9 

Endolysosomal System 

Unmethylated CpG DNA 

Bacteria, viruses, protozoa 

NOD-like receptors (NLRs) 

N0D2 

Cytoplasm 

Dégradation products of peptidoglycans 

Bacteria 

Retinoic acid-inducible gene 1-like receptors (RLRs) 

RIG1 

Cytoplasm 

Double-stranded RNA 

Viruses 

C-type lectin receptors (CLRs) 

Dectinl 

Plasma membrane 

p-Glucan 

Fungi 
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The activation of PRRs results in the production of a large variety of extracel¬ 
lular signal molécules that médiate the inflammatory response at the site of an 
infection. These include both lipid signal molécules, such as prostaglandins, and 
protein (or peptide) signal molécules called cytokines. Some of the most import¬ 
ant pro-inflammatory cytokines are tumor necrosisfactor-a (TNFa), interferon-y 
(IFNy), a variety of chemokines (which recruit leukocytes), and various interleu- 
kins (ILs) that we discuss later, including IL1, IL6, IL12, and IL17. In addition, a 
secreted PRR (mannose-binding lectin) activâtes the complément System when 
the PRR binds to a pathogen; fragments of complément proteins released during 
complément activation stimulate an inflammatory response (discussed shortly; 
see Figure 24-7). 

When activated by PAMPs, most cell-surface and intracellular PRRs stimulate 
the production of multiple pro-inflammatory cytokines by activating intracellu¬ 
lar signaling pathways that switch on transcription regulators, including NFkB, 
to induce the transcription of the relevant cytokine genes (see Figure 15-62). 
Some PRRs, however, can also stimulate pro-inflammatory cytokine production 
by a different mechanism: when activated, several cytoplasmic NLRs assemble 
with adaptor proteins and spécifie protease precursors of the caspase family (dis¬ 
cussed in Chapter 18) to form inflammasomes, in which pro-inflammatory cyto¬ 
kines such as IL1 are cleaved from their inactive precursor proteins by activated 
caspases. These cytokines are then released from the cell by a poorly understood, 
unconventional sécrétion pathway. Inflammasomes closely resemble apopto- 
somes in their assembly and structure, but, in apoptosomes, procaspases are 
activated to initiate a proteolytic caspase cascade that leads to apoptosis (see Fig¬ 
ure 18-7). 

NLR-dependent inflammasome assembly can also be triggered in the absence 
of infection if cells are damaged or stressed. Such cells produce “danger signais," 
such as altered or misplaced self molécules, which can activate the relevant NLRs: 
the arthritis caused by uric acid crystals formed in the joints of individuals with 
goût, who hâve abnormally high uric acid levels in their blood, is a painful example. 

Phagocytic Cells Seek, Engulf, and Destroy Pathogens 

In ail animais, the récognition of a microbial invader is usually quicldy followed by 
its engulfment by a phagocytic cell. Macrophages are long-lived phagocytes that 
résidé in most vertebrate tissues; they are among the first cells to encounter invad- 
ing microbes, whose PAMPs activate the macrophages to secrete pro-inflamma¬ 
tory signal molécules. Neutrophils are short-lived phagocytes that are abundant 
in blood but are not présent in healthy tissues; they are rapidly recruited to sites of 
infection by various attractive molécules, including formylmethionine-contain- 
ing peptides (which are released by microbes but are not made by mammalian 
cells), chemokines secreted by activated macrophages, and peptide fragments 
produced from cleaved, activated complément proteins. The recruited neutro¬ 
phils contribute their own pro-inflammatory cytokines. 

In addition to their PRRs, macrophages and neutrophils display a variety of 
cell-surface receptors that recognize fragments of complément proteins or anti- 
bodies bound to the surface of a pathogen. The binding of such a pathogen to these 
receptors leads to its phagocytosis (Figure 24-5) and an attaclc on the ingested 
pathogen once inside a phagolysosome. The phagocytes possess an impres- 
sive armory of weapons to kill the invader, including enzymes such as lysozyme 
and acid hydrolases that can dégradé the pathogen's cell wall. The cells assem¬ 
ble NADPH oxidase complexes on the phagolysosomal membrane, where the 
complexes catalyze the production of highly toxic oxygen-derived compounds, 
including superoxide (02~), hydrogen peroxide, and hydroxyl radicals. A transient 
increase in oxygen consumption by the phagocytic cells, called the respiratory 
burst, accompanies the production of these toxic compounds. Whereas macro¬ 
phages generally survive this killing frenzy and live to kill again, neutrophils do 
not. Dead and dying neutrophils are a major component of the pus that forms 
in acute bacterially infected wounds; their half-life in the human bloodstream is 
only a few hours. 
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Figure 24-5 Antibody-activated 
phagocytosis. Electron micrograph of 
a neutrophil phagocytosing an antibody- 
coated bacterium, which is in the process 
of dividing. The process in which antibody 
(or complément) coating of a pathogen 
increases the efficiency with which the 
pathogen is phagocytosed is called 
opsonization. (Courtesy of Dorothy F. 
Bainton, from R.C. Williams, Jr. and 
H.H. Fudenberg, Phagocytic Mechanisms 
in Health and Disease. New York: 
Intercontinental Medical Book 
Corporation, 1971.) 
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If a pathogen is too large to be successfully phagocytosed (if it is a large para¬ 
site such as a worm, for example), a group of macrophages, neutrophils, or eosin- 
ophils (another type of leukocyte) will gather around the invader. They secrete 
defensins and other damaging agents and release the toxic products of the respi- 
ratory burst. This barrage is often sufficient to destroy the pathogen (Figure 24-6). 

Complément Activation Targets Pathogens for Phagocytosis or 
Lysis 

The blood and other extracellular fiuids contain numerous proteins with antimi- 
crobial activity, some of which are produced in response to an infection, while 
others are produced constitutively. The most important of these are components 
of the complément System, which consists of about thirty interacting soluble 
proteins that are mainly made continuously by the liver and are inactive until an 
infection or another trigger activâtes them. They were originally identified by their 
ability to amplify and "complément" the action of antibodies made by B cells, but 
some are also secreted PRRs, which directly recognize PAMPs on microbes. 

The early complément components consist of three sets of proteins, belong- 
ing to three distinct pathways of complément activation—the classical pathway, 
the lectin pathway, and the alternative pathway. The early components of ail 
three pathways act locally to cleave and activate C3, which is the pivotai complé¬ 
ment component (Figure 24-7); individuals with a C3 deficiency are subject to 
repeated bacterial infections. The early components are proenzymes, which are 
activated sequentially by proteolytic cleavage. The cleavage of each proenzyme 
in the sériés activâtes the next component to generate a serine protease, which 
cleaves the next proenzyme in the sériés, and so on. Since each activated enzyme 
cleaves many molécules of the next proenzyme in the chain, the activation of the 
early components consists of an amplifying proteolytic cascade. 

Many of these protein cleavages liberate a biologically active small fragment 
that can attract neutrophils, plus a membrane-binding larger fragment. The bind- 
ing of the large fragment to a cell membrane, usually the surface of a pathogen, 
helps stimulate the next reaction in the sequence. In this way, complément acti¬ 
vation is largely kept confined to the cell surface where it began. In particular, the 
large fragment of C3, called C3b, binds covalently to the surface of the pathogen. 
Here, it recruits protein fragments produced by cleavage of other early complément 
components to form proteolytic complexes that catalyze the subséquent steps in 
the complément cascade. The early events in complément activation hâve diverse 
functions: C3b-binding receptors on phagocytic cells enhance the ability of these 
cells to phagocytose the pathogen, and similar receptors on B cells enhance the 
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Figure 24-6 Eosinophils attacking a 
parasite. Phagocytes cannot ingest large 
parasites such as the schistosome larva 
shown here. When the larva is coated with 
antibody or complément components, 
however, eosinophils (and other leukocytes) 
can recognize it and collectively kill it by 
secreting a large variety of toxic molécules. 
(Courtesy of Anthony Butterworth.) 


Figure 24-7 The principal stages in 
complément activation by the classical, 
lectin, and alternative pathways. In 

ail three pathways, the reactions of 
complément activation usually take place 
on the surface of an invading microbe, such 
as a bacterium, and lead to the cleavage of 
C3 and the various conséquences shown. 
As indicated, the complément proteins 
Cl to C9, mannose-binding lectin (MBL), 
MBL-associated serine protease (MASP), 
and factors B and D are the central 
components of the complément System. 
The early components are shown within 
grayarrows, while the late components are 
shown within a brown arrow. The functions 
of the protein fragments produced du ring 
complément activation are indicated by 
the black arrows. The various complément 
proteins that regu late the System are 
omitted. 
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ability of these cells to malce antibodies against various microbial molécules on 
C3b-coated pathogens; the smaller fragment of C3 (called C3a), as well as small 
fragments of C4 and C5, act independently as diffusible signais to promote an 
inflammatory response by recruiting leukocytes to the site of infection. 

As indicated in Figure 24-7, antibodies bound to the surface of a pathogen 
activate the classical pathway. Mannose-binding lectin, mentioned earlier, is a 
secreted PRR that initiâtes the lectin pathway of complément activation when 
it recognizes bacterial or fungal glycolipids and glycoproteins bearing terminal 
mannose and fucose sugars in a particular spatial conformation. These initial 
binding events in the classical and lectin pathways cause the recruitment and 
activation of the early complément components. Finally, molécules on the sur¬ 
face of pathogens will often directly activate the alternative pathway. 

Host cells produce various plasma membrane molécules that prevent complé¬ 
ment reactions from proceeding on their cell surface. The most important of these 
is the carbohydrate moiety sialic acid, a common constituent of cell-surface gly¬ 
coproteins and glycolipids (see Figure 10-16). Because pathogens generally laclc 
sialic acid, they are singled out for complement-mediated destruction, while host 
cells are spared. Some pathogens, including the bacterium Neisseria gonorrhoeae 
that causes the sexually transmitted disease gonorrhea, coat themselves with a 
layer of sialic acid to effectively hide from the complément System. 

Membrane-immobilized C3b, produced by any of the three pathways, trig- 
gers a further cascade of reactions that leads to the assembly of the late complé¬ 
ment components to form membrane attaclc complexes. These protein complexes 
assemble in the pathogen membrane near the site of C3 activation, forming aque- 
ous pores through the membrane (Figure 24-8). For this reason, and because they 
perturb the structure of the lipid bilayer in their vicinity, they malce the membrane 
lealcy and can, in some cases, cause the microbe to lyse. 

The self-amplifying, inflammatory, and destructive properties of the complé¬ 
ment cascade malce it essential that lcey activated components be rapidly inacti- 
vated after they are generated, ensuring that the attaclc does not spread to nearby 
host cells. Inactivation is achieved in at least two ways. First, spécifie inhibitor pro¬ 
teins in the blood or on the surface of host cells terminate the cascade, by either 
binding or cleaving certain complément components once the components hâve 
been activated by proteolytic cleavage. Second, many of the activated compo¬ 
nents in the cascade are unstable; unless they bind immediately to either the next 
component in the complément cascade or to a nearby membrane, they rapidly 
inactivate. 


Virus-lnfected Cells Take Drastic Measures to Prevent Viral 
Réplication 

Because host-cell ribosomes malce a virus's proteins and host-cell lipids form the 
membranes of enveloped viruses, PAMPs are generally not présent on the surface 
of viruses. Therefore, the only general way that a host cell PRR can recognize the 
presence of a virus is to detect unusual éléments of the viral genome, such as the 



Figure 24-8 Assembly of the late 
complément components to form a 
membrane attack complex. The cleavage 
of the early complément components 
(shown within gray arrows in Figure 
24-7) results in the formation of C3b- 
containing proteolytic complexes called 
C5 convertases (not shown). These then 
cleave the first of the late components, C5, 
to produce C5a and C5b. As illustrated, 
C5b rapidly assembles with C6 and C7 
to form C567, which then binds firmly via 
C7 to the membrane. One molécule of 
C8 binds to the complex to form C5678. 
The binding of a molécule of C9 to C5678 
induces a conformational change in C9 
that exposes a hydrophobie région and 
causes C9 to insert into the lipid bilayer of 
the target membrane. This starts a chain 
reaction in which the altered C9 binds a 
second molécule of C9, which can then 
bind another molécule of C9, and so on. 

In this way, a ring of C9 molécules forms 
a large transmembrane channel in the 
membrane. 
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double-stranded RNA (dsRNA) that is an intermediate in the life cycle of many 
viruses and is recognized by several PRRs including the Toll-like receptor TLR3; 
in addition, DNA virus genomes frequently contain significant amounts of the 
CpG motifs discussed earlier, which can be recognized by TLR9 (see Table 24-1, 
p. 1300). 

Mammalian cells are particularly adept at recognizing the presence of dsRNA, 
which activâtes intracellular PRRs that induce the host cell to produce and secrete 
two antiviral cytokines—interferon-a (IFNa) and interferon-P (IFNP). These 
interferons are referred to as type I interferons to distinguish them from IFNy, 
which is a type II interferon and has different functions, as we discuss later. Type 
I interferons act in both an autocrine fashion on the infected cells that produced 
it and a paracrine fashion on uninfected neighbors. They bind to a common 
cell-surface receptor, which activâtes the JAK-STAT intracellular signaling path- 
way (see Figure 15-56) to stimulate spécifie gene transcription and thereby the 
production of more than 300 proteins, including many cytokines, reflecting the 
complexity of the celFs acute response to a viral infection. 

The production of type I interferons appears to be a general response of mam¬ 
malian cells to a viral infection, and viral components other than dsRNA can trig- 
ger it. The type I interferons help bloclc viral réplication in multiple ways. They 
activate a latent ribonucléase that nonspecifically dégradés single-stranded RNA. 
They also indirectly activate a protein kinase that phosphorylâtes and inactivâtes 
the protein synthesis initiation factor eIF2 (discussed in Chapter 6), thereby shut- 
ting down most protein synthesis in the infected host cell. Apparently, by destroy- 
ing most of its own RNA and transiently halting most of its protein synthesis, the 
host cell inhibits viral réplication without killing itself. If these measures fait, the 
cell talces an even more extreme step to prevent the virus from replicating: it kills 
itself by undergoing apoptosis, often with the help of immune killer cells, as we 
discuss next. 

Natural Killer Cells Induce Virus-lnfected Cells to Kill Themselves 

Type I interferons also hâve less direct ways of blocking viral réplication. One of 
these is to enhance the activity of natural killer cells (NK cells), which are leu¬ 
kocytes related to T and B cells but are part of the innate immune System and 
are recruited early to sites of inflammation. Lilce cytotoxic T cells of the adaptive 
immune System (discussed later), NK cells destroy virus-infected cells by induc- 
ing the infected cells to kill themselves by undergoing apoptosis (discussed in 
Chapter 18). We consider how killer cells induce apoptosis later, when we dis¬ 
cuss how cytotoxic T cells do it (see Figure 24-43). Although they kill in the same 
way, the means by which cytotoxic T cells and NK cells distinguish the surface of 
virus-infected cells from that of uninfected cells are different (Movie 24.2). 

Both cytotoxic T cells and NK cells recognize the same spécial class of cell-sur¬ 
face proteins on a host cell to help détermine if the cell is virus-infected, but they 
use distinct receptors to do so. The spécial cell-surface proteins recognized are 
called class IMHC proteins, because they are encoded by genes in the major his- 
tocompatibility complex; almost ail nucleated cells in vertebrates express these 
genes, and we discuss them in detail later. Cytotoxic T cells use both T cell recep¬ 
tors (TCRs ) and co-receptors to recognize peptide fragments of viral proteins 
bound to class IMHC pro teins on the surface of virus-infected host cells and then 
induce the infected cells to kill themselves. By contrast, NK cells hâve cell-surface 
inhibitory receptors that monitor the level of class I MHC proteins on the surface 
of other host cells: the high levels of these MHC pro teins normally présent on 
healthy cells engage these receptors and thereby inhibit the killing activity of the 
NK cells. The NK cells thus focus primarily on host cells expressing abnormally 
low levels of class I MHC proteins and induce them to kill themselves; these are 
mainly virus-infected cells and some cancer cells (Figure 24-9). NK cell killing 
activity is stimulated when various activating receptors on the NK cell surface rec¬ 
ognize spécifie proteins that are greatly increased on the surface of virus-infected 
cells and some cancer cells. 


natural killer cell cancer cell 



5 |um 


Figure 24-9 A natural killer (NK) cell 
attacking a cancer cell. This scanning 
électron micrograph was taken shortly 
after the NK cell attached to the cancer 
cell, but before it induced the cell to die 
by apoptosis. (Courtesy of J.C. Hiserodt, 
in Mechanisms of Cytotoxicity by Natural 
Killer Cells [R.B. Herberman and D. 
Callewaert, eds.]. New York: Academie 
Press, 1995.) 



THE INNATE IMMUNE SYSTEM 


1305 


NKcell 
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apoptotic virus-infected host cell 


(B) VIRUS-INFECTED HOST CELL KILLED 


Figure 24-10 How an NK cell recognizes 
its target. An NK cell preferentially attacks 
infected host cells and cancer cells 
because these cells hâve on their surface 
both activating proteins and, in some 
cases, abnormally low levels of class I 
MHC proteins. (A) The high levels of class 
I MHC proteins found on normal host cells 
activate inhibitory receptors on the NK cell 
that suppress the killing activity of the NK 
cell. (B) In contrast, the activating proteins 
on infected cells and cancer cells bind to 
activating receptors on the NK cell and 
stimulate the killing activity of the cell. 


The reason that class I MHC protein levels are often low on virus-infected 
cells is that many viruses hâve developed a variety of mechanisms to inhibit the 
expression of these proteins on the surface of the host cells they infect, in order 
to avoid détection by cytotoxic T cells: some viruses encode proteins that bloclc 
class I MHC gene transcription; others block the intracellular assembly of pep- 
tide-MHC complexes; still others block the transport of these complexes to the 
cell surface. By evading récognition by cytotoxic T cells in these ways, however, 
a virus incurs the wrath of NK cells, which recognize the infected cells as being 
different—both because the infected cells express little class I MHC protein and 
because they express large amounts of other surface proteins that are recognized 
by the activating receptors on the NK cells (Figure 24-10). 

Dendritic Cells Provide the Link Between the Innate and 
Adaptive Immune Systems 

Dendritic cells are crucially important components of the innate immune System. 
They are a heterogeneous class of cells that are widely distributed in the tissues 
and organs of vertebrates. They express a large variety of PRRs, which enable den¬ 
dritic cells to recognize and phagocytose invading pathogens and their products 
and to become activated in the process. The activated dendritic cells cleave the 
proteins of the pathogen into peptide fragments, which bind to newly synthesized 
MHC proteins, which then carry the fragments to the dendritic cell surface. The 
activated cells then migrate to a nearby lymphoid organ such as a lymph no de 
(also called a lymph gland), where they présent the peptide-MHC complexes to 
T cells of the adaptive immune System, activating the T cells to join in the battle 
against the spécifie pathogen (Figure 24-11). 

In addition to the complexes of MHC proteins and microbial peptides dis- 
played on their cell surface, activated dendritic cells also display cell-surface co- 
stimulatory proteins that help activate T cells (see Figure 24-11). As we discuss 
later, the activated dendritic cells also secrete a variety of cytokines that influ¬ 
ence the type of response that the T cells malce, ensuring that it is appropriate to 
fight the particular pathogen. In these ways, dendritic cells serve as crucial links 
between the innate immune System, which provides a rapid first line of defense 
against invading pathogens, and the adaptive immune System, which mounts 
slower but more powerful and highly spécifie responses to attaclc an invader, as 
we now discuss. 

Summary 

AU multicellular organisms possess innate immune defenses against invading 
pathogens; these defenses include physical and Chemical barriers and various 
défensive cell responses. In vertebrates , these innate defense responses can also 
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ACTIVATED T CELLS MIGRATE TO 
SITE OF INFECTION VIA THE BLOOD 



MICROBES ENTER THROUGH 
BREAK IN SKIN AND ARE 
PHAGOCYTOSED BY 
DENDRITIC CELL 


ACTIVATED DENDRITIC CELL 
CARRIES MICROBIAL PEPTIDES 
TO LOCAL LYMPH NODE 


ACTIVATED DENDRITIC CELL ACTIVATES 
SPECIFIC T CELLS TO RESPOND TO MICROBIAL 
PEPTIDES BOUND TO MHC PROTEINS ON 
DENDRITIC CELL SURFACE 


INNATE IMMUNE RESPONSE 


ADAPTIVE IMMUNE RESPONSE 


Figure 24-11 Dendritic cells as functional links between the innate and adaptive immune Systems. Dendritic cells pick 
up invading microbes or their products at the site of an infection. The microbial PAMPs activate the dendritic cells to express 
co-stimulatory proteins and increased amounts of MHC proteins on their surface and to migrate via lymphatic vessels to a 
nearby lymph node. In the lymph node, the activated dendritic cells activate T cells that express appropriate receptors for the 
co-stimulatory proteins and the microbial peptides bound to MHC proteins on the dendritic cell surface. The activated T cells 
proliferate, and some of their progeny migrate to the original site of infection, where they help eliminate the microbes, either by 
activating local macrophages or by killing infected host cells (not shown). In addition, some of the activated T cells help stimulate 
spécifie B cells in the lymph node to secrete antibodies against the microbe (not shown). 

A crucial feature of dendritic cell activation is that the pathogen provides an individual dendritic cell with both the peptides 
for présentation to T cells and the PAMP signais that activate the dendritic cell to express co-stimulatory proteins. In this way, 
the individual dendritic cell has ail it needs to activate spécifie T cells that recognize the peptide-MHC complexes on its surface 
(Movie 24.3). 


recruit spécifie and more powerful adaptive immune responses to help fight the 
infection. Innate immune responses rely on the ahility of host cells to recognize 
characteristic features of microbial molécules called pathogen-associated molecu- 
lar patterns; or PAMPs, which can be associated with a pathogen's proteins; lip- 
ids, sugars, or nucleic acids. PAMPs are mainly recognized by pattern récognition 
receptors (PRRs), including the Toll-like receptors (TLRs)found on or in both plant 
and animal cells. In vertebrates, some PRRs are secreted and can activate complé¬ 
ment when they bind microbial PAMPs. The complément System, which can also 
be activated by antimicrobial antibodies bound to pathogens, consists of a group 
ofblood proteins that are activated in sequence to help fight infections, by disrupt- 
ing the pathogen's membrane, stimulating an inflammatory response, or target- 
ing the microbe for phagocytosis—mainly by macrophages and neutrophils. The 
phagocytes use a combination of degradative enzymes, antimicrobial peptides; and 
oxygen-derived toxic molécules to kill invading pathogens; in addition, they secrete 
various signal molécules that help trigger an inflammatory response. 

Cells infected by a virus produce and secrete type I interferons (IFNa and IFN fi), 
which induce a complexset ofhost-cell responses that inhibit viral réplication. The 
interferons also enhance the killing activity ofnatural Iciller (NK) cells. An NK cell 
Icills infected host cells because they express large amounts of surface proteins that 
activate the NK cell; the killing is especially efficient when infected cells express 
reduced amounts ofclass I MHC proteins, which, when présent in normal amounts 
on a host cell surface inhibit the killing activity ofNK cells. 
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Dendritic cells of the innate immune System functionally link innate immune 
responses to adaptive immune responses. The cells become activated when their 
PRRs piclc up microbes and their products at sites of infection and phagocytose them. 
The activated cells cleave the microbial proteins into peptide fragments, which bind 
to newly made MHC proteins, which transport the fragments to the cell surface. The 
activated dendritic cells then carry the peptide-MHC complexes to a lymph organ, 
where they activate appropriate T cells to make spécifie adaptive immune responses 
against the microbes. 

OVERVIEW OF THE ADAPTIVE IMMUNE SYSTEM 

A dramatic “big bang" in immune defense mechanisms occurred when jawed ver- 
tebrates evolved and acquired an adaptive immune System. Ibis sophisticated 
defense System dépends on B and T lymphocytes (B and T cells), which, during 
their development, rearrange particular DNA sequences in various combinations 
so that, together, the cells can produce an almost limitless variety of B and T cell 
receptors and antibodies. Collectively, these proteins can bind to essentially any 
molécule, including small Chemicals, carbohydrates, lipids, and proteins; indi- 
vidually, they can distinguish between molécules that are very similar—such as 
between two proteins that differ in only a single amino acid, or between two opti- 
cal isomers of the same small molécule. By this strategy, the adaptive immune 
System can recognize and respond specifically to any pathogen, including new 
mutant forms. However, because the genetic rearrangement process produces 
both receptors that can bind to self molécules as well as receptors that can bind to 
foreign molécules, vertebrates hâve had to evolve spécial mechanisms to ensure 
that B and T cells do not react against the host's own molécules and cells—a pro¬ 
cess called immunological self-tolerance. 

Moreover, many harmless foreign substances enter the body, for example, as 
food or inhaled material, and it would be pointless and potentially dangerous to 
mount adaptive immune responses against them. Such inappropriate responses 
are normally avoided because innate immune responses are required to call 
adaptive immune responses into play and do so only when the innate cells' PRRs 
recognize microbial PAMPs, as we discussed earlier. One can triclc the adaptive 
immune System into responding to a harmless foreign molécule, such as a foreign 
protein, by co-injecting a molécule (often of microbial origin) called an adjuvant , 
which activâtes PRRs. This triclc is called immunization and is the basis of vacci¬ 
nation. Any substance capable of stimulating B or T cells to make a spécifie adap¬ 
tive immune response against it is referred to as an antigen (antibody genemtor). 

There are two broad classes of adaptive immune responses— antibody 
responses and T-cell-mediated immune responses, and most pathogens induce 
both classes of responses. In antibody responses, B cells are activated to secrete 
antibodies, which are proteins that circulate in the bloodstream and permeate 
the other body fluids, where they can bind specifically to the foreign antigen that 
stimulated their production (see Figure 24-2). Binding of antibody neutralizes 
extracellular viruses and microbial toxins (such as tetanus toxin or choiera toxin) 
by blocking their ability to bind to receptors on host cells. Antibody binding also 
marks invading pathogens for destruction, both by malcing it easier for phago¬ 
cytes of the innate immune System to ingest and destroy them and by activating 
the complément System. 

In T-cell-mediated immune responses, T cells recognize foreign antigens that 
are bound to MHC proteins on the surface of host cells such as dendritic cells, 
which are specialized for presenting antigen to T cells and are therefore referred 
to as professional antigen-presenting cells (APCs ). Because MHC proteins carry 
fragments of pathogen proteins from inside a host cell to the cell surface, T cells 
can detect pathogens hiding inside a host cell and either lcill the infected cell (see 
Figure 24-2) or stimulate phagocytes or B cells to help eliminate the pathogens. 

In this section, we discuss the origins and general properties of B and T cells. 
In later sections, we consider the spécifie properties and functions of these cells. 


1308 Chapter 24: The Innate and Adaptive Immune Systems 



Figure 24-12 Human lymphoid organs. 

Lymphocytes develop from lymphoid 
progenitor cells in the thymus and bone 
marrow (yellow), which are therefore called 
central (or primarÿ) lymphoid organs. 

The newly formed lymphocytes migrate 
from these primary organs to peripheral 
(or secondarÿ) lymphoid organs, where 
they can react with foreign antigen. Only 
some of the peripheral lymphoid organs 
(blue) and lymphatic vessels (green) are 
shown; many lymphocytes, for example, 
are found in the skin and respiratory tract. 
As we discuss later, the lymphatic vessels 
ultimately empty into the bloodstream (not 
shown). 


B Cells Develop in the Bone Marrow, T Cells in the Thymus 

There are about 2 x 10 12 lymphocytes in the human body, malcing the immune 
System comparable in cell mass to the liver or the brain. They occur in large num- 
bers in the blood and lymph (the colorless fluid in the lymphatic vessels, which 
connect the lymph nodes in the body to each other and to the bloodstream). They 
are also concentrated in lymphoid organs, such as the thymus, lymph nodes, and 
spleen (Figure 24-12), and many are also found in other organs, including skin, 
lung, and gut. 

T cells and B cells dérivé their names from the organs in which they develop: 
T cells develop in the thymus , and B cells, in adult mammals, develop in the bone 
marrow. Both types of cells develop from lymphoid progenitor cells that are pro- 
duced from multipotent hematopoietic stem cells , which are found mainly in the 
bone marrow (Figure 24-13). The hematopoietic stem cells give rise to more than 
just lymphocytes: as discussed in Chapter 22, they produce ail of the cells of the 


bone thymus peripheral 

marrow lymphoid organs 
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Figure 24-13 The development of 
B and T cells. The central lymphoid 
organs, where lymphocytes develop from 
lymphoid progenitor cells, are labeled in 
yellow boxes. The lymphoid progenitor cells 
develop from multipotent hematopoietic 
stem cells in the bone marrow. Some 
lymphoid progenitor cells develop locally 
in the bone marrow into immature B cells, 
while others migrate to the thymus (via 
the bloodstream) where they develop into 
thymocytes (developing T cells). Foreign 
antigens activate B cells and T cells mainly 
in peripheral lymphoid organs, such as 
lymph nodes or the spleen. 
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hematopoietic System, including érythrocytes, leukocytes, and platelets (see Fig¬ 
ure 22-32). 

Because they are sites where lymphocytes develop from lymphoid progeni- 
tor cells, the thymus and bone marrow are referred to as central (primary) lym¬ 
phoid organs (see Figure 24-12). As we discuss later, most B and T cells die in 
the central lymphoid organs soon after they develop, without ever functioning. 
Others, however, mature and migrate via the blood to the peripheral (secondary) 
lymphoid organs—mainly the lymph nodes, spleen, and epithelium-associated 
lymphoid tissues in the gastrointestinal tract, respiratory tract, and skin. It is in 
these peripheral lymphoid organs that foreign antigens activate B and T cells (see 
Figure 24-13). 

B and T cells become morphologically distinguishable from each other only 
after antigen has activated them: resting B and T cells look very similar, even in 
an électron microscope (Figure 24-14A). After activation by an antigen, both pro- 
liferate and mature into effector cells. Effector B cells secrete antibodies; in their 
most mature form, called plasma cells , they are filled with an extensive rough 
endoplasmic réticulum that is busily malcing antibodies (Figure 24-14B). In con- 
trast, effector T cells (Figure 24-14C) contain very little endoplasmic réticulum 
and secrete a variety of cytokines rather than antibodies. Whereas B-cell-derived 
antibodies are widely distributed by the bloodstream, T-cell-derived cytokines 
mainly act locally on neighboring cells, although some are carried via the blood 
and act on distant host cells. 


Figure 24-14 Electron micrographs of 
resting and effector lymphocytes. 

(A) This resting lymphocyte could be 
either a B cell or a T cell, as these cells 
are difficult to distinguish morphologically 
until antigen activâtes them to become 
effector cells. (B) An effector B cell (a 
plasma cell). It is filled with an extensive 
rough endoplasmic réticulum (ER), which is 
distended with antibody molécules that are 
secreted in large amounts. (C) An effector 
T cell, which has relatively little rough ER 
but is filled with free ribosomes; it sécrétés 
cytokines, but in relatively small amounts. 
The three cells are shown at the same 
magnification. (A, courtesy of Dorothy 
Zucker-Franklin; B, courtesy of Carlo 
Grossi; A and B, from D. Zucker-Franklin 
et al., Atlas of Blood Cells: Function and 
Pathology, 2nd ed. Milan, Italy: Edi. Ermes, 
1988; C, courtesy of Stefanello de Pétris.) 


Immunological Memory Dépends On Both Clonal Expansion and 
Lymphocyte Différentiation 

The most remarkable feature of the adaptive immune System is that it can respond 
to millions of different foreign antigens in a highly spécifie way. Human B cells, for 
example, collectively, can malce more than 10 12 different antibody molécules that 
react specifically with the antigen that induced their production. How can B cells 
and T cells respond specifically to such an enormous diversity of foreign antigens? 
The answer for both B and T cells is the same. As each lymphocyte develops in 
a central lymphoid organ, it becomes committed to react with a particular anti¬ 
gen before ever being exposed to the antigen. It expresses this commitment in 
the form of cell-surface receptors that specifically bind the antigen. When a lym¬ 
phocyte encounters its antigen in a peripheral lymphoid organ, the binding of the 
antigen to the receptors (with help from co-stimulatory signais, discussed later) 
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activâtes the lymphocyte; this causes the lymphocyte to proliferate, thereby pro- 
ducing many more cells with the same receptor—a process called clonal expan¬ 
sion. The encounter with antigen also causes some of the cells to differentiate into 
effector cells. An antigen therefore selectively stimulâtes those cells that express 
complementary antigen-specific receptors and are thus already committed to 
respond to it (Figure 24-15). This arrangement, called clonal sélection, provides 
an explanation for immunological memory, whereby we develop lifelong immu- 
nity to many common infectious diseases after our initial exposure to the patho- 
gen—either through natural infection or vaccination. 

It is easy to demonstrate such immunological memory in experimental ani¬ 
mais. If an animal is immunized once with antigen A, an immune response (anti- 
body, T-cell-mediated, or both) can be detected after several days; the response 
rises rapidly and exponentially, and then, more gradually, déclinés. This is the 
characteristic course of a primary immune response, occurring on an animaFs 
first exposure to an antigen. If, after some weelcs, months, or even years hâve 
elapsed, the animal is immunized again with antigen A, it will usually produce 
a secondary immune response that differs from the primary response: the lag 
period is shorter, because there are now many more preexisting B or T cells (or 
both) with specificity for antigen A, and the response is greater and more efficient. 
These différences indicate that the animal has “remembered” its first exposure 
to antigen A. If the animal is given a different antigen (for example, antigen B) 
instead of a second immunization with antigen A, the response is typical of a pri¬ 
mary, and not a secondary, immune response. The secondary response therefore 
reflects antigen-specific immunological memory for antigen A (Figure 24-16). 

Immunological memory dépends on both lymphocyte prolifération and 
différentiation. In an adult animal, the peripheral lymphoid organs contain a 
mixture of lymphocytes in at least three stages of maturation: naïve cells, effector 
cells , and memory cells. When naïve cells encounter their spécifie foreign antigen 
for the first time, the antigen stimulâtes some of them to proliferate and differen¬ 
tiate into effector cells, which then carry out an immune response (effector B cells 
secrete antibody, whereas effector T cells either lcill infected cells or influence 
the response of other immune cells—by secreting cytokines, for example). Some 
of the antigen-stimulated naïve cells multiply and differentiate into memory 
cells, which are more easily and more quicldy induced to become effector cells 
by a later encounter with the same antigen: lilce naïve cells, when memory cells 
encounter their antigen, they give rise to either effector cells or more memory 
cells (Figure 24-17). 


Figure 24-15 Clonal sélection. An antigen 
activâtes only those lymphocytes that are 
already committed to respond to it. The 
committed cell expresses cell-surface 
receptors that specifically recognize the 
antigen. The human adaptive immune 
System consists of many millions of 
different T and B lymphocyte clones, 
with cells within a clone expressing the 
same unique antigen receptor. Before 
its first encounter with antigen, a clone 
would usually contain only one or a small 
number of cells. A particular antigen may 
activate hundreds of different clones, each 
expressing a different antigen receptor that 
binds either a different part of the antigen 
or the same part with a different binding 
affinity. Although only B cells are shown 
here, T cells are selected in a similar way. 
Note that the antigen receptors on the 
B cells labeled (3 in this diagram hâve the 
same antigen-binding site as the antibodies 
secreted by the effector Bp cells. As we 
discuss later, B cells require co-stimulatory 
signais from T cells to become activated by 
antigen to proliferate and differentiate into 
antibody-secreting cells (not shown). 
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Figure 24-16 Immunological memory: 
primary and secondary antibody 
responses. The secondary response 
induced by a second exposure to antigen 
A is faster and greater than the primary 
response and is spécifie for A, indicating 
that the adaptive immune System has 
specifically remembered its previous 
encounter with antigen A. The same type 
of immunological memory is observed in 
T-cell-mediated responses (not shown). As 
we discuss later, the types of antibodies 
produced in the secondary response 
are different from those produced in the 
primary response, and these antibodies 
bind the antigen more tightly. 


Thus, during the primary response, clonal expansion and différentiation of 
antigen-stimulated naïve cells créâtes many memory cells, which are able to 
respond to the same antigen more sensitively, rapidly, and effectively. And, unlilce 
most effector cells, which die within days or weelcs, memory cells can persist for 
the lifetime of the animal, even in the absence of their spécifie antigen, thereby 
providing lifelong immunological memory. Although most effector B and T cells 
die after an immune response is over, some survive as effector cells and help pro¬ 
vide long-term protection against the pathogen. A small proportion of the plasma 
cells produced in a primary B cell response, for example, can survive for many 
months or years in the bone marrow, where they continue to secrete their spécifie 
antibodies into the bloodstream. 


Lymphocytes Continuously Recirculate Through Peripheral 
Lymphoid Organs 

Pathogens generally enter the body through an épithélial surface, usually through 
the skin, gut, or respiratory tract. To induce an adaptive immune response, 
microbes or their products must travel from these entry points to a peripheral 
lymphoid organ, such as a lymph node or the spleen, which are the sites where 
lymphocytes are activated (see Figure 24-11). The route and destination dépend 
on the site of entry. Lymphatic vessels carry antigens that enter through the skin 
or respiratory tract to local lymph nodes; antigens that enter through the gut end 
up in gut-associated peripheral lymphoid organs such as Peyer's patches; and the 
spleen filters out antigens that enter the blood (see Figure 24-12). As discussed 
earlier (see Figure 24-11), in many cases, activated dendritic cells will carry the 
antigen from the site of infection to the peripheral lymphoid organ, where they 
play a crucial part in activating T cells, as we discuss later. 

But only a tiny fraction of naïve B and T cells can recognize a particular micro- 
bial antigen in a peripheral lymphoid organ, a reasonable estimate being between 
1/10,000 and 1/1,000,000 of each class of lymphocyte, depending on the antigen. 
How do these rare cells find an antigen-presenting cell displaying their spécifie 
antigen? The answer is that the lymphocytes continuously recirculate between 
one peripheral lymphoid organ and another via the lymph and blood. In a lymph 
node, for example, lymphocytes continually leave the bloodstream by squeezing 
out between specialized endothélial cells lining small veins called postcapillary 

Figure 24-17 A model for the cellular basis of immunological memory. 

When stimulated by their spécifie antigen and co-stimulatory signais, naïve 
lymphocytes proliferate and differentiate. Most become effector cells, which 
function and then usually die, while others become memory cells. During a 
subséquent exposure to the same antigen, the memory cells respond more 
readily, rapidly, and efficiently than did the naïve cells: they proliferate and give 
rise to effector cells and to more memory cells. Some memory T cells also 
develop from a minority of effector T cells (not shown). It is not known how 
the decision to become an effector cell versus a memory cell is made. 
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venules. After percolating through the node, they accumulate in small lymphatic 
vessels that leave the node and connect with other lymphatic vessels that pass 
through other lymph nodes downstream (see Figure 24-12). Passing into larger 
and larger vessels, the lymphocytes eventually enter the main lymphatic vessel 
(the thoracic duct), which carries them baclc into the blood (Figure 24-18). 

The continuous recirculation of a lymphocyte between the blood and lymph 
ends only if its spécifie antigen activâtes it in a peripheral lymphoid organ. In that 
case, the lymphocyte remains in the peripheral lymphoid organ, where it prolifér¬ 
âtes and differentiates into either effector cells or memory cells. Many of the effec- 
tor T cells leave the lymphoid organ via the lymph and migrate through the blood 
to the site of infection (see Figure 24-11), whereas others stay in the lymphoid 
organ and help activate (or suppress) other immune cells there. Some effector 
B cells (plasma cells) remain in the peripheral lymphoid organ and secrete anti- 
bodies into the blood for days until they die; others migrate to the bone marrow, 
where they secrete antibodies into the blood for months or years. The memory T 
and B cells produced join the recirculating pool of lymphocytes. 

Lymphocyte recirculation dépends on spécifie interactions between the lym¬ 
phocyte cell surface and the surface of the endothélial cells lining the blood ves¬ 
sels in the peripheral lymphoid organs. Lymphocytes that enter a lymph node 
via the blood, for example, adhéré wealdy to specialized endothélial cells lining 
the postcapillary venules via homing receptors that belong to the selectin family 
of cell-surface lectins that bind to spécifie sugar groups on the endothélial cell 
surface (see Figure 19-28). The lymphocytes roll slowly along the surface of the 
endothélial cells until another, much stronger adhesion System, dépendent on 
an integrin protein, is called into play by chemokines secreted by the endothé¬ 
lial cells. Now, the lymphocytes stop rolling, and they crawl out of the blood ves¬ 
sel into the lymph node by using yet another cell adhesion protein called CD31 
(Figure 24-19). Although B and T cells initially enter the same région of a lymph 
node, different chemokines guide them to separate régions of the node—B cells to 
lymphoid follicles and T cells to the paracortex (Figure 24-20). 



postcapillary 

venule 


infected tissue 

afferent 



Figure 24-18 The path followed by 
lymphocytes as they continuously 
recirculate between the lymph and 
blood. The circulation through a lymph 
node (yellow) is shown here. Microbial 
antigens are usually carried into the 
lymph node by activated dendritic cells 
(not shown), which enter the node via 
afferent lymphatic vessels draining an 
infected tissue (green). B and T cells, by 
contrast, enter via the blood, migrating out 
of the bloodstream into the lymph node 
through postcapillary venules. Unless 
they encounter their antigen, the B and 
T cells leave the lymph node via efferent 
lymphatic vessels, which eventually join the 
thoracic duct. The thoracic duct empties 
into a large vein carrying blood to the heart, 
completing the circulation cycle for T and 
B cells. A typical circulation cycle for these 
lymphocytes takes about 12-24 hours. 


Figure 24-19 Migration of a lymphocyte out of the bloodstream into a lymph node. 

A circulating lymphocyte adhères weakly to the surface of the specialized endothélial cells lining a 
postcapillary venule in a lymph node. This initial adhesion is mediated by L-selectin (discussed in 
Chapter 19) on the lymphocyte surface. The adhesion is sufficiently weak to enable the lymphocyte, 
pushed by the flow of blood, to roll along the surface of the endothélial cells. Stimulated by 
chemokines secreted by specialized endothélial cells in the node (curved red arrow ), the lymphocyte 
rapidly activâtes a stronger adhesion System, mediated by an integrin. This strong adhesion 
enables the cell to stop rolling. The lymphocyte then uses an immunoglobulin-like cell adhesion 
protein (CD31) to bind to the junctions between adjacent endothélial cells and migrate out of the 
venule. The subséquent migration of the lymphocyte in the lymph node dépends on chemokines 
produced within the node (straight red arrow). The migration of other types of leukocytes out of the 
bloodstream into sites of infection occurs in a similar way (see Figure 19-28 and Movie 19.2). 
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Figure 24-20 A simplified drawing of a 
human lymph node. B cells are primarily 
clustered in structures called lymphoid 
follicles, whereas T cells are found mainly 
in the paracortex. Chemokines attract both 
types of lymphocytes into the lymph node 
from the blood via postcapillary venules 
(see Figure 24-19). B and T cells then 
migrate to their respective areas, attracted 
by different chemokines. If they do not 
encounter their spécifie antigen, both B 
cells and T cells then enter the medullary 
sinuses and leave the node via the efferent 
lymphatic vessel. This vessel ultimately 
empties into the bloodstream, allowing 
the lymphocytes to begin another cycle of 
circulation through a peripheral lymphoid 
organ (see Figure 24-18). During an 
infection, prolifération of pathogen-specific 
B cells produces a germinal center in some 
lymphoid follicles. 


Unless they encounter their antigen, both B and T cells soon leave the lymph 
node via efferent lymphatic vessels. If they encounter their antigen, however, they 
are stimulated to display adhesion receptors that trap the cells in the node; the 
cells accumulate at the junction between the B cell and T cell areas, where the 
rare antigen-specific B and T cells can interact, leading to their prolifération and 
différentiation into either effector cells or memory cells. Many of the effector cells 
leave the node, expressing different chemokine receptors that help guide them to 
their new destinations—effector plasma B cells to the bone marrow and effector 
T cells to sites of infection. 

Immunological Self-Tolerance Ensures That B and T Cells Do Not 
Attack Normal Host Cells and Molécules 

As discussed earlier, cells of the innate immune System use PRRs to distinguish 
microbial molécules from self molécules made by the host. The adaptive immune 
System has the far more difficult récognition taslc of responding specifically to an 
almost unlimited number of foreign molécules while not responding to the large 
number of self molécules. How does it accomplish this feat? It helps that self mol¬ 
écules normally do not induce the innate immune reactions required to activate 
adaptive immune responses. But even when an infection or tissue injury triggers 
innate reactions, the vast majority of self molécules normally still fail to induce an 
adaptive immune response. Why? 

One important reason is that the adaptive immune System “learns” not to 
respond to self molécules. Normal mice, for example, cannot mount an immune 
response against one of their own protein components of the complément System 
called C5 (see Figure 24-7). However, mutant mice that laclc the gene encoding C5 
but are otherwise genetically identical to normal mice of the same strain can malce 
a strong immune response to this blood protein when immunized with it. The 
immunological self-tolerance exhibited by normal mice persists only for as long 
as the self molécule remains in the body: if a self molécule such as C5 is experi- 
mentally removed from an adult mouse, the animal gains the ability to respond 
to it after a few weelcs or months, as new B and T cells develop in the absence of 
C5. Thus, the adaptive immune System is genetically capable of responding to self 
molécules, but it learns not to do so. 

Self-tolerance dépends on a number of distinct mechanisms, including the 
following (Figure 24-21): 

1. In receptor editing, developing B cells that recognize self molécules change 
their antigen receptors so that the cells no longer do so. 

2. In clonal délétion , potentially self-reactive B and T cells die by apoptosis 
when they encounter their particular self molécule. 
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3. In clonal inactivation (also called clonal anergy), self-reactive B and T cells 
become functionally inactivated when they encounter their self molécule. 

4. In clonal suppression , self-reactive regulatory T cells (discussed later) sup- 
press the activity of other types of potentially self-reactive lymphocytes. 

Some of these mechanisms—especially the first two, receptor editing in B cells 
and clonal délétion of B and T cells—operate in central lymphoid organs when 
newly formed self-reactive B and T cells first encounter their self molécules, and 
they are largely responsible for the process called central tolérance. Clonal inac¬ 
tivation and clonal suppression, by contrast, operate mainly when mature B and 
T cells encounter their self molécules in peripheral lymphoid organs, and they 
are largely responsible for the process called peripheral tolérance. Clonal délétion, 
however, can also operate peripherally, and clonal inactivation can also operate 
centrally. 

Why does the binding of a self molécule lead to tolérance rather than activa¬ 
tion? The answer is still not completely known. As we discuss later, the activation 
of a B or T cell by its antigen in a peripheral lymphoid organ requires more than 
just antigen binding: it requires co-stimulatory signais, which are provided by a 
helper T cell (discussed later) in the case of a B cell and by an activated dendritic 
cell in the case of a naïve T cell. The production of such signais is usually triggered 
by exposure to a pathogen, but a self-reactive lymphocyte normally encounters 
its self antigen in the absence of such signais. Under these conditions, the lym¬ 
phocyte will not only fail to be activated, it will often be rendered tolérant—being 
either lcilled or inactivated, or actively suppressed by a regulatory T cell (see Fig¬ 
ure 24-21). In peripheral lymphoid organs, both T cell tolérance and activation 
usually occur on the surface of a dendritic cell. 

For reasons that are usually unknown, self-tolerance mechanisms sometimes 
fail, causing T or B cells (or both) to react against the animaFs own molécules. 
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Figure 24-21 Mechanisms of immunological self-tolerance. When a self-reactive immature B cell binds its self molécule in the central lymphoid 
organ where the cell is produced, it may alter its antigen receptor so that it is no longer self-reactive (cell 1); this process is called receptor editing. 
Alternative^, when either an immature B or T cell binds its self molécule in a central lymphoid organ, it may die by apoptosis, a process called clonal 
délétion (cell 2). Because these two forms of tolérance (shown on the left) occur in central lymphoid organs, they are called central tolérance. 

When a self-reactive naïve B or T cell escapes tolérance in the central lymphoid organ and binds its self molécule in a peripheral lymphoid organ 
(cell 4), or in another peripheral tissue, it will generally not be activated, because the binding usually occurs in the absence of sufficient co-stimulatory 
signais; instead, the cell may die by apoptosis (often after a period of prolifération), be inactivated, or be suppressed by a regulatory T cell. These 
forms of tolérance (shown on the right) are called peripheral tolérance. As discussed later, the cells providing the co-stimulatory signais are 
T lymphocytes for B cells and usually dendritic cells for T cells (not shown). For T cells at least, both activation and tolérance in a peripheral lymphoid 
organ usually occurs on the surface of a dendritic cell, although the dendritic cells are different in the two cases. 
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Myasthenia gravis is an example of such an autoimmune disease. Most of the 
affected individuals malce antibodies against the acétylcholine receptors on their 
own slceletal muscle cells; these receptors are required for the muscle to contract 
normally in response to nerve stimulation, which releases acétylcholine (see Fig¬ 
ure 11-39). The antibodies interfère with the normal functioning of the receptors 
so that the patients become wealc and may die because they cannot breathe. Sim- 
ilarly, in juvénile (type 1 ) diabètes, adaptive immune reactions against insulin-se- 
creting p cells in the pancréas lcill these cells, leading to severe insulin deficiency. 

Summary 

Innate immune responses triggered by pathogens at sites of infection help activate 
adaptive immune responses in peripheral lymphoid organs. The adaptive immune 
System is composed ofmany millions of B and T cell clones, with the cells in each 
clone sharing a unique cell-surface receptor that enables them to bind a particu- 
lar pathogen antigen. The binding of antigen to these receptors, with the help of 
co-stimulatory signais, stimulâtes the lymphocyte to proliferate and differentiate 
into an effector cell that can help eliminate the pathogen. Effector B cells secrete 
antibodies; which can act over long distances to help eliminate extracellular 
pathogens and their toxins. Effector T cells, by contrast, produce cell-surface and 
secreted co-stimulatory molécules, which mainly act locally to help other immune 
cells eliminate the pathogen; in addition, some T cells can induce infected host cells 
to lcill themselves. 

During a primary adaptive immune response to an antigen, lymphocytes that 
recognize the antigen proliferate so that there are more ofthem to respond the next 
time, during a secondary response to the same antigen; moreover, during a primary 
response, some lymphocytes differentiate into memory cells, which can respond 
faster and more efficiently the next time the same pathogen invades. These two 
mechanisms are largely responsiblefor immunological memory. Both B and T cells 
circulate continuously between one peripheral lymphoid organ and another via the 
blood and lymph; only if they encounter their spécifie foreign antigen in a periph¬ 
eral lymphoid organ do they stop migrating, proliferate, and differentiate into effec¬ 
tor cells or memory cells. Lymphocytes that would react against self molécules either 
alter their receptors (in the case ofB cells) or are eliminated or inactivated; they can 
also be suppressed by regulatory T cells. These mechanisms collectively are respon¬ 
siblefor immunological self-tolerance, which ensures that the adaptive immune 
System normally avoids attaclcing the molécules and cells ofthe host. 

B CELLS AND IMMUNOGLOBULINS 

Vertebrates inevitably die of infection if they are unable to malce antibodies. 
Antibodies are secreted proteins that defend us against extracellular pathogens 
in several ways. They bind to viruses and microbial toxins, thereby preventing 
them from binding to and damaging host cells (see Figure 24-2). When bound 
to an extracellular pathogen or its products, antibodies also recruit some of the 
components of the innate immune System, including various types of leukocytes 
and components of the complément System, which worlc together to inactivate or 
eliminate the invaders. 

Synthesized exclusively by B cells, antibodies are produced in billions of forms, 
each with a different amino acid sequence. They belong to the class of proteins 
called immunoglobulins (abbreviated as Igs) and are among the most abundant 
protein components in the blood. In this section, we discuss the structure and 
function of immunoglobulins and how they are made in so many different forms. 

B Cells Make Immunoglobulins (Igs) as Both Cell-Surface Antigen 
Receptors and Secreted Antibodies 

The first Igs made by a newly formed B cell are not secreted but are instead 
inserted into the plasma membrane, where they serve as receptors for antigen. 
They are called B cell receptors (BCRs), and each B cell has approximately 10 5 of 
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them in its plasma membrane. Each BCR is stably associated with invariant trans¬ 
membrane proteins that activate intracellular signaling pathways when antigen 
binds to the BCR; we discuss these invariant proteins later, when we consider how 
B cells are activated with the assistance of helper T cells. 

Each B cell clone produces a single species of BCR, with a unique antigen- 
binding site. When an antigen and a helper T cell activate a naïve or a memory 
B cell, the B cell proliférâtes and differentiates into an effector cell, which then 
produces and sécrétés large amounts of soluble (rather than membrane-bound) 
Ig. The secreted Ig is now called an antibody, and it has the same unique antigen- 
binding site as the BCR (Figure 24-22). 

A typical Ig molécule is bivalent, with two identical antigen-binding sites. It 
consists of four polypeptide chains—two identical light chains and two identical 
heavy chains. The N-terminal parts of both light and heavy chains usually cooper- 
ate to form the antigen-binding surface, while the more C-terminal parts of the 
heavy chains form the tail of the Y-shaped protein (Figure 24-23). The tail médi¬ 
ates many of the activities of antibodies, and antibodies with the same antigen- 
binding sites can hâve any one of a number of different tail régions, each of which 
gives the antibody different functional properties, such as the ability to activate 
complément or to bind to receptor proteins on phagocytic cells that bind a spé¬ 
cifie type of antibody tail. 

Mammals Make Five Classes of Igs 

In mammals, there are five major classes of Igs, each of which médiates a char- 
acteristic biological response following antigen binding to an antibody: IgA, IgD, 
IgE, IgG, and IgM, each with its own class of heavy chain—a, 8, 8, y, and p, respec- 
tively. IgA molécules hâve a chains, IgG molécules hâve y chains, and so on. 
Moreover, there are four human IgG subclasses (IgGl, IgG2, IgG3, and IgG4), with 
Yi, y 2 , Y3, and 74 heavy chains, respectively. There are also two IgA subclasses in 
humans. In addition to the various classes and subclasses of heavy chains, higher 
vertebrates hâve two types of light chains, K and X, which seem to be function- 
ally indistinguishable. Either type of light chain may be associated with any of the 
heavy chains, but an individual Ig molécule always contains identical light chains 
and identical heavy chains: an IgG molécule, for instance, may hâve either K or 
X light chains, but not one of each. As a resuit, an Ig's antigen-binding sites are 
always identical (see Figure 24-22). 

The various heavy chains give a distinctive conformation to the tail région of 
antibodies, so that each class (and subclass) has characteristic properties of its 
own. IgM is always the first class of Ig that a developing B cell in the bone marrow 
malces. It forms the BCRs on the surface of immature naïve B cells. After these cells 
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Figure 24-22 The B cell receptors 
(BCRs) and secreted antibodies made 
by a B cell clone. The binding of an 
antigen to BCRs on either a naïve or 
memory B cell (together with co-stimulatory 
signais provided by helper T cells—not 
shown) activâtes the cell to proliferate 
and differentiate into effector B cells. 

The effector cells produce and secrete 
antibodies with a unique antigen-binding 
site, which is the same as that of the cell- 
surface BCRs. Because antibodies hâve 
two identical antigen-binding sites, they 
can cross-link antigens, as shown for an 
antigen with multiple identical antigenic 
déterminants. 


Figure 24-23 A schematic drawing of 
a bivalent antibody molécule. The two 

heavy chains each hâve a hinge région, 
which, because of its flexibility, improves 
the efficiency with which the antibody can 
cross-link antigens (see Figure 24-22). The 
two heavy chains also form the tail of the 
antibody, which détermines its functional 
properties. The heavy and light chains are 
held together by both covalent S-S bonds 
(red) and noncovalent bonds (not shown). 
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leave the bone marrow, they start to produce IgD BCRs as well, with the same 
antigen-binding site as the IgM BCRs. These cells are now called mature neuve B 
cells, as they can now respond to their spécifie foreign antigen in peripheral lym¬ 
phoid organs (Figure 24-24). IgM is also the major class of antibody secreted into 
the blood in the early stages of a primary antibody response on first exposure to 
an antigen. In its secreted form, IgM is a wheel-like pentamer composed of five 
four-chain units, giving it a total of 10 antigen-binding sites that allow it to bind 
strongly to pathogens; in its antigen-bound form, IgM is highly efficient at activat- 
ing complément, which is important in early antibody responses to pathogens. 

The major antibody class in the blood is IgG. These antibodies are four-chain 
monomers (see Figure 24-23), and they are produced in especially large quan- 
tities during secondary antibody responses. The tail région of some subclasses 
of IgG antibodies that are bound to antigen can activate complément and also 
bind to spécifie receptors on macrophages and neutrophils. Largely by means of 
such Fc receptors (so-named because antibody tails are called Fc régions), these 
phagocytic cells bind, ingest, and destroy infecting microorganisms that hâve 
become coated with the IgG antibodies produced in response to the infection; 
the activated Fc receptors also signal the phagocyte to secrete pro-inflammatory 
cytokines (Movie 24.4). 

The tail région of IgE antibodies binds to another class of Fc receptors on the 
surface of mast cells in tissues and of basophils in the blood. Because antigen-free 
IgE antibodies bind with high affinity to such Fc receptors, the antibodies act as 
antigen receptors on these cells. Antigen binding to the bound antibodies acti¬ 
vâtes the Fc receptors and stimulâtes the cells to secrete a variety of cytokines 
and biologically active amines, especially histamine, which causes blood vessels 
to dilate and become leaky; this helps leukocytes, antibodies, and complément 
components to enter sites where mast cells hâve been activated. The release of 
amines from mast cells and basophils is largely responsible for the symptoms 
of such allergie reactions as hay fever, asthma, and hives. In addition, mast cells 
secrete factors that attract and activate leukocytes called eosinophils, which also 
hâve Fc receptors that bind IgE molécules and can lcill extracellular parasitic 
worms, especially if the worms are coated with IgE antibodies (see Figure 24-6). 

IgA is the principal antibody class in sécrétions, including saliva, tears, mille, 
and respiratory and intestinal sécrétions. Yet another class of Fc receptors, located 
on the relevant épithélial cells, guides the sécrétion by binding antigen-free IgA 
dimers and transporting them across the epithelium. The properties of the various 
classes of antibodies in humans are summarized in Table 24-2. 

Ail classes of Ig can be made in a membrane-bound form, as well as in a sol¬ 
uble, secreted form. The two forms differ only in the C-terminus of their heavy 
chain. The heavy chains of membrane-bound Ig molécules (BCRs) hâve a trans¬ 
membrane hydrophobie C-terminus, which anchors them in the lipid bilayer of 
the B celFs plasma membrane. The heavy chains of secreted antibody molécules, 
by contrast, hâve instead a hydrophilic C-terminus, which allows them to escape 
from the cell. The switch in the character of the Ig molécules made occurs because 
the activation of B cells by antigen and helper T cells induces a change in the way 
in which the heavy-chain RNA transcripts are made and processed in the nucléus 
(see Figure 7-59). 


Figure 24-24 Stages of B cell 
development. Ail of the stages shown 
occur before the cells bind their spécifie 
antigen. The first cells in the B cell lineage 
that make Ig are called pro-B ce//s; they 
make g heavy chains, which remain in the 
endoplasmic réticulum until a spécial type 
of light chain is made called a surrogate 
light chain. The surrogate light chains 
substitute for genuine light chains and 
assemble with g chains to form a receptor 
molécule that inserts into the plasma 
membrane. The cells are now called pre-B 
cells. Signaling from this pre-B cell receptor 
allows the cells to make bona fide light 
chains, which combine with g chains to 
form four-chain IgM molécules that serve 
as cell-surface BCRs on immature naïve 
B cells. After these cells leave the bone 
marrow, they start to express IgD BCRs as 
well, which hâve the same antigen-binding 
sites as the IgM BCRs; it is this mature 
naïve B cell that reacts with its spécifie 
foreign antigen in peripheral lymphoid 
organs. 
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|^roperties of the Major Classes of Antibodies in Humans 

Class of antibody 

Properties IgM IgD IgG IgA IgE 


Heavy chains 

F 

5 

y 

a 

s 

Light chains 

k or À, 

k or X 

k or À, 

k or X 

k or X 

Number of four-chain units 

5 

1 

1 

1 or 2 

1 

Percentage of total Ig in blood 

10 

<1 

75 

15 

<1 

Activâtes classical complément 
pathway 

+ 

- 

+ (some subclasses) 

- 

- 

Crosses from mother to fétus 

- 

- 

+ (some subclasses) 

- 

- 

Binds to macrophages and 
neutrophils 

+ (macrophages only) 

- 

+ (some subclasses) 

+ 

- 

Binds to mast cells and basophils 

- 

- 

+ (some subclasses) 

- 

+ 


Ig Light and Heavy Chains Consist of Constant and Variable 
Régions 

Both light and heavy chains hâve a variable amino acid sequence at their N-termi- 
nal ends but a constant sequence at their C-terminal ends. Whereas the constant 
région and variable région of a light chain are the same size, the constant région 
of a heavy chain is about three or four times longer, depending on the class (Fig¬ 
ure 24-25). 

The variable régions of the light and heavy chains corne together to form the 
antigen-binding sites, and the variability of their amino acid sequences provides 
the structural basis for the diversity of these binding sites. The greatest diversity 
occurs in three small hypervariable régions in the variable régions of both light 
and heavy chains. Only about 5-10 amino acids in each hypervariable région form 
the actual antigen-binding site (Figure 24-26). As a resuit, the size of the anti- 
genic déterminant that an Ig molécule recognizes is generally comparably small: 
it can consist of fewer than 10 amino acids on the surface of a globular protein, for 
example (see Figure 24-22). 

Both light and heavy chains are made up of repeating segments—each about 
110 amino acids long and each containing one intrachain disulfide bond. Each 
repeating segment folds independently to form a compact functional unit called 
an immunoglobulin (Ig) domain. As shown in Figure 24-27A, a light chain con- 
sists of one variable (Vl) and one constant (Cl) domain, whereas a heavy chain 
has one variable and three or four constant domains: the variable domains of the 
light and heavy chains pair to form the antigen-binding région. Each Ig domain 
has a very similar three-dimensional structure, consisting of a sandwich of two (3 

LIGHT CHAIN 

variable constant 

région région (k type or k type) 

I-Il-1 

H N COOH 

HEAVY CHAIN 

H 2 N- -COOH 

I_Il_I 

variable région constant région 

(a, 5, 8, y, or p type) 

Figure 24-25 Constant and variable régions of Ig chains. The variable régions of the light 
and heavy chains form the antigen-binding sites, while the constant régions of the heavy chains 
détermine the other biological properties of an Ig protein. The different subclasses of IgG antibodies 
hâve different y-chain constant régions. 
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Figure 24-26 Ig hypervariable régions. 

Highly schematized drawing of how the 
three hypervariable régions in each light 
and heavy chain together form each 
antigen-binding site of an Ig protein. 


sheets held together by a disulfide bond; the variable domains are unique in that 
each has its particular set of hypervariable régions, which are arranged in three 
hypervariable loops that cluster together at the ends of the variable domains to 
form the antigen-binding site (Figure 24-27B). 

Ig Genes Are Assembled From Separate Gene Segments During 
B Cell Development 

Even in the absence of antigen stimulation, a human can probably malce more 
than 10 12 different Ig molécules—its preimmune, primary Ig répertoire. The pri- 
mary répertoire consists of IgM and IgD proteins and is apparently large enough 
to ensure that there will be an antigen-binding site to fit almost any potential 
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Figure 24-27 Ig domains. (A) The light and heavy chains in an Ig protein are each folded into similar repeating domains. The variable domains 
(shaded in blue) of the light and heavy chains (V|_ and Vh) make up the antigen-binding sites, while the constant domains (shaded in gray) of the 
heavy chains (mainly Ch2 and Ch3) détermine the other biological properties of the protein. The heavy chains of IgM and IgE do not hâve a hinge 
région and hâve an extra constant domain (Ch4). Hydrophobie interactions between domains on adjacent chains help hold the chains together in the 
Ig molécule: V|_ binds to Vh, Cl binds to ChI , and so on. (B) X-ray crystallography-based structures of the Ig domains of a light chain (Movie 24.5). 
Both the variable and constant domains hâve a similar overall structure, consisting of two (3 sheets joined by a disulfide bond (red). Note that ail the 
hypervariable régions (black) form loops at the far end of the variable domain, where they corne together to form part of the antigen-binding site. Ail 
Igs are glycosylated on their Ch 2 domains (not shown); the attached oligosaccharide chains vary from Ig to Ig and can greatly influence the biological 
properties of the protein, largely by affecting its binding to Fc receptors on immune cells. 
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antigenic déterminant, albeit with low affïnity— K a ~ 10 5 -10 7 liters/mole. After 
stimulation by antigen and helper T cells, B cells can switch from malcing IgM and 
IgD to making other classes of Ig—a process called class switching. In addition, the 
binding affïnity of these Igs for their antigen progressively increases over time—a 
process called affinity maturation. Thus, antigen stimulation generates a second- 
ary Ig répertoire, with a greatly increased affinity (K â up to 10 11 liters/mole) and 
diversity of both Ig classes and antigen-binding sites. 

How can each of us malce so many different Igs? The problem is not quite as 
formidable as it might first appear. Recall that the variable régions of the Ig light 
and heavy chains usually combine to form the antigen-binding site. Thus, if we 
had 1000 genes encoding light chains and 1000 genes encoding heavy chains, we 
could, in principle, combine their products in 1000 x 1000 different ways to malce 
10 6 different antigen-binding sites. Nonetheless, we hâve evolved spécial genetic 
mechanisms to enable our B cells to generate an almost unlimited number of dif¬ 
ferent light and heavy chains in a remarkably economical way. We do so in two 
steps. First, before antigen stimulation, developing B cells join together separate 
gene segments in DNA to create the genes that encode the primary répertoire of 
low-affinity IgM and IgD proteins. Second, after antigen stimulation, the assem- 
bled Ig genes can undergo two further changes—mutations that can increase the 
affinity of their antigen-binding site and DNA rearrangements that switch the 
class of Ig made. Together, these changes produce the secondary répertoire of 
high-affinity IgG, IgE, and IgA proteins. 

We produce our primary Ig répertoire by joining separate Ig gene segments 
together during B cell development. Each type of Ig chain— K light chains, X light 
chains, and heavy chains—is encoded by a separate locus on a separate chromo¬ 
some. Each locus contains a large number of gene segments encoding the V région 
of an Ig chain, and one or more gene segments encoding the C région. During the 
development of a B cell in the bone marrow, a complété coding sequence for each 
of the two Ig chains to be synthesized is assembled by site-specific genetic recom¬ 
bination (discussed in Chapter 5). Once a V-région coding sequence is assembled 
next to a C-region sequence, it can then be co-transcribed and the resulting RNA 
transcript processed to produce an mRNA molécule that codes for the complété 
Ig polypeptide chain. 

Each light-chain V région, for example, is encoded by a DNA sequence assem¬ 
bled from two gene segments—a long V gene segment and a short joining or 
/ gene segment (Figure 24-28). Each heavy-chain V région is similarly con- 
structed by combining gene segments, but here an additional diversity segment, or 
D gene segment, is also required (Figure 24-29). In addition to bringing together 
the separate gene segments of the Ig gene, these rearrangements also activate 
transcription from the gene promoter through changes in the relative positions of 
the czs-regulatory DNA sequences acting on the gene. Thus, a complété Ig chain 
can be synthesized only after the DNA has been rearranged. 

The large number of inherited V, J, and D gene segments available for encod¬ 
ing Ig chains contributes substantially to Ig diversity, and the combinatorial join¬ 
ing of these segments (called combinatorial diversification ) greatly increases this 
contribution. Any of the 35 or so functional V segments in our K light-chain locus, 
for example, can be joined to any of the 5 J segments (see Figure 24-28), so that 
this locus can encode at least 175 (35 x 5) different K-chain V régions. Similarly, 
any of the 40 V segments in the human heavy-chain locus can be joined to any of 
the 23 or so D segments and to any of the 6 / segments to encode at least 5520 (40 
x 23 x 6 ) different heavy-chain V régions. By this mechanism alone, called V(D)J 
recombination, a human can produce 295 different Vl régions (175 K and 120 X) 
and 5520 different V H régions. In principle, these could then be combined to malce 
over 1.5 x 10 6 (295 x 5520) different antigen-binding sites. 

V(D)I recombination is mediated by an enzyme complex called V(D)J recom¬ 
binase, which recognizes recombination signal sequences in the DNA that flanlcs 
each gene segment to be joined. Although the process ensures that only appro- 
priate gene segments recombine, a variable number of nucléotides are often lost 
from the ends of the recombining gene segments, and one or more randomly 
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Figure 24-28 The V-J joining process 
involved in making a human 
k light chain. In the “germ-line” DNA 
(where the Ig gene segments are not 
rearranged and are therefore not being 
expressed), the cluster of five J gene 
segments is separated from the C-region 
coding sequence by a short intron and from 
the 35 or so functional V gene segments 
by thousands of nucléotide pairs. During 
the development of a B cell, a randomly 
chosen V gene segment [V3 in this case) 
is moved to lie precisely next to one of the 
J gene segments (J3 in this case). The 
“extra” J gene segments (. J4 and J5) and 
the intron sequence are transcribed (along 
with the joined V3 and J3 gene segments 
and the C-region coding sequence) and 
then removed by RNA splicing to generate 
mRNA molécules with contiguous V3, 

J3, and C sequences, as shown. These 
mRNAs are then translated into k light 
chains. A J gene segment encodes the 
15 or so C-terminal amino acids of the 
V région, and a short sequence containing 
the V-J segment junction encodes the third 
hypervariable région, which is the most 
variable part of the light-chain V région. 
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chosen nucléotides are also inserted. This random loss and gain of nucléotides 
at joining sites is called junctional diversification, and it enormously increases 
the diversity of V-region coding sequences created by V(D)J recombination (up 
to about 10 8 -fold), specifically in the third hypervariable région. This increased 
diversification cornes at a price, however. In many cases, it shifts the reading frame 
to produce a nonfunctional gene, in which case the developing B cell fails to malce 
a functional Ig molécule and consequently dies in the bone marrow. Once a B cell 
malces a functional heavy chain and light chain that form an antigen-binding site, it 
turns off the V(D)J recombination process, thereby ensuring that the cell malces Ig of 
only one antigen-binding specificity. 

B cells making BCRs that bind strongly to self antigens in the bone marrow 
would be dangerous. Such B cells maintain expression of an active V(D)J recom¬ 
binase and are activated by such self-binding to undergo a second round of 
V(D)J recombination in a light-chain locus, thereby changing the specificity of its 
BCR—the process of receptor editing discussed earlier; self-reactive B cells that 
fail to change their specificity die by apoptosis, in the process of clonal délétion 
(see Figure 24-21). 

Antigen-Driven Somatic Hypermutation Fine-Tunes Antibody 
Responses 

As mentioned earlier, with the passage of time following an infection or vaccina¬ 
tion, there is usually a progressive increase in the affinity of the antibodies pro- 
duced against the pathogen. This phenomenon of affinity maturation is due to 



germ-line DNA 


Figure 24-29 The human heavy-chain 
locus. There are 40 V segments, about 23 
D segments, 6 J segments, and an ordered 
cluster of C-region coding sequences, 
each cluster encoding a different class 
of heavy chain. The D segment (and part 
of the J segment) encodes amino acids 
in the third hypervariable région, which 
is the most variable part of the heavy- 
chain V région. The genetic mechanisms 
involved in producing a heavy chain 
are the same as those shown in Figure 
24-28 for light chains, except that two 
DNA rearrangement steps are required 
instead of one: first a D segment joins 
to a J segment, and then a V segment 
joins to the rearranged DJ segment. The 
rearrangements lead to the production of 
a VDJC mRNA that encodes a complété 
Ig heavy chain. The figure is not drawn to 
scale: the total length of the heavy-chain 
locus is over two megabases. Moreover, a 
number of details are omitted: for example, 
the exons encoding each C-region Ig 
domain and the hinge région (see Figure 
24-27) and the different subclasses of 
Cy-coding segments are not shown. 
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the accumulation of point mutations in both heavy-chain and light-chain V-re- 
gion coding sequences. The mutations occur long after the coding régions hâve 
been assembled. After B cells hâve been stimulated by antigen and helper T cells 
in a peripheral lymphoid organ, some of the activated B cells proliferate rapidly 
in the lymphoid follicles and form germinal centers (see Figure 24-20). Here, the 
B cells mutate at the rate of about one mutation per V-region coding sequence 
per cell génération. Because this is about a million times greater than the sponta- 
neous mutation rate in other genes and occurs in somatic cells rather than germ 
cells, the process is called somatic hypermutation. 

Very few of the altered Igs generated by hypermutation will hâve an increased 
affinity for the antigen. But, because the same Ig genes produce both BCRs and 
secreted antibodies, the antigen will stimulate preferentially those few B cells that 
do malce BCRs with increased affinity for the antigen. Clones of these altered B 
cells will preferentially survive and proliferate, especially as the amount of antigen 
decreases to very low levels late in the response. Most other B cells in the germinal 
center will die by apoptosis. Thus, as a resuit of repeated cycles of somatic hyper¬ 
mutation followed by antigen-driven prolifération of selected clones of effector 
and memory B cells, antibodies of increasingly higher affinity become abundant 
during an adaptive immune response, providing progressively better protection 
against the pathogen (Movie 24.6). 

A brealcthrough in understanding the molecular mechanism of somatic hyper¬ 
mutation came with the identification of an enzyme that is required for the process. 
It is called activation-induced deaminase (AID) because it is expressed specifi- 
cally in activated B cells and deaminates cytosine (C) to uracil (U) during tran¬ 
scription of V-region coding DNA. The deamination produces U: G mismatches in 
the DNA double hélix, and the repair of these mismatches produces various types 
of mutations, depending on the repair pathway used. Somatic hypermutation 
affects only actively transcribed DNA, because AID worlcs only on single-stranded 
DNA (which is transiently exposed during transcription) and because proteins 
involved in the transcription of V-region coding sequences are required to recruit 
the AID enzyme. AID is also required for activated B cells to switch from IgM and 
IgD production to the production of the other classes of Ig, as we now discuss. 

B Cells Can Switch the Class of Ig They Make 

After a developing B cell leaves the bone marrow, before it interacts with antigen, it 
expresses both IgM and IgD BCRs on its surface, both with the same antigen-bind- 
ing sites (see Figure 24-24). Stimulation by antigen and helper T cells activâtes 
many of these mature naïve B cells to become IgM-secreting effector cells, so that 
IgM antibodies dominate the primary antibody response. Later in the immune 
response, however, when activated B cells are undergoing somatic hypermuta¬ 
tion, the combination of antigen and helper-T-cell-derived cytokines (discussed 
later) stimulâtes many of the B cells to switch from malcing membrane-bound 
IgM and IgD to making IgG, IgE, or IgA, in the process of class switching. Some 
of these cells become memory cells that express the corresponding class of Ig 
as BCRs on their surface, while others become effector cells that secrete the Ig 
molécules as antibodies. The IgG, IgE, and IgA molécules retain their original 
antigen-binding site and are collectively referred to as secondary classes of Igs, 
because they are produced only after antigen stimulation, dominate secondary 
antibody responses, and make up the secondary Ig répertoire. 

As discussed earlier, the constant région of an Ig heavy chain détermines 
the class of the Ig. Thus, the ability of B cells to switch the class of antibody they 
make without changing the antigen-binding site implies that the same assem¬ 
bled Vn-region coding sequence (which spécifiés the antigen-binding part of the 
heavy chain) can sequentially associate with different Cn-coding sequences. This 
has important functional implications. It means that, in an individual animal, a 
particular antigen-binding site that has been selected by environmental antigens 
can be distributed among the various classes of antibodies, thereby acquiring the 
different biological properties of each class. 
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When a B cell switches from making IgM and IgD to one of the secondary classes 
of Ig, an irréversible change occurs in the DNA—a process called class switch 
recombination. It entails the délétion of ail the Cn-coding sequences between 
the assembled VDI-coding sequence and the particular Cn-coding sequence 
that the cell is destined to express. Class switch recombination differs from V(D)I 
recombination in several ways. (1) It happens after antigen stimulation, mainly in 
germinal centers, and dépends on helper T cells. (2) It uses different recombina¬ 
tion signal sequences, called switch sequences, which flanlc the different Cn-cod- 
ing segments. (3) It involves cutting and joining the switch sequences, which are 
noncoding sequences, and leaves the assembled V H -region coding sequence 
unchanged (Figure 24-30). (4) Most importantly, the molecular mechanism is 
different. It dépends on AID, which is also involved in somatic hypermutation, 
rather than on the V(D)I recombinase. The cytokines that activate class switching 
induce the production of transcription regulators that activate transcription from 
the relevant switch sequences, allowing the recruitment of AID to these sites. 

Once bound, AID initiâtes switch recombination by deaminating some cyto- 
sines to uracil in the vicinity of these switch sequences. Excision of these uracils is 
thought to lead to double-strand breaks in the participating switch régions, which 
are then joined by a form of nonhomologous end joining (discussed in Chapter 5). 

Thus, whereas the primary Ig répertoire in humans (and mice) is generated by 
V(D)I joining mediated by V(D)I recombinase, the secondary antibody répertoire 
is generated by somatic hypermutation and class switch recombination, both of 
which are mediated by AID. Figure 24-31 lists the main mechanisms that we hâve 
discussed in this chapter that diversify Igs. 

Summary 

Each B cell clone makes Ig molécules with a unique antigen-binding site. Initially, 
the Ig molécules are inserted into the plasma membrane and serve as B cell recep- 
tors (BCRs) for antigen. Antigen binding to the BCRs, together with co-stimulatory 
signais from helper T cells, activâtes the B cells to proliferate and differentiate into 
either memory cells or antibody-secreting effector cells. The effector cells secrete large 
amounts of antibodies with the same antigen-binding site as the BCRs. 

A typical Ig molécule is composed of four polypeptide chains—two identical 
heavy chains and two identical light chains. Parts ofboth the heavy and light chains 
form the two identical antigen-binding sites. There are multiple classes oflg (IgA, 
IgD, IgE, IgG, and IgM), each with a distinctive heavy chain, which détermines the 


Figure 24-30 An example of the DNA 
rearrangement that occurs in class 
switch recombination. A B cell making 
IgM molécules with a V région encoded 
by a particular assembled VDJ DNA 
sequence is stimulated to switch to 
making IgA molécules with the same 
V région. In the process, it deletes the 
DNA between the VDJ sequence and 
the C a -coding sequence. Spécifie DNA 
sequences (switch sequences) located 
upstream of each Cn-coding sequence 
(except Cô, as B cells don't switch from 
to Cô) can recombine with one another, 
with the délétion of the intervening DNA, 
as shown here. As discussed in the text, 
the recombination process dépends on 
AID, the same enzyme that is involved in 
somatic hypermutation. When switching 
from IgM to IgG or IgE, the C-region coding 
sequences downstream of C y or Cô, which 
remain after the DNA délétion, are removed 
during RNA splicing. 
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Figure 24-31 The main mechanisms of 
Ig diversification in mice and humans. 

Those shaded in green occur during B cell 
development in the bone marrow, whereas 
the two mechanisms shaded in red occur 
when B cells are stimulated by foreign 
antigen and helper T cells in germinal 
centers in peripheral lymphoid organs, 
either late in a primary response or in a 
secondary response. 
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biological properties ofthe Ig class. Each light and heavy chain is composed of a 
number oflg domains. The amino acid sequence variation in the variable domains 
of both light and heavy chains is concentrated in several small hypervariable 
régions, whichform loops at one end ofthese domains to produce the antigen-bind- 
ingsite. 

Igs are encoded by loci on three different chromosomes, each ofwhich is respon- 
siblefor producing a different polypeptide chain—a K light chain, a À light chain, 
or a heavy chain. Each locus contains separate gene segments that encode different 
parts of the variable région of the particular Ig chain. Each light-chain locus con¬ 
tains one or more constant- (C-) région coding sequences and sets of variable (V) 
and joining (]) gene segments. The heavy-chain locus contains sets ofC-region cod¬ 
ing sequences and sets of V, diversity (D), and J gene segments. 

DuringB cell development in the bone marrow, before antigen stimulation, sepa¬ 
rate gene segments are brought together by site-specific recombination that dépends 
on a V(D)J recombinase. A Vl gene segment recombines with a Jl gene segment to 
produce a DNA sequence coding for the V région ofa light chain, and a Vr gene seg¬ 
ment recombines with a D and a Jh gene segment to produce a DNA sequence cod¬ 
ing for the V région ofa heavy chain. Each ofthe newly assembled V-region coding 
sequences is then co-transcribed with the appropriate C-region sequence to produce 
an RNA molécule that codes for the complété Ig polypeptide chain. 

By randomly combining inherited gene segments that code for the variable 
régions during B cell development, humans can make hundreds of different light 
chains and thousands of different heavy chains. Because the antigen-binding site 
isformed where the hypervariable loops ofthe Vl and Vu domains corne together 
in the final Ig molécule, the heavy and light chains can potentially pair to form 
Igs with millions of different antigen-binding sites. A loss or gain of nucléotides at 
the site of gene-segment joining increases this number enormously. The Igs made 
by such V(D)I recombination before antigen stimulation are IgMs and IgDs with low 
ajfinity for binding antigen, and they constitute the primary Ig répertoire. 

Igs areJurther diversifiedfollowing antigen stimulation in peripheral lymphoid 
organs by the AID- and helper-T-cell-dépendent processes of somatic hypermuta¬ 
tion and class switch recombination, which together produce the high-ajfinity IgG, 
IgE, and IgA Igs that constitute the secondary Ig répertoire. The process of class 
switching allows the same antigen-binding site to be incorporated into antibodies 
that hâve different tails and therefore different biological properties. 

T CELLS AND MHC PROTEINS 

Like antibody responses, T-cell-mediated immune responses are exquisitely anti- 
gen-specific, and they are at least as important as antibodies in defending ver- 
tebrates against infection. Indeed, most adaptive immune responses, including 
most antibody responses, require helper T cells for their initiation. Most impor- 
tantly, unlilce B cells, T cells can help eliminate pathogens that hâve entered the 
interior of host cells, where they are invisible to B cells and antibodies. Much of 
the rest of this chapter is concerned with how T cells accomplish this feat. 

T cell responses differ from B cell responses in at least two crucial ways. First, 
a T cell is activated by foreign antigen to proliferate and differentiate into effector 
cells only when the antigen is displayed on the surface of an antigen-presenting 
cell {APC), usually a dendritic cell in a peripheral lymphoid organ. One reason 
T cells require APCs for activation is that the form of antigen they recognize is 
different from that recognized by the Igs produced by B cells. Whereas Igs can 
recognize antigenic déterminants on the surface of pathogens and soluble folded 
proteins, for example, T cells can only recognize fragments of protein antigens 
that hâve been produced by partial proteolysis inside a host cell. As mentioned 
earlier, newly formed MHC proteins capture these peptide fragments and carry 
them to the surface of the host cell, where T cells can recognize them. 

The second différence is that, once activated, effector T cells act mainly at short 
range, either within a secondary lymphoid organ or after they hâve migrated to a 
site of infection. Effector B cells, by contrast, secrete antibodies that can act far 
away. Effector T cells interact directly with another host cell in the body, which 
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they either kill (if it is an infected host cell, for example) or signal in some way (if 
it is a B cell or macrophage, for example). We will refer to such host cells as target 
cells. As is the case with APCs, target cells must display an antigen bound to an 
MHC protein on their surface for a T cell to recognize them. 

There are three main classes of T cells—cytotoxic T cells, helper T cells, and 
regulatory T cells. When activated, they function as effector cells (see Figure 
24-17), each with their own distinct activities. Effector cytotoxic T cells directly kill 
cells that are infected with a virus or some other intracellular pathogen. Effector 
helper T cells help stimulate the responses of other immune cells—mainly mac¬ 
rophages, dendritic cells, B cells, and cytotoxic T cells; as we will see, there are a 
variety of functionally distinct subtypes of helper T cells. Effector regulatory T cells 
suppress the activity of other immune cells. 

In this section, we describe these classes and subclasses of T cells and their 
respective functions. We discuss how they recognize foreign antigens on the sur¬ 
face of APCs or target cells and the crucial part played by MHC proteins in the réc¬ 
ognition process. We begin by considering the cell-surface receptors that T cells 
use to recognize antigen. 

T Cell Receptors (TCRs) Are Ig-like Heterodimers 

T cell receptors (TCRs), unlilce Igs made by B cells, exist only in membrane-bound 
form. They are composed of two transmembrane, disulfide-linked polypeptide 
chains, each of which contains two Ig-like domains—one variable and one con¬ 
stant. On most T cells, the TCRs hâve one a chain and one P chain (Figure 24-32). 

The genetic loci that encode the a and p chains are located on different chro¬ 
mosomes. Lilce an Ig heavy-chain locus (see Figure 24-29), the TCR loci contain 
separate V, D, and / gene segments (or just V and J gene segments in the case 
of the a-chain locus), which are brought together by site-specific recombination 
during T cell development in the thymus. With one exception, T cells use the same 
mechanisms to generate antigen-binding site diversity of their TCRs as B cells 
use to generate antigen-binding site diversity of their Igs, and they use the same 
V(D)J recombinase; thus, humans or mice déficient in this recombinase can- 
not malce functional B or T cells. The mechanism that does not operate in TCR 
diversification is antigen-driven somatic hypermutation. Thus, the affinities of 
TCRs tend to be low (iC a ~ 10 5 -10 7 liters/mole). Various co-receptors and cell-cell 
adhesion proteins, however, greatly strengthen the binding of a T cell to an APC 
or target cell. 



Figure 24-32 A T cell receptor (TCR) 
heterodimer. (A) Schematic drawing 
showing that the receptor is composed of 
an a and a P polypeptide chain. Each chain 
has a large extracellular part that is folded 
into two Ig-like domains—one variable 
(V) and one constant (C). A V a and a 
V (3 domain (shaded in blue) form the 
antigen-binding site. Unlike Igs, which 
hâve two binding sites for antigen, TCRs 
hâve only one. The ap-heterodimer is 
noncovalently associated with a large set 
of invariant membrane-bound proteins (not 
shown), which help activate the T cell when 
the TCRs bind their spécifie antigen (see 
Figure 24-45B). A typical T cell has about 
30,000 TCRs on its surface. (B) The three- 
dimensional structure of the extracellular 
part of a TCR. The antigen-binding site 
is formed by the hypervariable loops of 
both the \/ a and Vp domains (black), and 
it is similar in its overall dimensions and 
geometry to the antigen-binding site of an 
Ig molécule. (B, based on K.C. Garcia et 
al., Science 274:209-219, 1996.) 
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Instead of making a and P chains, a minority of T cells malces a different but 
related type of TCR heterodimer, composed of y chains and S chains. Although 
these y/ô T cells normally malce up only 5-10% of the T cells in human blood, 
they can be the dominant T cell population in epithelia (in the skin and gut, for 
example). They hâve some properties in common with natural lciller (NK) cells 
and with an enlarging category of T-lilce cells that hâve features of both innate 
and adaptive immune cells, which are sometimes collectively referred to as innate 
lymphoid cells. The cells in ail these categories tend to be enriched in mucosal tis- 
sues, respond early to infection, display little immunological memory, and, com- 
pared with B and T cells, hâve surface receptors of restricted diversity. We will not 
dis eus s them further. 

As with BCRs, TCRs are tightly associated in the plasma membrane with a 
number of invariant membrane-bound proteins that are involved in passing the 
signal from an antigen-activated receptor to the cell interior. We will discuss these 
proteins in more detail later, when we consider some of the molecular events 
involved in T and B cell activation. First, we consider the spécial ways in which T 
cells recognize foreign antigen on the surface of an APC or target cell. 

Activated Dendritic Cells Activate Naïve T Cells 

Generally, naïve T cells, including naïve helper and cytotoxic T cells, proliferate 
and differentiate into effector cells and memory cells only when they see their 
spécifie antigen on the surface of an activated dendritic cell in a peripheral lym¬ 
phoid organ (Figure 24-33). The activated dendritic cell displays the antigen in 
a complex with MHC proteins on its surface, along with co-stimulatory proteins 
(see Figure 24-11). The memory T cells that develop, however, can be activated 
by the same antigen-MHC complex on the surface of other types of APCs (target 
cells), including macrophages and B cells—as well as by dendritic cells. 

Immature dendritic cells are located in most tissues—underlying épithélial 
layers of the skin and gut, for example—where they are constantly sampling and 
Processing proteins in their environment. They become activated to mature when 
their pattern récognition receptors (PRRs) encounter pathogen associated molec¬ 
ular patterns (PAMPs) on an invading pathogen or its products. The pathogen or 
products are ingested, and the microbial proteins are cleaved into peptide frag¬ 
ments, which are loaded onto MHC proteins, as we discuss later. The activated 
dendritic cells then migrate via the lymph from the site of infection to local lymph 
nodes or gut-associated lymphoid organs, where they présent the foreign anti¬ 
gens, displayed as peptide-MHC complexes on the dendritic cell surface, for réc¬ 
ognition by the relevant T cells (see Figure 24-11). 

Activated dendritic cells display three types of protein molécules on their sur¬ 
face that hâve a rôle in activating a T cell to become an effector cell or memory 
cell (Figure 24-34): (1) MHC proteins, which présent foreign peptides to the TCRs; 
(2) co-stimulatory proteins, which bind to complementary receptors on the T cell 
surface; and (3) cell-cell adhesion molécules, which enable a T cell to bind to the 
dendritic cell for long enough to become activated, typically several hours. In 
addition, activated dendritic cells secrete a variety of cytokines that influence the 
type of effector helper T cell that develops, and different types of dendritic cells 
promote different outcomes (discussed later). 

T Cells Recognize Foreign Peptides Bound to MHC Proteins 

MHC proteins capture and display peptide fragments of foreign proteins for 
présentation to T cells. There are two main classes of MHC proteins, which are 
structurally and functionally distinct. Class I MHC proteins mainly présent for¬ 
eign peptides to cytotoxic T cells, whereas class II MHC proteins mainly présent 
foreign peptides to helper and regulatory T cells (Figure 24-35). Some class-I-like 
MHC proteins présent microbial lipid and glycolipid antigens to T cells, but they 
are not encoded within the MHC région of the genome, and we will not consider 
them further. 
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Figure 24-33 Immunofluorescence 
micrograph of a dendritic cell in culture. 

These APCs dérivé their name from their 
long processes, or “dendrites.” The cell has 
been labeled with a monoclonal antibody 
that recognizes a surface antigen on these 
cells. (Courtesy of David Katz.) 
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Figure 24-34 The three general types of 
proteins on the surface of an activated 
dendritic cell involved in activating a 

T cell. Although only membrane-bound 
co-stimulatory molécules are shown, 
activated dendritic cells also secrete 
soluble co-stimulatory molécules. The 
invariant polypeptide chains that are 
always stably associated with the 
TCR are not shown; they are illustrated 
in Figure 24-45B and Movie 24.7. 


Both class I and class II MHC proteins are heterodimers, in which two extra¬ 
cellular domains form a peptide-bindinggroove, which always has a variable small 
peptide bound in it. In class I MHC proteins, the two domains that form the pep¬ 
tide-binding groove are provided by the transmembrane a chain, which is nonco- 
valently associated with a small subunit called P 2 -microglobulin; in class II MHC 
proteins, a different a chain and a large noncovalently associated P chain each 
contribute an extracellular domain to form the peptide-binding groove (Figure 
24-36). A TCR binds to both the peptide and the ridges of the binding groove. 
Humans hâve three major class I pro teins, called HLA-A, HLA-B, and HLA-C, 
and three class II proteins, called HLA-DR, HLA-DP, and HLA-DQ (HLA stands 
for fouman-Zeukocyte-associated, as these proteins were first demonstrated on 
human leukocytes). Figure 24-37 shows how the genes that encode these pro¬ 
teins are arranged on human chromosome 6. 

There are important différences between the class I and class II MHC pro¬ 
teins with regard to the cell types that express them and the origin of the pep¬ 
tides in their peptide-binding grooves. Almost ail of our nucleated cells express 
class I proteins. Their peptide-binding groove displays one of a diverse collection 
of peptides (typically 8-10 amino acids in length). In a healthy cell, the peptides 
originate from the cell's own cytosolic and nuclear proteins that hâve undergone 
partial dégradation in protéasomes in the processes of normal protein turnover 
and quality control mechanisms. Some of the peptide fragments produced in this 
way are actively transported into the lumen of the endoplasmic réticulum (ER), 
through a specialized transporter in the ER membrane, where they are loaded 
onto newly synthesized class I MHC a chains; once a peptide binds, the a chain 
can assemble with its partner chain. The resulting self-peptide-MHC complex is 
then transported through the Golgi apparatus to the cell surface. Such complexes 
are not dangerous, however, because the cytotoxic T cells that could recognize 


CYTOTOXIC T CELL 


HELPER OR REGULATORY T CELL 



or target cell 


Figure 24-35 Récognition by T cells 
of foreign peptides bound to MHC 
proteins. Cytotoxic T cells recognize 
foreign peptides in association with class 
I MHC proteins, whereas helperT cells 
and regulatory T cells recognize foreign 
peptides in association with class II 
MHC proteins. In both cases, the T cell 
recognizes the peptide-MHC complexes 
on the surface of an APC—either a 
dendritic cell or a target cell. Some 
regulatory T cells recognize self peptides 
in association with class II MHC proteins 
(not shown). 
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Figure 24-36 Class I and class II MHC proteins. (A) The a chain of the class I molécule has three extracellular domains, ai, a 2 , and 
a 3 , each encoded by a separate exon. The a chain is noncovalently associated with a smaller polypeptide chain, (32-microglobulin, 
which is not encoded within the MHC région of the genome. The a 3 domain and (32-microglobulin are Ig-like. While P 2 -microglobulin is 
invariant, the a chain is extremely polymorphie, mainly in the ai and a 2 domains. (B) In class II MHC proteins, both the a chain and the 
P chain are encoded within the MHC and are polymorphie, mainly in the ai and Pi domains; the a 2 and P 2 domains are Ig-like. Thus, 
there are striking similarities between class I and class II MHC proteins. In both, the two outermost domains (shaded in blue) are 
polymorphie and interact to form a groove that binds peptide fragments. (C) The three-dimensional structure of the peptide-binding 
groove of a human class I MHC protein is viewed from above, with bound peptide shown schematically; a peptide must be bound in the 
groove for the MHC protein to assemble and be transported to the cell surface. The sides of the groove are formed by two a helices, 
and the floor is formed by a P pleated sheet. The S-S disulfide bond is shown in red (Movie 24.8 and Movie 24.9). (C, adapted from 
P.J. Bjorkman et al., Nature 329:506-512, 1987. With permission from Macmillan Publishers Ltd.) 


them hâve been either eliminated or inactivated, or suppressed by regulatory T 
cells in the process of self-tolerance. By contrast, in a cell infected by a pathogen 
such as a virus, the pathogen proteins will be processed in the same way, and 
peptides derived from them will be displayed on the infected cell surface bound 
to class I MHC proteins; there, they are recognized by cytotoxic T cells expressing 
the appropriate TCRs, thereby targeting the infected cell for destruction (Figure 
24-38). 

In general, only antigen-presenting cells (APCs) express class II MHC pro- 
teins. Dendritic cells are referred to as professional APCs , as they are specialized 
for this function and only they can activate naïve T cells. Other immune cells that 
are targets of effector T cell régulation, including B cells and macrophages, are 
nonprofessional APCs. Ail APCs load their newly synthesized class II MHC pro¬ 
teins with peptides derived mainly from extracellular proteins that are endocy- 
tosed and delivered to endosomes. The newly synthesized class II MHC proteins 
initially contain an invariant chain , which occupies the peptide-binding groove 
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■ ■ ■■ ■■■ ■ ■ ■ 

(3 a (3 a (3 (3 a 

human 

chromosome 6 


Figure 24-37 Human MHC genes. This 
simplifiée! schematic drawing shows the 
location of the genes that encode the 
transmembrane subunits of class I (light 
green) and class II (darkgreen) MHC 
proteins. The genes shown encode three 
types of class I MHC proteins (HLA-A, 

HIA-B, and HLA-C) and three types of 
class II MHC proteins (HLA-DP, HLA-DQ, 
and HIA-DR). An individual can therefore 
make six types of class I MHC proteins 
(three encoded by maternai genes and 
three by paternal genes) and more than six 
types of class II MHC proteins. Because 
of the extreme polymorphism of the MHC 
genes, the chances are very low that 
the maternai and paternal alleles will be 
the same. The number of class II MHC 
proteins that can be made is greater than 
six because there are two DR (3 genes 
and because maternally encoded and 
paternally encoded polypeptide chains can 
sometimes pair. The entire région shown 
spans about seven million base pairs and 
contains other genes that are not shown. 
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and prevents it from prematurely binding a peptide until the class II MHC pro- 
tein reaches specialized vesicles, which fuse with endosomes. Here, the invari¬ 
ant chain is removed and peptide fragments (typically 12-20 amino acids long) 
produced from endocytosed proteins can bind to the groove of the class II MHC 
proteins, which are then transported to the plasma membrane for display on the 
surface of the APC. In a healthy host cell, class II MHC protein grooves are loaded 
with self-peptides derived from normal proteins and will be ignored by T cells 
because of self-tolerance mechanisms. During an infection, however, pathogen 
proteins are also endocytosed and processed in the same way, enabling APCs to 
présent pathogen peptides bound to class II MHC proteins to T cells expressing an 
appropriate TCR (Figure 24-39). 

The distinction just discussed between the antigen-processing pathways for 
loading peptides onto class I and class II MHC proteins is not absolute. Dendritic 
cells, for example, need to be able to activate cytotoxic T cells to kill virus-in- 
fected cells even when the virus do es not infect dendritic cells themselves. To 
do so, specialized subsets of dendritic cells use a process called cross-presenta- 
tion, which begins when these noninfected dendritic cells phagocytose virus-in- 
fected host cells or their fragments. The ingested viral proteins are then released 
by an unknown mechanism from phagolysosomes into the cytosol, where they 
are degraded in protéasomes; the resulting protein fragments are then trans¬ 
ported into the ER lumen, where they load onto assembling class I MHC proteins. 
Cross-presentation in dendritic cells is not confined to endocytosed pathogens 
and their products: it also opérâtes to activate cytotoxic T cells against tumor anti¬ 
gens of cancer cells and the MHC proteins of foreign organ grafts. 
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Figure 24-38 The Processing of an extracellular foreign protein for présentation to cytotoxic T cells. An effector cytotoxic T cell kills a virus- 
infected cell when it recognizes fragments of an internai viral protein bound to class I MHC proteins on the surface of the infected cell. Not ail viruses 
enter the cell in the way that this enveloped RNA virus does, but fragments of internai viral proteins always follow the pathway shown. Only a small 
proportion of the viral proteins synthesized in the cytosol are degraded and transported to the cell surface, but this is sufficient to attract an attack 
by a cytotoxic T cell. Several chaperone proteins in the ER lumen aid the folding and assembly of class I MHC proteins (not shown). The assembly 
of class I MHC proteins and their transport to the cell surface require the binding of either a self or foreign peptide (Movie 24.10). 
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Figure 24-39 The Processing of an extracellular protein antigen for présentation to 
a helper T cell. This simplifiée! depiction shows how peptide-class-ll-MHC complexes 
are formed in endosomes and delivered via vesicles to the cell surface. Viral envelope 
glycoproteins can also be processed by this pathway for présentation to helper T cells 
(not shown): these glycoproteins are normally made in the ER and transported via the 
Golgi for insertion into the plasma membrane; although most of these glycoproteins will 
be incorporated into the envelope of budding viral particles, some will be endocytosed 
and enter endosomes, from where they can enter the class II MHC Processing pathway. 


During an infection, only a small fraction of the many thousands of MHC pro- 
teins on the surface of an APC or target cell will hâve pathogen peptides bound 
to them. This is sufficient, however: fewer than 50 copies of such a peptide-MHC 
complex on a dendritic cell, for example, can activate a helper T cell that has a 
TCR that binds the complex with a high-enough affinity. Table 24-3 compares the 
properties of class I and class II MHC proteins. 

MHC Proteins Are the Most Polymorphie Human Proteins Known 

Although any individual can malce only a small number of different class I and class 
II MHC proteins, together, these proteins must be able to présent peptide frag¬ 
ments from almost any foreign protein to T cells. Thus, unlilce the antigen-binding 
site of an Ig protein, the peptide-binding groove of each MHC protein must be 
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able to bind a very large number of different peptides. The genes encoding class 
I and class II MHC proteins (see Figure 24-37) are the most polymorphie known 
in higher vertebrates: in the human population, for example, there are more than 
2000 allelic variants of these genes. The corresponding variations in the MHC pro¬ 
teins are concentrated in the floor and walls of the peptide-binding grooves and 
allow MHC molécules in different individuals to bind different arrays of peptides. 

It is thought that infectious diseases hâve been an important driving force for 
generating this remarkable MHC polymorphism. In the evolutionary war between 
pathogens and the adaptive immune System, pathogens will tend to change their 
proteins through mutation so that the peptides derived from them will not fit 
in the MHC peptide-binding grooves. When a pathogen succeeds, it can sweep 
through a population as an épidémie. In such circumstances, the few individuals 
who produce a new allelic form of MHC protein that can bind peptides derived 
from the altered pathogen will hâve a large sélective advantage. This type of 
sélection will tend to promote and maintain a large diversity of MHC proteins 
in the population. In West Africa, for example, individuals with a spécifie MHC 
allele (HLA-B53) hâve a reduced susceptibility to a severe form of malaria that is 
endemic there; although this allele is rare elsewhere, it is found in 25% of the West 
African population. 

The extensive diversity of human MHC proteins is the main reason that indi¬ 
viduals who receive a foreign organ transplant must be treated with strong immu¬ 
nosuppressive drugs to prevent the immunological rejection of the grafted organ. 
Of ail the foreign proteins that the graft expresses, the MHC proteins are by far the 
most powerful stimulators of the recipient's T cells, which would rapidly destroy 
the graft if they were not prevented from doing so by such drugs. Foreign MHC 
proteins are powerful T cell stimulants because T cells respond to them in the 
same way they respond to self MHC proteins that hâve foreign peptides bound 
to them; for this reason, the proportion of a person's T cells that can specifically 
recognize any foreign MHC protein is relatively high. 

CD4 and CD8 Co-receptors on T Cells Bind to Invariant Parts of 
MHC Proteins 

The affinity of TCRs for peptide-MHC complexes on an APC is usually too low by 
itself to médiate a functional interaction between the two cells. T cells normally 
require accessory receptors to help stabilize the interaction by increasing the over- 
all strength of the cell-cell adhesion. Unlilce TCRs or MHC proteins, the accessory 
receptors are invariant and do not bind to foreign peptides. Once bound to the 
surface of a dendritic cell, for example, a T cell increases the strength of the bind- 
ing by activating an integrin adhesion protein (discussed in Chapter 19), which 
then binds more strongly to an Ig-like protein on the surface of the dendritic 
cell. This increased adhesion enables the T cell to remain bound long enough to 
become activated. 

When an accessory receptor has a direct rôle in activating the T cell by gener¬ 
ating its own intracellular signais, it is called a co-receptor. The most important 
and best understood of the co-receptors on T cells are the CD4 and CD8 proteins, 
both of which are single-pass transmembrane proteins with extracellular Ig-like 
domains. Like TCRs, they recognize MHC proteins, but, unlilce TCRs, they bind 
to invariant parts of the MHC protein, far away from the peptide-binding groove. 
CD4 is expressed on both helper T cells and regulatory T cells and binds to class II 
MHC proteins, whereas CD8 is expressed on cytotoxic T cells and binds to class I 
MHC proteins (Figure 24-40). 

CD4 and CD8 contribute to T cell récognition by helping the T cell to focus on 
particular MHC proteins, and thereby on particular types of target cells. Thus, the 
récognition of class I MHC proteins by CD8 allows cytotoxic T cells to focus on 
any type of infected host cell, while the récognition of class II MHC proteins by 
CD4 allows helper and regulatory T cells to focus on the target immune cells that 
they help or suppress, respectively. The cytoplasmic tail of the CD4 and CD8 pro¬ 
teins is associated with a member of the Src family of cytoplasmic tyrosine kinases 
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Figure 24-40 CD4 and CD8 co-receptors 
on the surface of T cells. Cytotoxic T cells 
(Te) express CD8, which recognizes class I 
MHC proteins, whereas helper T cells 
(Th) and regulatory T cells (not shown) 
express CD4, which recognizes class II 
MHC proteins. Note that the co-receptors 
bind to the same MHC protein that the 
TCR has engaged, so that they are brought 
together with TCRs during the antigen- 
recognition process. Whereas the TCR 
binds to the variable (polymorphie) parts 
of the MHC protein that form the peptide- 
binding groove, the co-receptor binds 
to the invariant part, well away from the 
binding groove. 
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(discussed in Chapter 15) called Lclc, which phosphorylâtes various intracellular 
proteins on tyrosines and thereby participâtes in the activation of the T cell (dis¬ 
cussed later). 

The AIDS virus (HIV) uses CD4 molécules (as well as chemokine receptors) to 
enter helper T cells (see Figure 23-17). AIDS patients are susceptible to infection 
by microbes that are not normally dangerous because HIV depletes helper T cells. 
As a resuit, most AIDS patients die of infection within several years of the onset of 
symptoms, unless they are treated with a combination of anti-HIV drugs. HIV also 
uses CD4 and chemokine receptors to enter macrophages, which also hâve both 
types of receptors on their surface. 

Developing Thymocytes Undergo Négative and Positive Sélection 

T cell development begins when bone-marrow-derived lymphoid progenitor 
cells enter the thymus from the bloodstream. There, the cells receive a variety of 
signais from thymus stromal cells, épithélial cells, macrophages, and dendritic 
cells, which promote their stepwise development into mature thymocytes. At one 
step, the progenitor cells are induced to express V(D)I recombinase and begin to 
rearrange their TCR gene segments. Soon thereafter, the cells express both CD4 
and CD8 co-receptors, and these so-called double-positive thymocytes migrate 
inward and interact with thymus dendritic cells or épithélial cells expressing self 
peptides bound to class I and class IIMHC proteins. If the TCR on the thymocyte 
binds with high affinity to these complexes, a strong signal will be transmitted, 
causing the cell to undergo apoptosis. This process, called négative sélection, is 
an example of clonal délétion (see Figure 24-21), and it éliminâtes thymocytes 
that could potentially attaclc normal host cells and tissues and thereby cause an 
autoimmune disease if the cells were to continue to mature and leave the thymus. 

If its TCR is unable to bind at ail to a self-peptide-MHC complex in the thy¬ 
mus, the thymocyte will fail to receive the signais it needs to survive and will die 
of “neglect;" without the ability to recognize self-MHC proteins, a T cell would 
generally be of no use, as T cells can only see pathogen-derived peptides in the 
context of self-MHC proteins. Thymocytes that express a TCR that binds with an 
appropriate affinity to a self peptide bound to either a class I MHC protein (using 
CD8 as a co-receptor) or a class II MHC protein (using CD4 as a co-receptor) will 
receive an optimal signal to survive and continue to mature, a process called pos¬ 
itive sélection (Figure 24-41). As part of this maturation process, and depending 
on the TCR's preference for class I or class II MHC proteins, the CD4 or CD8 co-re- 
ceptor that is not needed is silenced by DNA méthylation of the respective gene; 
this results in the development of CD4 or CD8 single-positive thymocytes, which 
exit the thymus as naïve T cells and enter the recirculating pool of T cells—the CD4 
cells as either helper or regulatory T cells and the CD8 cells as cytotoxic T cells. 

Although naïve helper and cytotoxic T cells constantly receive survival signais 
in the form of self peptides bound to MHC proteins that the T cells bind wealdy, 
a T cell is only activated to proliferate and mount an immune response if its TCR 
binds with high affinity to a peptide-MHC complex and receives co-stimulatory 
signais at the same time. Generally, this happens only when the T cell encoun- 
ters an activated dendritic cell (in a peripheral lymphoid organ) that expresses 
an MHC protein with a foreign peptide derived from a pathogen in its binding 
groove. Only then will the naïve T cell proliferate and differentiate into an effector 
or memory T cell. 

Négative sélection in the thymus is a major mechanism for ensuring that 
peripheral T cells do not react with host cells expressing MHC proteins with pep¬ 
tides derived from self proteins in their peptide-binding grooves. This mecha¬ 
nism, however, requires that the APCs in the thymus display an array of peptides 
on their MHC molécules that will reflect the self proteins in peripheral tissues, as 
well as in the thymus. The thymus, however, would not be expected to produce 
many of the proteins that are specifically expressed in other organs. As an exam¬ 
ple, it would not be expected to produce insulin, and yet it is crucial to delete thy¬ 
mocytes with TCRs that could recognize insulin-derived peptides bound to MHC 
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Figure 24-41 Positive and négative 
sélection in the thymus. Developing 
thymocytes with TCRs that would 
potentially enable them to respond to 
peptides in association with self MHC 
proteins after they leave the thymus are 
positively selected: the binding of their 
TCRs to self peptides bound to self 
MHC proteins in the thymus signais such 
cells to survive, mature, and migrate to 
peripheral lymphoid organs. Ail of the other 
thymocytes undergo apoptosis—either 
because they do not express TCRs that 
recognize self MHC proteins with self 
peptides bound or because they recognize 
such complexes too well and undergo 
négative sélection. 

The regulatory T cells (T reg cells) that 
are positively selected in the thymus are 
called natural T reg cells to distinguish them 
from induced T reg cells, which develop 
in peripheral lymphoid organs from naive 
helper T cells (Th cells), as we discuss 
shortly. 


proteins on the surface of insulin-secreting P cells in the pancréas. Any failure to 
do so would resuit in the T-cell-dependent destruction of the p cells and, as a con¬ 
séquence, cause type 1 (or juvénile) diabètes. 

The mechanism that enables the délétion of ail such cells in the thymus 
dépends on a subpopulation of épithélial cells in the thymus that express a tran- 
scriptional regulator called AIRE (autoimmune regulator). By a poorly under- 
stood mechanism, the AIRE protein promûtes the production of small amounts 
of mRNA from many genes that encode such “organ-specific" proteins, including 
the insulin gene. When the peptides derived from the proteins encoded by these 
genes are bound by MHC proteins and displayed on the surface of the épithélial 
cells in the thymus medulla, this is sufficient to provoke the délétion of the poten¬ 
tially self-reactive thymocytes. Mutations that inactivate the AIRE gene cause a 
severe multiorgan autoimmune disease in both mice and humans, indicating the 
importance of AIRE in self-tolérance. 

Cytotoxic T Cells Induce Infected Target Cells to Kill Themselves 

Cytotoxic T cells (Te cells), lilce the NK cells discussed earlier, protect us against 
intracellular pathogens, including viruses, bacteria, and parasites, that multiply 
in the cytoplasm of a host cell. Te cells kill infected host cells before the pathogen 
can escape to infect neighboring host cells. Before it can kill, however, a naïve 
Te cell has to become an effector cell by activation on an APC, usually an acti- 
vated dendritic cell that has pathogen-derived peptides bound to class I MHC 
proteins—a process that dépends on helper T cells. The effector Te cell can then 
recognize any target cell harboring the same pathogen and expressing some of 
the same peptide-MHC complexes on its surface: its TCRs cluster, along with CD8 
co-receptors, adhesion molécules, and intracellular signaling proteins (discussed 
later), at the interface between the two cells, forming an immunological synapse. 
In this process, the effector Te cell reorganizes its cytoslceleton to focus its kill- 
ing apparatus on the target cell, secreting its toxic proteins into a confined space 
(Figure 24-42); in this way, it avoids killing neighboring cells. A similar synapse 
forms when an effector helper T cell interacts with its target cell, except that the 
co-receptor is CD4 (Movie 24.11). 
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Figure 24-42 Effector cytotoxic T cells killing target cells in culture. (A) Electron micrograph showing an effector 
cytotoxic T cell Je cell) binding to a target cell. The Te cells were obtained from mice immunized with the target cells, which 
are foreign tumor cells. (B) Electron micrograph showing a Te cell and a tumor cell that the Te cell has killed. In an animal, 
as opposed to a culture dish, the killed target cell would be phagocytosed by neighboring cells (especially macrophages) 
long before it disintegrated in the way that it has here. (C) Immunofluorescence micrograph of a Te cell and tumor cell after 
immunofluorescence staining with anti-tubulin antibodies. Note that the centrosome in the Te cell is located at the point of 
cell—cell contact with the target cell—an immunological synapse. The secretory granules (not visible) in the Te cell are initially 
transported along microtubules to the centrosome, which then moves to the synapse, delivering the granules to where they can 
release their contents. (A and B, from D. Zagury et al., Eur. J. Immunol. 5:818-822, 1975. With permission from John Wiley & 
Sons, Inc; C, from B. Geiger, D. Rosen and G. Berke, J. Cell Biol. 95:137-143, 1982. With permission from the authors.) 


An effector Te cell (or an NK cell) can employ one of two strategies to kill the 
target, both of which operate by inducing the target cell to activate caspases and 
kill itself by undergoing apoptosis. One mechanism uses a protein called Fas 
ligand on the killer-cell surface, which binds to a transmembrane receptor pro¬ 
tein called Fas on the target cell; this mechanism is discussed in Chapter 18 (see 
Figure 18-5). The other mechanism is the main one used by both NK cells and 
Te cells to kill an infected target cell. The killer cell stores various toxic proteins 
within secretory vesicles in its cytoplasm that it releases into the synaptic space 
by exocytosis. The toxic proteins include perforin and proteases called granzymes. 
The perforin is homologous to complément component C9 and polymerizes in 
the target-cell plasma membrane (see Figure 24-8), forming a transmembrane 
pore that disrupts the membrane and allows the granzymes to enter the target 
cell. Once in the cytosol, the granzymes help activate caspases, thereby inducing 
apoptosis (Figure 24-43). 


effector cytotoxic T cell 



Figure 24-43 The main way that an 
effector Te cell (or NK cell) kills an 
infected target cell. This simplified 
drawing shows how the killer cell releases 
perforin and granzymes onto the surface 
of an infected target cell by localized 
exocytosis at an immunological synapse. 
The high concentration of Ca 2+ in the 
extracellular fluid causes the perforin to 
assemble into transmembrane channels in 
the target-cell plasma membrane, allowing 
the granzymes to enter the target-cell 
cytosol. The granzymes cleave and activate 
procaspases to initiate a caspase cascade, 
leading to apoptosis (see Figure 18-3). A 
single cytotoxic cell can kill multiple target 
cells in sequence. It remains a mystery why 
the released perforins do not form pores in 
the plasma membrane of the killer cell itself 
(Movie 24.12 and Movie 24.13). 
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Effector Helper T Cells Help Activate Other Cells of the Innate and 
Adaptive Immune Systems 

In contrast to Te cells, helper T cells (Th cells) are crucial for defense against both 
extracellular and intracellular pathogens, and they express CD4 rather than CD8 
co-receptors and recognize foreign peptides bound to class II rather than class 
I MHC proteins. Once naïve Th cells are induced on activated dendritic cells to 
become effector cells, they can help activate other cells: they help activate B cells 
to become antibody-secreting cells and later to undergo Ig class switching and 
somatic hypermutation; they help activate macrophages to destroy any intra¬ 
cellular pathogens multiplying within the macrophage's phagosomes; they help 
induce naïve Te cells to become effector cells that can lcill infected target cells; 
and they stimulate the activated dendritic cell that activated them to maintain the 
dendritic cell in an activated State. In each case, the effector Th cell recognizes the 
same complex of foreign peptide and class II MHC protein on the target-cell sur¬ 
face that it initially recognized on the activated dendritic cell. As discussed later, 
the Th cell stimulâtes the target cell both by secreting a variety of cytokines and by 
displaying co-stimulatory proteins on its surface. 

Naïve Helper T Cells Can Differentiate Info Different Types of 
Effector T Cells 

When activated by binding to a foreign peptide bound to a class II MHC protein 
on an activated dendritic cell, a naïve Th cell can differentiate into several distinct 
types of effector T cells, depending on the nature of the pathogen and the cyto¬ 
kines they encounter. These cells include four subtypes of helper cells—T hI, Th2 , 
TpH; and Th 17 cells—and regulatory (suppressor) T cells. Figure 24-44 summa- 
rizes both the cytokines that induce these effector T cells and some of the cyto¬ 
kines the effector cells secrete, as well as the master transcription regulators that 
control the effector cell's development. 

Naïve Th cells activated by dendritic cells secreting the cytokine interleulcin-12 
( IL12 ’) develop into TrI cells. These effector cells produce interferon-y {IFNy), 



Figure 24-44 Différentiation of naïve 
helper T cells into different types of 
effector helper cells or regulatory 
T cells in a peripheral lymphoid organ. 

The cytokines produced by the activating 
dendritic cell (and by other cells in the 
environment) mainly détermine which type 
of effector T cell develops, as indicated. 
Some of the main cytokines produced by 
each type of effector cell are also shown, 
and the master transcription regulator for 
each subset is indicated in the nucléus. 
There is increasing evidence that some 
of the effector cells are plastic and can 
change the cytokines they produce in 
response to changes in their environment 
(not shown). 
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which is critical for the activation of macrophages to destroy pathogens that either 
invaded the macrophage or were ingested by it; the IFNy can also induce B cells 
to switch the class of Ig they are making. Naïve Th cells activated in the presence 
of IL4 develop into Tr2 cells. These effector cells are important for the control 
of extracellular pathogens, including parasites. They stimulate B cells to undergo 
somatic hypermutation and to switch the class of Ig they produce: for example, 
the Th 2 cells themselves produce IL4, which can induce B cells to switch from 
making IgM and IgD to making IgE antibodies, which can bind to mast cells, as 
discussed earlier. Naïve Th cells activated in the presence of IL6 and IL21 develop 
into follicular helper T cells (Tfh); which are located in lymphoid follicles and 
secrete a variety of cytokines, including IL4 and IL21; these cells are especially 
important for stimulating B cells to undergo Ig class switching and somatic hyper¬ 
mutation. Naïve Th cells activated in the presence of IL6 and TGFfi develop into 
Th 17 cells. These effector cells secrete IL1 7, which recruits neutrophils and stim¬ 
ulâtes épithélial cells and fibroblasts in the slcin and gut to produce pro-infiam- 
matory cytokines. Th 17 cells are important in controlling extracellular bacterial 
and fungal infections and in wound healing, but they can also hâve a major rôle in 
autoimmune diseases and allergy. 

In some cases, naïve T H cells that encounter their antigen in a peripheral lym¬ 
phoid organ in the presence of TGFp and the absence of IL6 develop into induced 
regulatory T cells (T reg cells), which suppress rather than help immune cells; as 
mentioned earlier, natural T reg cells develop in the thymus during thymocyte 
development (see Figure 24-41). In either case, the Treg cells suppress the devel¬ 
opment, activation, or function of most other types of immune cells, by means of 
both secreted suppressive cytokines such as IL10 and TGFP and inhibitory pro¬ 
teins on the T reg cell surface. Induced T reg cells seem mainly to suppress immune 
responses to foreign antigens—preventing responses to harmless ingested or 
inhaled antigens and limiting responses against pathogens to avoid excessive 
responses that cause unwanted pathology; natural T reg cells are needed to pre- 
vent immune responses to self molécules (see Figure 24-21). T reg cells express the 
transcription regulator FoxP3, which serves as both a marlcer of these cells and a 
master controller of their development: if the gene encoding this protein is inac- 
tivated in mice or humans, the individuals fail to produce T reg cells and develop a 
fatal autoimmune disease involving multiple organs—findings that establish the 
crucial importance of T reg cells in self-tolérance. 

Both T and B Cells Require Multiple Extracellular Signais 
For Activation 

Foreign antigen binding to BCRs or TCRs initiâtes the process whereby the T and 
B cells are stimulated to proliferate and differentiate into effector or memory 
cells. As mentioned earlier, these antigen receptors do not act on their own: they 
are stably associated with invariant transmembrane polypeptide chains that are 
required to relay the signal into the cell. In B cells, these are called Iga and Igfi 
(Figure 24-45A), while in T cells they exist in a complex called CD3, composed 
of four types of polypeptide chains (Figure 24-45B). In both cases, the associated 
proteins help convert extracellular antigen binding to the TCR or BCR into intra- 
cellular signais, and they do so in similar ways. 

Antigen binding to BCRs or TCRs clusters these receptors and their associated 
invariant chains (and CD4 or CD8 co-receptors in the case of TCRs). This clus- 
tering activâtes a Src family cytoplasmic tyrosine kinase to phosphorylate tyro¬ 
sines on the cytoplasmic tails of some of the invariant chains. The phosphotyro- 
sines then serve as docking sites for a second cytoplasmic tyrosine kinase, which 
becomes phosphorylated and activated by the first kinase; the second kinase then 
relays the signal downstream by phosphorylating other intracellular signaling 
proteins on tyrosines. Some of these early events in the signaling pathway acti¬ 
vated by BCRs are shown in Figure 24-46. 

Signaling through BCRs or TCRs and their associated proteins alone is not 
sufficient to activate a lymphocyte to proliferate and differentiate. Extracellular 


T CELLS AND MHC PROTEINS 


1337 



co-stimulatory signais produced by another cell are also required, and they are 
provided by membrane-bound proteins (see Figure 24-34) and secreted cyto¬ 
kines. Indeed, signaling through the BCR or TCR with insuffïcient co-stimulation 
can either eliminate the lymphocyte (clonal délétion) or inactivate it, with both of 
these mechanisms contributing to self-tolerance (see Figure 24-21). For a naïve T 
cell, an activated dendritic cell provides the co-stimulatory signais; these include 
the transmembrane B7proteins, which are recognized by the co-receptor protein 
CD28 on the surface of the T cell (Figure 24-47A). For a B cell, an effector Th 
cell provides the co-stimulatory signais; these include the transmembrane CD40 
ligand , which binds to CD40 recep tors on the B cell (Figure 24-47B). The CD40 
ligand on effector Th cells acts in two other situations: (1) it acts baclc on CD40 
receptors on the dendritic cell surface to increase and sustain the activation of the 
dendritic cell, creating a positive feedback loop; and (2) it acts as a co-stimulatory 
signal on the surface of an effector T H 1 cell, allowing the T cell to help activate an 
infected macrophage to destroy the pathogens it harbors. 

In addition to receptors for co-stimulatory proteins, both B and T cells hâve 
inhibitory proteins on their surface that help regulate the celFs activity, preventing 
excessive or inappropriate responses. Two such proteins expressed by T cells hâve 
attracted great attention because of their rôles in suppressing the ability of T cells 
to inhibit cancer progression: CTLA4 and PDI pro teins inhibit T cell activity in 
different ways, and monoclonal antibodies against either or especially both can 
relieve the inhibition and allow T cells to dramatically destroy the tumors in some 
patients with metastatic cancer (see Figure 20-45). 


Figure 24-45 The invariant chains 
associated with BCRs and TCRs. 

(A) Each BCR is associated with two 
invariant heterodimers, each composed of 
an Iga and an Igp polypeptide chain linked 
by a sulfide bond (red). (B) Each TCR is 
associated with an invariant CD3 complex 
composed of two disulfide-bonded Ç 
chains, two s chains, and one 5 and one 
y chain; these chains form homodimers or 
heterodimers, as shown. 


Figure 24-46 Early signaling events 
in a B cell activated by the binding of 
spécifie foreign antigen to its BCRs. If 

the antigen is on the surface of a pathogen 
or is a soluble macromolecule with two 
or more identical antigenic déterminants 
(as shown), it cross-links adjacent BCRs, 
causing them and their associated invariant 
chains to cluster, as shown. A Src-like 
cytoplasmic tyrosine kinase (which can 
be Fyn or Lyn) is associated with the 
cytosolic tail of lg(3; it joins the cluster 
and phosphorylâtes both the Iga and 
Igp invariant chains (for simplicity, only 
the phosphorylation on IgP is shown). 

A transmembrane protein tyrosine 
phosphatase called CD45 is also required 
to remove inactivating phosphates from 
these Src-like kinases (not shown). The 
resulting phosphotyrosines on Iga and Igp 
serve as docking sites for another Src-like 
tyrosine kinase called Syk, which becomes 
phosphorylated and thereby activated to 
relay the signal downstream. 

The pathway from TCRs is similar 
(including a requirement for CD45), except 
that the first Src-like kinase is Lck, which is 
associated with a CD4 or CD8 co-receptor 
and phosphorylâtes tyrosines on ail the 
CD3 polypeptide chains shown in Figure 
24-45B; the second Src-like kinase is 
ZAP70, which is homologous to the Syk 
kinase in B cells (Movie 24.14). 
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(A) DENDRITIC CELL ACTIVATES T CELL 


(B) HELPER T CELL ACTIVATES B CELL 
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Many Cell-Surface Proteins Belong to the Ig Superfamily 

Most of the proteins that médiate antigen récognition and cell-cell récognition 
in the immune System contain one or more Ig or Ig-like domains, suggesting that 
the proteins hâve a common evolutionary history. Included in this very large Ig 
superfamily are antibodies, TCRs, MHC proteins, the CD4, CD8, and CD28 co-re- 
ceptors, the B7 co-stimulatory proteins, and most of the invariant polypeptide 
chains associated with TCRs and BCRs, as well as the various Fc receptors on lym¬ 
phocytes and other leukocytes. Many of these proteins are dimers or higher oligo¬ 
mers, in which Ig or Ig-like domains of one chain interact with those in another 
(Figure 24-48). 



Fc receptor 


class I MHC 
protein 


class II MHC 
protein 


TCR 


BCR 


generated by gene-segment 
rearrangement 



CD3 complex 


Figure 24-47 Comparison of the co- 
stimulatory proteins required to activate 
a helper T cell and a B cell in response 
to the same foreign protein. (A) A naïve 
helper T cell is activated by a peptide 
fragment of a foreign protein bound to a 
class II MHC protein on the surface of an 
activated dendritic cell. The co-stimulatory 
protein on the dendritic cell (a B7 protein — 
either CD80 or CD86) binds to the CD28 
co-receptor on the T cell, providing a 
necessary co-stimulatory signal to the 
T cell; in addition, cytokines secreted by the 
dendritic cell (or other nearby cells) influence 
what subtype of effector helper cell the 
T cell becomes (see Figure 24-44). (B) Once 
activated to become an effector cell, the 
helper T cell can help activate 
B cells that hâve the same peptide-MHC 
protein complexes on their surface as 
the dendritic cell that activated the T cell. 
These B cells hâve BCRs that bind an 
antigenic déterminant on the surface of 
a folded foreign protein and endocytose 
the protein (red arrow)\ the protein is then 
cleaved into peptides, which are carried to 
the B cell surface by class II MHC proteins, 
where some of them can be recognized by 
the TCRs on the helper T cell (see Figure 
24-39). Note that the BCRs and TCRs 
recognize different antigenic déterminants of 
the protein. As indicated, the co-stimulatory 
protein used by the effector helper T cell 
is CD40 ligand, which binds to the CD40 
co-receptor on the B cell; the T cell also 
sécrétés cytokines such as IL4 to help 
stimulate the B cell to undergo somatic 
hypermutation and class switching (not 
shown). The CD4 co-receptor on Th cells is 
omitted in both (A) and (B) for simplicity. 


Figure 24-48 Some of the cell-surface 
proteins discussed in this chapter that 
belong to the Ig superfamily. The Ig 

and Ig-like domains are shaded in gray, 
except for the antigen-binding domains 
(not ail of which are Ig domains—the 
class I and class II MHC proteins are the 
exception), which are shaded in blue. 

The Ig superfamily also includes many 
cell-surface proteins involved in cell-cell 
interactions outside the immune System, 
such as the neural cell adhesion molécule 
(N-CAM) discussed in Chapter 19 and the 
receptors for various protein growth factors 
discussed in Chapter 15 (not shown). There 
are more than 750 members of the Ig 
superfamily in humans. 























T CELLS AND MHC PROTEINS 


1339 


In both vertebrates and invertebrates, many proteins in the Ig superfamily are 
also found outside immune Systems, where they often function in cell-cell réc¬ 
ognition and adhesion processes, both during development and in adult tissues. 
It seems lilcely that the entire gene superfamily evolved from a primordial gene 
coding for a single Ig-like domain, similar to that encoding P2-microglobulin (see 
Figure 24-36). In present-day family members, a separate exon usually encodes 
the amino acids in each Ig-like domain, consistent with the likelihood that new 
family members arose during évolution by exon and gene duplications. 

Summary 

There are three main functionally distinct classes of T cells. Cytotoxic T cells (Te 
cells) directly kill infected cells by secreting perforins and granzymes that induce the 
infected cells to undergo apoptosis. Helper T cells (Th cells) help activate cytotoxic 
T cells to kill their target cells, B cells to malce antibody responses, macrophages 
to destroy the microorganisms they harbor, and dendritic cells to activate T cells. 
Regulatory T cells (T reg cells) produce suppressive proteins (such as the cytokines 
IL10 and TGF fi) to inhibit other immune cells. 

AU T cells express cell-surface antigen receptors (TCRs), which are encoded by 
genes that are assembled from multiple gene segments during T cell development 
in the thymus. TCRs recognize peptide fragments offoreign proteins that are dis- 
played in association with MHC proteins on the surface of antigen-presenting cells 
(APCs) and target cells. Naïve T cells are activated in peripheral lymphoid organs 
by activated dendritic cells, which secrete cytokines and express peptide-MHC com¬ 
plexes , co-stimulatory proteins, and varions cell-cell adhesion molécules on their 
cell surface. 

ClassI MHC proteins présent foreign peptides to Te cells, whereas class II MHC 
proteins présent foreign peptides to Th cells and Treg cells. Whereas class I MHC 
proteins are expressed on almost ail nucleated vertebrate cells, class II MHC pro¬ 
teins are normally restricted to APCs; including dendritic cells, macrophages, and 
B lymphocytes. Both classes ofMHC proteins hâve a single peptide-binding groove, 
which binds a large set ofsmall peptide fragments produced intracellularly by nor¬ 
mal protein-degradation processes: class I MHC proteins mainly bind fragments 
produced in the cytosol, whereas class II MHC proteins mainly bind fragments pro¬ 
duced in endocytic compartments. The peptide-MHC complexes are transported 
to the cell surface , where complexes that contain a peptide derived from a foreign 
protein are recognized by TCRs, which interact with both the peptide and the walls 
ofthe peptide-binding groove. T cells also express CD4 or CD8 co-receptors, which 
recognize invariant régions ofMHC proteins: Th cells and Treg cells express CD4, 
which recognizes class II MHC proteins; Te cells express CD8, which recognizes 
class I MHC proteins. 

A combination of positive and négative sélection opérâtes during T cell devel¬ 
opment in the thymus to help ensure that only T cells with potentially useful TCRs 
survive, mature, and emigrate, while ail ofthe others die by apoptosis. The naïve 
Th and Te cells that leave the thymus constantly receive survival signais when their 
TCRs recognize self-peptide-MHC complexes, but they can only be activated when 
their TCRs encounter foreign peptides in the grooves ofMHC proteins on an acti¬ 
vated dendritic cell. The natural Treg cells that leave the thymus suppress self-reac- 
tive lymphocytes to help maintain self-tolérance. 

The production ofan effector T cell from a naïve T cell requires multiple signais 
from an activated dendritic cell. MHC-peptide complexes on the dendritic cell sur¬ 
face provide one signal, by binding to both TCRs and a CD4 co-receptor on a Th or 
T reg cell. Co-stimulatory proteins on the dendritic cell surface and secreted cytokines 
are the other signais. When naïve Th cells are initially activated on a dendritic cell, 
they dijferentiate into ThT Th2, Tfh> or Th 17 effector helper cells or into induced 
Treg cells, depending mainly on the cytokines in their environment. ThI cells secrete 
interferon-y (IFNy) to activate macrophages and to induce B cells to switch the class 
of Ig they malce; Th 2 and Tfh cells secrete other cytokines that also induce B cells 
to switch Ig class; and T H 17 cells secrete IL17 to promote inflammatory responses 


WHATWE DONT KNOW 


• What initiâtes an autoimmune 
disease such as type 1 diabètes or 
multiple sclerosis? 

• When a naïve or memory T or 
B cell is activated by antigen and 
co-stimulatory signais, how does it 
décidé whether to become an effector 
cell or memory cell? Are there cells 
that are pre-committed to becoming 
either effector or memory cells, for 
example, or is the decision determined 
solely by extracellular signais? 

• Why do some of us make IgE 
antibodies against harmless antigens 
and thereby develop hay fever and 
allergie asthma, while most of us do 
not, and why is the proportion of such 
allergie individuals increasing? 

• How does a cytotoxic T cell (or NK 
cell) avoid being killed by the perforin 
and granzymes that it sécrétés to kill a 
target cell? 
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and wound healing. The effector helper Th cells recognize the same complex offor- 
eign peptide and class IIMHC protein on the target-cell surface as they initially 
recognized on the dendritic cell that activated them. They activate their target cells 
by producing a combination ofmembrane-bound and secreted co-stimulatory pro¬ 
teins. T reg cells suppress immune cells using cell-surface and secreted inhibitory pro¬ 
teins. 

Both T cells and B cells require multiple signais for activation. Antigen binding 
to the TCRs or BCRs provides one signal, while co-stimulatory proteins binding to 
co-receptors and cytokines binding to their complementary receptors provide the 
others. Effector Th cells provide the co-stimulatory signais for B cells, whereas APCs 
provide them for T cells. 


PROBLEMS 

Which statements are true? Explain why or why not. 

24-1 T cells whose receptors strongly bind a self-pep- 
tide-MHC complex are killed off in peripheral lymphoid 
organs when they encounter the self peptide on an anti- 
gen-presenting dendritic cell. 

24-2 To guarantee that the antigen-presenting cells in 
the thymus will display a complété répertoire of self pep¬ 
tides to allow élimination of self-reactive T cells, the thy¬ 
mus recruits dendritic cells from ail over the body. 

24-3 The antibody diversity created by the combinato- 
rial joining ofV, D, and J segments by V(D)J recombination 
pales in comparison to the enormous diversity created by 
the random gain and loss of nucléotides at V, D, and J join¬ 
ing sites. 

Discuss the following problems. 

24-4 Why do living trees not rot? Redwood trees, for 
example, can live for centuries, but once they die they 
decay fairly quicldy. What might this suggest? 

24-5 It would be disastrous if a complément attaclc were 
not confined to the surface of the pathogen that is the tar¬ 
get of the attaclc. Yet, the proteolytic cascade involved in 
the attaclc libérâtes biologically active molécules at several 
steps: one that diffuses away and one that remains bound 
to the target surface. How does the complément reaction 
remain localized when active products leave the surface? 

24-6 Based on its sequence similarity to Apobecl, 
which deaminates Cs to Us in RNA, activation-induced 
deaminase (AID) was originally proposed to worlc on RNA. 
But definitive experiments in E. coli demonstrated that 
AID deaminates Cs to Us in DNA. The authors of the paper 
expressed AID in bacteria and followed mutations in a 
selectable gene. They found that AID expression increased 
mutations about fivefold above the baclcground level in 
the absence of AID expression. More importantly, they 
found that 80% of the induced mutations were G—>A or 
C^T. Does this fit with your expectation if AID-induced 
mutations arose by deamination of C to U in the DNA? 


[Hint: imagine what would happen if the G:U mismatch 
created by AID was replicated several times; how would 
the sequences of the final mutations relate to the original 
G-C base pair?] 

24-7 For many years it was a complété mystery how 
cytotoxic T cells could see a viral protein that seemed to be 
présent only in the nucléus of the virus-infected cell. The 
answer was revealed in a classic paper that toolc advan- 
tage of a clone of T cells whose T cell receptor was directed 
against an antigen assoicated with the nuclear protein of 
the 1968 strain of influenza virus. The authors of the paper 
found that when they incubated high concentrations of 
certain peptides derived from the viral nuclear protein, the 
cells became sensitive to lysis by subséquent incubation 
with the cytotoxic T cells. Using various peptides from the 
1968 strain and the 1934 strain (with which the cytotoxic T 
cells did not react), the authors defined the particular pep¬ 
tide responsible for the T cell response (Figure Q24-1). 

A. Which part of the viral protein gives rise to the 
peptide that is recognized by the clone of cytotoxic T cells? 

(A) 345-360 365-380 

1968 DLRVLSFIRGTKVSPRGKLSTRGVQIASNENMDAMESSTLELRS 
1934 DLRVLSFIKGTKVVPRGKLSTRGVQIASNENMETMESSTLELRS 



strain strain 360 380 382 380 382 

Figure Q24-1 Viral nuclear protein récognition by cytotoxic T cells 
(Problem 24-7). (A) Sequences of a segment of the nuclear protein 
from the 1968 and 1934 strains of influenza virus. Peptides used 
in the experiments in (B) are highlighted by pinkbars. The amino 
acid différences between the viral proteins are highlighted in blue. 

(B) Cytotoxic T-cell-mediated lysis of target cells. The target cells 
were untreated (none), infected with virus (1968 or 1934 strain), or 
preincubated with high concentrations of the indicated viral peptide. 
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Why do not ail viral peptides sensitize the target cells for 
lysis by the cytotoxic T cells? 

B. It is thought the MHC molécules corne to the cell 
surface with peptides already bound. If that is so, how do 
you imagine that these experiments worlced? 


24-8 Working out the rules by which T cells interact 
with their target cells was complicated. Some of the key 
observations came from studying the way cytotoxic T cells 
killed cells infected with choriomeningitis virus (LCMV). 
Cytotoxic T cells derived from mice expressing "le-type" 
class I MHC proteins lysed LCMV-infected cells express¬ 
ing the same k-type MHC protein, but they did not lyse 
infected cells from mice expressing “d-type" class I MHC 
proteins (Figure Q24-2). Similarly, cytotoxic T cells from 
d-type mice lysed infected d-type cells, but not infected 
k-type cells. LCMV can kill both k-type and d-type mice. 


LCMV 



LCMV infected 

k-type mouse 
fibroblasts 


d-type mouse 
fibroblasts 
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T cells kill 
target cells 

YES 


NO 


Figure Q24-2 Pattern of killing of LCMV-infected fibroblasts by 
cytotoxic T cells from an LCMV-infected k-type mouse (Problem 24-8). 


A. If homozygous d-type mice were bred to homozy- 
gous k-type mice to generate d-type/k-type heterozygous 
progeny, would you expect that cytotoxic T cells from these 
hétérozygotes, when infected with LCMV, to be able to lyse 
infected d-type cells? How about infected k-type cells? 
Explain your answers. 

B. Oddly enough, LCMV infection does not kill mice 
that laclc a thymus—such as “nude" mice, so called because 
they also laclc hair. If a thymus is transplanted baclc into a 
nude mouse, it will die when infected with LCMV. Suppose 
that a d-type/k-type heterozygous nude mouse was given 
a thymus from an d-type donor. Would you expect its cyto¬ 
toxic T cells to be able to lyse infected d-type cells? How 
about infected k-type cells? Explain your answers. 

24-9 Before exposure to a foreign antigen, T cells with 
receptors spécifie for the antigen are a tiny fraction of the T 
cells—on the order of 1 in 10 5 or 1 in 10 6 T cells. After expo¬ 
sure to the antigen, only a small number of dendritic cells 
typically display the antigen on their surface. How long 
does it talce for such antigen-presenting dendritic cells to 
interact with the antigen-specific T cells, which is the key 
first step in T cell activation and clonal expansion? The 
dynamics of the search process were examined by labeling 
dendritic cells red and T cells green, so that contacts in an 
intact lymph node could be scored visually using two-pho- 
ton fluorescence microscopy (Figure Q24-3A). The fre- 
quency of contacts between dendritic cells and T cells 
from such experiments is given in Figure 24-3B. Assuming 
that 100 dendritic cells présent the spécifie antigen, how 
long would it talce them to scan 10 5 T cells? How long for 
10 6 T cells? 
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Figure Q24-3 Scanning 
of the T cell répertoire by 
dendritic cells (Problem 24-9). 
(A) Contacts between different 
T cells and one dendritic 
cell. T cells are green and 
dendritic cells are red. The 
dendritic cell labeled with 
an asterisk contacts a total 
of three T cells (numbered) 
over time in this sequence 
of images. Times are shown 
as hours: minutes. (B) Plot of 
T cell contacts for individual 
dendritic cells over time. 

(A, from P Bousso and 
E. Robey, Nat. Immunol. 

60 4:579-581,2003. With 
permission from Macmillan 
Publishers Ltd.) 


24-10 At first glance, it would seem a dangerous strategy 
for the thymus to actively promote the survival, matura¬ 
tion, and émigration of developing T cells that bind wealdy 
to self peptides bound to self MHC molécules. Would it 
not be safer to get rid of these T cells, along with those that 
bind strongly to such self-peptide-MHC complexes, as this 
would seem a more secure way to avoid autoimmune reac¬ 
tions? 


24-11 CD4 proteins on helper and regulatory T cells 
serve as co-receptors that bind to invariant parts of class 
II MHC proteins. CD4 is thought to increase the adhesion 
between T cells and antigen-presenting cells (APCs) that 
are initially connected only wealdy by the T cell receptor 
bound to its spécifie peptide-MHC complex. To test this 
possibilité you label cell-surface MHC molécules with a 
fluorescently labeled peptide so that you can detect indi¬ 
vidual peptide-MHC complexes at the interface between 
the APCs and the T cells in a culture dish. To detect T cell 
responses—the sign of a productive contact—you load 
them with a Ca 2+ indicator dye, as cytosolic Ca 2+ increases 
when lymphocytes are active. You now count the peptide- 
MHC complexes at a large number of interfaces (immuno- 
logical synapses) and measure the resulting uptalce of Ca 2+ 
in the adhèrent T cells (Figure Q24-4, red circles). When 
you repeat the experiment in the presence of blocking anti- 
bodies against CD4, you get a different resuit (blue circles). 
Do these results support or réfuté the notion that CD4 aug- 
ments T cell receptor binding? Explain your answer. 

Figure Q24-4 Rôle of 
CD4 in the T cell response 
(Problem 24-11). The uptake 
of Ca 2+ in cells with different 
numbers of fluorescently 
labeled peptide-MHC 
complexes at the interface 
between the T cells and the 
antigen-presenting cells. 

The results in the absence 
of CD4-blocking antibodies 
are shown by the red curve\ 
results in the presence of 
CD4 antibodies are shown 
by the blue curve. 
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Glossary 


ABC transporters A large family of membrane transport 
proteins that use the energy of ATP hydrolysis to transfer 
peptides or small molécules across membranes. (Figure 11-16) 

acetyl CoA Small water-soluble activated carrier molécule. 
Consists of an acetyl group linked to coenzyme A (CoA) by an 
easily hydrolyzable thioester bond. (Figure 2-38) 

acétylcholine receptor (AChR) Membrane protein that 
responds to binding of acétylcholine (ACh). The nicotinic AChR 
is a transmitter-gated ion channel that opens in response 
to ACh. The muscarinic AChR is not an ion channel, but a 
G-protein-coupled cell-surface receptor. 

acid A proton donor. Substance that releases protons (H + ) 
when dissolved in water, forming hydronium ions (Fi 3 Û + ) and 
lowering the pH. (Panel 2-2, pp. 92-93) 

acid hydrolases Hydrolytic enzymes—including proteases, 
nucleases, glycosidases, lipases, phospholipases, 
phosphatases, and sulfatases—that work best at acidic pH; 
these enzymes are found within the lysosome. 

action potential Rapid, transient, self-propagating electrical 
excitation in the plasma membrane of a cell such as a neuron 
or muscle cell. Action potentials, or nerve impulses, make 
possible long-distance signaling in the nervous System. 

(Figure 11-31) 

activated carrier Small diffusible molécule that stores easily 
exchangeable energy in the form of one or more energy-rich 
covalent bonds. Examples are ATP, acetyl CoA, FADH 2 , 

NADH, and NADPH. (Figure 2-31) 

activation energy The extra energy that must be acquired by 
atoms or molécules in addition to their ground-state energy in 
order to reach the transition State required for them to undergo 
a particular Chemical reaction. (Figure 2-21) 

activation-induced deaminase (AID) The enzyme catalyzing 
the processes of somatic hypermutation and immunoglobulin 
class switching in activated B cells. 

active site Région of an enzyme surface to which a substrate 
molécule binds in order to undergo a catalyzed reaction. 

(Figure 1-7) 

active transport Movement of a molécule across a 
membrane or other barrier driven by energy other than that 
stored in the electrochemical or concentration gradient of the 
transported molécule. 

adaptation (1) Adaptation (desensitization): adjustment of 
sensitivity following repeated stimulation. The mechanism 
that allows a cell to react to small changes in stimuli even 
against a high background level of stimulation. (2) Evolutionary 
adaptation: an evolved trait. 


adaptive immune System System of lymphocytes providing 
highly spécifie and long-lasting defense against pathogens in 
vertebrates. It consists of two major classes of lymphocytes: 

B lymphocytes (B cells), which secrete antibodies that bind 
specifically to the pathogen or its products, and T lymphocytes 
(T cells), which can either directly kill cells infected with the 
pathogen or produce secreted or cell-surface signal proteins 
that stimulate other host cells to help eliminate the pathogen. 
(Figure 24-2) 

adaptor protein, adaptor General term for a protein that 
functions solely to link two or more different proteins together in 
an intracellular signaling pathway or protein complex. 

(Figure 15-11) 

adenylyl cyclase (adenylate cyclase) Membrane-bound 
enzyme that catalyzes the formation of cyclic AMP from ATP. An 
important component of some intracellular signaling pathways. 

adherens junction Cell junction in which the cytoplasmic 
face of the plasma membrane is attached to actin filaments. 
Examples include adhesion belts linking adjacent épithélial cells 
and focal contacts on the lower surface of cultured fibroblasts. 

adhesins Spécifie proteins or protein complexes of 
pathogenic bacteria that recognize and bind cell-surface 
molécules on the host cells to enable tight adhesion and 
colonization of tissues. 

adhesion belt Adherens junctions in epithelia that form a 
continuous belt (zonula adherens) just beneath the apical face 
of the epithelium, encircling each of the interacting cells in the 
sheet. 

ADP (adenosine 5'-diphosphate) Nucléotide produced by 
hydrolysis of the terminal phosphate of ATP. Regenerates ATP 
when phosphorylated by an energy-generating process such as 
oxidative phosphorylation. (Figure 2-33) 

aérobic respiration Process by which a cell obtains energy 
from sugars or other organic molécules by allowing their carbon 
and hydrogen atoms to combine with the oxygen in air to 
produce CO 2 and H 2 O, respectively. 

affinity maturation Progressive increase in the affinity of 
antibodies for the immunizing antigen with the passage of time 
after immunization. 

Agrin Signal protein released by an axonal growth cône 
during formation of the synapse between it and a muscle cell. 

AIRE (autoimmune regulator) A protein expressed by a 
subpopulation of épithélial cells in the thymus that stimulâtes 
the production of small amounts of self proteins characteristic 
of other organs, exposing developing thymocytes to these 
proteins for the purpose of self-tolerance. 
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Akt Serine/threonine protein kinase that acts in the PI-3- 
kinase/Akt intracellular signaling pathway involved especially in 
signaling cells to grow and survive. Also called protein kinase B 
(PKB). 

allele One of several alternative forms of a gene. In a diploid 
cell, each gene will typically hâve two alleles, occupying the 
corresponding position (locus) on homologous chromosomes. 

allosteric protein A protein that can adopt at least two 
distinct conformations, and for which the binding of a ligand at 
one site causes a conformational change that alters the activity 
of the protein at a second site; this allows one type of molécule 
in a cell to alter the fate of a molécule of another type, a feature 
widely exploited in enzyme régulation. 

allostery (adjective allosteric) Change in a protein’s 
conformation brought about by the binding of a regulatory 
ligand (at a site other than the protein’s catalytic site), or by 
covalent modification. The change in conformation alters 
the activity of the protein and can form the basis of directed 
movement. (Figures 3-57 and 16-29) 

alpha hélix (a hélix) Common folding pattern in proteins, 
in which a linear sequence of amino acids folds into a right- 
handed hélix stabilized by internai hydrogen-bonding between 
backbone atoms. (Figure 3-7) 

alternative RNA splicing Production of different RNAs from 
the same gene by splicing the transcript in different ways. 
(Figure 7-57) 

amino acid Organic molécule containing both an amino 
group and a carboxyl group. Those that serve as building 
blocks of proteins are alpha amino acids, having both the 
amino and carboxyl groups linked to the same carbon atom. 
(NH 2 CHRCOOH, Panel 3-1, pp. 112-113) 

aminoacyl-tRNA synthetase Enzyme that attaches the 
correct amino acid to a tRNA molécule to form an aminoacyl- 
tRNA. (Figure 6-54) 

AMPA receptor Glutamate-gated ion channel in the 
mammalian central nervous System that carries most of the 
depolarizing current responsible for excitatory postsynaptic 
potentials. 

amphiphilic Having both hydrophobie and hydrophilic 
régions, as in a phospholipid or a detergent molécule. 

amyloid fibrils Self-propagating, stable p-sheet aggregates 
built from hundreds of identical polypeptide chains that become 
layered one over the other to create a continuous stack of 
p sheets. The unbranched fibrous structure can contribute to 
human diseases when not controlled. 

anaphase (1) Stage of mitosis during which sister chromatids 
separate and move away from each other. (2) Anaphase I and 
II: stages of meiosis during which chromosome homolog pairs 
separate (I), and then sister chromatids separate (II). (Panel 
17-1, pp. 980-981) 

anaphase A Stage of mitosis during which chromosome 
ségrégation occurs as chromosomes move toward the two 
spindle pôles. 

anaphase B Stage of mitosis during which chromosome 
ségrégation occurs as spindle pôles separate and move apart. 

anaphase-promoting complex (APC/C; cyclosome) 

Ubiquitin ligase that catalyzes the ubiquitylation and destruction 
of securin and M- and S-cyclins, initiating the séparation of 
sister chromatids in the metaphase-to-anaphase transition 
during mitosis. 

anchorage dependence Dependence of cell growth, 
prolifération, and survival on attachment to a substratum. 


anchoring junction Cell junction that attaches cells to 
neighboring cells or to the extracellular matrix. 

(Table 19-1, p. 1037) 

angiogenesis Growth of new blood vessels by sprouting 
from existing ones. 

antenna complex Part of a photosystem that captures light 
energy and channels it into the photochemical reaction center. 

It consists of protein complexes that bind large numbers of 
chlorophyll molécules and other pigments. 

Antennapedia complex One of two gene clusters in 
Drosophila that contain Hox genes; genes in the Antennapedia 
complex control the différences among the thoracic and head 
segments of the body. 

anti-apoptotic Bcl2 family proteins Proteins (e.g., Bcl2, 
BcIXl) on the cytosolic surface of the outer mitochondrial 
membrane that bind and inhibit pro-apoptotic Bcl2 family 
proteins and thereby help prevent inappropriate activation of the 
intrinsic pathway of apoptosis. 

anti-lAP Produced in response to various apoptotic stimuli 
and, by binding to lAPs and preventing their binding to a 
caspase, neutralize the inhibition of apoptosis provided by lAPs. 

antibiotic Substance such as penicillin or streptomycin that is 
toxic to microorganisms. Often a natural product of a particular 
microorganism or plant. 

antibody Protein secreted by activated B cells in response to 
a pathogen or foreign molécule. Binds tightly to the pathogen 
or foreign molécule, inactivating it or marking it for destruction 
by phagocytosis or complement-induced lysis. (Figure 24-23) 

antibody response Adaptive immune response in which 
B cells are activated to secrete antibodies that circulate in 
the bloodstream or enter other body fluids, where they can 
bind specifically to the foreign antigen that stimulated their 
production. 

anticodon Sequence of three nucléotides in a transfer RNA 
(tRNA) molécule that is complementary to a three-nucleotide 
codon in a messenger RNA (mRNA) molécule. 

antigen A molécule that can induce an adaptive immune 
response or that can bind to an antibody or T cell receptor. 

antigen-presenting cell Cell that displays foreign antigen 
complexed with an MHC protein on its surface for présentation 
to T lymphocytes. 

antigenic déterminant Spécifie région of an antigen that 
binds to an antibody or a complementary receptor on the 
surface of a B cell (BCR) or T cell (TCR). 

antigenic variation Ability to change the antigens 
displayed on the cell surface; a property of some pathogenic 
microorganisms that enables them to évadé attack by the 
adaptive immune System. 

antiparallel Describes the relative orientation of the two 
strands in a DNA double hélix or two paired régions of a 
polypeptide chain; the polarity of one strand is opposite to that 
of the other. 

antiporter Carrier protein that transports two different ions 
or small molécules across a membrane in opposite directions, 
either simultaneously or in sequence. (Figure 11-8) 

Apafl Adaptor protein of the intrinsic apoptotic pathway; on 
binding cytochrome c, oligomerizes to form an apoptosome. 

apical Referring to the tip of a cell, a structure, or an organ. 
The apical surface of an épithélial cell is the exposed free 
surface, opposite to the basal surface. The basal surface rests 
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on the basal lamina that séparâtes the epithelium from other 
tissue. 

apoptosis Form of programmée! cell death, in which a 
“suicide” program is activated within an animal cell, leading to 
rapid cell death mediated by intracellular proteolytic enzymes 
called caspases. 

apoptosome Heptamer of Apafl proteins that forms 
on activation of the intrinsic apoptotic pathway; it recruits 
and activâtes initiator caspases that subsequently activate 
downstream executioner caspases to induce apoptosis. 

aquaporin (water channel) Channel protein embedded 
in the plasma membrane that greatly increases the cell’s 
permeability to water, allowing transport of water, but not ions, 
at a high rate across the membrane. 

archaeon (plural arch[a]ea) (archaebacterium) Single- 
celled organism without a nucléus, superficially similar 
to bacteria. At a molecular level, more closely related to 
eukaryotes in genetic machinery than are bacteria. Archaea and 
bacteria together make up the prokaryotes. (Figure 1-17) 

ARF proteins Monomeric GTPase in the Ras superfamily 
responsible for regulating both COPI coat assembly and clathrin 
coat assembly. (Table 15-5, p. 854) 

ARP (actin-related protein) complex (Arp 2/3 complex) 

Complex of proteins that nucleates actin filament growth from 
the minus end. 

arrestin Member of a family of proteins that contributes to 
GPCR desensitization by preventing the activated receptor from 
interacting with G proteins and serving as an adaptor to couple 
the receptor to clathrin-dependent endocytosis. (Figure 15-42) 

astral microtubule In the mitotic spindle, any of the 
microtubules radiating from the aster which are not attached to 
a kinetochore of a chromosome. 

asymmetric cell division Cell division in which some 
important molécule or molécules are distributed unequally 
between the two daughter cells, causing these cells to become 
different from each other. 

ATM (ataxia telangiectasia mutated protein) Protein 
kinase activated by double-strand DNA breaks. If breaks are 
not repaired, ATM initiâtes a signal cascade that culminâtes in 
cell cycle arrest. Related to ATR. 

ATP (adenosine 5'-triphosphate) Nucleoside triphosphate 
composed of adenine, ribose, and three phosphate groups. 

The principal carrier of Chemical energy in cells. The terminal 
phosphate groups are highly reactive in the sense that their 
hydrolysis, or transfer to another molécule, takes place with the 
release of a large amount of free energy. (Figure 2-33) 

ATP synthase (F-|F 0 ATPase) Transmembrane enzyme 
complex in the inner membrane of mitochondria and the 
thylakoid membrane of chloroplasts. Catalyzes the formation 
of ATP from ADP and inorganic phosphate during oxidative 
phosphorylation and photosynthesis, respectively. Also présent 
in the plasma membrane of bacteria. 

ATR (ataxia telangiectasia and Rad3 related protein) 

Protein kinase activated by DNA damage. If damage remains 
unrepaired, ATR helps initiate a signal cascade that culminâtes 
in cell cycle arrest. Related to ATM. 

autoimmune disease Pathological State in which the body 
mounts a disabling adaptive immune response against one or 
more of its own molécules. 

autophagosome Organelle surrounded by a double 
membrane contains engulfed cytoplasmic cargo in the initial 
stages of autophagy. 


autophagy Digestion of cytoplasm and worn-out organelles 
by the cell’s own lysosomes. 

auxin Plant hormone, commonly indole-3-acetic acid, with 
numerous rôles in plant growth and development. 

axon Long nerve cell projection that can rapidly conduct 
nerve impulses over long distances so as to deliver signais to 
other cells. 

axoneme Bundle of microtubules and associated proteins 
that forms the core of a cilium or a flagellum in eukaryotic cells 
and is responsible for their movements. 

bacterial artificial chromosome (BAC) Cloning vector that 
can accommodate large pièces of DNA, typically up to 1 million 
base pairs. 

bacteriorhodopsin Pigmented protein found in the plasma 
membrane of a salt-loving archaeon, Halobacterium salinarium 
(i Halobacterium halobium). Pumps protons out of the cell in 
response to light. 

bacterium (plural bacteria) (eubacterium) Member of 
the domain bacteria, one of the three main branches of the 
tree of life (archaea, bacteria, and eukaryotes). Bacteria and 
archaea both lack a distinct nuclear compartiment, and together 
comprise the prokaryotes. (Figure 1-17) 

Bak A main effector Bcl2 family protein of the intrinsic 
pathway of apoptosis in mammalian cells that is bound to 
the mitochondrial outer membrane even in the absence of an 
apoptotic signal; activation is usually by activated pro-apoptotic 
BH3-only proteins. 

basal Situated near the base. Opposite the apical surface. 

basal lamina (plural basal laminae) Thin mat of extracellular 
matrix that séparâtes épithélial sheets, and many other types 
of cells such as muscle or fat cells, from connective tissue. 
Sometimes called basement membrane. (Figure 19-51) 

base (1) A substance that can reduce the number of protons 
in solution, either by accepting H + ions directly, or by releasing 
OH - ions, which then combine with H + to form H 2 O. 

(2) The purines and pyrimidines in DNA and RNA are organic 
nitrogenous bases and are often referred to simply as bases. 
(Panel 2-2, pp. 92-93) 

base excision repair DNA repair pathway in which single 
faulty bases are removed from the DNA hélix and replaced. 
Compare nucléotide excision repair. (Figure 5-41) 

base pair Two nucléotides in an RNA or DNA molécule that 
are held together by hydrogen bonds—for example, G paired 
with C, and A paired with T or U. 

basement membrane Thin mat of extracellular matrix that 
séparâtes épithélial sheets, and many other types of cells such 
as muscle or fat cells, from connective tissue. Also called basal 
lamina. (Figure 19-51) 

Bax A main effector Bcl2 family protein of the intrinsic 
pathway of apoptosis in mammalian cells; located mainly 
in the cytosol and translocates to the mitochondria only 
after activation, usually by activated pro-apoptotic BH3-only 
proteins. 

B cell receptor (BCR) The transmembrane immunoglobulin 
protein on the surface of a B cell that serves as its receptor for 
antigen. 

Bcl2 Anti-apoptotic Bcl2 family protein of the outer 
mitochondrial membrane that binds and inhibits pro-apoptotic 
Bcl2 family proteins and prevents inappropriate activation of the 
intrinsic pathway of apoptosis. 
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Bcl2 family Family of intracellular proteins that either promote 
or inhibit apoptosis by regulating the release of cytochrome 
c and other mitochondrial proteins from the intermembrane 
space into the cytosol. 

BcIXl Anti-apoptotic Bcl2 family protein of the outer 
mitochondrial membrane that binds and inhibits pro-apoptotic 
Bcl2 family proteins and prevents inappropriate activation of the 
intrinsic pathway of apoptosis. 

benign Of tumors: self-limiting in growth, and noninvasive. 

beta sheet (P sheet) Common structural motif in proteins in 
which different sections of the polypeptide chain run alongside 
each other, joined together by hydrogen-bonding between 
atoms of the polypeptide backbone. Also known as a (3 pleated 
sheet. (Figure 3-7) 

beta-catenin (P-catenin) Multifunctional cytoplasmic protein 
involved in cadherin-mediated cell-cell adhesion, linking 
cadherins to the actin cytoskeleton. Can also act independently 
as a transcription regulatory protein. Has an important rôle in 
animal development as part of a Wnt signaling pathway. 

BH3-only proteins The largest subclass of Bcl2 family 
proteins. Produced or activated in response to an apoptotic 
stimulus and promote apoptosis mainly by inhibiting anti- 
apoptotic Bcl2 family proteins. 

bi-orientation The attachment of sister chromatids to 
opposite pôles of the mitotic spindle, so that they move to 
opposite ends of the cell when they separate in anaphase. 

binding site Région on the surface of one molécule (usually a 
protein or nucleic acid) that can interact with another molécule 
through noncovalent bonding. 

BiP Endoplasmic réticulum (ER)-resident chaperone protein 
member of the family of hsp70-type chaperone proteins. 

Bithorax complex One of two gene clusters in Drosophila 
that contain Hox genes; genes in the Bithorax complex control 
the différences among the abdominal and thoracic segments of 
the body. 

bivalent A four-chromatid structure formed during meiosis, 
consisting of a duplicated chromosome tightly paired with its 
homologous duplicated chromosome. 

blastomere One of the many cells formed by the cleavage of 
a fertilized egg. 

blastula Early stage of an animal embryo, usually consisting 
of a hollow bail of épithélial cells surrounding a fluid-filled cavity, 
before gastrulation begins. 

blebbing Membrane protrusion formed when the plasma 
membrane détachés locally from the underlying actin cortex, 
allowing cytoplasmic flow and hydrostatic pressure within the 
cell to push the membrane outward. 

bone Dense and rigid connective tissue comprising a mixture 
of tough fibers (type I collagen fibrils), which resist pulling forces, 
and solid particles (calcium phosphate as hydroxylapatite 
crystals), which resist compression. 

brassinosteroids Class of steroid signal molécules in plants 
that regulate the growth and différentiation of plants throughout 
their life cycle via binding to a cell-surface receptor kinase to 
initiate a signaling cascade. 

bright-field microscope Normal light microscope in which 
the image is obtained by simple transmission of light through 
the object being viewed. 

buffer Solution of weak acid or weak base that resists the pH 


change that would otherwise occur when small quantities of 
acid or base are added. 

C3 The pivotai complément protein that is activated by the 
early components of ail three complément pathways (the 
classical pathway, the lectin pathway, and the alternative 
pathway). (Figure 24-7) 

Ca 2+ pump (calcium pump, Ca 2+ ATPase) Transport protein 
in the membrane of sarcoplasmic réticulum of muscle cells 
(and elsewhere). Pumps Ca 2+ out of the cytoplasm into the 
sarcoplasmic réticulum using the energy of ATP hydrolysis. 

Ca 2+ -activated K + channel Opens in response to the raised 
concentration of Ca 2+ in nerve cells that occurs in response 
to an action potential. Increased K + permeability makes the 
membrane harder to depolarize, increasing the delay between 
action potentials and decreasing the response of the cell to 
constant, prolonged stimulation (adaptation). 

Ca 2+ /calmodulin-dependent kinase (CaM-kinase) Serine/ 
threonine protein kinase that is activated by Ca 2+ /calmodulin. 
Indirectly médiates the effects of an increase in cytosolic Ca 2+ 
by phosphorylating spécifie target proteins. (Figure 15-33) 

cadherin Member of the large cadherin superfamily of 
transmembrane adhesion proteins. Médiates homophilic 
Ca 2+ -dependent cell-cell adhesion in animal tissues. (Figure 
19-3 and Table 19-1, p. 1037) 

cadherin superfamily Family of classical and nonclassical 
cadherin proteins with more than 180 members in humans. 

calmodulin Ubiquitous intracellular Ca 2+ -binding protein that 
undergoes a large conformation change when it binds Ca 2+ , 
allowing it to regulate the activity of many target proteins. In 
its activated (Ca 2+ -bound) form, it is called Ca 2+ /calmodulin. 
(Figure 15-33) 

calnexin Carbohydrate-binding chaperone protein in 
the endoplasmic réticulum (ER) membrane that binds to 
oligosaccharides on incompletely folded proteins and retains 
them in the ER. 

calreticulin Carbohydrate-binding chaperone protein 
in the endoplasmic réticulum (ER) lumen that binds to 
oligosaccharides on incompletely folded proteins and retains 
them in the ER. 

CaM-kinase II Multifunctional Ca 2+ /calmodulin-dependent 
protein kinase that phosphorylâtes itself and various target 
proteins when activated. Found in most animal cells but is 
especially abundant at synapses in the brain, and is involved in 
some forms of synaptic plasticity in vertebrates. (Figure 15-34) 

cancer stem cells Rare cancer cells capable of dividing 
indefinitely. 

cancer-critical genes Genes whose alteration contributes to 
the causation or évolution of cancer by driving tumorigenesis. 

capsid Protein coat of a virus, formed by the self-assembly of 
one or more types of protein subunit into a geometrically regular 
structure. (Figure 3-27) 

carbohydrate layer The carbohydrate-rich zone on the 
eukaryotic cell surface attributable to glycoproteins, glycolipids, 
and proteoglycans of the plasma membrane. 

carbon-fixation reaction Process by which inorganic carbon 
(as atmospheric CO 2 ) is incorporated into organic molécules. 
The second stage of photosynthesis. (Figure 14-40) 

carcinogenesis The génération of cancer. 
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carcinoma Cancer of épithélial cells. The most common form 
of human cancer. 

cargo The membrane components and soluble molécules 
carried by transport vesicles. 

cartilage Form of connective tissue composed of cells 
(chondrocytes) embedded in a matrix rich in type II collagen 
and chondroitin sulfate proteoglycan. 

caspase Intracellular protease that is involved in mediating 
the intracellular events of apoptosis. 

catalyst Substance that can lower the activation energy of a 
reaction (thus increasing its rate), without itself being consumed 
by the reaction. 

caveola (plural caveolae) Invaginations at the cell surface 
that bud off internally to form pinocytic vesicles. Thought to 
form from lipid rafts, régions of membrane rich in certain lipids. 

caveolins Family of unusual intégral membrane proteins that 
are the major structural proteins in caveolae, 

CD4 Co-receptor protein on helper T cells and regulatory 
T cells that binds to a nonvariable part of class II MHC proteins 
(on antigen-presenting cells) outside the peptide-binding 
groove. (Figure 24-40) 

CD8 Co-receptor protein on cytotoxic T cells that binds to a 
nonvariable part of class I MHC proteins (on antigen-presenting 
cells and infected target cells) outside the peptide-binding 
groove. (Figure 24-40) 

Cdc20 Activating subunit of the anaphase-promoting 
complex (APC/C). 

Cdc25 Protein phosphatase that dephosphorylates Cdks and 
increases their activity. 

Cdc42 Member of the Rho family of monomeric GTPases 
that regulate the actin and microtubule cytoskeletons, cell-cycle 
progression, gene transcription, and membrane transport. 

Cdc6 Protein essential in the préparation of DNA for 
réplication. With Cdtl it binds to an origin récognition complex 
on chromosomal DNA and helps load the Mcm proteins onto 
the complex to form the prereplicative complex. 

Cdhl Activating subunit of the anaphase-promoting complex 
(APC/C). 

Cdk inhibitor protein (CKI) Protein that binds to and inhibits 
cyclin-Cdk complexes, primarily involved in the control of Gi 
and S phases. 

Cdk-activating kinase (CAK) Protein kinase that 
phosphorylâtes Cdks in cyclin-Cdk complexes, activating the 
Cdk. 

cDNA clone Clone containing double-stranded cDNA 
molécules derived from the protein-coding mRNA molécules 
présent in a cell. 

cDNA library Collection of cloned DNA molécules 
representing complementary DNA copies of the mRNA 
produced by a cell. 

Cdtl Protein essential in the préparation of DNA for 
réplication. With Cdc6 it binds to origin récognition complexes 
on chromosomes and helps load the Mcm proteins on to the 
complex, forming the prereplicative complex. 

cell cortex Specialized layer of cytoplasm on the inner face 
of the plasma membrane. In animal cells it is an actin-rich layer 
responsible for movements of the cell surface. 

cell cycle (cell-division cycle) Reproductive cycle of a cell: 


the orderly sequence of events by which a cell duplicates its 
chromosomes and, usually, the other cell contents, and divides 
into two. (Figure 17-4) 

cell détermination Process whereby a cell progressively 
loses the potential to form other cell types, as development 
proceeds. 

cell doctrine The proposai that ail living organisms are 
composed of one or more cells and that ail cells arise from the 
division of other living cells. 

cell memory Rétention by cells and their descendants of 
persistently altered patterns of gene expression, without any 
change in DNA sequence. See also epigenetic inheritance. 

cell plate Flattened membrane-bounded structure that forms 
by fusing vesicles in the cytoplasm of a dividing plant cell and is 
the precursor of the new cell wall. 

cell-cycle control System Network of regulatory proteins 
that governs progression of a eukaryotic cell through the cell 
cycle. 

cellulose Long, unbranched chains of glucose; major 
constituent of plant cell walls. 

cellulose microfibril Highly ordered crystalline aggregate 
formed from bundles of about 40 cellulose chains, arranged 
with the same polarity and stuck together in overlapping 
parallel arrays by hydrogen bonds between adjacent cellulose 
molécules. 

central (primary) lymphoid organ Organ in which T or 
B lymphocytes are produced from precursor cells. In adult 
mammals, these are the thymus and bone marrow, respectively. 
(Figure 24-12) 

centriole Short cylindrical array of microtubules, closely 
similar in structure to a basal body. A pair of centrioles is usually 
found at the center of a centrosome in animal cells. (Figure 
16-48) 

centromere Constricted région of a mitotic chromosome 
that holds sister chromatids together. This is also the site 
on the DNA where the kinetochore forms so as to capture 
microtubules from the mitotic spindle. (Figure 4-43) 

centrosome Centrally located organelle of animal cells that is 
the primary microtubule-organizing center (MTOC) and acts as 
the spindle pôle during mitosis. In most animal cells it contains 
a pair of centrioles. (Figures 16-47 and 17-24) 

cérébral cortex Outermost layer of the hemispheres of the 
brain; the most complex structure in the human body. 

CG island Région of DNA in vertebrate genomes with a 
greater than average density of CG sequences; these régions 
generally remain unmethylated. 

channel (membrane channel) Transmembrane protein 
complex that allows inorganic ions or other small molécules to 
diffuse passively across the lipid bilayer. (Figure 11-3) 

channelrhodopsin Photosensitive protein forming a cation 
channel across the membrane that opens in response to light. 

charge séparation In photosynthesis, the light-induced 
transfer of a high-energy électron from chlorophyll to an 
acceptor molécule resulting in the formation of a positive 
charge on the chlorophyll and a négative charge on a mobile 
électron carrier. 

Chemical biology Name given to a strategy that uses large- 
scale screening of hundreds of thousands of small molécules 
in biological assays to identify Chemicals that affect a particular 
biological process and that can then be used to study it. 
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Chemical carcinogens Disparate Chemicals that are 
carcinogenic—due to the ability to cause mutations—when fed 
to experimental animais or painted repeatedly on their skin. 

Chemical group Certain combinations of atoms—such as 
methyl (-CH 3 ), hydroxyl (-OH), carboxyl (-COOH), carbonyl 
(-C=0), phosphate (-PO 3 2- ), sulfhydryl (-SH), and amino (-NH 2 ) 
groups—that hâve distinct Chemical and physical properties 
and influence the behavior of the molécule in which the group 
occurs. 

chemiosmotic coupling (chemiosmosis) Mechanism in 
which an electrochemical proton gradient across a membrane 
(composed of a pH gradient plus a membrane potential) is used 
to drive an energy-requiring process, such as ATP production 
or the rotation of bacterial flagella. 

chemotaxis Movement of a cell toward or away from some 
diffusible Chemical. 

chiasma (plural chiasmata) X-shaped connection visible 
between paired homologous chromosomes during meiosis. 
Represents a site of chromosomal crossing-over, a form of 
genetic recombination. 

chlorophyll Light-absorbing green pigment that plays a 
central part in photosynthesis in bacteria, plants, and algae. 

chloroplast Organelle in green algae and plants that contains 
chlorophyll and carries out photosynthesis. 

choiera toxin Secreted toxic protein of Vibrio cholerae 
responsible for causing the watery diarrhea associated with 
choiera. Comprises an A subunit with enzymatic activity and a 
B subunit that binds to host-cell receptors to direct subunit A to 
the host-cell cytosol. 

cholestérol An abundant lipid molécule with a characteristic 
four-ring steroid structure. An important component of the 
plasma membranes of animal cells. (Figure 10-4) 

chromatin Complex of DNA, histones, and non-histone 
proteins found in the nucléus of a eukaryotic cell. The material 
of which chromosomes are made. 

chromatin immunoprécipitation Technique by which 
chromosomal DNA bound by a particular protein can be 
isolated and identified by precipitating it by means of an 
antibody against the protein. (Figures 8-66 and 8-67) 

chromosome Structure composed of a very long DNA 
molécule and associated proteins that carries part (or ail) of 
the hereditary information of an organism. Especially évident 
in plant and animal cells undergoing mitosis or meiosis, during 
which each chromosome becomes condensed into a compact 
rodlike structure visible in the light microscope. 

cilium (plural cilia) Hairlike extension of a eukaryotic cell 
containing a core bundle of microtubules. Many cells contain a 
single nonmotile cilium, while others contain large numbers that 
perform repeated beating movements. Compare flagellum. 

circadian clock Internai cyclical process that produces a 
particular change in a cell or organism with a period of around 
24 hours, for example the sleep-wakefulness cycle in humans. 

c/s face Face on the same or near side. 

c/s Golgi network (CGN) Network of fused vesicular tubular 
clusters that is closely associated with the cis face of the Golgi 
apparatus and is the compartiment at which proteins and lipids 
enter the Golgi. 

c/s-regulatory sequences DNA sequences to which 
transcription regulators bind to control the rate of gene 


transcription. In nearly ail cases, these sequences must be on 
the same chromosome (that is, in cis) to the genes they control. 
(Figure 7-18) 

cisternal maturation model One hypothesis for how the 

Golgi apparatus achieves and maintains its polarized structure 
and how molécules move from one cisterna to another. This 
model views the cisternae as dynamic structures that mature 
from early to late by acquiring and then losing spécifie Golgi- 
resident proteins as they move through the Golgi stack with 
cargo. 

citric acid cycle [tricarboxylic acid (TCA) cycle, Krebs 
cycle] Central metabolic pathway found in aérobic organisms. 
Oxidizes acetyl groups derived from food molécules, generating 
the activated carriers NADH and FADH 2 , some GTP, and waste 
CO 2 . In eukaryotic cells, it occurs in the mitochondria. 

(Panel 2-9, pp. 106-107) 

clamp loader Protein complex that utilizes ATP hydrolysis to 
load the sliding clamp on to a primer-template junction in the 
process of DNA réplication. 

class I MHC protein One of two classes of major 
histocompatibility complex (MHC) protein. Found on the surface 
of almost ail vertebrate cell types, where it can présent foreign 
peptides derived from a pathogen such as a virus to cytotoxic 
T cells. (Figures 24-35 and 24-36A) 

class II MHC protein One of two classes of major 
histocompatibility complex (MHC) protein. Found on the surface 
of various antigen-presenting cells, where it présents peptides 
to helper and regulatory T cells. (Figures 24-35 and 24-36B) 

class switching Change from making one class of 
immunoglobulin (for example, IgM) to making another class (for 
example, IgG) that many B cells undergo during the course of 
an adaptive immune response. Involves DNA rearrangements 
called class-switch recombination. (Figure 24-30) 

class-switch recombination An irréversible change at the 
DNA level when a B cell switches from making IgM and IgD to 
making one of the secondary classes of immunoglobulin. 

classical cadherins Family of cadherin proteins, including 
E-cadherin, N-cadherin, and P-cadherin, that are closely related 
in sequence throughout their extracellular and intracellular 
domains. 

clathrin Protein that assembles into a polyhedral cage on the 
cytosolic side of a membrane so as to form a clathrin-coated 
pit, which buds off by endocytosis to form an intracellular 
clathrin-coated vesicle. (Figure 13-6) 

clathrin-coated pits Specialized régions typically occupying 
about 2 % of the total plasma membrane area at which the 
endocytic pathway often begins. 

clathrin-coated vesicles Coated vesicles that transport 
material from the plasma membrane and between endosomal 
and Golgi compartiments. 

cleavage (1) Physical splitting of a cell into two. 

( 2 ) Specialized type of cell division seen in many early embryos 
whereby a large cell becomes subdivided into many smaller 
cells without growth. 

clonal sélection From a population of T and B lymphocytes 
with a vast répertoire of randomly generated antigen-specific 
receptors, a given foreign antigen activâtes (selects) only those 
lymphocyte clones that display a receptor that fits the antigen. 
Explains how the adaptive immune System can respond to 
millions of different antigens in a highly spécifie way. 

(Figure 24-15) 
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co-receptor In immunology: an accessory receptor on 
B cells or T cells that does not bind antigen but binds to a 
co-stimulatory signal and helps activate the lymphocyte, by 
helping to activate an intracellular signaling pathway. 

co-stimulatory signal In immunology: a secreted or 
membrane-bound signal protein that helps activate an antigen- 
responding B cell or T cell. 

co-translational Occurring as translation proceeds. 

Examples include the import of a protein into the endoplasmic 
réticulum before the polypeptide Chain is completely 
synthesized (co-translational translocation, Figure 12-32), and 
the folding of a nascent protein into its secondary and tertiary 
structure as it emerges from a ribosome. (Figure 6-79) 

coat-recruitment GTPases Members of a family of 
monomeric GTPases that hâve important rôles in vesicle 
transport, being responsible for coat assembly at the 
membrane. 

coated vesicle Small membrane-enclosed organelle with a 
cage of proteins (the coat) on its cytosolic surface. Formed by 
the pinching off of a coated région of membrane (coated pit). 
Some coats are made of clathrin, others are made from other 
proteins. 

codon Sequence of three nucléotides in a DNA or mRNA 
molécule that represents the instruction for incorporation of a 
spécifie amino acid into a growing polypeptide chain. 

coenzyme Small molécule tightly associated with an enzyme 
that participâtes in the reaction that the enzyme catalyzes, often 
by forming a covalent bond to the substrate. Examples include 
biotin, NAD + , and coenzyme A. 

cohesin, cohesin complex Complex of proteins that holds 
sister chromatids together along their length before their 
séparation. (Figure 17-19) 

coiled-coil Especially stable rodlike protein structure formed 
by two or more a helices coiled around each other. (Figure 3-9) 

collagen Fibrous protein rich in glycine and proline that 
is a major component of the extracellular matrix in animais, 
conferring tensile strength. Exists in many forms: type I, the 
most common, is found in skin, tendon, and bone; type II is 
found in cartilage; type IV is présent in basal laminae. 

(Figures 3-23 and 19-40) 

collagen fibril A higher-order collagen polymer of fibrillar 
collagens that assemble into thin structures (10-300 nm in 
diameter) many hundreds of micrometers long in mature 
tissues. 

colony-stimulating factor (CSF) General name for 
numerous signal molécules that control différentiation of blood 
cells. 

colorectal cancer Cancer arising from the epithelium lining 
the colon (the large intestine) and rectum (the terminal segment 
of the gut). 

column chromatography Technique for séparation of a 
mixture of substances in solution by passage through a column 
containing a porous solid matrix. Substances are retarded to 
different extents by their interaction with the matrix and can be 
collected separately from the column. Depending on the matrix, 
séparation can be according to charge, hydrophobicity, size, or 
the ability to bind to other molécules. 

commensalism Ecological relationship between microbes 
and their host in which the microbe benefits but offers no 
benefit and causes no harm. 


committed precursor Cell derived from a stem cell 
that divides for a limited number of times before terminally 
differentiating; also known as a transit amplifying cell. 

complément System System of blood proteins that can 
be activated by antibody-antigen complexes or pathogens to 
help eliminate the pathogens, by directly causing their lysis, by 
promoting their phagocytosis, or activating an inflammatory 
response. (Figure 24-7) 

complementary (1) Of nucleic acid sequences: capable of 
forming a perfect base-paired duplex with each other. 

(Figure 4-4) (2) Of other interacting molécules, such as an 
enzyme and its substrate: having biochemical or structural 
features that marry up, so that noncovalent bonding is 
facilitated. (Figure 2-3) 

complémentation test Test to détermine whether two 
mutations that produce similar phenotypes are in the same or 
different genes. (Panel 8-2, pp. 487) 

complex oligosaccharides Broad class of A/-linked 
oligosaccharides, attached to mammalian glycoproteins in the 
endoplasmic réticulum and modified in the Golgi apparatus, 
containing A/-acetylglucosamine, galactose, sialic acid, and 
fucose residues. 

condensin, condensin complex Complex of proteins 
involved in chromosome condensation prior to mitosis. Target 
for M-Cdk. (Figure 17-22) 

conditional mutation Mutation that changes a protein or 
RNA molécule so that its function is altered only under some 
conditions, such as at an unusually high or unusually low 
température. 

cône photoreceptor (cône) Photoreceptor cell in the 
vertebrate retina that is responsible for color vision in bright 
light. 

confocal microscope Type of light microscope that 
produces a clear image of a given plane within a solid object. 

It uses a laser beam as a pinpoint source of illumination and 
scans across the plane to produce a two-dimensional “optical 
section.” (Figure 9-19) 

conformation The folded, three-dimensional structure of a 
polypeptide chain. 

connective tissue Any supporting tissue that lies between 
other tissues and consists of cells embedded in a relatively 
large amount of extracellular matrix. Includes bone, cartilage, 
and loose connective tissue. 

connexin Protein component of gap junctions, a four-pass 
transmembrane protein. Six connexins assemble in the plasma 
membrane to form a connexon, or “hemichannel.” (Figure 
19-25) 

connexon Water-filled pore in the plasma membrane formed 
by a ring of six connexin protein subunits. Half of a gap 
junction: connexons from two adjoining cells join to form a 
continuous channel through which ions and small molécules 
can pass. (Figure 19-25) 

consensus nucléotide sequence A summary or “average” 
of a large number of individual nucléotide sequences derived 
by comparing many sequences with the same basic function 
and tallying up the most common nucléotides found at each 
position. (Figure 6-12) 

consensus sequence Average or most typical form of a 
sequence that is reproduced with minor variations in a group 
of related DNA, RNA, or protein sequences. Indicates the 
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nucléotide or amino acid most often found at each position. 
Préservation of a sequence implies that it is functionally 
important. 

conservative site-specifc recombination A type of 
DNA recombination that takes place between short, spécifie 
sequences of DNA and occurs without the gain or loss of 
nucléotides. It does not require extensive homology between 
the recombining DNA molécules. 

constant région In immunology: région of an immunoglobulin 
or T cell receptor chain that has a constant amino acid 
sequence. 

constitutive secretory pathway Pathway présent in ail cells 
by which molécules such as plasma membrane proteins are 
continually delivered to the plasma membrane from the Golgi 
apparatus in vesicles that fuse with the plasma membrane. 

The default route to the plasma membrane if no other sorting 
signais are présent. (Figure 13-63) 

contact-dependent signaling Form of intercellular signaling 
in which signal molécules remain bound to the surface of the 
signaling cell and influence only cells that contact it. 

contractile ring Ring containing actin and myosin that forms 
under the surface of animal cells undergoing cell division. 
Contracts to pinch the two daughter cells apart. (Figure 17-42) 

convergent extension Rearrangement of cells within a tissue 
that causes it to extend in one dimension and shrink in another. 
(Figure 21-50) 

COPI-coated vesicles Coated vesicles that transport 
material early in the secretory pathway, budding from Golgi 
compartiments. 

COPII-coated vesicles Coated vesicles that transport 
material early in the secretory pathway, budding from the 
endoplasmic réticulum. 

copy number variations (CNVs) A différence between two 
individuals in the same population in the number of copies of 
a particular block of DNA sequence. This variation arises from 
occasional duplications and délétions of these sequences. 

cortex The cytoskeletal network in the cortical région of the 
cytosol just beneath the plasma membrane. 

coupled reaction Linked pair of Chemical reactions in which 
the free energy released by one serves to drive the other. 

(Figure 2-29) 

covalent bond Stable Chemical link between two atoms 
produced by sharing one or more pairs of électrons. 

(Panel 2-1, pp. 90-91) 

CRE-binding (CREB) protein Transcription regulator that 
recognizes the cyclic AMP response element (CRE) in the 
regulatory région of genes activated by cAMP. On activation 
by PKA, phosphorylated CREB recruits a transcriptional 
coactivator (CREB-binding protein; CBP) to stimulate 
transcription of target genes. 

CRISPR A defense mechanism in bacteria using small 
noncoding RNA molécules (crRNAs) to seek out and destroy 
invading viral genomes through complementary base-pairing 
and targeted nuclease digestion. 

crista (plural cristae) A specialized invagination of the inner 
mitochondrial membrane. 

cross-linking glycan One of a heterogeneous group of 
branched polysaccharides that help to cross-link cellulose 
microfibrils into a complex network. Has a long linear backbone 


of one sugar type (glucose, xylose, or mannose) with short side 
chains of other sugars. 

cross-presentation A process in which extracellular proteins 
taken up by specialized dendritic cells can give rise to peptides 
that can be presented by class I MHC proteins to cytotoxic 
T cells. 

crRNAs Small noncoding RNAs (—30 nucléotides) that are the 
effectors of CRISPR-mediated immunity in bacteria. 

cryoelectron microscopy Technique for examining a thin 
film of an aqueous suspension of biological material that 
has been frozen rapidly enough to create vitreous ice. The 
specimen is then kept frozen and transferred to the électron 
microscope. Image contrast is low, but is generated solely by 
the macromolecular structures présent. 

cryptochrome Plant flavoprotein sensitive to blue light. 
Structurally related to blue-light-sensitive enzymes called 
photolyases (involved in the repair of ultraviolet-induced DNA 
damage) but do not hâve a rôle in DNA repair. Also found in 
animais, where they hâve an important rôle in circadian docks. 

Cubitus interruptus (Ci) Latent transcription regulator that 
médiates the effects of Hedgehog. 

cyclic AMP (cAMP) Nucléotide that is generated from 
ATP by adenylyl cyclase in response to various extracellular 
signais. It acts as a small intracellular signaling molécule, mainly 
by activating cAMP-dependent protein kinase (PKA). It is 
hydrolyzed to AMP by a phosphodiesterase. (Figure 15-25) 

cyclic AMP phosphodiesterase Spécifie enzyme that 
rapidly and continuously destroys cyclic AMP, forming 5'-AMP. 
(Figure 15-25). 

cyclic AMP-dependent protein kinase (protein kinase A, 
PKA) Enzyme that phosphorylâtes target proteins in response 
to a rise in intracellular cyclic AMP. (Figure 15-26) 

cyclic GMP (cGMP) Nucléotide that is generated from GTP 
by guanylyl cyclase in response to various extracellular signais. 

cyclic GMP phosphodiesterase Spécifie enzyme that 
rapidly hydrolyzes and dégradés cyclic GMP. 

cyclin Protein that periodically rises and falls in concentration 
in step with the eukaryotic cell cycle. Cyclins activate crucial 
protein kinases (called cyclin-dependent protein kinases, or 
Cdks) and thereby help control progression from one stage of 
the cell cycle to the next. 

cyclin-dependent kinase (Cdk) Protein kinase that has to 
be complexed with a cyclin protein in order to act. Different 
cyclin-Cdk complexes trigger different steps in the cell-division 
cycle by phosphorylating spécifie target proteins. (Figure 17-10) 

cyclin-Cdk complex Protein complex formed periodically 
during the eukaryotic cell cycle as the level of a particular cyclin 
increases. A cyclin-dependent kinase (Cdk) then becomes 
partially activated. (Figures 17-10 and 17-11, and 
Table 17-1, p. 969) 

cyclosome see anaphase-promoting complex 

cytochrome Colored heme-containing protein that transfers 
électrons during respiration and photosynthesis. 

cytochrome c Soluble component of the mitochondrial 
electron-transport chain. Its release into the cytosol from the 
mitochondrial intermembrane space also initiâtes apoptosis. 
(Figure 14-26) 

cytochrome c oxidase complex Third of the three electron- 
driven proton pumps in the respiratory chain. It accepts 
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électrons from cytochrome c and generates water using 
molecular oxygen as an électron acceptor. (Figure 14-18) 

cytochrome c reductase Second of the three electron- 
driven proton pumps in the respiratory chain. Accepts électrons 
from ubiquinone and passes them to cytochrome c. 

(Figure 14-18) 

cytokine Extracellular signal protein or peptide that acts as a 
local mediator in cell-cell communication. 

cytokine receptor Cell-surface receptor that binds a spécifie 
cytokine or hormone and acts through the JAK-STAT signaling 
pathway. (Figure 15-56) 

cytokinesis Division of the cytoplasm of a plant or animal cell 
into two, as distinct from the associated division of its nucléus 
(which is mitosis). Part of M phase. (Panel 17-1, pp. 980-981) 

cytoplasm Contents of a cell that are contained within its 
plasma membrane but, in the case of eukaryotic cells, outside 
the nucléus. 

cytoplasmic tyrosine kinase Enzyme activated by certain 
cell-surface receptors (tyrosine-kinase-associated receptors) 
that transmits the receptor signal onward by phosphorylating 
target cytoplasmic proteins on tyrosine side chains. 

cytoskeleton System of protein filaments in the cytoplasm of 
a eukaryotic cell that gives the cell shape and the capacity for 
directed movement. Its most abundant components are actin 
filaments, microtubules, and intermediate filaments. 

cytosol Contents of the main compartiment of the cytoplasm, 
excluding membrane-bounded organelles such as endoplasmic 
réticulum and mitochondria. 

cytotoxic T cell (Te cell) Type of T cell responsible for killing 
host cells infected with a virus or another type of intracellular 
pathogen. (Figure 24-42) 

dark-field microscopy Type of light microscopy in which 
oblique rays of light focused on the specimen do not enter the 
objective lens, but light that is scattered by components in the 
living cell can be collected to produce a bright image on a dark 
background. (Figure 9-7) 

death receptor Transmembrane receptor protein that 
can signal the cell to undergo apoptosis when it binds its 
extracellular ligand. (Figure 18-5) 

death-inducing signaling complex (DISC) Activation 
complex in which initiator caspases interact and are activated 
following binding of extracellular ligands to cell-surface death 
receptors in the extrinsic pathway of apoptosis. 

deep RNA sequencing see RNA-seq 

default pathway The transport pathway of proteins directly 
to the cell surface via the nonselective constitutive secretory 
pathway, entry into which does not require a particular signal. 

defensin Positively charged, amphipathic, antimicrobial 
peptide—secreted by épithélial cells—that binds to and 
disrupts the membranes of many pathogens. 

delayed K + channel Neuronal voltage-gated K + channel 
that opens following membrane depolarization but during the 
falling phase of an action potential due to slower activation 
kinetics than Na + channels; opening permits K + efflux, driving 
the membrane potential back toward its original négative value, 
ready to transmit a second impulse. 

Delta Single-pass transmembrane signal protein displayed 
on the surface of cells that binds to the Notch receptor protein 
on a neighboring cell, activating a contact-dependent signaling 
mechanism. 


dendrite Extension of a nerve cell, often elaborately 
branched, that receives stimuli from other nerve cells. 

dendritic cell The most potent type of antigen-presenting 
cell, which takes up antigen and processes it for présentation 
to T cells. It is required for activating naïve T cells. 

(Figure 24-11) 

deoxyribonucleic acid (DNA) Polynucleotide formed 
from covalently linked deoxyribonucleotide units. The store 
of hereditary information within a cell and the carrier of this 
information from génération to génération. (Figure 4-3 and 
Panel 2-6, pp. 100-101) 

depolarization Déviation in the electric potential across the 
plasma membrane towards a positive value. A depolarized cell 
has a potential that is positive outside and négative inside. 

desensitization see adaptation 

desmosome Anchoring cell-cell junction, usually formed 
between two épithélial cells. Characterized by dense plaques 
of protein into which intermediate filaments in the two adjoining 
cells insert. (Figure 19-2) 

detergent Small amphiphilic molécule, more soluble in water 
than lipids, that disrupts hydrophobie associations and destroys 
the lipid bilayer thereby solubilizing membrane proteins. 

D gene segment A short DNA sequence that encodes a part 
of the variable région of an immunoglobulin heavy chain or the 
(3 chain of a T cell receptor (TCR). 

diacylglycérol (DAG) Lipid produced by the cleavage of 
inositol phospholipids in response to extracellular signais. 
Composed of two fatty acid chains linked to glycerol, it serves 
as a small signaling molécule to help activate protein kinase C 
(PKC). (Figure 15-28) 

dideoxy sequencing The standard enzymatic method of 
DNA sequencing. (Panel 8-1, p. 478) 

differential-interference-contrast microscope Type of 
light microscope that exploits the interférence effects that occur 
when light passes through parts of a cell of different refractive 
indices. Used to view unstained living cells. 

différentiation Process by which a cell undergoes a change 
to an overtly specialized cell type. 

diffusion The net drift of molécules through space due to 
random thermal movements. 

Dishevelled Scaffold protein recruited to the Frizzled family of 
cell-surface receptors upon their activation by Wnt binding that 
helps relay the signal to other signaling molécules. 

DNA cloning (1) The act of making many identical copies 
(typically billions) of a DNA molécule—the amplification of a 
particular DNA sequence. (2) Also, the isolation of a particular 
stretch of DNA (often a particular gene) from the rest of the 
cell’s genome. 

DNA helicase Enzyme that is involved in opening the DNA 
hélix into its single strands for DNA réplication. 

DNA library Collection of cloned DNA molécules, 
representing either an entire genome (genomic library) or 
complementary DNA copies of the mRNA produced by a cell 
(cDNA library). 

DNA ligase Enzyme that joins the ends of two strands of 
DNA together with a covalent bond to make a continuous DNA 
strand. 

DNA méthylation Addition of methyl groups to DNA. 
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Extensive méthylation of the cytosine base in CG sequences 
is used in plants and animais to help keep genes in an inactive 
State. 

DNA microarray A large array of short DNA molécules 
(each of known sequence) bound to a glass microscope 
slide or other suitable support. Used to monitor expression of 
thousands of genes simultaneously: mRNA isolated from test 
cells is converted to cDNA, which in turn is hybridized to the 
microarray. (Figure 8-64) 

DNA polymerase Enzyme that synthesizes DNA by joining 
nucléotides together using a DNA template as a guide. 

DNA primase Enzyme that synthesizes a short strand of RNA 
on a DNA template, producing a primer for DNA synthesis. 
(Figure 5-10) 

DNA repair A set of processes for repairing the many 
accidentai lésions that occur continuais in DNA. 

DNA réplication Process by which a copy of a DNA molécule 
is made. 

DNA supercoiling A conformation with loops or coils that 
DNA adopts in response to superhelical tension; conversely, 
creating various loops or coils in the hélix can create such 
tension. 

DNA topoisomerase (topoisomerase) Enzyme that binds 
to DNA and reversibly breaks a phosphodiester bond in one 
or both strands. Topoisomerase I créâtes transient single- 
strand breaks, allowing the double hélix to swivel and relieving 
superhelical tension. Topoisomerase II créâtes transient double- 
strand breaks, allowing one double hélix to pass through 
another and thus resolving tangles. (Figures 5-21 and 5-22) 

DNA tumor virus General term for a variety of different DNA 
viruses that can cause tumors. 

DNA-only transposon Transposable element that exists as 
DNA throughout its life cycle. Many move by cut-and-paste 
transposition. See also transposon. 

dolichol Isoprenoid lipid molécule that anchors the precursor 
oligosaccharide in the endoplasmic réticulum membrane during 
protein glycosylation. 

domain (protein domain) Portion of a protein that has 
a tertiary structure of its own. Larger proteins are generally 
composed of several domains, each connected to the next 
by short flexible régions of polypeptide chain. Homologous 
domains are recognized in many different proteins. 

Dorsal protein Transcription regulator of the NFkB family 
regulating gene expression and involved in establishing the 
dorsoventral axis in the embryo. 

double hélix The three-dimensional structure of DNA, in 
which two antiparallel DNA chains, held together by hydrogen- 
bonding between the bases, are wound into a hélix. 

(Figure 4-5) 

drivers Mutations that are causal factors in the development 
of cancer. 

dynamic instability Sudden conversion from growth to 
shrinkage, and vice versa, in a protein filament such as a 
microtubule or actin filament. (Panel 16-2, pp. 902-903) 

dynamin Cytosolic GTPase that binds to the neck of a 
clathrin-coated vesicle in the process of budding from the 
membrane, and which is involved in completing vesicle 
formation. 


dynein Large motor protein that undergoes ATP-dependent 
movement along microtubules. 

E2F protein Transcription regulatory protein that switches on 
many genes that encode proteins required for entry into the 
S phase of the cell cycle. 

early endosome Common receiving compartiment with 
which most endocytic vesicles fuse and where internalized 
cargo is sorted either for return to the plasma membrane or for 
dégradation by inclusion in a late endosome. 

ectoderm Embryonic épithélial tissue that is the precursor of 
the epidermis and nervous System. 

edema factor One of the two A subunits of anthrax toxin; an 
adenylyl cyclase that catalyzes production of cAMP, leading to 
ion imbalance and conséquent edema in the skin or lung. 

effector Bcl2 family proteins Pro-apoptotic proteins of the 
intrinsic pathway of apoptosis that in response to an apoptotic 
stimulus become activated and aggregate to form oligomers 
in the mitochondrial outer membrane, inducing the release of 
cytochrome c and other intermembrane proteins. Bax and Bak 
are the main effector Bcl2 family proteins in mammalian cells. 

effector cell Cell that carries out the final response or 
function in a particular process. The main effector cells of the 
immune System, for example, are activated lymphocytes and 
phagocytes that help eliminate pathogens. 

egg-polarity genes Genes in the Drosophila egg that define 
the anteroposterior and dorsoventral axes of the future embryo 
through the création of landmarks (mRNA or protein) in the egg 
that provide signais organizing the developmental process. 

elastic fiber Extensible fiber formed by the protein elastin in 
many animal connective tissues, such as in skin, blood vessels, 
and lungs, which gives them their stretchability and resilience. 

elastin Extracellular protein that forms extensible fibers 
(elastic fibers) in connective tissues. 

electrochemical gradient Combined influence of a 
différence in the concentration of an ion on two sides of a 
membrane and the electrical charge différence across the 
membrane (membrane potential). Ions or charged molécules 
can move passively only down their electrochemical gradient. 

électron microscope Microscope that uses a beam of 
électrons to create the image. 

électron microscope (EM) tomography Technique 
for viewing three-dimensional specimens in the électron 
microscope in which multiple views are taken from different 
directions by tilting the specimen holder. The views are 
combined computationally to give a three-dimensional image. 

electron-transport chain Sériés of reactions in which 
électron carrier molécules pass électrons “down the chain” 
from higher to successively lower energy levels. The energy 
released during such électron movement can be used to power 
various processes. Electron-transport chains présent in the 
inner mitochondrial membrane (called the respiratory chain) 
and in the thylakoid membrane of chloroplasts generate a 
proton gradient across the membrane that is used to drive ATP 
synthesis. See especially Figures 14-18 and 14-52. 

electrostatic attraction A noncovalent, ionic bond between 
two molécules carrying groups of opposite charge. 

(Panel 2-3, pp. 94-95) 

embryonic stem cells (ES cells) Cells derived from the 
inner cell mass of the early mammalian embryo. Capable of 
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giving rise to ail the cells in the body. Can be grown in culture, 
genetically modified, and inserted into a blastocyst to develop a 
transgenic animal. 

endocrine cell Specialized animal cell that sécrétés a 
hormone into the blood. Usually part of a gland, such as the 
thyroid or pituitary gland. 

endocytic vesicle Vesicle formed as material ingested by 
the cell during endocytosis is progressively enclosed by a small 
portion of the plasma membrane, which first invaginâtes and 
then pinches off to form the vesicle. 

endocytosis Uptake of material into a cell by an invagination 
of the plasma membrane and its internalization in a membrane- 
enclosed vesicle. See also pinocytosis and phagocytosis. 

endoderm Embryonic tissue that is the precursor of the gut 
and associated organs. 

endoplasmic réticulum (ER) Labyrinthine membrane- 
bounded compartment in the cytoplasm of eukaryotic cells, 
where lipids are synthesized and membrane-bound proteins 
and secretory proteins are made. (Figure 12-33) 

endosome maturation Process by which early endosomes 
mature to late endosomes and endolysosomes; in the 
conversion process, the endosome membrane protein 
composition changes, the endosome moves from the cell 
periphery to close to the nucléus, and the endosome ceases 
to recycle material to the plasma membrane and irreversibly 
commits its remaining contents to dégradation. 

endothélial cell Flattened cell type that forms a sheet (the 
endothélium) lining ail blood and lymphatic vessels. 

entropy (S) Thermodynamic quantity that measures the 
degree of disorder or randomness in a System; the higher the 
entropy, the greaterthe disorder. (Panel 2-7, pp. 102-103) 

enveloped virus Virus with a capsid surrounded by a lipid 
bilayer membrane (the envelope), which is often derived from 
the host-cell plasma membrane when the virus buds from the 
cell. (Figure 23-12) 

enzyme Protein that catalyzes a spécifie Chemical reaction. 

enzyme-coupled receptor A major type of cell-surface 
receptor that has a cytoplasmic domain that either has 
enzymatic activity or is associated with an intracellular enzyme. 
In either case, the enzymatic activity is stimulated by an 
extracellular ligand binding to the receptor. (Figure 15-6) 

ephrin One of a family of membrane-bound protein ligands 
for the Eph receptor tyrosine kinases (RTKs) that, among many 
other functions, stimulate repulsion or attraction responses that 
guide the migration of cells and nerve cell axons during animal 
development. 

epidermis Epithelial layer covering the outer surface of the 
body. Has different structures in different animal groups. The 
outer layer of plant tissue is also called the epidermis. 

epigenetic inheritance Inheritance of phenotypic changes 
in a cell or organism that do not resuit from changes in the 
nucléotide sequence of DNA. Can be due to positive feedback 
loops of transcription regulators or to heritable modifications in 
chromatin such as DNA méthylation or histone modifications. 
(Figure 7-53) 

epistasis analysis Analysis to discover the order in which the 
genes act, by investigating if a mutation in one gene can mask 
the effect of a mutation in another gene when both mutations 
are présent in the same organism or cell. 

épithélial tissues Tissues, such as the lining of the gut or the 


epidermal covering of the skin, in which cells are closely bound 
together into sheets called epithelia. 

epithelium (plural epithelia) Sheet of cells covering the outer 
surface of a structure or lining a cavity. 

equilibrium State in a Chemical reaction where there is no net 
change in free energy to drive the reaction in either direction. 
The ratio of product to substrate reaches a constant value at 
Chemical equilibrium. (Figure 2-30) 

equilibrium constant (K) The ratio of forward and reverse 
rate constants for a reaction. Equal to the association or affinity 
constant [K a ) for a simple binding reaction (A + B ^ AB). See 
also affinity constant, dissociation constant. (See page 62) 

ER lumen Space enclosed by the membrane of the 
endoplasmic réticulum (ER). 

ER résident protein Protein that remains in the endoplasmic 
réticulum (ER) or its membranes and carries out its function 
there, as opposed to proteins that are présent in the ER only in 
transit. 

ER rétention signal Short amino acid sequence on a protein 
that prevents it from moving out of the endoplasmic réticulum 
(ER). Found on proteins that are résident in the ER and function 
there. 

ER signal sequence N-terminal signal sequence that directs 
proteins to enter the endoplasmic réticulum (ER). Cleaved off by 
signal peptidase after entry. 

ER tail-anchored proteins Membrane proteins anchored 
in the endoplasmic réticulum (ER) membrane by a single 
transmembrane a hélix contained at their C-terminus. 

érythrocyte Small hemoglobin-containing blood cell of 
vertebrates that transports oxygen to, and carbon dioxide from, 
tissues. Also called a red blood cell. 

erythropoietin A hormone produced by the kidney that 
stimulâtes the production of red blood cells in bone marrow. 

ESCRT protein complexes Four protein complexes 
(ESCRT-0, ESCRT-1, ESCRT-2, and ESCRT-3) that act 
sequentially to shepherd mono-ubiquitylated membrane 
proteins on endosomal membranes into intralumenal vesicles. 
ESCRT-3 complex catalyzes the pinching-off reaction. 

ethylene Small gas molécule that is a plant growth regulator 
influencing plant development in various ways including 
promoting fruit ripening, leaf abscission, and plant senescence 
and functioning as a stress signal in response to wounding, 
infection, and flooding. 

euchromatin Région of an interphase chromosome that 
stains diffusely; “normal” chromatin, as opposed to the more 
condensed heterochromatin. 

eukaryote Organism composed of one or more cells that 
hâve a distinct nucléus. Member of one of the three main 
divisions of the living world, the other two being bacteria and 
archaea. (Figure 1-17) 

eukaryotic initiation factor (elF) Protein that helps load 
initiator tRNA on to the ribosome, thus initiating translation. 

excitatory neurotransmitter Neurotransmitter that opens 
cation channels in the postsynaptic membrane, causing an 
influx of Na + , and in many cases Ca 2+ , that depolarizes the 
postsynaptic membrane toward the threshold potential for firing 
an action potential. 

executioner caspases Apoptotic caspases that catalyze the 
widespread cleavage events during apoptosis that kill the cell. 
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exocytosis Excrétion of material from the cell by vesicle 
fusion with the plasma membrane; can occur constitutively or 
be regulated. 

exon Segment of a eukaryotic gene that consists of a 
sequence of nucléotides that will be represented in mRNA or 
in a final transfer, ribosomal, or other mature RNA molécule. 

In protein-coding genes, exons encode the amino acids in 
the protein. An exon is usually adjacent to a noncoding DNA 
segment called an intron. (Figure 4-15) 

exosome Large protein complex with an interior rich in 
3'-to-5' RNA exonucleases; dégradés RNA molécules to 
produce ribonucleotides. 

extracellular pathogens Pathogens that disturb host cells 
and can cause serious disease without replicating in host cells. 

extracellular signal molécule Any secreted or cell-surface 
Chemical signal that binds to receptors and régulâtes the 
activity of the cell expressing the receptor. 

extrinsic pathway Pathway of apoptosis triggered by 
extracellular signal proteins binding to cell-surface death 
receptors. 

facultative pathogens Bacteria that replicate in an 
environmental réservoir such as water or soil and only cause 
disease if they happen to encounter a susceptible host. 

FAD/FADH 2 (flavin adenine dinucleotide/reduced flavin 
adenine dinucleotide) Electron carrier System that functions 
in the citric acid cycle and fatty acid oxidation. One molécule 
of FAD gains two électrons plus two protons in becoming the 
activated carrier FADH 2 . (Figure 2-39) 

Fas (Fas protein, Fas death receptor) Transmembrane 
death receptor that initiâtes apoptosis when it binds its 
extracellular ligand (Fas ligand). (Figure 18-5) 

Fas ligand Ligand that activâtes the cell-surface death 
receptor, Fas, triggering the extrinsic pathway of apoptosis. 

fat Energy-storage lipid in cells. Composed of triglycérides— 
fatty acids esterified with glycerol. 

fate map Représentation showing which cell types will later 
dérivé from which régions of a tissue; e.g. from the blastula. 
(Figure 21-28) 

Fc receptor One of a family of cell-surface receptors that 
bind the tail région (Fc région) of an antibody molécule. Different 
Fc receptors are spécifie for different classes of antibodies, 
such as IgG, IgA, or IgE. 

feedback inhibition The process in which a product of a 
reaction feeds back to inhibit a previous reaction in the same 
pathway. (Figures 3-55 and 3-56) 

fermentation Anaérobie energy-yielding metabolic pathway 
involving the oxidation of organic molécules. Anaérobie 
glycolysis refers to the process whereby pyruvate is converted 
into lactate or éthanol, with the conversion of NADH to NAD + . 
(Figure 2-47) 

fibril-associated collagen Médiates the interactions 
of collagen fibrils with one another and with other matrix 
macromolecules to help détermine the organization of the fibrils 
in the matrix. This collagen (including types IX and XII) has a 
flexible triple-stranded helical structure and binds to the surface 
of the fibrils rather than forming aggregates. 

fibrillar collagen Class of fibril-forming collagens (including 
type I collagen, the most common type and the principal 
collagen of skin and bone) that hâve long ropelike structures 


with few or no interruptions and which assemble into collagen 
fibrils. 

fibroblast Common cell type found in connective tissue. 
Sécrétés an extracellular matrix rich in collagen and other 
extracellular matrix macromolecules. Migrâtes and proliférâtes 
readily in wounded tissue and in tissue culture. 

fibronectin Extracellular matrix protein involved in adhesion 
of cells to the matrix and guidance of migrating cells during 
embryogenesis. Integrins on the cell surface are receptors for 
fibronectin. 

filopodium (plural filopodia) (microspike) Thin, spike-like 
protrusion with an actin filament core, generated on the leading 
edge of a crawling animal cell. (Figure 16-21) 

firing rule Important principle governing synapse 
reinforcement and élimination during development of the 
nervous System: when two (or more) neurons synapsing on 
the same target cell fire at the same time, they reinforce their 
connections to that cell; when they fire at different times, they 
compete, so that ail but one of them tend to be eliminated. 

flagellum (plural flagella) Long, whiplike protrusion whose 
undulations drive a cell through a fluid medium. Eukaryotic 
flagella are longer versions of cilia. Bacterial flagella are smaller 
and completely different in construction and mechanism of 
action. Compare cilium. 

fluorescence microscope Microscope designed to view 
material stained with fluorescent dyes or proteins. Similar to a 
light microscope but the illuminating light is passed through one 
set of filters before the specimen, to select those wavelengths 
that excite the dye, and through another set of filters before 
it reaches the eye, to select only those wavelengths emitted 
when the dye fluoresces. (Figure 9-12) 

fluorescence recovery after photobleaching (FRAP) 

Technique for monitoring the kinetic parameters of a protein by 
analyzing how fluorescent protein molécules move into an area 
of the cell bleached by a beam of laser light. (Figure 9-29) 

fluorescence résonance energy transfer (FRET) 

Technique for monitoring the closeness of two fluorescently 
labeled molécules (and thus their interaction) in cells. Also 
known as Fôrster résonance energy transfer. (Figure 9-26) 

focal adhesion kinase (FAK) Cytoplasmic tyrosine kinase 
présent at cell-matrix junctions (focal adhesions) in association 
with the cytoplasmic tails of integrins. 

focal adhesion kinase (FAK) Cytoplasmic tyrosine kinase 
présent at cell-matrix junctions (focal adhesions) in association 
with the cytoplasmic tails of integrins. 

follicular helper T cell (T F h) Type of T cell located in 
lymphoid follicles that sécrétés various cytokines to stimulate 
B cells to undergo antibody class switching and somatic 
hypermutation. 

formin Dimeric protein that nucleates the growth of straight, 
unbranched actin filaments that can be cross-linked by other 
proteins to form parallel bundles. 

Fôrster résonance energy transfer see fluorescence 
résonance energy transfer (FRET) 

FRAP see fluorescence recovery after photobleaching 

free energy (G) (Gibbs free energy) The energy that can be 
extracted from a System to drive reactions. Takes into account 
changes in both energy and entropy. (Panel 2-7, pp. 102-103) 

free ribosome Ribosome that is free in the cytosol, 
unattached to any membrane. 
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free-energy change (ÀG) see AG. 

FRET see fluorescence résonance energy transfer 

Frizzled Family of cell-surface receptors that are seven-pass 
transmembrane proteins that resemble GPCRs in structure 
but do not generally work through the activation of G proteins. 
Activated by Wnt binding to recruit the scaffold protein 
Dishevelled, which helps relay the signal to other signaling 
molécules. 

fungus (plural fungi) Kingdom of eukaryotic organisms that 
includes the yeasts, molds, and mushrooms. Many plant 
diseases and a relatively small number of animal diseases are 
caused by fungi. 

fusion protein Engineered protein that combines two or more 
normally separate polypeptides. Produced from a recombinant 
gene. 

ÀG Change in the free energy during a reaction: the free 
energy of the product molécules minus the free energy of the 
starting molécules. A large négative value of ÀG indicates that 
the reaction has a strong tendency to occur. (Panel 2-7, 
pp. 102-103) 

Go State of withdrawal from the eukaroytic cell-division cycle 
by entry into a quiescent digression from the Gi phase. A 
common, sometimes permanent, State for differentiated cells. 

Gi phase Gap 1 phase of the eukaryotic cell-division cycle, 
between the end of mitosis and the start of DNA synthesis. 
(Figure 17-4) 

G-i-Cdk Cyclin-Cdk complex formed in vertebrate cells by 
a Gi-cyclin and the corresponding cyclin-dependent kinase 
(Cdk). (Table 17-1, p. 969) 

G-i-cyclin Cyclin présent in the Gi phase of the eukaryotic cell 
cycle. Forms complexes with Cdks that help govern the activity 
of the Gi/S-cyclins, which control progression to S phase. 

G-|/S-Cdk Cyclin-Cdk complex formed in vertebrate cells by 
a G-|/S-cyclin and the corresponding cyclin-dependent kinase 
(Cdk). (Figure 17-11 and Table 17-1, p. 969) 

G-|/S-cyclin Cyclin that activâtes Cdks in late Gi of the 
eukaryotic cell cycle and thereby helps trigger progression 
through Start, resulting in a commitment to cell-cycle entry. Its 
level falls at the start of S phase. (Figure 17-11) 

G 2 phase Gap 2 phase of the eukaryotic cell-division cycle, 
between the end of DNA synthesis and the beginning of 
mitosis. (Figure 17-4) 

G 2 /M transition Point in the eukaryotic cell cycle at which the 
cell checks for completion of DNA réplication before triggering 
the early mitotic events that lead to chromosome alignment on 
the spindle. (Figure 17-9) 

ganglioside Any glycolipid having one or more sialic acid 
residues in its structure. Found in the plasma membrane of 
eukaryotic cells and especially abundant in nerve cells. 

(Figure 10-16) 

gap gene In Drosophila development, a gene that is 
expressed in spécifie broad régions along the anteroposterior 
axis of the early embryo, and which helps designate the main 
divisions of the insect body. (Figure 21-20) 

gap junction Communicating channel-forming cell—cell 
junction présent in most animal tissues that allows ions and 
small molécules to pass from the cytoplasm of one cell to the 
cytoplasm of the next. 


gastrulation Important stage in animal embryogenesis 
during which the embryo is transformed from a bail of cells to a 
structure with a gut (a gastrula). 

gated transport Movement of proteins between the cytosol 
and the nucléus through nuclear pore complexes in the nuclear 
envelope that function as sélective gates. 

geminin Protein that prevents the formation of new 
prereplicative complexes during S phase and mitosis, thus 
ensuring that the chromosomes are replicated only once in 
each cell cycle. 

gene Région of DNA that is transcribed as a single unit and 
carries information for a discrète hereditary characteristic, 
usually corresponding to (1) a single protein (or set of related 
proteins generated by variant post-transcriptional Processing), 
or (2) a single RNA (or set of closely related RNAs). 

gene control région The set of linked DNA sequences 
regulating expression of a particular gene. Includes promoter 
and c/s-regulatory sequences required to initiate transcription of 
the gene and control the rate of transcription. (Figure 7-17) 

gene conversion Process by which DNA sequence 
information can be transferred from one DNA hélix (which 
remains unchanged) to another DNA hélix whose sequence is 
altered. It often accompanies general recombination events. 
(Figure 5-59) 

gene family The set of genes in an organism related in DNA 
sequence due to their dérivation from the same ancestor. 

gene segments In immunology: short DNA sequences that 
are joined together during B cell and T cell development to 
produce the coding sequences for immunoglobulins and T cell 
receptors, respectively. (Figure 24-28) 

general transcription factor Any of the proteins whose 
assembly at ail promoters of a given type is required for the 
binding and activation of RNA polymerase and the initiation of 
transcription. (Table 6-3, p. 311) 

genetic code The set of rules specifying the correspondence 
between nucléotide triplets (codons) in DNA or RNA and amino 
acids in proteins. (Figure 6-48) 

genetic instability Abnormally increased spontaneous 
mutation rate, such as occurs in cancer cells. 

genetic screen Procedure for discovery of genes affecting a 
spécifie phenotype by surveying large numbers of mutagenized 
individuals. 

genetics The study of the genes of an organism on the basis 
of heredity and variation. 

genome The totality of genetic information belonging to a 
cell or an organism; in particular, the DNA that carries this 
information. 

genome annotation Process attempting to mark out ail 
the genes (protein-coding and noncoding) in a genome and 
ascribing functions to each. 

genomic imprinting Phenomenon in which a gene is either 
expressed or not expressed in the offspring depending on 
which parent it is inherited from. (Figure 7-49) 

genomic library Collection of cloned DNA molécules 
representing an entire genome. 

génotype Genetic constitution of an individual cell or 
organism. The particular combination of alleles found in a 
spécifie individual. (Panel 8-2, p. 486) 



G:14 Glossary 


germ cell A cell in the germ line of an organism, which 
includes the haploid gametes and their specified diploid 
precursor cells. Germ cells contribute to the formation of a new 
génération of organisms and are distinct from somatic cells, 
which form the body and leave no descendants. 

germ layer One of the three primary tissue layers (endoderm, 
mesoderm, and ectoderm) of an animal embryo. (Figure 21-3) 

glial cell Supporting non-neural cell of the nervous System. 
Includes oligodendrocytes and astrocytes in the vertebrate 
central nervous System and Schwann cells in the peripheral 
nervous System. 

glycogen Polysaccharide composed exclusively of glucose 
units. Used to store energy in animal cells. Large granules of 
glycogen are especially abundant in liver and muscle cells. 
(Figure 2-51 and Panel 2-4, pp. 96-97) 

glycolipid Lipid molécule with a sugar residue or 
oligosaccharide attached. (Panel 2-5, pp. 98-99) 

glycolysis Ubiquitous metabolic pathway in the cytosol in 
which sugars are incompletely degraded with production of 
ATP. Literally, “sugar splitting.” (Figure 2-46 and Panel 2-8, 
pp. 104-105) 

glycoprotein Any protein with one or more saccharide or 
oligosaccharide chains covalently linked to amino acid side 
chains. Most secreted proteins and most proteins exposed on 
the outer surface of the plasma membrane are glycoproteins. 

glycosaminoglycan (GAG) Long, linear, highly charged 
polysaccharide composed of a repeating pair of sugars, one of 
which is always an amino sugar. Mainly found covalently linked 
to a protein core in extracellular matrix proteoglycans. Examples 
include chondroitin sulfate, hyaluronan, and heparin. (Figure 
19-32) 

glycosylphosphatidylinositol anchor (GPI anchor) Lipid 
linkage by which some membrane proteins are bound to the 
membrane. The protein is joined, via an oligosaccharide linker, 
to a phosphatidylinositol anchor during its travel through the 
endoplasmic réticulum. (Figure 12-52) 

Golgi apparatus (Golgi complex) Complex organelle in 
eukaryotic cells, centered on a stack of flattened, membrane- 
enclosed spaces, in which proteins and lipids transferred from 
the endoplasmic réticulum are modified and sorted. It is the 
site of synthesis of many cell wall polysaccharides in plants and 
extracellular matrix glycosaminoglycans in animal cells. (Figure 
13-26) 

GPCR kinase (GRK) Member of a family of enzymes that 
phosphorylâtes multiple serines and thréonines on a GPCR to 
produce receptor desensitization. (Figure 15-42) 

G protein (trimeric GTP-binding protein) A trimeric GTP- 
binding protein with intrinsic GTPase activity that couples 
GPCRs to enzymes or ion channels in the plasma membrane. 
(Table 15-3, p. 846) 

G-protein-coupled receptor (GPCR) A seven-pass cell- 
surface receptor that, when activated by its extracellular ligand, 
activâtes a G protein, which in turn activâtes either an enzyme 
or ion channel in the plasma membrane. (Figures 15-6 and 
15-21) 

G q Class of G protein that couples GPCRs to phospholipase 
C-p to activate the inositol phospholipid signaling pathway. 

Gram négative Description for bacteria that do not stain 
with Gram stain as a resuit of having a thinner peptidoglycan 


cell wall outside their inner (plasma) membrane, and on an 
additional outer membrane. 

Gram positive Description for bacteria that stain positive 
with Gram stain due to a thick layer of peptidoglycan cell wall 
outside their inner (plasma) membrane. 

Gram staining A technique for classifying bacteria based on 
différences in the structure of the bacterial cell wall and outer 
surface. 

granulocyte Category of white blood cell distinguished 
by conspicuous cytoplasmic granules. Includes neutrophils, 
basophils, and eosinophils. Arises from a granulocyte/ 
macrophage (GM) progenitor cell. (Figure 22-27) 

granulocyte/macrophage (GM) progenitor cell Committed 
progenitor cell in the bone marrow that gives rise to neutrophils 
and macrophages. (Figure 22-31) 

green fluorescent protein (GFP) Fluorescent protein 
isolated from a jellyfish. Widely used as a marker in cell biology. 
(Figure 9-24) 

growth cône Migrating motile tip of a growing nerve cell axon 
or dendrite. (Figure 21-72) 

growth factor Extracellular signal protein that can stimulate a 
cell to grow. They often hâve other functions as well, including 
stimulating cells to survive or proliferate. Examples include 
epidermal growth factor (EGF) and platelet-derived growth 
factor (PDGF). 

growth hormone (GH) Mammalian hormone secreted by 
the pituitary gland into the bloodstream that stimulâtes growth 
throughout the body. 

GTP (guanosine 5 f -triphosphate) Nucleoside triphosphate 
produced by the phosphorylation of GDP (guanosine 
diphosphate). Like ATP, it releases a large amount of free 
energy on hydrolysis of its terminal phosphate group. Has a 
spécial rôle in microtubule assembly, protein synthesis, and cell 
signaling. (Figure 2-58) 

GTP-binding protein Also called GTPase; an enzyme that 
converts GTP to GDP. 

GTPase An enzyme that converts GTP to GDP. GTPases 
fall into two large familles. Large trimeric G proteins are 
composed of three different subunits and mainly couple GPCRs 
to enzymes or ion channels in the plasma membrane. Small 
monomeric GTP-binding proteins (also called monomeric 
GTPases) consist of a single subunit and help relay signais 
from many types of cell-surface receptors and hâve rôles in 
intracellular signaling pathways, regulating intracellular vesicle 
trafficking, and signaling to the cytoskeleton. Both trimeric 
G proteins and monomeric GTPases cycle between an active 
GTP-bound form and an inactive GDP-bound form and 
frequently act as molecular switches in intracellular signaling 
pathways. See page 820. 

GTPase-activating protein (GAP) Protein that binds to 
a GTPase and inhibits it by stimulating its GTPase activity, 
causing the enzyme to hydrolyze its bound GTP to GDP. 

(Figure 15-8) 

guanine nucléotide exchange factor (GEF) Protein that 
binds to a GTPase and activâtes it by stimulating it to release its 
tightly bound GDP, thereby allowing it to bind GTP in its place. 
(Figure 15-8) 

haplotype block Combination of alleles and DNA markers 
that has been inherited in a large, linked block on one 
chromosome of a homologous pair—undisturbed by genetic 
recombination—across many générations. 
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Hedgehog protein Secreted extracellular signal molécule 
that has many different rôles controlling cell différentiation 
and gene expression in animal embryos and adult tissues. 
Excessive Hedgehog signaling can lead to cancer. 

helper T cell (Th cell) Type of T cell that helps activate B cells 
to make antibodies, cytotoxic T cells to become effector cells, 
and macrophages to kill ingested pathogens. They can also 
help activate dendritic cells. 

heterochromatin Chromatin that is highly condensed even 
in interphase; generally transcriptionally inactive. (Compare with 

euchromatin.) 

heterochronic Describes genes involved in developmental 
timing; mutation results in cells of a spécifie fate behaving as 
cells at a different stage of development. 

high-mannose oligosaccharides Broad class of /V-linked 
oligosaccharides, attached to mammalian glycoproteins in the 
endoplasmic réticulum, containing two /V-acetylglucosamine 
residues and many mannose residues. 

high-performance liquid chromatography (HPLC) Type of 
chromatography that uses columns packed with tiny beads of 
matrix; the solution to be separated is pushed through under 
high pressure. 

historié One of a group of small abundant proteins, rich in 
arginine and lysine, that combine to form the nucléosome cores 
around which DNA is wrapped in eukaryotic chromosomes. 
(Figure 4-24) 

historié chaperone (chromatin assembly factor) Protein 
that binds free histones, releasing them once they hâve been 
incorporated into newly replicated chromatin. (Figure 4-27) 

histone H1 “Linker” (as opposed to “core”) histone protein 
that binds to DNA where it exits from a nucléosome and helps 
package nucléosomes into the 30-nm chromatin fiber. 

(Figure 4-30) 

Holliday junction (cross-strand exchange) X-shaped 
structure observed in DNA undergoing recombination, in 
which the two DNA molécules are held together at the site of 
crossing-over, also called a cross-strand exchange. 

(Figure 5-55) 

homeotic selector gene In Drosophila development, a 
gene that defines and préserves the différences between body 
segments. 

homolog One of two or more genes that are similar in 
sequence as a resuit of dérivation from the same ancestral 
gene. The term covers both orthologs and paralogs. 

(Figure 1-21) See homologous chromosomes. 

homologous Genes, proteins, or body structures that are 
similar as a resuit of a shared evolutionary origin. 

homologous chromosomes (homologs) The maternai and 
paternal copies of a particular chromosome in a diploid cell. 

homologous recombination (general recombination) 

Genetic exchange between a pair of identical or very similar 
DNA sequences, typically those located on two copies of the 
same chromosome. Also a DNA repair mechanism for double- 
strand breaks. (Figures 5-48, 5-50, and 5-54) 

homophilic Binding between molécules of the same kind, 
especially those involved in cell—cell adhesion. (Figure 19-5) 

horizontal gene transfer Gene transfer between bacteria via 
natural transformation by released naked DNA, transduction by 
bactériophages, or sexual exchange by conjugation. 


hormone Signal molécule secreted by an endocrine cell into 
the bloodstream, which can then carry the signal to distant 
target cells. 

Hox complex A gene complex consisting of a sériés of Hox 
genes 

Hox genes Genes coding for transcription regulators, 
each gene containing a homeodomain, and specifying body- 
region différences. Hox mutations typically cause homeotic 
transformations. 

Hox proteins Transcription regulator proteins encoded by 
Hox genes; possess a highly conserved, 60-amino-acid-long 
DNA-binding homeodomain. 

HPV Human papillomavirus; infects the cervical epithelium 
and is important as a cause of carcinoma of the uterine cervix. 

hyaluronan (hyaluronic acid) Type of nonsulfated 
glycosaminoglycan with a regular repeating sequence of up to 
25,000 identical disaccharide units, not linked to a core protein. 
Found in the fluid lubricating joints and in many other tissues. 
(Figures 19-33 and 19-34) 

hybridization In molecular biology, the process whereby two 
complementary nucleic acid strands form a base-paired duplex 
DNA-DNA, DNA-RNA, or RNA-RNA molécule. Forms the 
basis of a powerful technique for detecting spécifie nucléotide 
sequences. (Figures 5-47 and 8-33) 

hybridoma Hybrid cell line generated by fusion of a tumor cell 
and another cell type. Monoclonal antibodies are produced by 
hybridoma lines obtained by fusing antibody-secreting B cells 
with cells of a B lymphocyte tumor. (Figure 8-4) 

hydrogen bond Noncovalent bond in which an 
electropositive hydrogen atom is partially shared by two 
electronegative atoms. (Panel 2-3, pp. 94-95) 

hydronium ion (H 3 0 + ) Water molécule associated with 
an additional proton. The form generally taken by protons in 
aqueous solution. 

hydrophilic Dissolving readily in water. Literally, “water loving.” 

hydrophobie (lipophilie) Not dissolving readily in water. 
Literally, “water-fearing.” 

hydrophobie force Force exerted by the hydrogen-bonded 
network of water molécules that brings two nonpolar 
surfaces together by excluding water between them. 

(Panel 2-3, pp. 94-95) 

hypervariable région In immunology: any of the three small 
parts of the variable région of an immunoglobulin or T cell 
receptor chain that show the highest variability from molécule 
to molécule and contribute to the antigen-binding site. 

(Figure 24-26) 

hypoxia-inducible factor 1 a (HIFI a). Transcription 
regulator, the intracellular levels of which increase in response 
to a shortage of oxygen, that stimulâtes transcription of the 
VEGF gene to promote angiogenesis. 

IkB Inhibitory proteins that bind tightly to NFkB dimers 
and hold them in an inactive State within the cytoplasm of 
unstimulated cells. 

Ig superfamily Large and diverse family of proteins that 
contain immunoglobulin or immunoglobulin-like domains. Most 
are involved in cell—cell interactions or antigen récognition. 
(Figure 24-48) 

IgA Immunoglobulin A; the principal class of antibody in 
sécrétions, including saliva, tears, milk, and respiratory and 
intestinal sécrétions. 
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IgD Immunoglobulin D; produced by immature naïve B cells 
after leaving the bone marrow. Transmembrane IgD and IgM 
proteins, with the same antigen-binding site, form the B cell 
receptors (BCRs) on these cells. 

IgE Immunoglobulin E; binds with high affinity via its tail région 
to a class of Fc receptors on the surface of mast cells (tissues) 
or basophils (blood), where it acts as an antigen receptor; 
antigen binding stimulâtes the sécrétion of cytokines and 
biologically active amines, which help attract white blood cells, 
antibodies, and complément proteins to the site of activation. 

IgG Immunoglobulin G; the major antibody class in the blood, 
produced in especially large quantities during secondary 
antibody responses. the tail région of some IgG subclasses 
can bind to spécifie Fc receptors on macrophages and 
neutrophils. Antigen-IgG complexes can activate complément. 

IgM Immunoglobulin M; the first class of immunoglobulin that 
a developing B cell in the bone marrow makes, forming B-cell 
receptors on its surface. IgM antibodies are the major class of 
antibody secreted into the blood in the early stages of a primary 
antibody response on first exposure to an antigen, where their 
pentameric structure (with 10 antigen-binding sites) allows 
strong binding to pathogens. When bound to antigen, it is 
highly efficient activation of complément. 

iHog Protein with four or five immunoglobulin-like domains 
and two or three fibronectin-type-lll-like domains; located 
on the cell surface and thought to serve as co-receptors for 
Hedgehog proteins. 

image Processing Computer based techniques in 
microscopy that process digital images in order to extract latent 
information. Enables compensation for some optical faults in 
microscopes, enhanced contrast to improve détection of small 
différences in light intensity, and subtraction of background 
irregularities in the optical System. 

imaginai dise Group of cells that are set aside, apparently 
undifferentiated, in the Drosophila embryo and which will 
develop into an adult structure, e.g., eye, leg, wing. Overt 
différentiation occurs at metamorphosis. (Figure 21-60) 

immunization Method of inducing adaptive immune 
responses to pathogens or foreign molécules, usually involving 
the co-injection of an adjuvant, a molécule (often of microbial 
origin) that helps activate innate immune responses required for 
the adaptive responses. 

immunoblotting see Western blotting 

immunoglobulin (Ig) superfamily Large and diverse 
family of proteins that contain immunoglobulin domains or 
immunoglobulin-like domains. Most are involved in cell—cell 
interactions or antigen récognition. (Figure 24-48) 

immunoglobulin domain (Ig domain) Characteristic 
protein domain of about 100 amino acids that is found in 
immunoglobulin light and heavy chains. Similar domains, 
known as immunoglobulin-like (Ig-like) domains, are présent in 
many other proteins, which, together with Igs, constitute the 
Ig superfamily. (Figure 24-27) 

immunogold électron microscopy Method to localize 
spécifie macromolecules using a primary antibody that binds to 
the molécule of interest and is then detected with a secondary 
antibody to which a colloïdal gold particle has been attached. 
The gold particle is electron-dense and can be seen as a black 
dot in the électron microscope. (Figure 9-45) 

immunological memory Long-lived property of the adaptive 
immune System that follows a primary immune response to 
many antigens, such that a subséquent encounter with the 


same antigen will provoke a more rapid and stronger secondary 
immune response. (Figure 24-16) 

immunological self-tolerance The lack of response of 
the adaptive immune System to an antigen. Tolérance to self 
molécules is crucial to avoid autoimmune diseases. (Figure 
24-21) 

immunological synapse The highly organized interface that 
develops between a T cell and an antigen-presenting cell (APC) 
or target cell it is in contact with, formed by T-cell receptors 
binding to antigen-MHC complexes on the APC and cell- 
adhesion proteins binding to their counterparts on the APCs. 

induced fit A principle for increasing the specificity of 
substrate récognition by proteins and RNAs. In protein 
synthesis, a ribosome, or enzyme folds around a codon- 
anticodon interaction and only when the match is correct is the 
subséquent reaction allowed to proceed. 

induced pluripotent stem cells (iPS cells) Cells that 
are induced by artificial expression of spécifie transcription 
regulators to look and behave like the pluripotent embryonic 
stem cells that are derived from embryos. 

induced regulatory T cell A regulatory T cell (T reg cell) that 
develops from naive helper T cells when they are activated in 
the presence of TGFp in the absence of IL6. 

inflammasome Intracellular protein complex formed after 
activation of cytoplasmic NOD-like receptors with adaptor 
proteins. It contains a caspase enzyme that cleaves pro- 
inflammatory cytokines from their precursor proteins. 

inflammatory response Local response of a tissue to injury 
or infection—characterized clinically by redness, swelling, heat, 
and pain. Caused by invasion of white blood cells, which are 
attracted by and secrete various cytokines. 

inhibitors of apoptosis (lAPs) Intracellular protein inhibitors 
of apoptosis. 

inhibitory G protein (Gj) Trimeric G protein that can regulate 
ion channels and inhibit the enzyme adenylyl cyclase in the 
plasma membrane. Seealso G protein. (Table 15-3, p. 846) 

inhibitory neurotransmitter Neurotransmitter that opens 
transmitter-gated CL or K + channels in the postsynaptic 
membrane of a nerve or muscle cell and thus tends to inhibit 
the génération of an action potential. 

initial segment Specialized membrane région at the base of 
a nerve axon (adjacent to the cell body) that is rich in voltage- 
gated Na + channels plus other classes of ion channels that ail 
contribute to the encoding of membrane depolarization into 
action potential frequency. 

initiator caspases Apoptotic caspases that begin the 
apoptotic process, activating the executioner caspases. 

initiator tRNA Spécial tRNA that intiates translation. It always 
carries the amino acid méthionine, forming the complex Met- 
tRNAi. (Figure 6-70) 

innate immune response An early immune response in ail 
organisms to a pathogen, which includes the production of 
antimicrobial molécules and the activation of phagocytic cells. 
Such a response is not spécifie for the pathogen, in contrast to 
an adaptive immune response. 

inner membrane Mitochondrial membrane that encloses the 
matrix space and forms extensive invaginations called cristae. 

inner mitochondrial membrane Mitochondrial membrane 
that encloses the matrix space and forms extensive 
invaginations called cristae. 
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inner nuclear membrane One of two concentric membranes 
comprising the nuclear envelope; contnuous with the outer 
nuclear membrane; contains spécifie proteins as anchoring 
sites for chromatin and the nuclear lamina. 

inositol 1,4,5-trisphosphate (IP 3 ) Small intracellular signaling 
molécule produced during activation of the inositol phospholipid 
signaling pathway. Acts to release Ca 2+ from the endoplasmic 
réticulum. (Figures 15-28 and 15-29) 

inositol phospholipid signaling pathway Intracellular 
signaling pathway that starts with the activation of 
phospholipase C and the génération of IP 3 and diacylglycérol 
(DAG) from inositol phospholipids in the plasma membrane. 

The DAG helps to activate protein kinase C. (Figures 15-28 and 
15-29) 

integrin Transmembrane adhesion protein that is involved in 
the attachment of cells to the extracellular matrix and to each 
other. (Figure 19-3 and Table 19-1, p. 1037) 

interaction domain Compact protein module, found in 
many intracellular signaling proteins, that binds to a particular 
structural motif (e.g., a short peptide sequence, a covalent 
modification, or another protein domain) in another protein or 
lipid. 

interferon-a (IFNa) and interferon-p (IFNP) Cytokines 
(type I interferons) produced by mammalian cells as a general 
response to a viral infection. 

intermembrane space Compartiment of mitochondrion 
between by the outer and inner mitochondrial membranes. 

internai ribosome entry site (IRES) Spécifie site in a 
eukaryotic mRNA, other than at the 5' end, at which translation 
can be initiated. (Figure 7-68) 

interphase Long period of the cell cycle between one mitosis 
and the next. Includes Gi phase, S phase, and G 2 phase. 
(Figure 17-4) 

interpolar microtubule In the mitotic or meiotic spindle, a 
microtubule interdigitating at the equator with the microtubules 
emanating from the other pôle. (Figure 17-23) 

intracellular pathogens Pathogens, including ail viruses and 
many bacteria and protozoa, that enter and replicate inside 
host cells to cause disease. 

intrinsic pathway (mitochondrial pathway) Pathway of 
apoptosis activated from inside the cell in response to stress or 
developmental signais; dépends on the release into the cytosol 
of mitochondrial proteins normally résident in the mitochondrial 
intermembrane space. 

intron Noncoding région of a eukaryotic gene that is 
transcribed into an RNA molécule but is then excised by RNA 
splicing during production of the mRNA or other functional 
RNA. (Figure 4-15) 

invadopodia Actin-rich protrusions extending in three- 
dimensions that are important for cells to cross tissue barriers 
by degrading the extracellular matrix. 

ion channel Transmembrane protein complex that forms 
a water-filled channel across the lipid bilayer through which 
spécifie inorganic ions can diffuse down their electrochemical 
gradients. (Figure 11-22) 

ion-channel-coupled receptor (transmitter-gated ion 
channel, ionotropic receptor) Ion channel found at 
Chemical synapses in the postsynaptic plasma membranes of 
nerve and muscle cells. Opens only in response to the binding 
of a spécifie extracellular neurotransmitter. The resulting inflow 


of ions leads to the génération of a local electrical signal in the 
postsynaptic cell. (Figures 15-6 and 11-35) 

ion-sensitive indicators Molécules whose light émission 
reflects the local concentration of a particular ion; some are 
luminescent (emitting light spontaneously) while others are 
fluorescent (emitting light on exposure to light). 

IP 3 -gated Ca 2+ -release channel (IP 3 receptor) Gated Ca 2+ 
channel in the ER membrane that opens on binding cytosolic 
IP 3 , releasing stored Ca 2+ into the cytosol. (Figure 15-29) 

iron-sulfur cluster Electron-transporting group consisting 
of either two or four iron atoms bound to an equal number of 
sulfur atoms, found in a class of electron-transport proteins. 
(Figure 14-16) 

J gene segment Short DNA sequences that encodes part 
of the variable région of light and heavy immunoglobulin chains 
and of a and p chains of T cell receptors. (Figures 24-28 and 
24-29) 

JAK-STAT signaling pathway Signaling pathway activated 
by cytokines and some hormones, providing a rapid route 
from the plasma membrane to the nucléus to alter gene 
transcription. Involves cytoplasmic Janus kinases (JAKs), and 
signal transducers and activators of transcription (STATs). 

Janus kinases (JAKs) Cytoplasmic tyrosine kinases 
associated with cytokine receptors, which phosphorylate and 
activate transcription regulators called STATs. 

junctional diversification The random loss and gain of 
nucléotides at joining sites during V(D)J recombination that 
occurs during B and T cell development when the cells are 
assembling the gene segments that encode their antigen 
receptors. It enormously increases the diversity of V-region 
coding sequences. 

K + leak channel K + -transporting ion channel in the plasma 
membrane of animal cells that remains open even in a “resting” 
cell. 

karyotype Display of the full set of chromosomes of a cell, 
arrangée! with respect to size, shape, and number. 

keratin Type of intermediate filament, commonly produced by 
épithélial cells. 

kinase cascade Intracellular signaling pathway in which one 
protein kinase, activated by phosphorylation, phosphorylâtes 
the next protein kinase in the sequence, and so on, relaying the 
signal onward. 

kinesin Member of one of the two main classes of motor 
proteins that use the energy of ATP hydrolysis to move along 
microtubules. (Figure 16-56) 

kinesin-1 Motor protein associated with microtubules that 
transports cargo within the cell; also called “conventional 
kinesin.” 

kinetic proofreading A principle for increasing the specificity 
of catalysis. In the synthesis of DNA, RNA, and proteins, it 
refers to a time delay that begins with an irréversible step (such 
as ATP or GTP hydrolysis) and during which incorrect base 
pairs are more likely to dissociate than correct pairs. 

kinetochore Large protein complex that connects the 
centromere of a chromosome to microtubules of the mitotic 
spindle. (Figure 17-30) 

kinetochore microtubule In the mitotic or meiotic spindle, a 
microtubule that connects the spindle pôle to the kinetochore 
of a chromosome. 
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lagging strand One of the two newly synthesized strands of 
DNA found at a réplication fork. The lagging strand is made in 
discontinuons lengths that are later joined covalently. 

(Figure 5-7) 

lamellipodium (plural lamellipodia) Flattened, sheetlike 
protrusion supported by a meshwork of actin filaments, which 
is extended at the leading edge of a crawling animal cell. 
(Figures 16-77 and 16-79) 

laminin Extracellular matrix fibrous protein found in basal 
laminae, where it forms a sheetlike network. (Figures 19-52 and 
19-53) 

lampbrush chromosome Huge chromosome paired in 
préparation for meiosis, found in immature amphibian eggs; 
consisting of large loops of chromatin extending out from a 
linear central axis. (Figure 4-47) 

late endosome Compartiment formed from a bulbous, 
vacuolar portion of early endosomes by a process called 
endosome maturation; late endosomes fuse with one another 
and with lysosomes to form endolysosomes that dégradé their 
contents. 

LDL-receptor-related protein (LRP) Co-receptor bound by 
Wnt proteins in the régulation of p-catenin proteolysis. 

leading strand One of the two newly synthesized strands of 
DNA found at a réplication fork. The leading strand is made by 
continuous synthesis in the 5'-to-3' direction. (Figure 5-7) 

lectin Protein that binds tightly to a spécifie sugar. Abundant 
lectins from plant seeds are used as affinity reagents to purify 
glycoproteins or to detect them on the surface of cells. 

Legionnaire’s disease Type of pneumonia resulting from 
infection with Légionella pneumophila, a parasite of freshwater 
amoebae that is spread to humans by air-conditioning Systems 
that harbor infected amoebae and produce microdroplets of 
water that are easily inhaled. 

léthal factor One of the two A subunits of anthrax toxin; a 
protease that cleaves several activated members of the MAP 
kinase kinase family and causes a large fall in blood pressure 
and death on entry into the bloodstream of an animal. 

leucine-rich repeat (LRR) receptor kinases Common 
type of receptor serine/threonine kinase in plants that contains 
a tandem array of leucine-rich repeat sequences in its 
extracellular portion. 

leukemia Cancer of white blood cells. 

leukocyte General name for ail the nucleated blood cells 
lacking hemoglobin. Also called white blood cells. Includes 
lymphocytes, granulocytes, and monocytes. (Figure 22-27) 

ligand Any molécule that binds to a spécifie site on a protein 
or other molécule. From Latin ligare, to bind. 

light microscope One of a class of microscopes that uses 
visible light to create the image. 

lignin Network of cross-linked phenolic compounds that 
forms a supporting network throughout the cell walls of xylem 
and woody tissue in plants. 

limit of resolution In microscopy, the smallest distance 
apart at which two point objects can be resolved as separate. 
Just under 0.2 jim for conventional light microscopy, a limit 
determined by the wavelength of light. 

linkage In ligand binding, the conformational coupling 
between two separate ligand-binding sites on a protein, such 
that a conformational change in the protein induced by binding 
of one ligand affects the binding of a second ligand. 


lipid bilayer (phospholipid bilayer) Thin double sheet of 
phospholipid molécules that forms the core structure of ail cell 
membranes. The two layers of lipid molécules are packed with 
their hydrophobie tails pointing inward and their hydrophilic 
heads outward, exposed to water. (Figure 10-1 and 
Panel 2-5, pp. 98-99) 

lipid droplets Storage form in cells for excess lipids; 
comprised of a single monolayer of phospholipids and proteins 
that surrounds neutral lipids that can be retrieved from droplets 
as required by the cell. 

lipid raft Small région of a membrane enriched in 
sphingolipids and cholestérol. (Figure 10-13) 

liposome Artificial phospholipid bilayer vesicle formed from an 
aqueous suspension of phospholipid molécules. (Figure 10-9) 

local mediator Extracellular signal molécule that acts on 
neighboring cells. 

long noncoding RNA (IncRNA) One of a large group 
(-8000 in humans) of RNAs longer than 200 nucléotides and 
not coding for protein. The functions, if any, of most IncRNAs 
is unknown but individual IncRNA are known to play important 
rôles in the cell, for example, in telomerase function and 
genomic imprinting. In a general sense, IncRNAs are believed to 
act as scaffolds, holding together proteins and nucleic acids to 
speed up a wide variety of reactions in the cell. 

long-term dépréssion (LTD) A long-lasting (hours or more) 
decrease in the sensitivity of certain synapses in the brain 
triggered by NMDA receptor activation. As the opposing 
process to long-term potentiation, it is thought to be involved in 
learning and memory. 

long-term potentiation (LTP) Long-lasting increase (days 
to weeks) in the sensitivity of certain synapses in the brain, 
induced by a short burst of répétitive firing in the presynaptic 
neurons. (Figure 11-44) 

loss of heterozygosity The resuit of errant homologous 
recombination that uses the homolog from the other parent 
instead of the sister chromatid as the template, converting the 
sequence of the repaired DNA to that of the other homolog. 

low-density lipoprotein (LDL) Large complex composed 
of a single protein molécule and many esterified cholestérol 
molécules, together with other lipids. The form in which 
cholestérol is transported in the blood and taken up into cells. 
(Figure 13-51) 

lumen The space inside a hollow structure. In cells: the cavity 
enclosed by an organelle membrane. In tissues: the cavity 
enclosed by a sheet of cells. 

lymphocyte White blood cell responsible for the specificity 
of adaptive immune responses. Two main types: B cells, which 
produce antibody, and T cells, which interact directly with 
other effector cells of the immune System and with infected 
cells. T cells develop in the thymus and are responsible for 
cell-mediated immunity. B cells develop in the bone marrow in 
mammals and are responsible for the production of circulating 
antibodies. 

lymphoid organ An organ containing large numbers of 
lymphocytes. Lymphocytes are produced in primary lymphoid 
organs and respond to antigen in peripheral lymphoid organs. 
(Figure 24-12) 

lymphoma Cancer of lymphocytes, in which the cancer cells 
are mainly found in lymphoid organs (rather than in the blood, 
as in leukemias). 
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lysosomal storage diseases Genetic diseases resulting 
from defects in or a lack of one or more functional hydrolases in 
lysosomes of some cells, leading to accumulation of undigested 
substrates in lysosomes and conséquent cell pathology. 

lysosome Membrane-enclosed organelle in eukaryotic cells 
containing digestive enzymes, which are typically most active at 
the acid pH found in the lumen of lysosomes. (Figure 13-37) 

lysozyme Enzyme that catalyzes the cutting of 
polysaccharide chains in the cell walls of bacteria. 

M-Cdk (M-phase Cdk) Cyclin-Cdk complex formed in 
vertebrate cells by an M-cyclin and the corresponding cyclin- 
dependent kinase (Cdk). (Figure 17-11 and Table 17-1, p. 969) 

M-cyclin A cyclin found in ail eukaryotic cells that promûtes 
the events of mitosis. (Figure 17-11) 

M6P receptor proteins Transmembrane receptor proteins 
présent in the trans Golgi network that recognize the mannose 
6 -phosphate (M6P) groups added exclusively to lysosomal 
enzymes, marking the enzymes for packaging and delivery to 
early endosomes. 

macromolecule Polymers constructed of long chains of 
covalently linked, small organic (carbon-containing) molécules. 
The principal building blocks from which a cell is constructed 
and the components that conter the most distinctive properties 
of living things. 

macrophage Phagocytic cell derived from blood monocytes, 
résident in most tissues but able to roam. It has both scavenger 
and antigen-presenting functions in immune responses. 

macropinocytosis Clathrin-independent, dedicated 
degradative endocytic pathway induced in most cell types by 
cell-surface receptor activation by spécifie cargoes. 

malaria Protozoal disease caused by four species of 
Plasmodium, which are transmitted to humans by the bite of 
the female Anopheles mosquito. 

malignant Of tumors and tumor cells: invasive and/or 
able to undergo metastasis. A malignant tumor is a cancer. 
(Figure 20-3) 

MAP kinase module (mitogen-activated protein kinase 
module) An intracellular signaling module composed of 
three protein kinases, acting in sequence, with MAP kinase as 
the third. Typically activated by a Ras protein in response to 
extracellular signais. (Figure 15-49) 

master transcription regulator A transcription regulator 
specifically required for formation of a particular cell type. 
Artificial expression of master transcription regulators (alone or 
in combination with others) will often convert one cell type into 
another. 

maternai inheritance A form of inheritance observed 
when following mitochondria in animais and plants, where 
mitochondrial DNA is inherited only through the female 
germ line. 

maternal-effect gene Gene that acts in the mother to 
specify maternai mRNAs and proteins in the egg. Maternal- 
effect mutations affect the development of the embryo even if 
the embryo itself has not inherited the mutated gene. 

maternal-zygotic transition (MZT) Event in animal 
development where the embryo’s own genome largely takes 
over control of development from maternaily deposited 
macromolecules. 

matrix metalloprotease Ca 2+ - or Zn 2+ -dependent 


proteolytic enzyme présent in the extracellular matrix that 
dégradés matrix proteins. Includes the collagénases. 

matrix space Large internai compartment of the 
mitochondrion. 

mechanosensitive channels Transmembrane ion channels 
that open in response to a mechanical stress on the lipid bilayer 
in which they are embedded. 

megakaryocyte Large myeloid cell with a multilobed nucléus 
that remains in the bone marrow when mature. Buds off 
platelets from long cytoplasmic processes. (Figures 22-29) 

meiosis I The first of two rounds of chromosome ségrégation 
following meiotic chromosome duplication; segregates the 
homologs, each composed of a tightly linked pair of sister 
chromatids. 

meiosis II The second of two rounds of chromosome 
ségrégation following meiotic chromosome duplication; 
segregates the sister chromatids of each homolog. 

membrane potential Voltage différence across a membrane 
due to a slight excess of positive ions on one side and of 
négative ions on the other. A typical membrane potential for 
an animal cell plasma membrane is -60 mV (inside négative 
relative to the surrounding fluid). (Figure 11-23) 

membrane protein Amphiphilic protein of diverse structure 
and function that associâtes with the lipid bilayer of cell 
membranes. (Figure 10-17) 

membrane transport protein Membrane protein that 
médiates the passage of ions or molécules across a 
membrane. The two main classes are transporters (also called 
carriers or permeases) and channels. (Figure 11-4) 

membrane-associated protein Membrane protein not 
extending into the hydrophobie interior of the lipid bilayer 
but bound to either face of the membrane by noncovalent 
interactions with other membrane proteins. (Figure 10-17) 

membrane-bending proteins Attach to spécifie membrane 
régions as needed and act to control local membrane curvature 
and thus conter on membranes their characteristic three- 
dimensional shapes. 

membrane-bound ribosome Ribosome attached to 
the cytosolic face of the endoplasmic réticulum. The site of 
synthesis of proteins that enter the endoplasmic réticulum. 
(Figure 12-38) 

memory cell In immunology: a T or B lymphocyte generated 
following antigen stimulation that is more easily and more 
quickly induced to become an effector cell or another memory 
cell by a later encounter with the same antigen. (Figure 24-17) 

mesoderm Embryonic tissue that is the precursor to muscle, 
connective tissue, skeleton, and many of the internai organs. 
(Figure 21-3) 

messenger RNA (mRNA) RNA molécule that spécifiés the 
amino acid sequence of a protein. Produced in eukaryotes by 
Processing of an RNA molécule made by RNA polymerase as 
a complementary copy of DNA. It is translated into protein in a 
process catalyzed by ribosomes. (Figure 6-20) 

metabolism The sum total of the Chemical processes that 
take place in living cells. Ail of catabolism plus anabolism. 

(Figure 2-14) 

metabotropic receptors Neurotransmitter receptors that 
regulate ion channels indirectly through the activation of 
second-messenger molécules. 
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metaphase plate Imaginary plane at right angles to the 
mitotic spindle and midway between the spindle pôles; the 
plane in which chromosomes are positioned at metaphase. 
(Panel 17-1, pp. 980-981) 

metaphase-to-anaphase transition Transition in the 
eukaryotic cell cycle preceding sister-chromatid séparation at 
anaphase. If the cell is not ready to proceed to anaphase, the 
cell cycle is halted at this point. (Figure 17-9, and Panel 17-1, 
pp. 980-981) 

métastasés Secondary tumors, at sites in the body additional 
to that of the primary tumor, resulting from cancer cells breaking 
loose, entering blood or lymphatic vessels, and colonizing 
separate environments. 

metastasis The spread of cancer cells from their site of origin 
to other sites in the body. (Figures 20-1 and 20-16) 

MHC complex (major histocompatibility complex) Cluster 
of genes in one vertebrate chromosome (chromosome 6 in 
humans) that code for a set of highly polymorphie cell-surface 
glycoproteins (MHC proteins). (Figure 24-37) 

microbiome The combined genomes of the various species 
of a defined microbiota. 

microbiota The collective of microorganisms that résidé in or 
on an organism. 

microelectrode A piece of fine glass tubing, pulled to an 
even finer tip, that is used to inject electric current into cells or 
to study the intracellular concentrations of common inorganic 
ions (such as H + , Na + , K + , Cl - , and Ca 2+ ) in a single living cell 
by insertion of its tip directly into the cell interior through the 
plasma membrane. 

microRNAs (miRNAs) Short (~21 nucléotide) eukaryotic 
RNAs, produced by the Processing of specialized RNA 
transcripts coded in the genome, that regulate gene expression 
through base-pairing with mRNA. (Figure 7-75) 

microsome Small vesicle derived from endoplasmic réticulum 
that is produced by fragmentation when cells are homogenized. 
(Figure 12-34) 

microtubule flux Movement of individual tubulin molécules 
in the microtubules of the spindle toward the pôles by loss of 
tubulin at their minus ends. Helps to generate the poleward 
movement of sister chromatids after they separate in anaphase. 
(Figure 17-35) 

microtubule-associated protein (MAP) Any protein that 
binds to microtubules and modifies their properties. Many 
different kinds hâve been found, including structural proteins, 
such as MAP2, and motor proteins, such as dynein. [Not to be 
confused with the “MAP” (mitogen-activated protein kinase) of 
“MAP kinase.”] 

microtubule-organizing center (MTOC) Région in a 
cell, such as a centrosome or a basal body, from which 
microtubules grow. 

midbody Structure formed at the end of cleavage that can 
persist for some time as a tether between the two daughter 
cells in animais. (Figure 17-43) 

mitochondrial hsp70 Part of a multisubunit protein assembly 
bound to the matrix side of the TIM23 complex that acts as a 
motor to pull mitochondrial precursor proteins into the matrix 
space. 

mitochondrial matrix Large internai compartiment of 
the mitochondrion. The corresponding compartiment in a 
chloroplast is known as the stroma. 


mitochondrial precursor proteins Proteins first fully 
synthesized in the cytosol and then translocated into 
mitochondrial subcompartments as directed by one or more 
signal sequences. 

mitochondrion (plural mitochondria) Membrane-bounded 
organelle, about the size of a bacterium, that carries out 
oxidative phosphorylation and produces most of the ATP in 
eukaryotic cells. (Figure 1-28) 

mitogen Extracellular signal molécule that stimulâtes cells to 
proliferate. 

mitotic chromosome Highly condensed duplicated 
chromosome as seen at mitosis, consisting of two sister 
chromatids held together at the centromere. 

mitotic spindle Bipolar array of microtubules and associated 
molécules that forms in a eukaryotic cell during mitosis and 
serves to move the duplicated chromosomes apart. 

(Figure 17-23 and Panel 17-1, pp. 980-981) 

model organism A species that has been studied intensively 
over a long period and thus serves as a “model” for deriving 
fundamental biological principles. 

molecular chaperone (chaperone) Protein that helps 
guide the proper folding of other proteins, or helps them avoid 
misfolding. Includes heat-shock proteins (hsp). 

monoallelic gene expression Expression of only one of 
the two copies of a gene in a diploid genome, occurring, 
for example, as a resuit of imprinting or X-chromosome 
inactivation. 

monoclonal antibody Antibody secreted by a hybridoma 
cell line. Because the hybridoma is generated by the fusion of a 
single B cell with a single tumor cell, each hybridoma produces 
antibodies that are ail identical. (Page 444) 

monocyte Type of white blood cell that leaves the 
bloodstream and matures into a macrophage in tissues. 

(Figure 22-27) 

monomeric GTPase A single-subunit enzyme that converts 
GTP to GDP (also called small monomeric GTP-binding 
proteins). Cycles between an active GTP-bound form and an 
inactive GDP-bound form and frequently acts as a molecular 
switch in intracellular signaling pathways. 

morphogen Diffusible signal molécule that can impose a 
pattern on a field of cells by causing cells in different places to 
adopt different fates. (Figure 21-8) 

morphogenesis Developmental process in which cells 
undergo movements and deformations in order to assemble 
into tissues and organs with spécifie shapes and sizes. 

motor protein Protein that uses energy derived from 
nucleoside triphosphate hydrolysis to propel itself along a linear 
track (protein filament or other polymeric molécule). 

mRNA dégradation control Régulation by a cell of gene 
expression by selectively preserving or destroying certain 
mRNA molécules in the cytoplasm. 

mTOR The mammalian version of the large protein kinase 
called TOR, involved in cell signaling; mTOR exists in two 
functionally distinct multiprotein complexes. 

multidrug résistance An observed phenomenon in which 
cells exposed to one anticancer drug evolve a résistance not 
only to that drug, but also to other drugs to which they hâve 
never been exposed. 

multidrug résistance (MDR) protein Type of ABC 
transporter protein that can pump hydrophobie drugs (such as 
some anticancer drugs) out of the cytoplasm of eukaryotic cells. 
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multipass transmembrane protein Membrane protein in 
which the polypeptide chain crosses the lipid bilayer more than 
once. (Figure 10-17) 

multivesicular bodies Intermediates in the endosome 
maturation process; early endosomes that are on their way to 
becoming late endosomes. 

mutation Heritable change in the nucléotide sequence of a 
chromosome. (Panel 8-2, pp. 486-487) 

mutation rate The rate at which changes (mutations) occur in 
DNA sequences. 

mutualism Ecological relationship between microbes and 
their host in which both the microbe and host benefit. 

Myc Transcription regulatory protein that is activated when a 
cell is stimulated to grow and divide by extracellular signais. It 
activâtes the transcription of many genes, including those that 
stimulate cell growth. (Figure 17-61) 

myelin sheath Insulating layer of specialized cell 
membrane wrapped around vertebrate axons. Produced 
by oligodendrocytes in the central nervous System and by 
Schwann cells in the peripheral nervous System. (Figure 11-33) 

myeloid cell Any white blood cell other than a lymphocyte. 
(Figure 22-31) 

myoblast Mononucleated, undifferentiated muscle precursor 
cell. A skeletal muscle cell is formed by the fusion of multiple 
myoblasts. (Figure 22-19) 

myofibril Long, highly organized bundle of actin, myosin, and 
other proteins in the cytoplasm of muscle cells that contracts 
by a sliding filament mechanism. 

myosin Type of motor protein that uses the energy of ATP 
hydrolysis to move along actin filaments. 

Na + -K + pump (Na + -K + ATPase) Transmembrane carrier 
protein found in the plasma membrane of most animal cells that 
pumps Na + out of and K + into the cell, using energy derived 
from ATP hydrolysis. (Figure 11-15) 

NADVNADH (nicotinamide adenine dinucleotide/reduced 
nicotinamide adenine dinucleotide) Electron carrier System 
that participâtes in oxidation-reduction reactions, such as the 
oxidation of food molécules. NAD + accepts the équivalent of 
a hydride ion (H - , a proton plus two électrons) to become the 
activated carrier NADFL The NADF1 formed donates its high- 
energy électrons to the ATP-generating process of oxidative 
phosphorylation. (Figure 2-36) 

NADH dehydrogenase complex First of the three electron- 
driven proton pumps in the mitochondrial respiratory chain, 
also known as Complex I. It accepts électrons from NADH and 
passes them to a quinone. (Figure 14-18) 

NADPVNADPH (nicotinamide adenine dinucleotide 
phosphate/reduced nicotinamide adenine dinucleotide 
phosphate) Electron carrier System closely related to NAD + / 
NADH, but used almost exclusively in reductive biosynthetic, 
rather than catabolic, pathways. (Figure 2-36) 

naïve cell In immunology: a T or B lymphocyte that 
proliférâtes and differentiates into an effector cell or memory cell 
when it encounters its spécifie foreign antigen for the first time. 
(Figure 24-17) 

natural killer cell (NK cell) Cytotoxic cell of the innate 
immune System that can kill virus-infected cells and some 
cancer cells. 

natural regulatory T cell A regulatory T cell (T reg cell) that 
develops in the thymus and helps maintain self-tolerance. 


négative sélection Process by which thymocytes expressing 
a T cell receptor with high affinity for a self peptide bound to a 
self-MHC protein are eliminated by undergoing apoptosis. 

négative staining A technique in électron microscopy 
enabling fine detail of isolated macromolecules to be seen. 
Samples are prepared such that a very thin film of heavy-metal 
sait covers everywhere except where excluded by the presence 
of macromolecules, which allow électrons to pass through, 
creating a reverse or négative image of the molécule. 

Nernst équation Equation that computes relates the 
electrical potential (voltage) generated by différences in ion 
concentrations across a membrane. 

Netrin Signal protein, secreted by cells of the neural tube floor 
plate, responsible for attracting growth cônes of commissural 
axons toward and across the midline. 

neural crest Collection of cells located along the line where 
the neural tube pinches off from the surrounding epidermis in 
the vertebrate embryo. Neural crest cells migrate to give rise to 
a variety of tissues, including neurons and glia of the peripheral 
nervous System, pigment cells of the skin, and the bones of the 
face and jaws. (Figure 19-8) 

neural map Regular mapping of neurons of a similar type 
from one territory to another, such that there are orderly 
projections of one array of neurons onto another. 

neural tube Tube of ectoderm that will form the brain and 
spinal cord in a vertebrate embryo. (Figure 21-56) 

neurofilament Type of intermediate filament found in nerve 
cells. (Figure 16-72) 

neuromuscular junction Specialized Chemical synapse 
between an axon terminal of a motor neuron and a skeletal 
muscle cell. (Figure 11-37) 

neuron (nerve cell) Impulse-conducting cell of the nervous 
System, with extensive processes specialized to receive, 
conduct, and transmit signais. (Figures 11-28 and 21-66) 

neuronal specificity Nonequivalence among neurons; an 
intrinsic characteristic that guides axons to their appropriate 
target sites. 

neurotransmitter Small signal molécule secreted by the 
presynaptic nerve cell at a Chemical synapse to relay the 
signal to the postsynaptic cell. Examples include acétylcholine, 
glutamate, GABA, glycine, and many neuropeptides. 

neurotrophic factor Factor released in limited amounts by a 
target tissue that the neurons innervating that tissue require to 
survive. 

neurotrophin Family of signal proteins that promote the 
survival and growth of spécifie classes of neurons. 

neutrophil White blood cell that is specialized for the uptake 
of particulate material by phagocytosis. Enters tissues that 
become infected or inflamed. (Figure 24-5) 

NFkB protein Latent transcription regulator that is activated 
by various intracellular signaling pathways when cells are 
stimulated during immune, inflammatory, or stress responses. 
Also has important rôles in animal development. (Figure 15-62) 

nitric oxide (NO) Gaseous signal molécule that is widely used 
in cell—cell communication in both animais and plants. 

(Figure 15-40) 

nitrogen fixation Biochemical process carried out by certain 
bacteria that reduces atmospheric nitrogen (N 2 ) to ammonia, 
leading eventually to various nitrogen-containing métabolites. 
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NMDA receptor Subclass of glutamate-gated ion channel 
in the mammalian central nervous System critical for long- 
term potentiation and long-term dépréssion. NM DA-receptor 
channels are doubly gated, opening only when glutamate is 
bound to the receptor and, simultaneously, the membrane is 
strongly depolarized. 

NO synthase (NOS) Enzyme that synthesizes nitric oxide 
(NO) by the deamination of arginine. (Figure 15-40B) 

NOD-like receptors (NLRs) Large family of pattern 
récognition receptors (PRRs) with leucine-rich repeat motifs; 
they are exclusively cytoplasmic and recognize a distinct set of 
microbial molécules. 

nonclassical cadherins Large family of cadherins that are 
more distantly related in sequence than classical cadherins and 
include proteins involved in adhesion (including protocadherins, 
desmocollins, and desmogleins) and signaling. 

noncoding RNA An RNA molécule that is the final product 
of a gene and does not code for protein. These RNAs serve as 
enzymatic, structural, and regulatory components for a wide 
variety of processes in the cell. 

nondisjunction Event occurring occasionally during meiosis 
in which a pair of homologous chromosomes fails to separate 
so that the resulting germ cell has either too many or too few 
chromosomes. 

nonenveloped virus Virus consisting of a nucleic acid core 
and a protein capsid only. (Figure 23-18C,D) 

nonhomologous end joining A DNA repair mechanism for 
double-strand breaks in which the broken ends of DNA are 
brought together and rejoined by DNA ligation, generally with 
the loss of one or more nucléotides at the site of joining. 

nonretroviral retrotransposons Type of transposable 
element that moves by being first transcribed into an RNA copy 
that is converted to DNA by reverse transcriptase then inserted 
elsewhere in the genome. The mechanism of insertion differs 
from that of the retroviral-like transposons. (Table 5-4, p. 288) 

nonsense-mediated mRNA decay Mechanism for 
degrading aberrant mRNAs containing in-frame internai stop 
codons before they can be translated into protein. (Figure 6-76) 

normal flora The human microbiota consisting of 
approximately 10 14 bacterial, fungal, and protozoan cells, 
representing thousands of microbial species. 

Notch Transmembrane receptor protein (and latent 
transcription regulator) involved in many cell-fate choices in 
animal development, for example in the spécification of nerve 
cells from ectodermal epithelium. Its ligands are cell-surface 
proteins such as Delta and Serrate. (Figure 15-59) 

NSF Hexameric ATPase that disassembles a complex of a 
v-SNARE and a t-SNARE. (Figure 13-20) 

nuclear envelope Double membrane (two bilayers) 
surrounding the nucléus. Consists of an outer and inner 
membrane and is perforated by nuclear pores. The outer 
membrane is continuous with the endoplasmic réticulum. 
(Figures 4-9 and 12-7) 

nuclear export receptors Bind to both the export signal and 
nuclear pore complex proteins to guide their cargo through the 
nuclear pore complex to the cytosol. 

nuclear export signal Sorting signal contained in the 
structure of molécules and complexes, such as nuclear RNPs 
and new ribosomal subunits, that are transported from the 


nucléus to the cytosol through nuclear pore complexes. 

(Figure 12-13) 

nuclear import receptors Recognize nuclear localization 
signais to initiate nuclear import of proteins containing the 
appropriate nuclear localization signal. 

nuclear lamin Protein subunit of the intermediate filaments 
that form the nuclear lamina. 

nuclear lamina Fibrous meshwork of proteins on the inner 
surface of the inner nuclear membrane. It is made up of a 
network of intermediate filaments formed from nuclear lamins. 

nuclear localization signal (NLS) Signal sequence or signal 
patch found in proteins destined for the nucléus that enables 
their sélective transport into the nucléus from the cytosol 
through the nuclear pore complexes. (Figures 12-9 and 12-13) 

nuclear magnetic résonance (NMR) spectroscopy NMR 

is the résonant absorption of electromagnetic radiation at a 
spécifie frequency by atomic nuclei in a magnetic field, due to 
flipping of the orientation of their magnetic dipole moments. 

The NMR spectrum provides information about the Chemical 
environment of the nuclei. NMR is used widely to détermine the 
three-dimensional structure of small proteins and other small 
molécules. The principles of NMR are also used for medical 
diagnostic purposes in magnetic résonance imaging (MRI). 
(Figure 8-22) 

nuclear pore complex (NPC) Large multiprotein structure 
forming an aqueous channel (the nuclear pore) through the 
nuclear envelope that allows selected molécules to move 
between nucléus and cytoplasm. (Figure 12-8) 

nuclear receptor superfamily Intracellular receptors for 
hydrophobie signal molécules such as steroid and thyroid 
hormones and retinoic acid. The receptor-ligand complex acts 
as a transcription factor in the nucléus. (Figure 15-65) 

nuclear transport receptor (karyopherin) Protein that 
escorts macromolecules either into or out of the nucléus: 
nuclear import receptor or nuclear export receptor. (Figure 
12 - 13 ) 

nucleolus A prominent structure in the nucléus where rRNA is 
transcribed and ribosomal subunits are assembled. (Figure 4-9) 

nucleoporin Any of a number of different proteins that make 
up nuclear pore complexes. 

nucléosome Beadlike structure in eukaryotic chromatin, 
composed of a short length of DNA wrapped around an 
octameric core of histone proteins. The fundamental structural 
unit of chromatin. (Figures 4-22 and 4-23) 

nucléotide Nucleoside with one or more phosphate groups 
joined in ester linkages to the sugar moiety. DNA and RNA are 
polymers of nucléotides. (Panel 2-6, pp. 100-101) 

nucléotide excision repair Type of DNA repair that corrects 
damage of the DNA double hélix, such as that caused by 
Chemicals or UV light, by cutting out the damaged région on 
one strand and resynthesizing it using the undamaged strand 
as template. Compare base excision repair. (Figure 5-41) 

O-linked glycosylation Addition of one or more sugars to a 
hydroxyl group on a protein. 

obligate pathogens Bacteria that can only replicate inside 
their host. 

olfactory receptors G-protein-coupled receptors on the 
modified cilia of olfactory receptor neurons that recognize 
odors. The receptors activate adenylyl cyclase via an olfactory- 
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spécifie G protein (Golf) and résultant increases in cAMP open 
cyclic-AMP-gated cation channels, allowing Na + influx and 
depolarization and initiation of a nerve impulse. 

oligodendrocyte Glial cell in the vertebrate central nervous 
System that forms a myelin sheath around axons. Compare 

Schwann cell. 

oncogene An altered gene whose product can act in a 
dominant fashion to help make a cell cancerous. Typically, an 
oncogene is a mutant form of a normal gene (proto-oncogene) 
involved in the control of cell growth or division. (Figure 20-17) 

open reading trame (ORF) A continuous nucléotide 
sequence free from stop codons in at least one of the three 
reading frames (and thus with the potential to code for protein). 

opportunistic pathogens Microbes of the normal flora that 
can cause disease only if the immune Systems are weakened or 
if they gain access to a normally stérile part of the body. 

optogenetics Use of genetically engineered 
channelrhodopsin and other light-responsive ion channels 
and transportera to modulate neuron function and hence 
analyze the neurons and circuits underlying complex functions, 
including behaviors in whole animais. (Figure 11-32) 

organelle Subcellular compartiment or large macromolecular 
complex, often membrane-enclosed, that has a distinct 
structure, composition, and function. Examples are nucléus, 
nucleolus, mitochondrion, Golgi apparatus, and centrosomes. 
(Figure 1-25) 

Organizer Specialized tissue at the dorsal lip of the 
blastopore in an amphibian embryo; a source of signais that 
help to orchestrate formation of the embryonic body axis. 

origin récognition complex (ORC) Large protein complex 
that is bound to the DNA at origins of réplication in eukaryotic 
chromosomes throughout the cell cycle. (Figure 5-31) 

orthologs Genes or proteins from different species that are 
similar in sequence because they are descendants of the same 
gene in the last common ancestor of those species. Compare 
paralogs. (Figure 1-21) 

osteoblast Cell that sécrétés matrix of bone. (Figure 22-14) 

osteoclast Macrophage-like cell that erodes bone, enabling 
it to be remodeled during growth and in response to stresses 
throughout life. (Figure 22-16) 

osteocyte Nondividing cell in bone that develops from an 
osteoblast and is embedded in bone matrix. (Figure 22-14) 

outer membrane Mitochondrial membrane that is in contact 
with the cytosol. 

outer mitochondrial membrane Membrane that séparâtes 
the organelle from the cytosol. 

outer nuclear membrane One of two concentric 
membranes comprising the nuclear envelope; surrounds the 
inner nuclear membrane and is continuous with the inner 
nuclear membrane and the membrane of the endoplasmic 
réticulum. 

OXA complex Protein translocator in the inner mitochondrial 
membrane that médiates insertion of inner membrane proteins. 

oxidation (verb oxidize) Loss of électrons from an atom, as 
occurs during the addition of oxygen to a molécule or when a 
hydrogen is removed. Opposite of réduction. (Figure 2-20) 

oxidative phosphorylation Process in bacteria and 
mitochondria in which ATP formation is driven by the transfer 


of électrons through the électron transport chain to molecular 
oxygen. Involves the intermediate génération of a proton 
gradient (pH gradient) across a membrane and a chemiosmotic 
coupling of that gradient to the ATP synthase. (Figures 14-12) 

oxidative phosphorylation Process in bacteria and 
mitochondria in which ATP formation is driven by the 
transfer of électrons through the electron-transport chain to 
molecular oxygen. Involves the intermediate génération of an 
electrochemical proton gradient across a membrane and a 
chemiosmotic coupling of that gradient to the ATP synthase. 
(Figure 14-10) 

P-type pumps A class of ATP-driven pumps comprising 
structurally and functionally related multipass transmembrane 
proteins that phosphorylate themselves during the pumping 
cycle. The class includes many of the ion pumps responsible 
for setting up and maintaining gradients of Na + , K + , H + , and 
Ca 2+ across cell membranes. (Figure 11-12) 

p53 A transcription regulatory protein that is activated by 
damage to DNA and is involved in blocking further progression 
through the cell cycle. (Figures 20-37 and 20-40) 

p53 Tumor suppressor gene that is mutated in about half 
of human cancers. Encodes a transcription regulator that is 
activated by damage to DNA and is involved in blocking further 
progression through the cell cycle. (Figure 20-27) 

pair-rule gene In Drosophila development, a gene expressed 
in a sériés of regular transverse stripes along the body of the 
embryo and which helps to détermine its segments. 

(Figure 21-19) 

pairing In meiosis, the lining up of the two homologous 
chromosomes along their length. (Figure 17-54) 

papillomaviruses Class of viruses responsible for human 
warts and a prime example of DNA tumor viruses, being a 
cause of cancer of the uterine cervix. 

paracrine signaling Short-range cell—cell communication 
via secreted signal molécules that act on neighboring cells. 
(Figure 15-2) 

paralogs Genes or proteins that are similar in sequence 
because they are the resuit of a gene duplication event 
occurring in an ancestral organism. Those in two different 
organisms are less likely to hâve the same function than are 
orthologs. Compare orthologs. (Figure 1-21) 

parasitism Ecological relationship between microbes and 
their host in which the microbe benefits to the détriment of the 
host, as is often the case for pathogens. 

passengers Mutations that hâve occurred in the same cell as 
driver mutations, but which are irrelevant to the development of 
the cancer. 

passive transport (facilitated diffusion) Transport of a 
soluté across a membrane down its concentration gradient or 
its electrochemical gradient, using only the energy stored in the 
gradient. (Figure 11-4) 

patch-clamp recording Electrophysiological technique 
in which a tiny electrode tip is sealed onto a patch of cell 
membrane, thereby making it possible to record the flow of 
current through individual ion channels in the patch. 

(Figure 11-34) 

Patched Transmembrane protein predicted to cross the 
plasma membrane 12 times; much is in intracellular vesicles 
and some is on the cell surface where it binds the Hedgehog 
protein. 
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pathogen (adjective pathogenic) An organism, cell, virus, or 
prion that causes disease. 

pathogen-associated molecular patterns (PAMPs) 

Microbe-associated molécules, either not présent or 
sequestered in the host organism, that often occur in repeating 
patterns that are recognized by pattern récognition receptors 
(PRRs) in or on cells of the innate immune System. PAMPs are 
présent in various microbial molécules, including nucleic acids, 
lipids, polysaccharides, and proteins. 

pattern récognition receptor (PRR) Receptor présent on 
or in cells of the innate immune System that recognizes and is 
activated by microbial pathogen-associated molecular patterns 
(PAMPs). 

PDZ domain Protein-binding domain présent in many 
scaffold proteins, and often used as a docking site for 
intracellular tails of transmembrane proteins. (Figure 19-22) 

pectin Mixture of polysaccharides rich in galacturonic acid 
which forms a highly hydrated matrix in which cellulose is 
embedded in plant cell walls. (Figure 19-63) 

peripheral (secondary) lymphoid organ Lymphoid organ 
in which T cells and B cells interact and respond to foreign 
antigens. Examples are spleen, lymph nodes, and mucosal- 
associated lymphoid organs. (Figure 24-12) 

peroxins Form a protein translocator that participâtes in the 
import of proteins into peroxisomes. 

peroxisome Small membrane-bounded organelle that uses 
molecular oxygen to oxidize organic molécules. Contains some 
enzymes that produce and others that dégradé hydrogen 
peroxide (PI 2 O 2 ). (Figure 12-27) 

PH scale Common measure of the acidity of a solution: “p” 
refers to power of 10, “H” to hydrogen. Defined as the négative 
logarithm of the hydrogen ion concentration in moles per liter 
(M). pH = -log [H + ]. Thus a solution of pH 3 will contain 10 -3 M 
hydrogen ions. pH less than 7 is acidic and pH greater than 7 is 
alkaline. 

phagocytosis Process by which unwanted cells, débris, and 
other bulky particulate material is endocytosed (“eaten”) by a 
cell. Prominent in carnivorous cells, such as Amoeba proteus, 
and in vertebrate macrophages and neutrophils. From Greek 
phagein, to eat. 

phagosome Large intracellular membrane-enclosed vesicle 
that is formed as a resuit of phagocytosis. Contains ingested 
extracellular material. (Figure 13-61) 

phase variation The random switching of phenotype and 
expression of proteins involved in infection at frequencies much 
higher than mutation rates. 

phase-contrast microscope Type of light microscope that 
exploits the interférence effects that occur when light passes 
through material of different refractive indices. Used to view 
living cells. (Figure 9-7) 

phenotype The observable character (including both physical 
appearance and behavior) of a cell or organism. 

(Panel 8-2, p. 486) 

phosphatidylinositol 4,5-bisphosphate [PI(4,5)P2, PIP 2 ] 

Membrane inositol phospholipid (a phosphoinositide) that is 
cleaved by phospholipase C into IP 3 and diacylglycérol at the 
beginning of the inositol phospholipid signaling pathway. It can 
also be phosphorylated by PI 3-kinase to produce PIP 3 docking 
sites for signaling proteins in the PI-3-kinase-Akt signaling 
pathway. (Figures 15-28 and 15-53) 


phosphoglyceride Phospholipid derived from glycerol, 
abundant in biomembranes. (Figures 10-2 and 10-3) 

phosphoinositide A lipid containing a phosphorylated inositol 
dérivative. Minor component of the plasma membrane, but 
important in demarking different membranes and for intracellular 
signal transduction in eukaryotic cells. (Figure 15-52) 

phosphoinositide 3-kinase (PI 3-kinase) Membrane-bound 
enzyme that is a component of the PI-3-kinase-Akt intracellular 
signaling pathway. It phosphorylâtes phosphatidylinositol 
4,5-bisphosphate at the 3 position on the inositol ring to 
produce PIP 3 docking sites in the membrane for other 
intracellular signaling proteins. (Figure 15-53) 

phosphoinositides (PIPs; phosphatidylinositol 
phosphates) A lipid containing a phosphorylated inositol 
dérivative. Minor component of the plasma membrane, but 
important in demarking different membranes and for intracellular 
signal transduction in eukaryotic cells. (Figure 13-10) 

phospholipase C (PLC) Membrane-bound enzyme that 
cleaves inositol phospholipids to produce IP 3 and diacylglycérol 
in the inositol phospholipid signaling pathway. PLCp is activated 
by GPCRs via spécifie G proteins, while PLCy is activated by 
RTKs. (Figure 15-55) 

phospholipid The main category of lipids used to construct 
biomembranes. Generally composed of two fatty acids linked 
through glycerol (or sphingosine) phosphate to one of a variety 
of polar groups. (Figure 10-3, and Panel 2-5, pp. 98-99) 

phosphorylation Reaction in which a phosphate group is 
covalently coupled to another molécule. 

photoactivation Technique for studying intracellular 
processes in which an inactive form of a molécule of interest 
is introduced into the cell, and is then activated by a focused 
beam of light at a précisé spot in the cell. (Figure 9-28) 

photochemical reaction center The part of a photosystem 
that converts light energy into Chemical energy in 
photosynthesis. (Figure 14-44) 

photosynthetic electron-transfer reactions Light-driven 
reactions in photosynthesis in which électrons move along an 
electron-transport chain in a membrane, generating ATP and 
NADPH. 

photosystem Multiprotein complex involved in 
photosynthesis that captures the energy of sunlight and 
converts it to useful forms of energy: a reaction center plus an 
antenna (Figure 14-45) 

phototropin Photoprotein associated with the plant plasma 
membrane that senses blue light and is partly responsible for 
phototropism. 

phragmoplast Structure made of microtubules and actin 
filaments that forms in the prospective plane of division of a 
plant cell and guides formation of the cell plate. (Figure 17-49) 

phytochrome Plant photoprotein that senses light via a 
covalently attached light-absorbing chromophore, which 
changes its shape in response to light and then induces a 
change in the protein’s conformation. Plant phytochromes are 
dimeric, cytoplasmic serine/threonine kinases, which respond 
differentially and reversibly to red and far-red light to alter cell 
behavior. 

PI-3-kinase-Akt pathway Intracellular signaling pathway 
that stimulâtes animal cells to survive and grow. (Figure 15-53) 

pinocytosis Literally, “cell drinking.” Type of endocytosis 
in which soluble materials are continually taken up from the 
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environment in small vesicles and moved into endosomes 
along with the membrane-bound molécules. Compare 

phagocytosis. (Figure 13-48) 

piRNAs (piwi-interacting RNAs) A class of small noncoding 
RNAs made in the germ line that, in complex with Piwi proteins, 
keep in check the movement of transposable éléments by 
transcriptionally silencing transposon genes and destroying 
RNAs produced by them. 

planar cell polarity Type of cellular asymmetry seen in some 
epithelia, such that each cell has a polarity vector oriented in 
the plane of the epithelium. (Figure 21-51) 

plant growth regulator (plant hormone) Signal molécule 
that helps coordinate growth and development. Examples are 
ethylene, auxins, gibberellins, cytokinins, abscisic acid, and the 
brassinosteroids. 

plasma membrane The membrane that surrounds a living 
cell. (Figure 10-1) 

plasmid vector Small, circular molécules of double-stranded 
DNA derived from plasmids that occur naturally in bacterial 
cells; widely used for gene cloning. 

plasmodesma (plural plasmodesmata) Plant équivalent 
of a gap junction. Communicating cell—cell junction in plants 
in which a channel of cytoplasm lined by plasma membrane 
connects two adjacent cells through a small pore in their cell 
walls. 

platelet Cell fragment, lacking a nucléus, that breaks off from 
a megakaryocyte in the bone marrow and is found in large 
numbers in the bloodstream. Helps initiate blood clotting when 
blood vessels are injured. (Figure 22-29) 

pleckstrin homology domain (PH domain) Protein 
domain found in some intracellular signaling proteins. 

Some PH domains in intracellular signaling proteins bind 
to phosphatidylinositol 3,4,5-trisphosphate produced by 
PI 3-kinase, bringing the signaling protein to the plasma 
membrane when PI 3-kinase is activated. 

pluripotent Describes a cell that has the potential to give rise 
to ail or almost ail of the cell types of the adult body. 

polarized In epithelia, that the basal end of a cell, adhèrent to 
the basal lamina below, differs from the apical end, exposed to 
the medium above; thus, ail epithelia and their individual cells 
are structurally polarized. 

Polycomb group Set of proteins critical for cell memory for 
some genes. They form complexes as part of the chromatin of 
the Hox complex, where they maintain a repressed State in cells 
where Hox genes hâve not been activated. 

polymerase Chain reaction (PCR) Technique for amplifying 
spécifie régions of DNA by the use of sequence-specific 
primers and multiple cycles of DNA synthesis, each cycle being 
followed by a brief heat treatment to separate complementary 
strands. (Figure 8-36) 

polymorphisms Describes genome sequences that coexist 
as two or more sequence variants at high frequency in a 
population. 

polypeptide Linear polymer of amino acids. Proteins are large 
polypeptides, and the two terms can be used interchangeable 
(Panel 3-1, pp. 112-113) 

polypeptide backbone Repeating sequence of atoms along 
the core of the polypeptide chain. 

polyribosome mRNA engaged with multiple ribosomes in the 
act of translation. 


polytene chromosome Giant chromosome in which the 
DNA has undergone repeated réplication and the many copies 
hâve stayed together in précisé alignment. (Figures 4-50 and 
4-51) 

porin Channel-forming proteins of the outer membranes of 
bacteria, mitochondria, and chloroplasts. 

position effect variegation Alteration in gene 
expression resulting from change in the position of the 
gene in relation to other chromosomal domains, especially 
heterochromatic domains. When an active gene is placed 
next to heterochromatin, the inactivating influence of the 
heterochromatin can spread to affect the gene to a variable 
degree, giving rise to position effect variegation. (Figure 4-31) 

positional value A cell’s internai record of its positional 
information in a multicellular organism; an intrinsic character 
that differs according to a cell’s location. 

positive feedback Control mechanism whereby the end 
product of a reaction or pathway stimulâtes its own production 
or activation. 

positive sélection In immunology: process of thymocyte 
maturation in which thymocytes expressing a T cell receptor 
with appropriate affinity for a self peptide bound to a self MHC 
protein is signaled to survive and continue development. 

post-transcriptional Controls Any control on gene 
expression that is exerted at a stage after transcription has 
begun. (Figure 7-54) 

post-translational Occurring after completion of translation. 

preprophase band Circumferential band of microtubules and 
actin filaments that forms around a plant cell under the plasma 
membrane priorto mitosis and cell division. (Figure 17-49) 

prereplicative complex (preRC) Multiprotein complex that 
is assembled at origins of réplication during late mitosis and 
early Gi phases of the cell cycle; a prerequisite to license 
the assembly of a preinitiation complex, and the subséquent 
initiation of DNA réplication. (Figures 17-17 and 17-18) 

primary cell wall The first cell wall produced by a developing 
plant cell; it is thin and flexible, allowing room for cell growth. 
(Figure 19-63) 

primary cilium Short, single, nonmotile cilium lacking 
dynein that arises from a centriole and projects from the 
surface of many animal cell types. Some signaling proteins are 
concentrated in the primary cilium. (Figure 15-38) 

primary Ig répertoire The billions of IgM and IgD 
immunoglobulin molécules made by the B cells of an adaptive 
immune System in the absence of antigen stimulation. 

primary immune response Adaptive immune response to 
an antigen that is made on first encounter with that antigen. 
(Figure 24-16) 

primary pathogens Pathogens that can cause overt disease 
in most healthy people. Some cause acute, life-threatening 
épidémie infections and spread rapidly between hosts; other 
potential primary pathogens may persistently infect a single 
individual for years without causing overt disease, the host 
often being unaware that they are infected. 

primary structure Linear sequence of monomer units in a 
polymer, such as the amino acid sequence of a protein. 

primary tumor Tumor at the original site at which a cancer 
first arose. Secondary tumors develop elsewhere by metastasis. 
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prion disease Transmissible spongiform encephalopathy— 
such as Kuru and Creutzfeldt-Jakob disease (CJD) in humans, 
scrapie in sheep, and bovine spongiform encephalopathy (BSE, 
or “mad cow disease”) in cows—that is caused and transmitted 
by an infectious, abnormally folded protein (prion). (Figure 3-33) 

pro-inflammatory cytokine Any cytokine that stimulâtes an 
inflammatory response. 

programmed cell death A form of cell death in which a cell 
kills itself by activating an intracellular death program. 

prokaryote Single-celled microorganism whose cells lack a 
well-defined, membrane-enclosed nucléus. Either a bacterium 
or an archaeon. (Figure 1-17) 

promoter Nucléotide sequence in DNA to which RNA 
polymerase binds to begin transcription. (Figure 7-17) 

protéasome Large protein complex in the cytosol with 
proteolytic activity that is responsible for degrading proteins that 
hâve been marked for destruction by ubiquitylation or by some 
other means. (Figures 6-83 and 6-84) 

protein The major macromolecular constituent of cells. A 
linear polymer of amino acids linked together by peptide bonds 
in a spécifie sequence. (Figure 3-1) 

protein activity control The sélective activation, inactivation, 
dégradation, or compartmentalization of spécifie proteins 
after they hâve been made. One of the means by which a cell 
Controls which proteins are active at a given time or location in 
the cell. 

protein domain see domain 

protein glycosylation Process of transferring a single 
saccharide or preformed precursor oligosaccharide to proteins. 

protein kinase Enzyme that transfers the terminal phosphate 
group of ATP to one or more spécifie amino acids (serine, 
threonine, or tyrosine) of a target protein. 

protein kinase C (PKC) Ca 2+ -dependent protein kinase 
that, when activated by diacylglycérol and an increase in the 
concentration of cytosolic Ca 2+ , phosphorylâtes target proteins 
on spécifie serine and threonine residues. (Figure 15-29) 

protein phosphatase Enzyme that catalyzes phosphate 
removal from amino acids of a target protein. 

protein subunit An individual protein chain in a protein 
composed of more than one chain. 

protein translocation Process of moving a protein across a 
membrane. 

protein translocator Membrane-bound protein that 
médiates the transport of another protein across a membrane. 
(Figure 12-21) 

protein tyrosine phosphatase Enzyme that removes 
phosphate groups from phosphorylated tyrosine residues on 
proteins. 

proteoglycan Molécule consisting of one or more 
glycosaminoglycan chains attached to a core protein. 

(Figure 19-38) 

proteomics Study of ail the proteins, including ail the 
covalently modified forms of each, produced by a cell, tissue, or 
organism. Proteomics often investigates changes in this larger 
set of proteins—in “the proteome”—caused by changes in the 
environment or by extracellular signais. 

proto-oncogene Normal gene, usually concerned with the 


régulation of cell prolifération, that can be converted into a 
cancer-promoting oncogene by mutation. 

protofilament Linear string of microtubule subunits joined 
end to end; multiple protofilaments associate with one another 
laterally to construct and provide strength and adaptability to 
microtubules. 

proton (H + ) Positively charged subatomic particle that forms 
part of an atomic nucléus. Hydrogen has a nucléus composed 
of a single proton (H + ). 

proton-motive force The force exerted by the 
electrochemical proton gradient that moves protons across a 
membrane. 

protozoan parasite Parasitic, nonphotosynthetic, single- 
celled, motile eukaryotic organism, for example Plasmodium. 

pseudogene Nucléotide sequence of DNA that has 
accumulated multiple mutations that hâve rendered an 
ancestral gene inactive and nonfunctional. 

purified cell-free System Fractionated cell homogenate that 
retains a particular biological function of the intact cell, and in 
which biochemical reactions and cell processes can be more 
easily studied. 

purifying sélection Natural sélection operating in a 
population to slow genome changes and reduce divergence by 
eliminating individuals carrying deleterious mutations. 

quantitative RT-PCR (reverse transcription-polymerase 
chain reaction) Technique in which a population of mRNAs is 
converted into cDNAs via reverse transcription, and the cDNAs 
are then amplified by PCR. The quantitative part relies on a 
direct relationship between the rate at which the PCR product 
is generated and the original concentration of the mRNA 
species of interest. 

quaternary structure Three-dimensional relationship of the 
different polypeptide chains in a multisubunit protein or protein 
complex. 

quinone (Q) Small, lipid-soluble, mobile électron carrier 
molécule found in the respiratory and photosynthetic electron- 
transport chains. (Figure 14-17) 

Rab cascade An ordered recruitment of sequentially acting 
Rab proteins into Rab domains on membranes, which changes 
the identity of an organelle and reassigns membrane dynamics. 

Rab effectors Molécules that bind activated, membrane- 
bound Rab proteins and act as downstream mediators of 
vesicle transport, membrane tethering, and fusion. 

Rab proteins Monomeric GTPase in the Ras superfamily 
présent in plasma and organelle membranes in its GTP-bound 
State, and as a soluble cytosolic protein in its GDP-bound State. 
Involved in conferring specificity on vesicle docking. 

(Table 15-5, p. 854) 

Rac Member of the Rho family of monomeric GTPases that 
regulate the actin and microtubule cytoskeletons, cell-cycle 
progression, gene transcription, and membrane transport. 

Rad51 Eukaryotic protein that catalyzes synapsis of DNA 
strands during genetic recombination. Called RecA in E. coli. 

Ran (Ran protein) Monomeric GTPase of the Ras superfamily 
présent in both cytosol and nucléus. Required for the active 
transport of macromolecules into and out of the nucléus 
through nuclear pore complexes. (Table 15-5, p. 854) 

rapidly inactivating K + channel Neuronal voltage-gated 
K + channel, open when the membrane is depolarized, with 
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a spécifie voltage sensitivity and kinetics of inactivation that 
induce a reduced rate of action potential firing at levels of 
stimulation only just above the threshold required, thereby 
resulting in a firing rate proportional to the strength of the 
depolarizing stimulus. 

Ras A small family of proto-oncogenes that are frequently 
mutated in cancers, each of which produces a Ras monomeric 
GTPase. 

Ras (Ras protein) Monomeric GTPase of the Ras superfamily 
that helps to relay signais from cell-surface receptor tyrosine 
kinase receptors to the nucléus, frequently in response to 
signais that stimulate cell division. Named for the ras gene, first 
identified in viruses that cause rat sarcomas. (Figure 3-67) 

Ras superfamily Large superfamily of monomeric GTPases 
(also called small GTP-binding proteins) of which Ras is the 
prototypical member. (Table 15-5, p. 854) 

Ras-GAPs Ras GTPase-activating proteins; increase the rate 
of hydrolysis of bound GTP by Ras, thereby inactivating Ras. 

Ras-GEFs Ras guanine nucléotide exchange factors; 
stimulate the dissociation of GDP and the subséquent uptake 
of GTP from the cytosol, thereby activating Ras. 

Ras-MAP-kinase signaling pathway Intracellular signaling 
pathway that relays signais from activated receptor tyrosine 
kinases to effector proteins in the cell including transcription 
regulators in the nucléus. 

Rb gene The gene that is defective in both copies in 
individuals with retinoblastoma; its protein product plays a 
central rôle in cell-cycle control. 

reading trame The phase in which nucléotides are read in 
sets of three to encode a protein. An mRNA molécule can be 
read in any one of three reading frames, only one of which will 
give the required protein. (Figure 6-49) 

RecA (RecA protein) Prototype for a class of DNA-binding 
proteins that catalyze synapsis of DNA strands during genetic 
recombination. (Figure 5-49) 

receptor Any protein that binds a spécifie signal molécule 
(ligand) and initiâtes a response in the cell. Some are on the cell 
surface, while others are inside the cell. (Figure 15-3) 

receptor editing Process by which a developing B cell that 
recognizes a self molécule changes its antigen receptors so 
that the cell no longer does so. 

receptor serine/threonine kinase Cell-surface receptor 
with an extracellular ligand-binding domain and an intracellular 
kinase domain that phosphorylâtes signaling proteins on serine 
or threonine residues in response to ligand binding. The TGFp 
receptor is an example. (Figure 15-57) 

receptor tyrosine kinase (RTK) Cell-surface receptor with 
an extracellular ligand-binding domain and an intracellular 
kinase domain that phosphorylâtes signaling proteins on 
tyrosine residues in response to ligand binding. (Figure 15-43 
and Table 15-4, p. 850) 

receptor-mediated endocytosis Internalization of receptor- 
ligand complexes from the plasma membrane by endocytosis. 
(Figure 13-52) 

recombinant DNA technology Collection of techniques 
by which DNA segments from different sources are combined 
to make a new DNA, often called a recombinant DNA. 
Recombinant DNAs are widely used in the cloning of genes, in 
the genetic modification of organisms, and in the production of 
large amounts of rare proteins. 


recycling endosome Organelle that provides an intermediate 
stage on the passage of recycled receptors back to the cell 
membrane. Régulâtes plasma membrane insertion of some 
proteins. (Figure 13-58) 

red blood cell Small hemoglobin-containing blood cell of 
vertebrates that transports oxygen to, and carbon dioxide from, 
tissues. Also called an érythrocyte. 

redox pair Pair of molécules in which one acts as an électron 
donor and one as an électron acceptor in an oxidation- 
reduction reaction: for example, NADH (électron donor) and 
NAD + (électron acceptor). (Panel 14-1, p. 765) 

redox potential The affinity of a redox pair for électrons, 
generally measured as the voltage différence between an 
equimolar mixture of the pair and a standard reference. NADH/ 
NAD + has a low redox potential and O 2 /H 2 has a high redox 
potential (high affinity for électrons). (Panel 14-1, p. 765) 

redox reaction Reaction in which one component becomes 
oxidized and the other reduced; an oxidation-reduction 
reaction. (Panel 14-1, p. 765) 

réduction (verb reduce) Addition of électrons to an atom, as 
occurs during the addition of hydrogen to a biological molécule 
or the removal of oxygen from it. Opposite of oxidation. 

(Figure 2-20) 

regulated nuclear transport Mechanisms controlling export 
of mRNAs from the nucléus to the cytosol that can be used to 
regulate gene expression. Also includes the sélective import of 
proteins and RNA molécules into the nucléus. 

regulated secretory pathway A second secretory pathway 
found mainly in cells specialized for secreting products 
rapidly on demand—such as hormones, neurotransmitters, 
or digestive enzymes—in which soluble proteins and other 
substances are initially stored in secretory vesicles for later 
release. (Figure 13-62) 

regulator of G protein signaling (RGS) A GAP protein that 
binds to a trimeric G protein and enhances its GTPase activity, 
thus helping to limit G-protein-mediated signaling. (Figure 15-8) 

regulatory site Région of an enzyme surface to which a 
regulatory molécule binds and thereby influences the catalytic 
events at the separate active site. 

regulatory T cell (T reg ) A type of T cell that suppresses the 
development, activation, or function of other immune cells via 
secreted cytokines or cell-surface inhibitory proteins. 

réplication fork Y-shaped région of a replicating DNA 
molécule at which the two strands of the DNA are being 
separated and the daughter strands are being formed. (Figures 
5-7 and 5-18) 

réplication origin Location on a DNA molécule at which 
duplication of the DNA begins. (Figures 4-19 and 5-23) 

réplicative cell senescence Phenomenon observed in 
primary cell cultures in which cell prolifération slows down and 
finally irreversibly halts. 

respiratory Chain (electron-transport Chain) Electron- 
transport chain présent in the inner mitochondrial membrane 
that generates an electrochemical gradient across the 
membrane that is used to drive ATP synthesis. 

(Figures 14-4 and 14-10) 

resting membrane potential Electrical potential across the 
plasma membrane of a cell at rest, i.e. a cell that has not been 
stimulated to open additional ion channels than those that are 
normally open. 
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restriction nuclease One of a large number of nucleases 
that can cleave a DNA molécule at any site where a spécifie 
short sequence of nucléotides occurs. Extensively used in 
recombinant DNA technology. (Figure 8-24) 

restriction point Important transition at the end of Gi in the 
eukaryotic cell cycle; commits the cell to enter S phase. The 
term was originally used for this transition in the mammalian cell 
cycle; in this book we use the term Start. (Figure 17-9) 

retinoblastoma A rare type of human cancer arising from 
cells in the retina of the eye that are converted to a cancerous 
State by an unusually small number of mutations. Studies 
of retinoblastoma led to the discovery of the first tumor 
suppressor gene. 

retinoblastoma protein (Rb protein) Tumor suppressor 
protein involved in the régulation of cell division. Mutated in the 
cancer retinoblastoma, as well as in many other tumors. Its 
normal activity is to regulate the eukaryotic cell cycle by binding 
to and inhibiting the E2F proteins, thus blocking progression to 
DNA réplication and cell division. (Figure 17-61) 

retroviral-like retrotransposons A large family of 
transposons that move themselves in and out of chromosomes 
by a mechanism similar to that used by retroviruses, being 
first transcribed into an RNA copy that is converted to DNA by 
reverse transcriptase then inserted elsewhere in the genome. 
(Table 5-4, p. 288) 

retrovirus RNA-containing virus that replicates in a cell by first 
making an RNA-DNA intermediate and then a double-strand 
DNA molécule that becomes integrated into the cell’s DNA. 
(Figure 5-62) 

reverse genetics Approach to discovering gene function that 
starts from the DNA (gene) and its protein product and then 
créâtes mutants to analyze the gene’s function. 

reverse transcriptase Enzyme first discovered in retroviruses 
that makes a double-strand DNA copy from a single-strand 
RNA template molécule. 

RGD sequence Tripeptide sequence of arginine-glycine- 
aspartic acid that forms a binding site for integrins; présent 
in fibronectin and some other extracellular proteins. (Figure 
19-47 C) 

Rheb A monomeric Ras-related GTPase that in its active form 
(Rheb-GTP) activâtes mTOR, which promûtes cell growth. 

Rho Member of the Rho family of monomeric GTPases that 
regulate the actin and microtubule cytoskeletons, cell-cycle 
progression, gene transcription, and membrane transport. 

Rho family Family of monomeric GTPases within the Ras 
superfamily involved in signaling the rearrangement of the 
cytoskeleton. Includes Rho, Rac, and Cdc42. (Table 15-5, p. 
854) 

rhodopsin Seven-span membrane protein of the GPCR 
family that acts as a light sensor in rod photoreceptor cells in 
the vertebrate retina. Contains the light-sensitive prosthetic 
group retinol. (Figure 15-39) 

ribosomal RNA (rRNA) Any one of a number of spécifie RNA 
molécules that form part of the structure of a ribosome and 
participate in the synthesis of proteins. Often distinguished by 
their sédimentation coefficient (e.g., 28S rRNA, 5S rRNA). 

ribosome Particle composed of rRNAs and ribosomal 
proteins that catalyzes the synthesis of protein using 
information provided by mRNA. (Figure 6-64) 


ribozyme An RNA molécule with catalytic activity. 

RNA (ribonucleic acid) Polymer formed from covalently 
linked ribonucleotide monomers. See also messenger RNA, 
ribosomal RNA, transfer RNA. (Figure 6-4) 

RNA editing Type of RNA Processing that alters the 
nucléotide sequence of an RNA transcript after it is synthesized 
by inserting, deleting, or altering individual nucléotides. 

RNA interférence (RNAi) As originally described, mechanism 
by which an experimentally introduced double-stranded RNA 
induces sequence-specific destruction of complementary 
mRNAs. The term RNAi is often used to include the inhibition 
of gene expression by microRNAs (miRNAs) and piwi RNAs 
(piRNAs), which are encoded in the cell’s own genome. 

RNA polymerase Enzyme that catalyzes the synthesis of 
an RNA molécule on a DNA template from ribonucleoside 
triphosphate precursors. (Figure 6-9) 

RNA primer Short stretch of RNA synthesized on a DNA 
template. It is required by DNA polymerases to start their DNA 
synthesis. 

RNA Processing control Régulation by a cell of gene 
expression by controlling the Processing of RNA transcripts, 
which includes their splicing. 

RNA splicing Process in which intron sequences are 
excised from RNA transcripts. A major process in the nucléus 
of eukaryotic cells leading to formation of messenger RNAs 
(mRNAs). 

RNA transport and localization control Régulation by a 
cell of gene expression by selecting which completed mRNAs 
are exported from the nucléus to the cytosol and determining 
where in the cytosol they are localized. 

RNA world Hypothesis that early life on Earth was based 
primarily on RNA molécules that both stored genetic 
information and catalyzed biochemical reactions. 

RNA-seq Sequencing the entire répertoire of RNA from a cell 
or tissue; also known as deep RNA sequencing. 

robustness The ability of biological regulatory Systems to 
function normally in the face of perturbations such as exposure 
to frequent and/or extreme variations in external conditions or 
the concentrations or activities of key components. 

rod photoreceptor (rod) Photoreceptor cell in the vertebrate 
retina that is responsible for noncolor vision in dim light. 

rough endoplasmic réticulum (rough ER) Endoplasmic 
réticulum with ribosomes on its cytosolic surface. Involved in 
the synthesis of secreted and membrane-bound proteins. 

rRNA gene Gene that spécifiés a ribosomal RNA (rRNA). 

ryanodine receptor A regulated Ca 2+ channel in the ER 
membrane that opens in response to rising Ca 2+ levels and 
thus amplifies the Ca 2+ signal. 

SAM complex Protein translocator that helps p-barrel 
proteins to fold properly in the outer mitochondrial membrane. 

Sanger sequencing see dideoxy sequencing 

Sari protein Monomeric GTPase responsible for regulating 
COPII coat assembly at the endoplasmic réticulum membrane. 

sarcoma Cancer of connective tissue. 

scaffold protein Protein that binds groups of intracellular 
signaling proteins into a signaling complex, often anchoring the 
complex at a spécifie location in the cell. (Figure 15-10) 
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scanning électron microscope Type of électron microscope 
that produces an image of the surface of an object. 

(Figure 9-50) 

S-Cdk Cyclin-Cdk complex formed in vertebrate cells by an 
S-cyclin and the corresponding cyclin-dependent kinase (Cdk). 
(Figure 17-11 and Table 17-1, p. 969) 

SCF Family of ubiquitin ligases formed as a complex of 
several different proteins. One is involved in regulating the 
eukaryotic cell cycle, directing the destruction of inhibitors of 
S-Cdks in late Gi and thus promoting the activation of S-Cdks 
and DNA réplication. (Figures 3-71 and 17-15) 

Schwann cell Glial cell responsible for forming myelin 
sheaths in the peripheral nervous System. Compare 

oligodendrocyte. (Figure 11-33) 

S-cyclin Member of a class of cyclins that accumulate during 
late Gi phase and bind Cdks soon after progression through 
Start; they help stimulate DNA réplication and chromosome 
duplication. Levels remain high until late mitosis, after which 
these cyclins are destroyed. (Figure 17-11) 

Sec61 complex Three-subunit core of the protein 
translocator that transfers polypeptide chains across the 
endoplasmic réticulum membrane. 

second messenger (small intracellular mediator) Small 
intracellular signaling molécule that is formed or released for 
action in response to an extracellular signal and helps to relay 
the signal within the cell. Examples include cyclic AMP, cyclic 
GMP, IP 3 , Ca 2+ , and diacylglycérol. 

secondary cell wall Permanent rigid cell wall that is laid 
down underneath the thin primary cell wall in certain plant cells 
that hâve completed their growth. 

secondary Ig répertoire Immunoglobulins produced 
by B cells after antigen- and helper-T-cell-induced somatic 
hypermutation and class switching. Compared to the primary Ig 
répertoire, these Igs hâve a greatly increased diversity of both 
Ig classes and antigen-binding sites and hâve increased affinity 
for antigen. 

secondary immune response The adaptive immune 
response that occurs in response to a second or subséquent 
exposure to an antigen. The response is more rapid in onset 
and stronger than the primary immune response. (Figure 24-16) 

secondary structure Regular local folding pattern of a 
polymeric molécule; in proteins, a helices and 
(3 sheets. 

sécrétion System Spécialized bacterial Systems that secrete 
effector proteins that interact with host cells. 

secretory vesicle Membrane-enclosed organelle in which 
molécules destined for sécrétion are stored prior to release. 
Sometimes called secretory granule because darkly staining 
contents make the organelle visible as a small solid object. 
(Figures 13-65) 

securin Protein that binds to the protease separase and 
thereby prevents its cleavage of the protein linkages that hold 
sister chromatids together in early mitosis. Securin is destroyed 
at the metaphase-to-anaphase transition. (Figure 17-38) 

segment Divisions of an insect body along its anteroposterior 
axis, each forming highly specialized structures, but ail built 
according to a similar fundamental plan. 

segment-polarity gene In Drosophila development, a gene 
involved in specifying the anteroposterior organization of each 
body segment. (Figure 21-19) 


segmentation clock The gene-expression oscillator 
controlling regular segmentation during vertebrate embryonic 
development. 

segmentation genes Genes expressed by subsets of cells 
in the embryo that refine the pattern of gene expression so as 
to define the boundaries and ground plan of the individual body 
segments. 

selectin Member of a family of cell-surface carbohydrate- 
binding proteins that médiate transient, Ca 2+ -dependent 
cell—cell adhesion in the bloodstream—for example between 
white blood cells and the endothélium of the blood vessel wall. 
(Figure 19-28) 

selectivity filter The part of an ion channel structure that 
détermines which ions it can transport. (Figures 11-24 and 
11-25) 

sensory bristles Miniature sense organs présent on most 
exposed surfaces of Drosophila, consisting of a sensory neuron 
and supporting cells and responding to Chemical or mechanical 
stimuli. 

separase Protease that cleaves the cohesin protein linkages 
that hold sister chromatids together. Acts at anaphase, 
enabling chromatid séparation and ségrégation. (Figure 17-38) 

septum Structure formed during bacterial cell division by the 
inward growth of the cell wall and plasma membrane and that 
divides the cell into two. 

sequential induction Development process that generates 
a progressively more complicated pattern. A sériés of 
local inductions whereby one of two cell types présent in a 
developing tissue can produce a signal to induce neighboring 
cells to specialize in a third way; the third cell type can then 
signal back to the other two cell types nearby to generate a 
fourth and a fifth cell type, and so on. 

serine protease Type of protease that has a reactive serine in 
the active site. (Figures 3-12 and 3-39) 

serine/threonine kinase Enzyme that phosphorylâtes 
spécifie proteins on serine or thréonines. 

SH2 domain Src homology région 2, a protein domain 
présent in many signaling proteins. Binds a short amino acid 
sequence containing a phosphotyrosine. (Panel 3-2, 
pp. 142-143) 

side chain The part of an amino acid that differs between 
amino acid types. The side chains give each type of amino acid 
its unique physical and Chemical properties. (Panel 3-1, 
pp. 112-113) 

signal patch Protein-sorting signal that consists of a spécifie 
three-dimensional arrangement of atoms on the folded protein’s 
surface. (Figure 13-46) 

signal peptidase Enzyme that removes a terminal signal 
sequence from a protein once the sorting process is complété. 
(Figure 12-35) 

signal sequence Short continuous sequence of amino acids 
that détermines the eventual location of a protein in the cell. An 
example is the N-terminal sequence of 20 or so amino acids 
that directs nascent secretory and transmembrane proteins to 
the endoplasmic réticulum. (Table 12-3, p. 648) 

signal-récognition particle (SRP) Ribonucleoprotein particle 
that binds an ER signal sequence on a partially synthesized 
polypeptide chain and directs the polypeptide and its attached 
ribosome to the endoplasmic réticulum. (Figure 12-36) 
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signaling center Cluster of specialized cells in developing 
tissues that serves as a source of developmental signais—for 
example, the génération of a morphogen gradient. 

single-nucleotide polymorphism (SNP) A variation between 
individuals in a population due to a relatively common différence 
in a spécifie nucléotide at a defined point in the DNA sequence. 

single-particle reconstruction Computational procedure 
in électron microscopy in which images of many identical 
molécules are obtained and digitally combined to produce an 
averaged three-dimensional image, thereby revealing structural 
details that are hidden by noise in the original images. 

(Figures 9-54 and 9-55) 

single-pass transmembrane protein Membrane protein in 
which the polypeptide chain crosses the lipid bilayer only once. 
(Figure 10-24) 

single-strand DNA-binding (SSB) protein Protein that 
binds to the single strands of the opened-up DNA double hélix, 
preventing helical structures from reforming while the DNA is 
being replicated. (Figure 5-15) 

sister chromatids Tightly linked pair of chromosomes that 
arise from chromosome duplication during S phase. They 
separate during M phase and segregate into different daughter 
cells. (Figure 17-21) 

sliding clamp Protein complex that holds the DNA 
polymerase on DNA during DNA réplication. (Figure 5-17) 

Slit Signal protein, secreted by cells of the neural tube floor 
plate, responsible for repelling the growth cônes of commissural 
axons after they hâve crossed the midline, thereby ensuring 
these neurons do not re-cross the midline. 

Smad family Latent transcription regulators that are 
phosphorylated and activated by receptor serine/threonine 
kinases and carry the signal from the cell surface to the 
nucléus. (Figure 15-57) 

small interfering RNAs (siRNAs) Short (21-26 nucléotide) 
double-stranded RNAs that inhibit gene expression by directing 
destruction of complementary mRNAs. Production of siRNAs is 
usually triggered by exogenously introduced double-stranded 
RNA. (Figure 7-77) 

small nuclear RNA (snRNA) Small RNA molécules that are 
complexed with proteins to form the ribonucleoprotein particles 
(snRNPs) involved in RNA splicing. (Figures 6-28 and 6-29) 

small nucleolar RNA (snoRNA) Small RNAs found in 
the nucleolus, with various functions, including guiding the 
modifications of precursor rRNA. (Table 6-1, p. 305, and 
Figure 6-41) 

smooth endoplasmic réticulum (smooth ER) Région of the 
endoplasmic réticulum not associated with ribosomes. Involved 
in détoxification reactions, Ca 2+ storage, and lipid synthesis. 
(Figure 12-33) 

Smoothened Seven-pass transmembrane protein with a 
structure very similar to a GPCR but does not seem to act 
as a Hedgehog receptor or as an activator of G proteins; it is 
controlled by the Patched and iHog proteins. 

SNARE proteins (SNAREs) Members of a large family of 
transmembrane proteins présent in organelle membranes 
and the vesicles derived from them. SNAREs catalyze the 
many membrane fusion events in cells. They exist in pairs—a 
v-SNARE in the vesicle membrane that binds specifically to a 
complementary t-SNARE in the target membrane. 


sodium dodecyl sulfate polyacrylamide-gel 
electrophoresis (SDS-PAGE) Type of electrophoresis 
used to separate proteins by size. The protein mixture 
to be separated is first treated with a powerful negatively 
charged detergent (SDS) and with a reducing agent 
(p-mercaptoethanol), before being run through a polyacrylamide 
gel. The detergent and reducing agent unfold the proteins, free 
them from association with other molécules, and separate the 
polypeptide subunits. 

somatic cell Any cell of a plant or animal other than cells of 
the germ line. From Greek sonna, body. 

somatic hypermutation In immunology: accumulation 
of point mutations in the assembled variable-region-coding 
sequences of immunoglobulin genes that occurs when B cells 
are activated to form memory cells. Results in the production 
of antibodies with altered antigen-binding sites, some of which 
bind antigen with increased affinity; it is responsible for affinity 
maturation in antibody responses. 

somatic mutations In cancer, one or more détectable 
abnormalities in the DNA sequence of tumor cells that 
distinguish them from the normal somatic cells surrounding the 
tumor. 

somite One of a sériés of paired blocks of mesoderm 
that form during early development and lie on either side of 
the notochord in a vertebrate embryo. They give rise to the 
segments of the body axis, including the vertebrae, muscles, 
and associated connective tissue. (Figure 21-38) 

sorting signal Signal sequence or signal patch that directs 
the delivery of a protein to a spécifie location, such as a 
particular intracellular compartment. 

spectrin Abundant protein associated with the cytosolic side 
of the plasma membrane in red blood cells, forming a network 
that supports the membrane. Also présent in other cells. 

(Figure 10-38) 

S phase Period of a eukaryotic cell cycle in which DNA is 
synthesized. (Figure 17-4) 

spinal cord Bundle of neurons and support cells that extends 
from the brain. 

spindle assembly checkpoint Regulatory System that 
opérâtes during mitosis to ensure that ail chromosomes 
are properly attached to the spindle before sister-chromatid 
séparation starts. (Figure 17-9 and Panel 17-1, pp. 980-981) 

spliceosome Large assembly of RNA and protein molécules 
that performs pre-mRNA splicing in eukaryotic cells. 

Src (Src protein family) Family of cytoplasmic tyrosine 
kinases (pronounced “sark”) that associate with the cytoplasmic 
domains of some enzyme-linked cell-surface receptors 
(for example, the T cell antigen receptor) that lack intrinsic 
tyrosine kinase activity. They transmit a signal onward by 
phosphorylating the receptor itself and spécifie intracellular 
signaling proteins on tyrosines. (Figure 3-10) 

SRP (signal-récognition particle) receptor Component 
in the endoplasmic réticulum (ER) membrane that guides the 
signal récognition particle to the ER membrane. 

starch Polysaccharide composed exclusively of glucose units, 
used as an energy-storage material in plant cells. (Figure 2-51) 

Start (restriction point) Important transition at the end of 
Gi in the eukaryotic cell cycle. Passage through Start commits 
the cell to enter S phase. The term was originally used for this 
point in the yeast cell cycle only; the équivalent point in the 
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mammalian cell cycle was called the restriction point. In this 
book we use Start for both. (Figure 17-9) 

start-transfer signal Short amino acid sequence that 
enables a polypeptide chain to start being translocated across 
the endoplasmic réticulum membrane through a protein 
translocator. Multipass membrane proteins sometimes hâve 
both N-terminal (signal sequence) and internai start-transfer 
signais. (Figure 12-42) 

STAT (signal transducer and activator of transcription) 

Latent transcription regulator that is activated by 
phosphorylation by Janus kinases (JAKs) and enters the 
nucléus in response to signaling from receptors of the cytokine 
receptor family. (Figure 15-56) 

stem cell Undifferentiated cell that can continue dividing 
indefinitely, throwing off daughter cells that can either commit 
to différentiation or remain a stem cell (in the process of self- 
renewal). (Figure 22-3) 

stem-cell niche The specialized microenvironment in a tissue 
in which self-renewing stem cells can be maintained. 

steroid hormones Hormones, including cortisol, estrogen, 
and testosterone, that are hydrophobie lipid molécules derived 
from cholestérol that activate intracellular nuclear receptors. 

stimulatory G protein (G s ) G protein that, when activated, 
activâtes the enzyme adenylyl cyclase and thus stimulâtes 
the production of cyclic AMP. See also G protein. 

(Table 15-3, p. 846) 

stochastic Random. Involving chance, probability, or random 
variables. 

stop-transfer signal Hydrophobie amino acid sequence 
that halts translocation of a polypeptide chain through the 
endoplasmic réticulum membrane, thus anchoring the protein 
chain in the membrane. (Figure 12-42) 

strand exchange Reaction in which one of the single-strand 
3' ends from one duplex DNA molécule pénétrâtes another 
duplex and searches it for homologous sequences through 
base-pairing. Also called strand invasion. 

strand-directed mismatch repair A proofreading System 
that removes DNA réplication errors missed by the DNA 
polymerase proofreading exonuclease. Detects the potential for 
DNA hélix distortion from noncomplementary base pairs then 
recognizes and excises the mismatch in the newly synthesized 
strand and resynthesizes the excised segment using the old 
strand as a template. 

stress fibers Cortical fibers of contractile actin-myosin II 
bundles that connect the cell to the extracellular matrix or 
adjacent cells through focal adhesions or a circumferential belt 
and adherens junctions. 

stroma (1) “Bedding”: the connective tissue in which a 
glandular or other epithelium is embedded. Stromal cells 
provide the environment necessary for the development of 
other cells within the tissue. (2) The large interior space of a 
chloroplast, containing enzymes that incorporate CO 2 into 
sugars. (Figure 14-38) 

substrate Molécule on which an enzyme acts. 

superresolution Describes several approaches in light 
microscopy that bypass the limit imposed by the diffraction of 
light and successfully allow objects as small as 20 nm to be 
imaged and clearly resolved. 

survival factor Extracellular signal that promûtes cell survival 
by inhibiting apoptosis. (Figure 18-12) 


symporter Carrier protein that transports two types of soluté 
across the membrane in the same direction. (Figure 11-8) 

synapse Communicating cell—cell junction that allows signais 
to pass from a nerve cell to another cell. In a Chemical synapse, 
the signal is carried by a diffusible neurotransmitter. (Figure 
19-22) In an electrical synapse, a direct connection is made 
between the cytoplasms of the two cells via gap junctions. 
(Figure 11-34 and 19-23) 

synapse élimination Process by which each muscle cell 
at first receives synapses from several motor neurons, but is 
ultimately left innervated by only one. 

synaptic plasticity Changes in the strength with which 
a Chemical synapse transmits a signal. It is thought to be 
important in memory formation, where concentrations of 
postsynaptic AMPA receptor are modulated in response to a 
synapse’s activity. 

synaptic signaling Intercellular signaling performed by 
neurons that transmit signais electrically along their axons and 
release neurotransmitters at synapses, which are often located 
far away from the neuronal cell body. 

synaptic vesicle Small neurotransmitter-filled secretory 
vesicle found at the axon terminais of nerve cells. Its contents 
are released into the synaptic cleft by exocytosis when an 
action potential reaches the axon terminal. 

synaptonemal complex Structure that holds paired 
homologous chromosomes tightly together in pachytene of 
prophase I in meiosis and promûtes the final steps of crossing- 
over. (Figures 17-55 and 17-56) 

syncytium Mass of cytoplasm containing many nuclei 
enclosed by a single plasma membrane. Typically the resuit 
either of cell fusion or of a sériés of incomplète division cycles in 
which the nuclei divide but the cell does not. 

TATA box Sequence in the promoter région of many 
eukaryotic genes that binds a general transcription factor 
(TFIID) and hence spécifiés the position at which transcription is 
initiated. (Figure 6-14) 

T cell receptor (TCR) Transmembrane receptor for 
antigen on the surface of T lymphocytes, consisting of an 
immunoglobulin-like heterodimer. (Figure 24-32) 

T-cell-mediated immune response Any adaptive immune 
response mediated by antigen-specific T cells. 

telomerase Enzyme that elongates telomere sequences in 
DNA, which occur at the ends of eukaryotic chromosomes. 

telomere End of a chromosome, associated with a 
characteristic DNA sequence that is replicated in a spécial way. 
Counteracts the tendency of the chromosome otherwise to 
shorten with each round of réplication. From Greek telos, end, 
and meros, portion. 

telomere End of a chromosome, associated with a 
characteristic DNA sequence that is replicated in a spécial way. 
Counteracts the tendency of the chromosome otherwise to 
shorten with each round of réplication. From Greek telos, end. 

telophase Final stage of mitosis in which the two sets of 
separated chromosomes décondense and become enclosed 
by nuclear envelopes. (Panel 17-1, pp. 980-981) 

template Single strand of DNA or RNA whose nucléotide 
sequence acts as a guide for the synthesis of a complementary 
strand. (Figure 1-3) 
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terminal différentiation The limit of cell détermination when 
a cell forms one of the highly specialized cell types of the adult 
body. 

terminally differentiated A cell at the limit of cell 
détermination, being one of the highly specialized cell types of 
the adult body. 

terminator Signal in bacterial DNA that halts transcription; 
in eukaryotes, transcription terminâtes after cleavage and 
polyadenylation of the newly synthesized RNA. 

tertiary structure Complex three-dimensional form of a 
folded polymer chain, especially a protein or RNA molécule. 

Tfh cell see follicular helper T cell 

ThI cell A type of effector helper T cell that sécrétés 
interferon-y to help activate macrophages and induces B cells 
to switch the class of antibody they make. (Figure 24-44) 

Th 17 cell A type of effector helper T cell that sécrétés IL17, 
which recruits neutrophils and stimulâtes an inflammatory 
response. (Figure 24-44) 

Th 2 cell A type of effector helper T cell that helps activate 
B cells to produce antibodies, to undergo somatic 
hypermutation, and switch the class of immunoglobulin 
produced. (Figure 24-44) 

thylakoid Flattened sac of membrane in a chloroplast that 
contains chlorophyll and other pigments and carries out the 
light-trapping reactions of photosynthesis. Stacks of thylakoids 
form the grana of chloroplasts. (Figures 14-35 and 14-36) 

thylakoid membrane Chloroplast membrane System 
that contains the large membrane protein complexes for 
photosynthesis and photophosphorylation. 

thymocytes Developing T cells in the thymus. 

tight junction Cell—cell junction that seals adjacent épithélial 
cells together, preventing the passage of most dissolved 
molécules from one side of the épithélial sheet to the other. 
(Figures 19-2 and 19-21) 

TIM complexes Protein translocators in the mitochondrial 
inner membrane. The TIM23 complex médiates the transport of 
proteins into the matrix and the insertion of some proteins into 
the inner membrane; the TIM22 complex médiates the insertion 
of a subgroup of proteins into the inner membrane. 

(Figure 12-21) 

Toll A transmembrane receptor protein. On the ventral side 
of the Drosophila egg membrane, its activation Controls the 
distribution of Dorsal, a transcription regulator of the NFkB 
family. 

Toll-like receptors (TLRs) Family of pattern récognition 
receptors (PRRs) on or in cells of the innate immune System. 
They recognize pathogen-associated immunostimulants 
(PAMPs) associated with microbes. (Figure 24-4) 

TOM complex Multisubunit protein complex that transports 
proteins across the mitochondrial outer membrane. 

(Figure 12-21) 

TOR Large, serine/threonine protein kinase that is activated 
by the PI-3-kinase-Akt signaling pathway and promûtes cell 
growth. 

totipotent Describes a cell that is able to give rise to ail the 
different cell types in an organism. 

trans face Face on the other (far) side. 

trans Golgi network (TGN) Network of interconnected 


tubular and cisternal structures closely associated with the 
trans face of the Golgi apparatus and the compartment from 
which proteins and lipids exit the Golgi, bound for the cell 
surface or another compartment. 

transcellular transport Transport of solutés, such as 
nutrients, across an epithelium, by means of membrane 
transport proteins in the apical and basal faces of the épithélial 
cells. (Figure 11-11) 

transcription (DNA transcription) Copying of one strand of 
DNA into a complementary RNA sequence by the enzyme RNA 
polymerase. (Figures 6-1 and 6-8) 

transcription regulators General name for any protein that 
binds to a spécifie DNA sequence (known as a c/s-regulatory 
sequence) to influence the transcription of a gene. 

transcriptional control Régulation by a cell of gene 
expression by controlling when and how often a given gene is 
transcribed. 

transcytosis Uptake of material at one face of a cell by 
endocytosis, its transfer across a cell in vesicles, and discharge 
from another face by exocytosis. (Figure 13-58) 

transfer RNA (tRNA) Set of small RNA molécules used in 
protein synthesis as an interface (adaptor) between mRNA and 
amino acids. Each type of tRNA molécule is covalently linked to 
a particular amino acid. (Figure 6-50) 

transferrin receptor Cell-surface receptor for transferrin (a 
soluble protein that carries iron) that delivers iron to the cell 
interior via receptor-mediated endocytosis and recycling of the 
receptor-transferrin complex. 

transformée! A cell with an altered phenotype that behaves 
in many ways like a cancer cell (i.e., unregulated prolifération, 
anchorage-independent growth in culture). 

transforming growth factor-p superfamily (TGFp 
superfamily) Large family of structurally related secreted 
proteins that act as hormones and local mediators to 
control a wide range of functions in animais, including 
during development. It includes the TGFp/activin and bone 
morphogenetic protein (BMP) subfamilies. (Figure 15-57) 

transgene The foreign or modified gene that has been added 
to create a transgenic organism. 

transgenic organism Plant or animal that has stably 
incorporated one or more genes from another cell or organism 
(through insertion, délétion, and/or replacement) and can pass 
them on to successive générations. (Figures 8-53 and 8-70) 

transit amplifying cell Cell derived from a stem cell 
that divides a limited number of times before terminally 
differentiating. 

transition State Structure that forms transiently in the course 
of a Chemical reaction and has the highest free energy of any 
reaction intermediate. Its formation is a rate-limiting step in the 
reaction. (Figure 3-47) 

translation (RNA translation) Process by which the 
sequence of nucléotides in an mRNA molécule directs 
the incorporation of amino acids into protein. Occurs on a 
ribosome. (Figures 6-1 and 6-64) 

translational control Régulation by a cell of gene expression 
by selecting which mRNAs in the cytoplasm are translated by 
ribosomes. 

translocon The assembly of a translocator associated with 
other membrane complexes, such as enzymes that modify the 
growing polypeptide chain. 
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transmembrane adhesion proteins Cytoskeleton- 
linked transmembrane molécules with one end linking to the 
cytoskeleton inside the cell and the other end linking to other 
structures outside it. 

transmembrane protein Membrane protein that extends 
through the lipid bilayer, with part of its mass on either side of 
the membrane. (Figure 10-17) 

transmitter-gated ion channel (ion-channel-coupled 
receptor, ionotropic receptor) Ion channel found at 
Chemical synapses in the postsynaptic plasma membranes of 
nerve and muscle cells. Opens only in response to the binding 
of a spécifie extracellular neurotransmitter. The resulting inflow 
of ions leads to the génération of a local electrical signal in the 
postsynaptic cell. (Figures 11-36 and 15-6) 

transport vesicle Membrane-enclosed transport containers 
that bud from specialized coated régions of donor membrane 
and pass from one cell compartment to another as part of the 
cell’s membrane transport processes; vesicles can be spherical, 
tubular, or irregularly shaped. 

transporter (carrier protein, permease) Membrane 
transport protein that binds to a soluté and transports it across 
the membrane by undergoing a sériés of conformational 
changes. Transporters can transport ions or molécules passively 
down an electrochemical gradient or can link the conformational 
changes to a source of metabolic energy such as ATP 
hydrolysis to drive active transport. Compare channel protein. 
See also membrane transport protein. (Figure 11-3) 

transposable element (transposon) Segment of DNA 
that can move from one genome position to another by 
transposition. (Table 5-4, p. 288) 

transposition (transpositional recombination) Movement 
of a DNA sequence from one genome site to another. 

(Table 5-4, p. 288) 

transposon see transposable element 

treadmilling Process by which a polymeric protein filament is 
maintained at constant length by addition of protein subunits at 
one end and loss of subunits at the other. (Panel 16-2, 
pp. 902-903) 

trimeric GTP-binding protein see G protein 

Trithorax group Set of proteins critical for cell memory 
that maintains the transcription of Hox genes in cells where 
transcription has already been switched on. 

t-SNAREs Transmembrane SNARE protein, usually 
composed of three proteins and found on target membranes 
where it interacts with v-SNAREs on vesicle membranes. 

tubulin The protein subunit of microtubules. (Panel 16-1, 
p. 891, and Figure 16-42) 

y-tubulin ring complex (y-TuRC) Protein complex 
containing y-tubulin and other proteins that is an efficient 
nucleator of microtubules and caps their minus ends. 

tumor progression Process by which an initial mildly 
disordered cell behavior gradually evolves into a full-blown 
cancer. (Figures 20-8 and 20-9) 

tumor suppressor gene Gene that appears to help prevent 
formation of a cancer. Loss-of-function mutations in such 
genes favor the development of cancer. (Figure 20-17) 

tumor virus Virus that can help make the cell it infects 
cancerous. 

turgor pressure Large hydrostatic pressure developed inside 


a plant cell as the resuit of the intake of water by osmosis; it is 
the force driving cell expansion in plant growth and it maintains 
the rigidity of plant stems and leaves. 

two-dimensional gel electrophoresis Technique combining 
two different séparation procedures—séparation by charge 
(isoelectric focusing) in the first dimension, then séparation by 
size in a direction at a right angle to that of the first step—to 
résolve up to 2000 proteins in the form of a two-dimensional 
protein map. 

type III fibronectin repeat The major repeat domain in 
fibronectin, it is about 90 amino acids long and occurs at 
least 15 times in each subunit. The repeat is among the most 
common of ail protein domains in vertebrates. 

type III sécrétion System One of several sécrétion Systems 
in Gram négative bacteria; delivers effector proteins into host 
cells in a contact-dependent manner. (Figure 23-7) 

type IV collagen An essential component of mature basal 
laminae consisting of three long protein chains twisted into a 
ropelike superhelix with multiple bends. Separate molécules 
assemble into a flexible, felt-like network that gives the basal 
lamina tensile strength. 

tyrosine kinase Enzyme that phosphorylâtes spécifie 
proteins on tyrosines. See also cytoplasmic tyrosine kinase. 

tyrosine-kinase-associated receptor Cell-surface receptor 
that functions similarly to RTKs, except that the kinase domain 
is encoded by a separate gene and is noncovalently associated 
with the receptor polypeptide chain. 

ubiquitin Small, highly conserved protein présent in ail 
eukaryotic cells that becomes covalently attached to lysines of 
other proteins. Attachment of a short chain of ubiquitins to such 
a lysine can tag a protein for intracellular proteolytic destruction 
by a protéasome. (Figure 3-69) 

ubiquitin ligase Any one of a large number of enzymes that 
attach ubiquitin to a protein, often marking it for destruction in 
a protéasome. The process catalyzed by a ubiquitin ligase is 
called ubiquitylation. (Figure 3-71) 

unfolded protein response Cellular response triggered 
by an accumulation of misfolded proteins in the endoplasmic 
réticulum. Involves expansion of the ER and increased 
transcription of genes that code for endoplasmic réticulum 
chaperones and degradative enzymes. (Figure 12-51) 

uniporter Carrier protein that transports a single soluté from 
one side of the membrane to the other. (Figure 11-8) 

V(D)J recombination Somatic recombination process by 
which gene segments are brought together to form a functional 
gene for a polypeptide chain of an immunoglobulin or T cell 
receptor. (Figure 24-28) 

vacuole Large fluid-filled compartment found in most plant 
and fungal cells, typically occupying more than a third of the cell 
volume. (Figure 13-41) 

van der Waals attraction Type of (individually weak) 
noncovalent bond that is formed at close range between 
nonpolar atoms. (Table 2-1, p. 45 and Panel 2-3, pp. 94-95) 

variable région Région of an immunoglobulin or T cell 
receptor polypeptide chain that is the most variable and 
contributes to the antigen-binding site. (Figures 24-25 and 
24-32) 

vascular endothélial growth factor (VEGF) Secreted 
protein that stimulâtes the growth of blood vessels. 

(Table 15-4, p. 850, and Figure 22-26) 
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vesicle transport model One hypothesis for how the Golgi 
apparatus achieves and maintains its polarized structure and 
how molécules move from one cisterna to another. This model 
holds that Golgi cisternae are long-lived structures that retain 
their characteristic set of Golgi-resident proteins firmly in place, 
and cargo proteins are transported from one cisterna to the 
next by transport vesicles. 

vesicular transport Transport of proteins from one cell 
compartiment to another by means of membrane-bounded 
intermediaries such as vesicles or organelle fragments. 

V gene segment A DNA sequence encoding most of 
the variable région of an immunoglobulin or T cell receptor 
polypeptide chain. There are many different V gene segments, 
one of which becomes joined to a D or J gene segment by 
somatic recombination when an individual lymphoid progenitor 
cell begins to differentiate into a B or T lymphocyte. 

(Figure 24-28) 

virulence factor Protein, encoded by a virulence gene, that 
contributes to an organism’s ability to cause disease. 

virulence gene Gene that contributes to an organism’s ability 
to cause disease. 

virus Particle consisting of nucleic acid (RNA or DNA) 
enclosed in a protein coat and capable of replicating within a 
host cell and spreading from cell to cell. (Figure 23-11) 

virus receptor Molécule on the host cell surface to which 
virus surface proteins bind to enable binding of virus to the cell 
surface. 

voltage-gated cation channel Type of ion channel found 
in the membranes of electrically excitable cells (such as nerve, 
endocrine, egg, and muscle cells). Opens in response to a shift 
in membrane potential past a threshold value. 

voltage-gated K + channel Ion channel in the membrane of 
nerve cells that opens in response to membrane depolarization, 
enabling K + efflux and rapid restoration of the négative 
membrane potential. 

voltage-gated Na + channel Ion channel in the membrane 
of nerve and skeletal muscle cells that opens in response to a 
stimulus causing sufficient depolarization, allowing Na + to enter 
the cell down its electrochemical gradient 

v-SNAREs Transmembrane SNARE protein, comprising a 
single polypeptide chain, usually found in vesicle membranes 
where it interacts with t-SNAREs in target membranes. 


V-type pumps Turbine-like protein machines constructed 
from multiple different subunits that use the energy of ATP 
hydrolysis to drive transport across a membrane. The V-type 
proton pump transfers H + into organelles such as lysosomes to 
acidify their interior. (Figure 11-12) 

WASp protein Key target of activated Cdc42. Exists in 
an inactive folded conformation and an activated open 
conformation; association with Cdc42 stabilizes the open form, 
enabling binding to the Arp 2/3 complex and enhancing actin- 
nucleating activity. 

Weel Protein kinase that inhibits Cdk activity by 
phosphorylating amino acids in the Cdk active site. Important in 
regulating entry into M phase of the cell cycle. 

Western blotting Technique by which proteins are separated 
by electrophoresis and immobilized on a paper sheet and then 
analyzed, usually by means of a labeled antibody. Also called 
immunoblotting. 

white blood cell General name for ail the nucleated blood 
cells lacking hemoglobin. Also called leukocytes. Includes 
lymphocytes, granulocytes, and monocytes. (Figure 22-27) 

Wnt protein Member of a family of secreted signal proteins 
that hâve many different rôles in controlling cell différentiation, 
prolifération, and gene expression in animal embryos and adult 
tissues. 

Wnt/p-catenin pathway Signaling pathway activated by 
binding of a Wnt protein to its cell-surface receptors. The 
pathway has several branches. In the major (canonical) branch, 
activation causes increased amounts of p-catenin to enter the 
nucléus, where it régulâtes the transcription of genes controlling 
cell différentiation and prolifération. Overactivation of the 
Wnt/p-catenin pathway can lead to cancer. (Figure 15-60) 

X-inactivation Inactivation of one copy of the X chromosome 
in the somatic cells of female mammals. 

X-inactivation center (XIC) Site in an X chromosome at 
which inactivation is initiated and spreads outward. 

x-ray crystallography Technique for determining the three- 
dimensional arrangement of atoms in a molécule based on the 
diffraction pattern of x-rays passing through a crystal of the 
molécule. (Figure 8-21) 

zygote Diploid cell produced by fusion of a male and female 
gamete. Afertilized egg. 
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The index covers the main text but not the problems or summaries. F indicates coverage in a Figure only, on a page separated 
from the indexed text treatment and T indicates inclusion in a Table. Figures on the same page as relevant text are not indexed 
separately. 

Acronyms hâve generally been preferred to their expansions as being easier to construct. Common names hâve been 
preferred for organs, so “stomach cancer” not “gastric cancer.” 

Sorting is word-by-word, ignoring spaces and punctuation, so that “ATP synthesis” will appear before “ATPases.” 


A 

A-kinase see cyclic-AMP-dependent 
protein kinase 

A-P (anteroposterior) axis 1162-1164, 
1169-1170 

A-to-l editing (adenineto inosine) 418 
A-type lamins 948 

A-V (animal-vegetal) axis 1155-1156, 
1167-1168 

AAA proteins 358, 359F, 709 
ABC (ATP-binding cassette) transporters 
18F, 163-164, 606, 609-611, 
1139 

Abcbl gene 1139 
Abl gene, in Bcr-Abl hybrid 1135 
abortive initiation 306 
absorptive cells, intestinal 1218, 1219F, 
1221, 1223-1224, 1225F 
ABT-737 1032F 

Acanthamoeba (A. castellanii) 801F, 805F, 
924 

accessory proteins 

actin accessory proteins 899, 

904-906, 908F, 909-911,913, 
921,958 

cytoskeletal filament assembly and 
889, 894-896 

intermediate filaments 946 
membrane transport 655, 677 
microtubules 929-930, 941 
motor proteins as 889 
accumulation delays 1176,1178 
acetyl CoA (acetylcoenzyme A) 
from p-oxidation 667 
in the citric acid cycle 106, 758-760 
in glycolysis 75 

from oxidation of fats 81-82, 83F 
pyruvate conversion to 75, 81-82 
structure and biosynthetic rôle 69 
acétylation see histones; lysine; AMermini 
acétylcholine 

as an excitatory neurotransmitter 629 
effects on different target cells 
816-817, 837T, 843 
effects on nitric oxide synthesis 846 


GPCRs activated by 832 
structure 817F 
acétylcholine receptors 
as dru g targets 632 
as muscarinic or nicotinic 843 
at neuromuscular junctions 630-631, 
633, 1072F, 1210F 
acetylcholinesterase 143,630 
A/-acetylgalactosamine 718, 1058, 1060F 
A/-acetylglucosamine (GIcNAc) 
in GAGs 1058 

GIcNAc phosphotransferase defects 
728-729 

in Golgi apparatus and ER 716, 

717-718F, 720 

in mammalian glycoproteins 717F 
Achaete/Scute family 1171-1172 
achondroplastic dwarfism 1196 
acid catalysis 144,145F 
acidic amino acid side chains 113 
acidification, in secretory vesicles 743 
acidity, of lysosomes 722-723 
acids, defined 46, 93, 763-764 
aconitase 106,427 
ACTFI (adrenocorticotrophic hormone) 
744F, 835T 

actin 

and actin-binding proteins 898-914 
bacterial homologs 896-897 
Chemical inhibitors 904T 
cross-linking proteins 911F 
F-actin and G-actin 898 
monomer availability and 906 
myosin and, in contractile rings 
996-997 

myosin and, in muscle contraction 
916-920 

myosin as associated motor protein 
890 

in non-muscle cells 923-925 
polymerization in cell migration 951 
polymerization in vitro 899, 900F 
séparation from samples 440 
actin-binding proteins 904-906, 907-909, 
1079 


actin depolymerizing factor (cofilin) 905, 
910, 914, 954, 957, 958F 
actin filaments 

in adherens junctions 1036, 1042 
arrays 911-913 

“arrowhead” conformation 898, 899F, 
911F, 913F 

bacterial effectors and 1281 
bundles 912F 
capping 903,909,910F 
in the cell cortex 592 
confocal microscopy 541F 
D- and T-forms 901,904F, 910, 927, 
954 

half-lives 896-897,906,919 
helical structure 124, 899F 
integrin links to 1075 
négative staining 559-561 
neuronal membranes 591 
nucléation in formation of 899-900, 
902, 906-907, 908-909F, 953, 
954F, 1289F 

and plant cell walls 1087 
plus and minus ends 898, 900, 902, 
904F 

treadmilling 901,903-904, 953-954 
tube and vesicle formation 1192 
underlying the plasma membrane 890 
visualizing with TIRF 548 
actin gene 371F, 923 
actin mRNA 422 

actin polymerization by pathogens 1278, 
1281-1282, 1287-1288, 1289F 
actin-related proteins see Arp2/3 complex 
actin-severing proteins 909-910 
actin subunits 

assembly 898-899 
as asymmetrical 894 
actin webs 908F, 953F, 959 
a-actinin 905, 911-912, 919, 920F, 1079 
action potentials 

firing frequency and distance 634, 

636 

firing frequency and PSPs 633 
gap junctions and 1051 
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in muscle contraction 920 
plasma membrane depolarization by 
607,621-624, 627-629, 632, 
634-636 

propagation 624F, 625 
activated amino acids 338F 
activated carriers 

carboxylated biotin 69T, 70F 
coenzyme A as 68-69 
coupling with favorable reactions 
64-65 

energy storage by 63-64 
FADH 2 as électron carrier 69, 83 
NADH and NADPH as électron carriers 
67-68 

and their functions 69T 
see also ATP; GTP 

activation, enzyme-coupled receptors and 
GPCRs 850-851,852F 
activation energies 

enzyme action and 57-58, 141-145 
of glycolysis 74 

activator proteins see transcription 
activators 
active sites 

elastase and chymotrypsin 119F 
in enzyme activity 59, 136, 147 
lysozyme 145, 146F 
and regulatory sites in allostery 151 
see also binding sites 
active transport 

ion-concentration gradients 601-604 
primary and secondary active 
transport 602 
three methods 601 
transporters and 600-611 
activity-dependent synaptic change 
1211-1212 

acute lymphocytic leukemias 1117 
acyl transferases 689 
adaptation 

of neurons to prolonged stimulation 
635 

of System response to signais 825, 
830-831, 848-849 
in the Visual System 846 
adaptive immune System 

antigenic variation and 1290-1291 
immunological self-tolerance 
1313-1315 
overview 1307-1315 
pathogen sélection through 1290 
two classes of response 1298F 
see also B cells; T cells 
adaptor molécules 334, 338F, 341 
adaptor proteins 

actin-binding 956F 
forcadherins 1044 
catenins as 1042 
clathrin coated vesicles 698-699 
Grb2 824F, 855, 862F 
interaction domains as 823 
nuclear import receptors 652 
SH2 and SH3 domains in 854 
talin as 1075, 1080 
vesicle transport model 721 
ADARs (adenosine deaminases acting on 
R N A) 418 
adducin 592F 
adenine 

1-methyl- 271 


deamination to hypoxanthine 271, 
272F 

DNA base pairing with thymine 176, 
177F 

structure 100 

adenocarcinomas, as malignant 1092, 
1093F 

adenomas, as benign 1092, 1093F, 1220 
adenomatous polyps 1123 
adenosine deamination 335, 336, 337F 
S-adenosylmethionine 

as an activated carrier 69T 
DNA méthylation damage from 267T, 
268 

transcription regulator binding 377 
adenovirus 1273T, 1274F, 1280F, 1281, 
1289 

adenylyl cyclase 834, 836, 843-844. 

846T, 848, 1278 
adherens junctions 

actin filaments in 1036, 1042, 1192 
assembly 1043F 
classical cadherins and 1037T 
in épithélial cells 893F, 924, 1036F 
response to force 1042-1043 
tissue remodeling 1043-1045 
adhesins 1277-1278 
adhesion belt 1036F, 1044, 1045F 
adipocytes 573 
adjuvants 1307 

ADP/ATP carrier protein 779, 780F 
adrénaline, GPCR mediated effects 827, 
832, 835T 

adrenoleukodystrophy 300F 
Aequorea Victoria 543, 547 
aequorin 547, 839 

aérobic bacteria and mitochondria 25 
aérobic lithotrophs 11 
aérobic respiration 55 
Aeropyrum pernix 21T 
affinity chromatography 448-451,459, 
484 

affinity maturation 1320-1321 

aflatoxins 1128-1129 

AFM (atomic force microscopy) 307F, 

481,548-549, 587F, 895, 913 
Africa, human origins 232 
agarose, in gel electrophoresis 465-466 
âge 

and cancer incidence 1094-1095, 
1111 

and DNA repair errors 274 
and mitochondrial mutations 808 
see also prématuré aging 
agent-based simulations 524 
aggrecan 1058F, 1060, 1061F 
agrin signal protein 1072, 1210 
Agrobacterium 508F 
AID (activation-induced deaminase) 
1322-1323 

AIDS (acquired immune deficiency 
syndrome) 
mortality 1263 
see also HIV 

AIRE (autoimmune regulator) 1333 
AKAPs (A-kinase anchoring proteins) 835 
Akt protein kinase 860, 1030F, 1114-1115 
alanine, structure 113 
albinism 490F, 729, 1186 
alcohol détoxification 667 
aldolase 104 


aldoses and ketoses 98 

Alexa dyes 537, 538F 

algae, as eukaryotes 30 

alkalis see bases 

alkylation lésions see méthylation 

all-or-none responses 827-829, 830F, 857 

alleles, defined 286, 486, 490 

allergie reactions 1317 

allostery 

activation of Ca 2+ /calmodulin 841 
allosteric enzymes 151-153 
EF-Tu conformational change 
160-161 

in GPCR kinases 848-849 
inducing protein activation 828F 
inducing protein dégradation 360 
integrin activation by 1077 
in ion channels 618, 825 
membrane transport proteins 
163-164 

in motor proteins and protein 
machines 162, 164 
in proton pumps 773-774 
and second messengers 848 
tryptophan repressor 381 
a and P subunits, ATP synthase 777 
a2 protein 120 
alpha satellite DNA 203, 204 
a-chains, collagen 1061-1062, 1063T 
a-chains, integrins 1075, 1078 
a-chains, laminins 1070F 
a-helices 

in bacteriorhodopsin 586-587 
discovery and description 115-116 
in the histone fold 188 
in intermediate filaments 945 
ion channels 617-618, 631F 
membrane anchoring 682 
membrane-spanning 579-581,677, 
679,680 

myosin lever arm 916 
in nascent proteins 349, 354 
in protein structural motifs 376-377 
as secondary structure éléments 
117-118 

switch helices 161 
in transporters 603 
Alport syndrome 1071 
ALS (amyotrophie latéral sclerosis) 947 
alternative pathway, complément System 
1302-1303 
alternative splicing 

constitutive 324,416 
définition of the gene 416-417 
multiple protein forms from 415 
positive and négative control 416F 
RNA sequencing and 482 
tropomyosin gene 319F, 416 
and voltage-gated ion channels 627 
Alu sequences 212F, 223F, 291-292 
Alzheimer’s disease 130,868 
Amanita mushrooms 904 
see also phalloidins 
Ames test for mutagenicity 1128 
amino acid sequences 
alignments 463 
implying protein function 121, 

462-463 

specifying protein structure 109-114 
see also signal sequences 
amino acid starvation 425 
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amino acids 

abbreviations 110F, 112 
activated amino acids 338F 
addition to polypeptide chains 
339-344 

codons specifying 7, 334 
coupling to transfer RNAs 336-338 
essential amino acids 86, 87F 
glutamine synthesis 66 
mitochondrial conversion to acetyl 
CoA 84 

multisite post-translational 
modification 166 
in the nitrogen cycle 85-86 
optical isomers 112 
polarity and ionization State 109-110, 
112-113 

at protein active sites 135-136 
as protein monomers 6 
selenocysteine 350 
side-chain modifications 196-197 
aminoacyl-tRNA synthetases 336-338, 
339F 

amoebae 182,997F 

AMPA receptors (a-amino 3-hydroxy 

5-methyl 4-isoxazolepropionic 
acid) 636-637 
amphibians 

metamorphosis 1182 
newts 1248-1249 
see also frogs 
amphipathicity 

ofdefensins 1298 

NADH dehydrogenase a-helices 768 
amphiphilicity of membrane molécules 
9, 566, 576 

amplification (of DNA) see DNA cloning 
amplification (of signais) 

in the caspase cascade 1023 
in intracellular signaling 824, 848, 857 
in microscopy 539-540 
of nerve impulses 621 
amygdala 935 
amyloid fibrils 130-133 
Anabaena cylindrica 14F 
anabolic reactions, NADPH in 68 
anabolism as biosynthesis 51-52 
anaérobie organisms 

anaérobie lithotrophs 11 
ATP synthase rôle 778 
glycolysis in 74-75 
proton gradients in 781 
analytical methods 
for DNA 463-466 
for proteins 452-462 
analytical ultracentrifuges 455 
anaphase 

chromatid séparation 964, 994 
in meiosis 1005, 1009 
in mitosis 981 

anaphase A and anaphase B 994, 995F 
anaphase chromosomes 988F 
anchorage dependence 1079 
anchoring fibrils 1062, 1063T 
anchoring junctions 1036, 1037T 
AND and AND NOT logic 521-522, 523F, 
825 

androgen receptor signaling 1117 
anémia, spectrin deficiency 591 
Angelman syndrome 407 
angina 847 


angiogenesis 

endothélial tip cells 1236-1237 
involvement of VEGF and HIFI a 1237 
and metastasis 1120 
and vasculogenesis 1236 
Ângstrom units 531 
animal-vegetal (A-V) axis 1155-1156, 
1167-1168 

animais 

body plan inversion 1169 
body plans as conserved 1147-1148, 
1174 

extracellular matrix in 1057-1074 
model organisms 33 
regenerative ability 1247 
regulatory DNA and morphology 
1149, 1174-1175 
size différences 1010 
ankyrin 592F 

antenna complexes 788-789, 794F 
Antennapedia gene/complex 1161F, 

1162-1163, 1164F, 1169, 1170F 
anteroposterior (A-P) axis 1147, 1155, 
1157-1160, 1161F, 1162-1164, 
1169, 1206 
anthrax 1270 
anti-lAP proteins 1029 
anti-inflammatory drugs 838 
antibiotic résistance 

DNA-only transposons in 288 
horizontal gene transfer and 19, 1269, 
1292-1293 

plasmid ségrégation and 897 
Staphylococci (M RSA) 1276 
three mechanisms 1293F 
antibiotics 

bacterial targets 1293F 
as inhibitors of protein synthesis 351, 
352T 

lysozyme as 144 
misuse 1293 

penicillin 1267,1291,1293 
ribosome response to 800 
sélective toxicity 1292 
vancomycin 1292-1293 
antibodies 

in affinity chromatography 449, 450 
anti-BrdU 966 
blotting techniques 454-455 
delivery of poisons 1137 
as immunoglobulins 1315-1316 
immunogold électron microscopy 557 
labeled, in électron microscopy 539 
labeled, in fluorescence microscopy 
537, 538F, 539-540 
major classes in humans 1318T 
number of potential antibodies 1309 
polyadenylation and release 417-418 
secreted by B lymphocytes 1297, 
1307, 1309 

transport in newborns 737 
as triggers of phagocytosis 739 
see also immunoglobulins; monoclonal 
antibodies 

anticancer drugs see cancer treatment 
anticodons, tRNA 7, 335-337, 338F, 339, 
341-342, 343F, 344-345 
antidepressants 632 
antigen présentation 1240, 1305, 1330F 
antigenic déterminants 1316F, 1318, 

1337, 1338F 


antigenic variation, and pathogen 
évolution 1289-1291 

antigens 

adaptive immune System 1307 
dendritic cell Processing 1305, 1330F 
Langerhans cells présentation 1240 
in the lymphatic System 1311 
récognition 138-139 
antiporters 602, 604, 607-608, 747F, 
769F, 781 

Antirrhinum 30F, 488F 
AP2 (adaptor protein 2) 699, 700F, 701, 
733-734 

Apafl (apoptotic protease activating 
factor-1) 1025, 1026F 
APC/C (anaphase-promoting complex/ 
cyclosome) 

Cdhl-APC/C 1003-1004,1014 
completion of mitosis 992-993 
M-Cdk inactivation 1002-1003 
metaphase-anaphase transition 
970-971,972F, 973, 975, 978, 
1009 

as a ubiquitin ligase 360 
Apc protein (adenomatous polyposis coli) 
870-871, 1112F, 1124 
Apc tumor suppressor gene 1123-1124, 
1220 

APCs (antigen-presenting cells) 1307, 
1311,1328 

see also dendritic cells 
apical meristems see meristems 
apolipoprotein B 418-419 
apoptosis 

Bcl2 family proteins and 860F, 1026, 
1027-1028F 
in C. elegans 1194 
caspase cascade and 1022-1023 
cytotoxic T cells and 1334 
DNA fragmentation in 1024F 
in embryonic development 1022 
épithélial cells 1219 
extentof 1021-1022 
extracellular signais and 816 
extrinsic and intrinsic activation 
pathways 1023-1028 
Golgi apparatus fragmentation 722 
macrophages and 739 
mitochondrial proteins in 802 
p53 pathway and 1115-1116 
phagocytes and 1030-1031 
phosphatidylserine in 574, 690, 740 
réduction in cancer cells 1099, 1103 
response to irréparable DNA damage 
1015 

response to viral infection 1304 
suppression by survival factors 1011, 
1029-1030 

apoptosome 1025,1026F 
aquaporins 580F, 599, 612-613 
Arabidopsis {A. thaliana) 

cell and organ size 1195F 
flowering 1183F, 1195F 
generating mutations in 488 
genome 880-881,1084 
gravitropism 884F 
mitochondrial genome 805F 
as model organism 29T, 32-33 
mutant libraries 498 
totipotent cells 507 
vacuoles 724F 


Page numbers with an F refer to a figure; page numbers with a T refer to a table. 
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arabinose metabolism 521 
arachidonic acid 837 
Ara J gene (E coli) 521 
archaea 

bacteriorhodopsin from 586 
CRISPR System 434 
cytokinesis in 737 
eukaryote origins and 26, 27F 
Methanococcus jannaschii 16F, 676F 
as prokaryotes 14-15 
Sec61 complex 676F 
thermophilic 572 
Archaeoglobus fulgidus 21T 
Arf proteins 703, 854T, 1016 
arginine 

in ATP synthases 777 
deamination to NO 847 
in nuclear localization signais 650 
in the nucléosome 188 
structure 112 

Argonaute protein 428F, 430-434 
Arp2/3 complex 

actin filament nucléation 905-906, 
908F 

actin networks and 911,953, 954F 
bacterial recruitment 913-914, 1281, 
1288, 1289F 
Rac activation 957-959 
Arpl (actin-related protein 1) 939 
arrestins 845, 849 

arthritis, as a multigenic condition 493 
Ascaris {A. lumbricoides) 1265-1266 
asparagine 

in aquaporins 613 
homeodomain arginine contact 376 
A/-linked oligosaccharides 683, 1057 
structure 113 
aspartic acid 

in P-type ATPases 607, 608F 
structure 113 

targeting by caspases 1022 
Aspergillus flavus oryzae 1129F 
aspirin 838 
assembly factors 130 
association constant 139F, 140 
astral microtubules 

dyneins and 984-985, 994 
spindle positioning 982, 983F, 987, 
1001-1002 

astral relaxation model 999 
astral stimulation model 998-999 
asymmetric cell division 1001-1002, 

1153, 1173-1174, 1222, 1223F 
ataxia telangiectasia (AT) 266T, 276, 1015 
ATF6 protein 688 
ATG9 protein 726 
atherosclerosis 733, 1265-1266 
ATM protein 266T, 276, 1014-1015 
atomic force microscopy (AFM) 307F, 

481,548-549, 587F, 895, 913 
atomic number and électron density 556 
atoms and cells, scale of 530F 
ATP (adenosine triphosphate) 
as an activated carrier 64, 65 
as an energy carrier 8 
binding in cyclin/Cdk complex 970F 
daily turnover in humans 774 
powering condensation reactions 
65-66, 70-73 

production by fermentations 75 
production by glycolysis 74-78, 85 


production by mitochondria and 
chloroplasts 753 
production by the citric acid cycle 
84-85 

substrate of Src protein kinase 74-78, 
85 

turnover and metabolic rate 148 
ATP-dependent chromatin remodeling 
complexes 190-193 
ATP-dependent proteases 357, 358F 
ATP-driven pumps 601-602, 606-607 
ATP hydrolysis 

actin catalysis of 894, 901,904, 954F 
in active transport 60, 602, 605-606, 
608F, 838 

aminoacyl-tRNA synthetases 337 
chaperone operation 356 
DNA helicase operation 246F 
DNA ligase operation 246 
by dyneins 938F 
by kinesins 936, 937F 
in lysosomes 723 
macromolecular synthesis 70-73 
by myosin heads 916 
NPC translocation 655 
overall usefulness 65 
in phospholipid translocators 690 
in the protéasome 358, 359F 
protein import into mitochondria 661 
protein import into peroxisomes 667 
protein import into the ER 682 
spliceosome RNA rearrangements 
321 

ATP synthases 586, 590, 606 
aggregation 590 
bacterial 780-781 
c subunits 777-778, 804 
of chloroplasts 787, 793, 794 
dimerization 778, 779F 
in mitochondria and chloroplasts 794, 
795F 

Na + -driven 781 

powered by electrochemical gradients 
586, 606, 774 

as protein machines 164, 754, 
776-778 

related enzymes 778, 781 
reversibility as a proton pump 778 
ATP synthesis 

driven by an electrochemical gradient 
761,763 

electron-transport chain and 84-85 
in glycolysis 781 
in mitochondria 774-782 
in mitochondria and chloroplasts 658, 
758 

thylakoid membrane as site of 
786-787 

ATPases, F-type see ATP synthases 
ATR protein 1014,1015F 
attachment to surfaces (cell migration) 
951,952F, 955 
auditory epithelium 1227 
auditory System 1204 
AUG codons (translation initiation/ 
méthionine) 334, 347-348, 
424-425 

Aurora kinases (Aurora-A and -B) 978, 
985-986, 990 
autism 494 

autocatalytic process, life as 7F 


autocrine action, Type I interferons 1304 
autocrine signaling 815 
autofluorescence 544F 
autoimmune diseases 
AIRE regulator 1333 
bullous pemphigoid 1076F 
FoxP3 gene and 1336 
lymphocyte survival and 1031 
myasthenia gravis 1315 
Type I diabètes 1315 
autoimmunity, T-cell inhibitory receptors 
1138 

autophagosomes 725F, 726-727 
autophagy 

function 726-727 

as a lysosome delivery pathway 725 
as sélective or nonselective 726 
of sperm-derived mitochondria 807 
autophosphorylation 364F, 688, 841, 
842F, 850, 878 
plant photoproteins 884 
transautophosphorylation 851 -852F 
autoradiography 452, 454, 466F, 1212F 
autosomes, defined 486 
autostimulation 518 
auxilin 702 

auxin signaling in plants 881-883, 1087 
axin protein 870 
axonal neurofilaments 944T, 947 
axonemal dyneins (ciliary dyenins) 
937-938, 942 

axonemes 927F, 931F, 938F, 941-943, 
950F 

axons 

electrical activity and synapse 
modification 1211-1212 
élongation 844, 1202 
growth cônes 858, 943, 951, 

1201-1204, 1206, 1208-1211 
guidance mechanisms 1202-1204, 
1206 

initial segment (axon hillock) 634 
microtubule orientation 940 
in neural development 1198, 1199F, 
1200-1202, 1203F, 1204-1212 
rétrogradé and anterograde axonal 
transport 938 

rôle in neurons 620-621, 940 
self-avoidance 1206-1207 
spectrin in 913 
azides 772 

AZT reverse transcriptase inhibitor 1292 


B7 proteins 1337-1338 
B-cell inhibitory receptors 1337 
B cell lymphoma 1031 
B cell receptors (BCRs) 1315-1317, 
1321-1322, 1336-1338 
B cells (B lymphocytes) 

antibody sécrétion by 1297, 1307 
apoptosis inhibition by Bcl2 
1246-1247 

bone marrow origin 1308 
in Burkitt’s lymphoma 1106 
class switching 1320,1322-1323, 

1335- 1336, 1338F 

control of antibody forms 417-418 
extracellular signal and activation 

1336- 1337 
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gene segments 1319-1320, 1321F, 
1325, 1332 

immunoglobulins and 1315-1324 
and monoclonal antibodies 444 
rough endoplasmic réticulum in 1309 
B-Raf oncoprotein 1136 
babies see infants 

BAC (bacterial artificial chromosome) 

469, 471,479 
Bacillus anthracis 1270 
Bacillus subtilis 

actin homologs in 897F 
genefamilies 17, 18F, 21T 
“backstitching” mechanism 242, 244 
bacteria 

ABC transporters in 163F 
aminoacyl-tRNA synthetases 336 
classification by shape 1267F 
control of translation 422-423 
DNA réplication in 253-255 
Gram-positive and Gram-negative 
610F, 1267F 

green sulfur bacteria 796 
ion channels 617-620 
largest 13F 

mutation rates 237-238 
A/-formylated peptide markers 958 
phagocytosis by host cells 
1281-1282 

as prokaryotes 14-15, 1266 
structure 13F 
thermophilic 473F, 483 
transcription in 306, 307F 
transposon frequency in 288 
transposon types, characteristic 292 
use in DNA cloning 467-469 
use of snRNAs against viruses 
433-434 

see also Escherichia coli 
bacterial cells, sizes 644 
bacterial cytoskeleton 896-897 
bacterial flagella 942 
bacterial origins 

of chemiosmosis 780-781 
of mitochondria and chloroplasts 
25-28, 644, 798F, 806-807 
bacterial pathogens 

cytoskeleton hijacking by 913-914 
extracellular bacterial pathogens 
1269-1271 

bactériophage lambda 

Cro repressor protein 123F 
virus receptors and 1279 
bactériophages 
CTXcp 1269 

T4 bactériophage 19F, 324 
T7 bactériophage 243 
virulence genes 1268 
as viruses 18, 19F 
bacteriorhodopsin 

hydropathy plot 579F 
structure and function 586-588, 591F 
Bad proteins 860F, 1030 
Bak protein 1027-1028 
Balbiani Ring genes 326 
BAM complex 662F 
band 3 protein 592F, 605 
band 4.1 protein 592F 
BAR domains 701, 702F 
“barcoding” mutant organisms 498, 499F 
Bardet-Biedl syndrome 943 


barrier DNAsequences 195F, 202, 210, 
391 

basal and basolateral surfaces, épithélial 
cells 590, 605 
basal bodies 943 
basal cell carcinomas 873, 1092 
basal lamina 

and endothélia 1235 
and epithelia 749, 1035, 1062, 
1068-1069 

and épithélial cancers 1093F, 

1096-1097F, 1123 
functions 1070-1072 
organization 1069-1070 
as specialized extracellular matrix 
1068-1069 
in synapses 1209 
base catalysis 144,145F 
base excision repair 269, 270F 
base-pairing 

bond strengths 255 
and edge récognition 374 
and homologous recombination 
277-278, 280 

limitations of complementary pairing 
345 

rôle in RNA folding and templating 
363-364 

rôle in RNA interférence 429 
wobble base-pairing 335, 336-337F, 
342,804 

base-pairing, in DNA 

antiparallel strand arrangement 176 
in DNA synthesis 4 
in réplication and repair 239-240, 255 
RNA as complementary 302-303 
basement membrane see basal lamina 
bases (nucléotide) 

in DNA 175-176, 177F 
in RNA 302 
structures of 100 
tautomeric forms 242 
unnatural, from DNA deamination 
271-273 

unusual, in tRNAs 335F, 337F 
bases (proton acceptors) defined 46, 93, 
763-764 

basic amino acid side chains 112 
basophils 1239, 1240F, 1241 T, 1245, 

1317 

Bax protein 1027-1028 
Bcl2 family 

ABT-737 and 1032F 
as apoptosis-inhibitory proteins 860F 
in B cell lymphoma 1031, 1047 
pro-and anti-apoptotic 1026, 

1027-1028F 

régulation of the intrinsic pathway 
1025-1028 

survival factors and 1030 
BcIXl protein 1026-1027, 1030-1031, 
1032 F 

Ber gene, in Bcr-Abl hybrid 1135 
BCRs (B cell receptors) 1315-1317, 
1321-1322, 1336-1338 
Beggiatoa 14F 
benign tumors 1092,1093F 
benzo/a7pyrene 270, 1128 
P-adrenergic receptors, structure 
832-833F 
P-lactamases 1293 


p 2 -microglobulin 1327, 1328F, 1330T, 
1339 

P-barrels 

in porins 758 

transmembrane proteins 579-581, 
659, 662-663 

P-oxidation, fatty acids 667 
P-sheets 

amyloid fibrils as 130-131,133 
discovery and description 115-116 
in DNA récognition proteins 377 
immunoglobulin (Ig) domains 
1318-1319 

as secondary structure 117-118 
BH domains (Bcl2 homology) 1026, 

1027F 

BFI3-only proteins 1027-1028, 
1030-1031,1032 F 
bi-orientation, chromatids 988-990, 
993-994, 1006, 1009 
Bicoid gene 1158-1159 
Bicoid transcription activator 393F, 394, 
395F, 422, 1158-1160 
Bid protein 1028 
bilayers, lipid see lipid 
bimetallic centers, cytochrome c oxidase 
772 

binding constant (K m ) 601-602 
binding interactions 

cooperative binding 516-517 
proteins with promoters 510-512, 
515F 

binding sites 

allostery 151-153 
Ca 2+ /calmodulin 841F 
cryptic binding sites 1043, 1068 
disordered régions as 126 
equilibrium constants and 138-140 
evolutionary tracing of 136-137 
integrins 1075 

loop régions as 118F, 121,138 
multienzyme complexes 148 
for nucléotides in G proteins 833F 
for phosphorylated amino acids 154 
polypeptide subunits 123 
RGD sequence 1067-1068,1075, 
1078 

ribosomes, for antibiotics 351F 
ribosomes, for RNAs 341-342, 347 
specificity 134-135 
transporters 600 

see also active sites; docking sites; 
ligands 

biofilms, and amyloid fibrils 132 

bioluminescence 547 

biotin 

carboxylated biotin 69T, 70F 
as a coenzyme 147 
nucléotide labeling 467 
BiP protein (binding protein) 677, 678F, 
683, 712-713 

BIR domains (baculovirus IAP repeat) 

1029 

1,3-bisphosphoglycerate 55-59F, 77-79F, 
105, 760, 786F 
bistability 

and positive feedback loops 518-520, 
829 

and robustness 520 
Bithorax gene/complex 1162-1163, 

1164F, 1169, 1170F 
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bivalents 1006, 1007-1009F 
BLAST program 462, 463F 
blastemas 1249 
blastocyst stage 1253,1254 
blastoderm stage 1159F, 1165F 
cellular blastoderms 1157 
syncytial blastoderms 1157-1158, 
1157-1161, 1165 
blastomeres 

différentiation 1148F 
maternal-zygotic transition 1181 
mouse 1156 
blastula stage 

fate maps 1159F, 1167 
gastrulation process 1147 
germ layers 1148, 1167 
pluripotency 1148 
blebbing 953-953, 1186 
blood cell formation 

in a hierarchical stem cell System 
1239-1247 
listed 1241T 
myeloid cells 1240 
see also red; white blood cells 
blood clotting 1067-1068, 1077 
blood sinuses 1242 
blood vessels 
arteries 1235F 
capillary growth in bone 1231 
elastin in 1065 

endothélial cells in 1235-1238 
smooth muscle relaxation 847F 
VEGF and 1237 
walls 1238 
bloodstream 

cell-cell adhesion 1054-1055 
métastasés use of 1101-1102 
BMP (bone morphogenetic protein) family 
865, 1168-1169, 1191F, 1199, 
1200F 

body patterning see spatial patterning 
body size 

cell numbers or cell sizes 1193 
IGF1 and 1196 

bond angles, in polypeptides 110 , 111 F, 
112 

bond energies 

in activated carriers 63-64 
in citric acid cycle 759 
of phosphate bonds 78, 79F 
bond strengths 

base-pairing 255 
bond types 44F, 45T, 92 
bone 

remodeling and repair 1230-1232 
résorption 1231 

trabecular and compact 1230-1231 
bone marrow 

B cells migration to 1312 
B cells origin in 1308 
stromal cells 1229 

bone matrix and osteoblasts 1229, 1230F 
Boolean networks 524 
Bordetella pertussis 1270F, 1277-1278 
Borrelia burgdorferi 1264F, 1286F 
BP230 adaptor protein 1037T, 1076 
brain 

CaM-kinase II in 841 
cérébral cortex 1200, 1201F, 1205, 
1211 

human, number of neurons 627,1198 


monkey brain 1212F 
protein abundances in liver and 372F 
transmitter-gated ion channels 636 
see also amygdala; hippocampus 
brain atlas project 502F 
“brainbow mice” 502F 
branch-migration reactions 284-285 
branching morphogenesis 1190, 1191F 
brassinosteroids 881 
Brcal and Brca2 genes 267, 1116, 
1133-1135 

Brcal and Brca2 proteins 281-282, 1134 
BrdU (bromodeoxyuridine) 966, 967F 
breast cancer 

Brcal and Brca2 proteins and 
281-282, 1134 

chromosome abnormalities 1097F, 
1111 

E-cadherin gene and 1122 
epigenetic changes 1110F 
genomes 1118, 1119F 
Her2 kinase and 1137 
incidences 1093F 
tumor growth 1094F 
variable différentiation in 1122 
Bril protein 881 

bright-field microscopy 533-534, 535F, 
940F 

bristles, mechanosensory (Drosophila) 
1172-1173 
bromodomains 388F 
brown fat cells 780 
brush-border cells see absorptive cells 
buffers 46 
bullfrogs 560F 
bullous pemphigoid 1076F 
bundling proteins 911-912 
Burkholderia pseudomallei 1287, 1288F 
Burkitt’s lymphoma 1107, 1128F, 1130T 
burns, treatment 1250 

c 

C. elegans see Caenorhabditis 
c-Cbl protein 853 

C3 complément component (C3a and 
C3b) 1302-1303 

C9 complément component 1303, 1334 
C-H bond réduction and oxidation 56, 

78F 

c-Src gene 1105 

c subunits, ATP synthase 777-778, 804 
C-terminal domain (CTD) 310F, 311-312, 
311 T, 316-317 

C-termini 

attachment of GPI anchors 688 
membrane anchoring 682 
polypeptide backbones 110F, 339 
signal sequences at 647, 667 
soluble and membrane-bound 
immunoglobulins 1317 
transcript cleavage site and 417-418 
C-to-U editing (cytosine to uracil) 418, 
419F 

CA repeats, as genetic markers 233 
CAD endonuclease 1024F 
cadherins 

cadherin superfamily members 1039F 
cell-cell junctions mediated by 1037F, 
1038-1046 

in cell sorting 1187-1188 


E-cadherin gene and breast cancer 

1120 , 1122 

E-cadherins 1038, 1041-1042, 1045F, 
1281, 1284F 

in embryonic development 
1040-1041, 1190 

and homophilic adhesion 1038-1042 
M-cadherins 1234F 
N-cadherins 1038, 1041, 1056, 1202 
P-cadherins 1038 
structure and function 1040F 
as transmembrane adhesion proteins 
1037 

cadmium selenide 538F 
Caenorhabditis elegans 
adult worm 1180F 
astral relaxation model 999 
asymmetric cell division 1002F 
behavioral changes 488F 
cell numbers 1194 
genes for voltage-gated ion channels 
627 

heterochronic mutants 1180, 1181F 
human Bcl2 and 1025 
kinesins 936 

loss of miRNA genes 1149 
as model organism 29T, 33 
MTOC 931F 
mutant libraries 498 
neurons 913 
RNA interférence 499 
sarcomeres 920 
caged molécules 544, 545F 
Cajal bodies 213, 331-332, 544F 
CAK (Cdk-activating kinase) 970, 973T, 
979 

calcineurin 655F 
calcium (Ca 2+ ) ions 

ATPase pump in muscle SR 606-607, 
920 

buffering 761 

Ca 2+ -activated K + channels 635, 636F 
Ca 2+ -release channels 607, 632F, 633 
Ca 2+ spikes 839-840 
in cell adhesion 440, 1038-1039, 

1054 

in cell wall cross-linking 1084 
fertilization and 547F, 839F 
integrin binding and 1075 
IP 3 receptors and 837 
LTP and LTD and 637 
monitoring with indicators 546-547 
in muscle contraction 920-923 
PKCand 837, 838F, 852 
in the regulated secretory pathway 
744-745 

release from ER 574 
as a second messenger 819-820 
as a signal mediator 838, 920 
storage by ER 669, 670-671 
triggering membrane repair 748 
calcium (Ca 2+ ) pumps 607-608, 671,838, 
840, 920 
calcium oscillators 

delayed négative feedback 516, 
839-840 

frequency decoding 842-843 
calcium phosphate (hydroxylapatite) 1229 
calculus 512 
calluses 442, 507 

calmodulin 840-841,842F, 843, 846, 
921-922 
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calnexin 685, 712 
calreticulin 685 

Calvin cycle (carbon fixation cycle) 755F, 
785-786, 787F 

CaM-kinases (Ca 2+ /calmodulin-dependent 
kinases) 841-843 
CaM-kinase II 841,842F, 843 
cancer cells 

aberrant properties required 1103 
abnormal behavior in culture 1098 
abnormal glucose metabolism 1098 
abnormal prolifération 1092, 1098, 
1099-1100 

abnormal surface MHC proteins 
1304, 1305F 

cross-presentation and 1329 
defining behaviors 1092 
dendritic cell cross-presentation 1329 
epigenetic changes 1094, 1096-1097, 
1109-1111, 1125-1126 
genetic analysis 1102 
genetic instability 1097,1103, 

1111- 1112, 1116, 1125, 
1133-1134 

genome sequencing 1095, 

1109-1111, 1119, 1137, 1141 
immune defenses 1137-1138 
immunity and T-cell inhibitory 
receptors 1138 

loss of anchorage dependence 1079 
NK cells/T cells attacking 1304F, 
1334F 

oncogene dependence 1135 
somatic mutations in 1094, 1104, 

1112 

transformed cell lines from 443 
variable différentiation in 1121, 1122 
cancer-critical genes 

cancer genome changes and 1111, 
1141 

discovery and effects of 1104-1126 
mutational landscapes 1112F 
proportion of human genome 

1112- 1113 

Ras-MAP kinase pathway 1137F 
studies in mice 1117-1118 
see also oncogenes; tumor suppressor 
genes 

cancer (generally) 

Apc gene mutations and 871,1124, 
1125 

apoptosis and B cell lymphoma 1031 
environmental and lifestyle factors 
contributing to 1127-1129 
epithelial-mesenchymal transitions 
1042 

five-year survival rates 1128 
Hedgehog hyperactivity and 873 
incidence and mortality 1091-1092, 
1093F, 1095F, 1127, 1128-1129F 
keratins in diagnosis 946 
as a microevolutionary process 
1091-1103 

Myc gene mutations and 1016 
need for multiple mutations 1118, 
1125-1126 

p53 gene mutations and 871, 1016, 
1031, 1115, 1125-1126 
pathogens contributing to 
1265-1266, 1289 
pathways commonly disrupted 

1113- 1116 


PTEN phosphatase mutations and 
859 

Ras hyperactivity and 854-855, 1016, 
1106, 1123, 1125 
Rb gene mutations and 1108 
RNA splicing errors and 324 
RTK régulation breakdown and 853 
T cell inhibition 1337 
telomerase production 10165 
see also cancers; carcinogenesis 
cancer stem cells 1120-1122, 1124 
cancer treatment 

cancer stem cells and 1121-1122 
combination thérapies 1139-1140 
curable cancers 1132 
cytotoxic drugs 1132-1133, 1140 
drug résistance 1135-1136, 1139 
drug targets and drug nomenclature 
1137F 

drugs targeting Bcl2 family proteins 
1032F 

drugs targeting topoisomerases 253F 
immune response enhancement 
1137-1139 

personalized 1139,1140 
présent and future 1132-1141 
radiation 1132 

sequencing tumor genomes and 506 
targeted, synthetic-lethal treatments 
1133 

targeting oncogenic proteins 
1135-1137 

targeting rapid division 1122 
targeting the Ras-MAP-kinase 
pathway 1137F 
Taxol® usefulness 929 
cancers (spécifie instances) 

arising in self-renewing tissues 1120 
caused by infectious agents 
1105-1106, 1129-1132 
cell motion in metastasis 951,952 
classification by appearance 1097 
classification by causative mutation 
1092 

classification by cell type 1092 
derived from one abnormal cell 
1093-1094 

genetic diversity 1096, 1118-1119 
hereditary 250, 282-282, 1107-1108, 
1124-1125 

incubation period 1095 
matrix dégradation 1072 
multidrug résistant 610 
origins in mutant clones 1091-1092 
preventable 1127-1132 
see also breast; colorectal; tu mors 
Candida albicans 349,1286F 
canonical Wnt pathway 869-870 
CAP (catabolite activator) protein 
382-383, 522-523 
cap snatching, mRNAs 1288 
capillaries 1231, 1235-1237F 
capping 

actin filaments 903, 909, 910F, 918 
decapping 426-428 
microtubules 927 
ofmRNA5'ends 315-317,422 
CapZ capping protein 909, 914, 919, 
920F 
carbohydrates 

Golgi apparatus as site of synthesis 
711 


plasma membrane protective layer 
582, 583F 

carbon cycle 55, 785 
carbon fixation 12 

in chloroplasts 783-786 
by cyanobacteria 782 
carbon fixation cycle 755F, 785-786, 

787F 

carbonyl oxygen, in ion channels 612, 
618, 619F 

carboxylated biotin 69T, 70F 
carboxylation stage, carbon cycle 785 
carcinogenesis 

Chemical carcinogens 270, 
1094-1095, 1127-1129 
link to mutagenesis 1094 
radiation and 1094, 1128F, 

1132-1133 

viral carcinogens 1130-1132 
carcinomas, defined 1092 
cardiac muscle see heart 
cardiac myopathy 948 
cardiolipin 760, 772 
CARDs (caspase recruitment domains) 
1026F 

cargo receptors 698, 711 
cargoes 

cytoskeletal filaments 896, 924-925 
of motor proteins 896 
and transport vesicles 695 
P-carotene 508 
carotenoids 789,791F 
carrier proteins 816F, 876 
carriers see transporters 
cartilage 1057-1058, 1060, 1061F, 1063T, 
1064, 1229 

cartoon summaries 509-510 
Cas9 497-498 

Cas (CRISPR-associated) proteins 434 
Cas transcription regulator 1179 
casein kinase 1 (CK1) 870 
caspase cascades 1023-1025, 
1028-1029, 1301, 1334 
caspases 

in apoptosis 1022-1023, 1334 
caspase-8 1024-1025, 1026F, 1028 
caspase-9 1025,1026F 
cytotoxic T cells and 1334 
executioner caspases 1022-1025, 
1026F 

lAPs and 1029 

in inflammatory response 1301 
initiator caspases 1022-1025,1028 
Caspr protein 625F 
catabolism 

as food breakdown 51,52F 
NADH in 68 

of sugars by glycolysis 74-78 
catalase, in peroxisomes 666-667 
catalysis 

and energy use 51-73 
enzymes as catalysts 48, 51,57, 
140146 

régulation of enzyme activity 149-151 
ribosomes as ribozymes 346-347 
ribozymes as catalysts 51,69, 
363-364 

rotary catalysis in ATP synthases 
776-778 

simultaneous acid and base 144, 
145F 
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speed of protein and ribozyme 363 
catalytic cascades (enzyme cascades) 

848, 873, 881 

catastrophe, microtubules 927-928, 
932-935, 986 

catastrophe factors 934-935, 986 
catenins 

as adaptor proteins 1042 
a-catenin 1042-1043, 1044F 
P-catenin 868-871, 1037T, 1042, 

1045, 1124 

p120-catenin 1042, 1046F 
y-catenin (plakoglobin) 1037T, 1042, 
1046F 

Caulobacter crescentus 897,898F 
caveolae 

in endocytosis 572 
as pinocytic vesicles 731-732 
TGFp inactivation 866 
CBC (cap-binding complex) 317, 323F, 
325, 327F 

CBP (CREB-binding protein) 836 
CCR5 receptor 1279 
CD4 co-receptors 1330T, 1331-1333, 
1335-1338 
CD4 receptor 1279 
CD8 co-receptors 1330T, 1331-1333, 
1335-1336, 1337F, 1338 
CD31 protein 1312 
CD40 receptors 1337 
Cdc2 protein (now Cdkl) 463F, 969T 
Cdc3 protein 950F 
Cdc6 protein 975, 976F 
Cdcl 2 protein 950F 
Cdcl 4 phosphatase 995 
Cdc20 protein 971,972F, 973, 993-994, 
1003 

Cdc25 phosphatase 970, 973T, 979, 1014 
Cdc28 protein (now Cdkl) 969T 
Cdc42, as Rho family member 854T, 858, 
956-957 

Cdhl protein 971,973T, 1003-1004, 
1013-1014 

Cdks (cyclin-dependent kinases) 

and cyclins in vertebrates and yeast 
969T 

inactivation 1002,1116 
rôle in cell cycle control System 968 
cDNA (complementary DNA) 
cloning 470-471 
DNA microarrays and 503, 504F 
RNA sequencing using 477, 503-504 
CDP-choline (cytidine-diphosphocholine) 
689F 

Cdtl protein 975, 976F 
cell adhesion 

at cell-matrix junctions 1074 
cell-substratum adhesion 954-955, 
956F 

loss of, in apoptosis 1023 
protein CD31 1312 
see also cell-cell adhesion 
cell cannibalism 1248 
cell-cell adhesion 
P-catenin and 870 
in the bloodstream 1054-1055 
control of 582-583, 870 
desmosomes in 946 
embryonic cell sorting 1187-1188 
immunoglobulins in 1055-1056 
T cell binding 1325-1326 


tissue remodeling 1043-1045 
vertebrate embryo patterning 1178 
cell-cell contact 

contact-dependent signaling 815F 
desmosomes 946 
hematopoietic stem cells 1244 
immunological synapses 1344F 
latéral inhibition dépendent on 1152, 
1173F 

Rac and Rho in actin organization at 
957, 958F 

signaling dépendent on 1150 
cell-cell junctions 

anchoring junctions 1036, 1037T 
cadherin médiation 1037F, 1188 
focal adhesions 863 
intermediate filaments 891 
major forms 1038-1056 
microtubules 931-932 
planar cell polarity 1190 
répulsive interactions 1188, 1206 
see also cell-matrix junctions; gap 
junctions; tight junctions 
cell coats 582, 583F 
cell cortex 

actin filaments beneath 592, 891,907 
cortical cytoskeleton 591-593 
in mitosis 913 
cell cycle 

accessory proteins and 895 
Caenorhabditis elegans 33 
changes in the nucleolus 330F 
eukaryotic 185, 258F, 964-966 
homologous recombination use 275 
length and cell type 1012 
M and S phase 963 
mitosis 978-995 
model organisms 966 
overview 963-967 
permanent arrest 1016 
regulatory transitions 967 
response to DNA damage 276, 
1014-1015 

Saccharomyces cerevisiae 31-32 
subnuclear structures and 331 
suggested timekeeping rôle 1180 
temperature-sensitive mutations and 
489 

withdrawal of sympathetic neurons 
1018 

see also cell division 
cell-cycle control System 
overview 967-974 
regulatory proteins 973T, 1108, 1115 
resetting 975, 1003 
transcriptional régulation in 971 
cell-cycle network, feedback 516 
cell death, programmed 

apoptosis and necroptosis 1021 
contrasted with necrosis 1021, 1022F, 
1099, 1115-1116 
see also apoptosis 
cell détermination 1148 
cell division 

asymmetric 1001-1002,1153, 
1173-1174, 1222 

control of cell growth and 1010-1018 
cytoskeleton in 890, 892F 
Lgr5 cell cycle times 1221 
limits, for human cells 1016, 
1099-1100 


mitogens in 1011-1012 
rate in hematopoietic stem cells 1243 
see also meiosis; mitosis 
the cell doctrine 529 
cell fate déterminants 1002, 1243F, 1246 
cell-free cloning 474 
cell-free Systems 451,673, 1224F 
cell fusion prévention, by membrane lipids 
570 

cell growth 

control and the PI-3-kinase-Akt 
pathway 861 
control of, in plants 1085 
control of cell division and 1010-1018 
distinguished from cell prolifération 
1011 

key regulatory pathways 1113-1114 
see also growth factors 
cell lines 

eukaryotic 442-444 
mouse fibroblast 1106 
RNA interférence 499 
cell mass régulation 1194-1196 
cell-matrix junctions 
actin-linked 1036 
anchoring junctions 1037T 
épithélial tissue 1035-1036 
integrin médiation 1037F, 1079 
response to mechanical forces 
1080-1081 

transmembrane receptors 1074-1081 
cell membranes 

composition 571T 
proportion of protein 576 
three classes of lipids 566-568 
see also plasma membranes 
cell memory 

and chromatin structure 194, 197, 

206, 387 

and différentiation 392, 397F, 401-402 
in embryonic development 1148, 

1150, 1162, 1164 
Hox complexes and 1164 
intracellular signaling pathways 825, 
829,843F 

overriding with cell reprogramming 
1251 

and positive feedback 520, 829 
reinforcement mechanisms 404-413 
see also epigenetic inheritance 
cell migration 

cell sorting in 1188 
chemotaxis and 958-960 
cytoskeletal coordination in 959-960 
endothélial cells 1235 
environmental eues 1185-1186 
neurons 1200,1201F 
polarization and 951-960 
survival factors and 1186-1187 
three components 951 
cell numbers 

body size corrélation with 1193 
régulation as cell mass 1194-1196 
cell plates 1000-1001,1082 
cell polarization 748-750 
in epithelia 749, 1047 
microtubules 927, 940F 
and migration 951-960 
planar cell polarity 1189-1190 
and Rho family proteins 955-959 
rôle of the cytoskeleton 892-893 
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cell prolifération 

accompanied by cell growth 
1016-1017 

in cancers 1092, 1098, 1099-1100, 
1104 

by clonal expansion 1310 
dépendent on mitogens 1017 
distinguished from cell growth 1011 
integrins in the control of 1079 
in intestinal crypts 1218-1219 
cell récognition 

glycolipids in 575 

surface oligosaccharides in 582, 720 
cell reprogramming 1251-1253 
cell signaling 

alternative routes in gene régulation 
867-880 

coordination of spatial patterning 
1150 

corralling and 593 
disordered régions in 126F 
extracellular matrix rôle 1073 
by germ layer 1187 
in plants 880-885 
principles of 813-831 
through enzyme-coupled receptors 
850-867 

through G protein-coupled receptors 
832-849 

see also inductive signaling 
cell size 

body size corrélation with 1193 
and ploidy 1194 

régulation as cell mass 1194-1196 
régulation by growth factors and 
mitogens 1018F 
régulation by vacuoles 725F 
cell sorting 1041, 1122, 1188-1189 
cell stress see stress 

cell-surface immunofluorescence staining 
591F 

cell-surface proteins 

displayed by dendritic cells 1306F, 
1326, 1327F, 1330 
Ig superfamily 1338-1339 
oligosaccharides on 720 
and phagocytosis 740 
cell-surface protrusions 

bacterial réplication 1287, 1288F 
blebbing 953-953,1186 
in cell migration 951-953, 954F, 955, 
956-957F, 959, 1185-1186 
pedestals 1278 

rôle of the cytoskeleton 892-893 
stereocilia 924 
cell-surface receptors 

death receptors 1024-1025 
immunoglobulins as 1315-1316 
inhibitory receptors 1245,1304, 
1305F 

intracellular signaling molécules 
819-820 

lymphocyte différentiation 1309 
survival factor binding 1030 
use by phagocytic cells 1301 
virus receptors 1279 
see also TCRs 
cell survival 

integrins in the control of 1079 
mouse épithélial cell lifetimes 1219 
PI 3-kinase and 860 
régulation of 1246-1247 


cell types 

in blood 1239 
and cDNA libraries 470 
and cell cycle length 1012 
and cell reprogramming 1251 
and cell wall composition 1082 
characteristics preserved in cancers 
1092 

different effects of acétylcholine 
816-817 

embryonic development 1047 
from induced pluripotent stem cells 
1257 

interconvertibility in connective tissues 
1228 

number in the human body 1217 
RNA splicing and protein variants 416 
ségrégation in the gut 1224 
specialized for contraction 1232 
from stem cells in culture 1169 
transdifferentiation 1258 
see also différentiation; tissues 
cell walls 

in bacteria 896, 1292 
in plants 26, 1000, 1053, 1081-1087 
primary cell walls 1082-1085 
in prokaryotes 13 
secondary cell walls 1082-1083, 
1085-1086 

cells 

appearance at different magnifications 
529, 530F 

biochemical similarity 8 
catalysis and energy use 51-73 
centering, by microtubules 931,932F 
Chemical components of 43-51 
comparative sizes 29F, 529 
composition by weight 48F 
eukaryotic and bacterial, sizes 644, 
645F 

extracts 447-448 
growing in culture 440-445 
introducing altered genes 495 
isolation from tissues 440 
locomotion by crawling and swimming 
951 

mechanical interaction with the ECM 
1064 

number in human body 2 
number of proteins in eukaryotic 641 
régénération and repair 1247-1251 
subcellular fractionation 445-447 
universality of 1-2 
water content of 535 
cellular blastoderms 1157 
cellularization 748, 1002, 1003F, 1160, 
1165 
cellulase 461 
cellulose 1083-1086 
cellulose synthase 1085 
CENP-A (centromere protein-A) variant of 
histone H3 198F, 203-204 
central dogma 299 
central nervous System 

asymmetric ségrégation in 1173-1174 
Drosophila 1179 
neural stem cell treatments 1250 
origins 1199 
synapse formation 1210 
see also brain; spinal cord 
central spindle 997, 999, 1000F 


central spindle stimulation model 999 
centrifugation and the ultracentrifuge 
445-447, 455 

centrioles 930, 931F, 943, 982, 985F 
centromeres 

centromeric chromatin 203-204 
création of human centromeres 204 
fluorescence microscopy 538F 
heterochromatin maintenance 
432-433 

positions in human chromosomes 
181F 

rôle and size 186 
centrosome maturation 985 
centrosome positioning 949, 959-960 
centrosomes 891,930-932, 933F, 

935-939, 943, 981-982, 983F 
duplication 984-985 
ceramide 690 

cérébral cortex 1200, 1201F, 1205, 1211 
cervical cancers (uterine cervix) 1093F, 
1096F, 1129, 1131, 1265 
CESA (cellulose synthase) genes 1085 
césium chloride 447 
CFTR (cystic fibrosis transmembrane 
conductance regulator) gene/ 
protein 225F, 611 
CG islands 406-407 
CG sequences (GpG sequences) 
loss of, in vertebrates 406-407 
méthylation and its inheritance 404 
TLR9 récognition 1300, 1304 
Chagas disease (Trypanosoma cruzi) 
1283-1284, 1290 
channelrhodopsins 588, 623-624 
channels 

distinguished from transporters 599 
membrane electrical properties and 
611-637 

as water channels or ion-channels 
611 

see also ion channels 
chaperones, histone 190, 192F, 198, 262, 
313 

and transcription activators 386-387 
chaperones, molecular 

BiP (binding protein) 677, 678F, 683, 
712-713 

calnexin and calreticulin 685, 712 
hsp60 355-357, 662 
hsp70 family 355-357, 659-662, 683, 
702 

mitochondrial precursor proteins and 
660-662, 664 

preventing folding in the ER and 
cytosol 677,686,711F 
protecting protein AMermini 361 
and protein folding 114, 354-357, 
1329F 

recognizing unfolded proteins 683 
chaperonins 356 

charge, on phosphatidylserines 567F, 574 
charge-separation in photosynthesis 
788-789, 792-793 
checkpoints 

cancer therapy and 1132-1133 
cell size 1174 
spindle assembly 993-994 
Chemical biology 459 
Chemical bonds 

and Chemical groups 47, 90-91 
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double bonds 90, 98, 566, 571 
Chemical carcinogens 270, 1094-1095, 
1127-1129 

Chemical components of cells 12, 43-51 
see also éléments 
Chemical synapses 627-630 
chemiosmotic processes 
bacterial origins 780-781 
in the citric acid cycle 759, 761-763 
defined 753-754 
évolution 794-796 
use by chloroplasts 782-783, 786 
chemokine receptors 1279 
chemokines 1241 

chemotaxis 958-960, 1185, 1202, 1237F, 
1241F 

chiasmata 1006F, 1007, 1008F 
chick embryos 

fibroblasts and collagen 1062F, 1064F, 
1069F 

neural tube 1041F, 1192F 
somite formation 1177F 
spinal cord 1200F 
chimeric animais 1254 
chimeric proteins 1135 
chimpanzees 17,217-219,221,224, 

225F, 226, 228, 231 
ChIP (chromatin immunoprécipitation) 
analysis 210 

Chkl protein kinase 1014, 1015F 
Chk2 protein kinase 1014, 1015F 
Chlamydia pneumoniae 1266 
Chlamydomonas (C. reinhardtii) 938F, 

941 F, 943 

chlorophyll 754-755 

energy/electron transfer 787-788 
ionization, initiating photosynthetic 
electron-transfer 783 
Peso chlorophyll 790, 791F, 794F 
spécial pairs 788-790, 791-792F, 793, 
794F 

structure 787F 
chlorophyll A 0 792 

chlorophyll-protein complexes 787-788 
photosystems as 788 
reaction centers and antenna 
complexes 788-789 
chloroplast genome 782, 802F, 806-807 
chloroplast precursor proteins 665F 
chloroplasts 

among intracellular compartments 
642 

collaboration with mitochondria 787F 
compared with mitochondria 782-783 
in cytokinesis 1001 
électron transport in mitochondria and 
755F 

energy conversion in 784F 
energy storage in 80-81 
GFP-tagged mitochondria and 544F 
origins and features 26-28, 644, 798F, 
806-807 

in photosynthesis 782-799 
as plastids 642 

protein transport into 658, 664-666 
self-splicing RNAs 324 
structure 658F 

thylakoid membranes 606, 658, 664, 
686 

chloroquine 610 
cholecystokinin 1219F 


choiera 576, 732, 834, 1265, 1266F, 
1269-1270 

choiera toxin 1270,1278 
cholestérol 

in cell membranes 568, 571-572 
latent gene régulation 655, 656F 
NADPH in biosynthesis 68F 
receptor-mediated endocytosis of 733 
and the steroid hormones 875 
structure 99, 568F 
in synaptic vesicles 747F 
chondroblasts 1057 
chondrocytes 1229 
chondroitin sulfate 1058, 1060-1061 F, 
1185, 1202 

chondromas, as benign 1092 
chondrosarcomas, as malignant 1092 
Chordin protein 1168-1169 
chromatids see sister chromatids 
chromatin 

activating and répressive 205, 210 
changes following nuclear 

transplantation 1252-1253 
DNA packaging in 179-193, 259 
heterochromatin and euchromatin 
194, 976 

insertion of histone variants 198 
position and gene expression 212, 
213F 

propagation of changes 199-201 
structure and function 194-207 
types of protein in 187 
zigzag model 192 
chromatin assembly factors see 
chaperones, histone 
chromatin domains 

and barrier sequences 202, 210, 391 
histone variants and 198 
reader-writer complexes 199-201, 

205 

chromatin immunoprécipitation technique 
505, 506F 

chromatin modification 

by reader-writer complexes 205, 406F 
timing of plant flowering 1183-1184 
by transcription activators 386-388 
chromatin remodeling 

fibroblast reprogramming and 1257F 
timescale of 1177,1179 
chromatin remodeling complexes 
for DNA réplication 261 
nucléosome changes and 190-193 
required by RNA polymerase II 
312-313 

and transcription regulators 380, 386, 
388, 390 

and transcription repressors 390 
chromatin structure 

changes in packing 206, 214-215 
and chromosome duplication 
975-977 

and epigenetic inheritance 194, 
204-206, 409-411 

and induced pluripotency 1255-1256 
loop structures 207-208,211-212, 

391 

multiple forms 210-211 
and réplication initiation 259 
and RNA splicing 323 
chromatography 

affinity chromatography 448-450, 

459, 484 


column chromatography of proteins 
448-449 

gel-filtration chromatography 
448-449, 450F, 455 
HPLC (high-performance liquid 
chromatography) 449, 457 
hydrophobie chromatography 448, 
452 

ion-exchange chromatography 
448-449, 452 
chromokinesins 984 
chromophores 

plant photoproteins 884 
retinal, in bacteriorhodopsin 587 
chromosomal translocation, reciprocal 
182F 

chromosome abnormalities 

aberrant human chromosomes 182 
breast cancer cells 1097F, 1111F 
inCML 1093 

nonhomologous end joining and 
274-275 

ovarian cancer cells 1116 
Philadelphia chromosome 1093. 
1094F, 1095, 1135 

chromosome banding 181-182, 209, 
210F, 211,391F 

chromosome condensation 978-979 
chromosome conformation capture (3C) 
method 209F, 212 
chromosome cross-overs 
crossover control 285 
frequency in humans 492 
in meiosis 1006, 1007F, 1009-1010 
as a resuit of homologous 

recombination 282-283, 486 
chromosome délétions, spécifie to 
humans 226 
chromosome duplication 

chromatin structures and 975-977 
matrix proteins 1067 
chromosome painting 180-182 
chromosome puffs 211 
chromosome ségrégation 

errors and cancer 1097, 1109, 1132 
in meiosis 1004-1006,1008,1010 
in mitosis 978, 994, 1097 
as universal 963 
chromosome size, Drosophila 
melanogaster 33-34 
chromosome translocations 
breast cancer 1111F 
globin gene family 230 
leading to CML 1093-1094 
chromosomes 

control of duplication 974, 975F 
DNA packaging at mitosis 214 
DNA packaging in chromatin 187-193 
DNA réplication within 254-266 
early discoveries 173-174 
electrophoresis of whole 
chromosomes 466 
essential components of 185-186 
forces on, in the mitotic spindles 
990-992 

global structures 207-216 
Ig Chain loci 1320 
lampbrush chromosomes 207-209, 
211 

number of nucléotide pairs, human 
257 
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polytene chromosomes 208-211 
rRNA genes 181 F, 330 
chromosomes, homologous see homologs 
chromosomes, numbered human see 
human chromosome 
chronic myelogenous leukemia (CML) 
1093-1095, 1135-1136 
chymotrypsin, compared with elastase 
119F 

Ci (Cubitus interruptus) transcription 
regulator 871-873 

CICR (Ca 2+ -induced calcium release) 838 
cilia 

built from microtubules 941-942 
microtubules and 890-891 
olfactory receptors 843, 844F 
planar cell polarity and 1189 
primary cilia 824F, 845F, 873 
stereocilia 890, 892, 924, 1189 
see also axonemes 
ciliary dyneins (axonemal dyneins) 
937-938, 942 

circadian docks 876-879, 1183 
circular DNA 

conservative site-specific 
recombination 293F 
in mitochondria 804 
in prokaryotes 23F 
réplication 242F, 255F 
circumferential belts 924 
c/s-acting IncRNAs 435F, 436 
c/s-double bonds, in phospholipid tails 
566, 571 

c/s-GoIgi network (CGN) 716 
c/s-regulatory sequences 402, 406-407, 
409 

CRE as 836 
as enhancers 386 
Eve gene 393, 395 
in gene control régions 384-385 
and genome annotation 477 
immunoglobulin chains 1320 
insulators and 391 
master regulators and 399-401 
nucléosomes and 379-380 
occupation by transcription regulators 
505 

reporter genes and 501 
as sequence logos 375, 378F 
transcription regulator récognition 
and binding 373-375, 379-381, 
383-384 

cisternae, Golgi apparatus 642, 715-716 

cisternae, rough ER 671F 

cisternal maturation model 720-721 

cisternal space 669 

citrate synthase 106 

citric acid cycle 

excess citrate from 760 
macromolecule precursors from 85F 
mitochondria and 664, 758-759 
overview 106-107 
in plants 786 
CK1 (casein kinase 1) 870 
CKIs (Cdk inhibitor proteins) 970-971, 
972F, 973T, 1003-1004, 
1013-1014, 1015F 

CI" (chloride ion) channels 612-613, 629 
clamp loader complex 247-249 
class switch recombination 1323 
class switching, B lymphocytes 1320, 
1322-1323, 1335-1336, 1338F 


classical cadherins 1037T, 1038, 

1039-1040F, 1042 

classical pathway, complément System 
1302-1303, 1318T 

clathrin-coated pits, plasma-membrane 
553F, 731,734, 1280 
clathrin coated vesicles 

adaptor proteins 698-699 
assembly 697-698, 700F, 703, 727 
coat structure 699F 
protein delivery to 736, 853 
claudins 1048-1051 
cleavage furrows 996-1000 
clonal analysis 1221F, 1222 
clonal délétion 

absence of co-stimulation 1337 
immunological self-tolerance 
1313-1314, 1321 
négative sélection 1332 
clonal expansion 808,1309-1311 
clonal inactivation/clonal suppression 
immunological self-tolerance 1314 
clones 

the body as a clone of the egg 1091 
mutant, and cancers 1091-1094, 
1097F 

subclones in cancer 1096, 1097F, 
1118-1119, 1123 
Clostridium difficile 1264 
cloverleaf structures 335-336 
CLRs (C-type lectin receptors) 1300 
cluster analysis approach 504 
CMC (critical micelle concentration) 583, 
584F 

CNS see central nervous System 
CNVs (copy number variations) 232, 492 
co-immunoprecipitation 457, 505, 506F 
co-repressors 385, 386F, 390, 392, 394, 
395F 

co-stimulatory proteins/signals 
B7 proteins 1337-1338 
dendritic cell expression 1305, 1306F, 
1314, 1326, 1327F, 1337, 1338F 
helperTcells 1314,1335 
naïve lymphocyte response 
1310-1311 F, 1316F, 1332 
co-translational processes 

contrasted with post-translational 
670, 678F 

protein import into ER 670, 674 
size limits 744 

co-transporters see symporters 
coactivators 385-386, 388-389, 392, 

394, 395F 

coat-recruitment GTPases 703 
coated vesicles 

coat assembly switching, Golgi 
apparatus 713 
three types 697 
coatomer 713 
“coccus” ending 1267 
Cockayne syndrome 271 
codon-anticodon matches 345, 804 
codon bias 482 
codons (nucleic acids) 

amino acid équivalents 7, 334 
mitochondrial usage 804-805 
stop and start codons 347-349 
synonymous codons 219-220 
coenzyme A 68-69 
see also acetyl CoA 


coenzyme Q (ubiquinone) 765-766, 767F, 
768-770, 771F, 772-773 
coenzymes 

nucléotides as 101 
and vitamins 146-148 
see also activated carriers 
cofactors, electron-carrying 764 
cofilin (actin depolymerizing factor) 905, 
910, 914, 954, 957, 958F 
cohesins 215, 550, 977-979, 982, 992, 
993F, 1004, 1007-1008F, 1009. 
coiled coils 

cytoskeletal filaments 915F, 931 F, 

936, 937-938F, 945, 950F 
derived from a-helices 116-117, 124, 
137 

in intermediate filaments 894, 
945-946, 950F 
in kinesin tails 936 
leucine zipper motif 376 
in myosin II 915 
see also DNA supercoiling 
coincidence detectors 699, 733, 825 
colchicine 459, 904T, 929, 935, 939, 993 
collaboration, in multicellular organisms 
1091 

collagen 

assembly 130 

triple hélix 124-125,1061-1062 
Type IV collagen 1058F, 1062, 
1069-1073 

Type XVII collagen 1037T, 1062, 1070, 
1076F 

vesicular transport of procollagen 704 
collagen family proteins 

as extracelluar matrix macromolecules 
1057, 1061-1064 

fibril-associated collagens 1062-1064 
fibrillar collagens 1058F, 1062-1064 
nonfibrillar collagens 1063T 
and their properties 1063T 
collagen fibers 1058,1062,1064 
collagen fibrils 1057F, 1058, 1060, 
1062-1065, 1069F 
color blindness 300F 
colorectal cancers 

common genetic abnormalities 
1123-1124 

driver mutations 1112F, 1117 
epigenetic changes 1110F 
evidence on stem cells from 1220 
tumor progression example 
1122-1125 

combination thérapies, for cancer and 
AIDS 1139-1140 
combinatorial Controls 
Eve gene 394-395 
gene expression and cell type 
396-398, 399, 1150, 1160 
logic functions governing 430 
miRNAs 430 

of transcription regulators 396, 397F, 
399,520-521 

combinatorial diversification 1320 
combinatorial regulatory codes 166 
commensalism 1264, 1277, 1298 
commissural neurons 1202-1204 
committed precursors see transit 
amplifying cells 
committed progenitor cells 
fateof 1245-1246 
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lymphoid and myeloid 1243 
compact bone 1230-1231 
comparative genomics 218, 482, 1269F 
compartmentalization see organelles 
compartments, maintenance of diversity 
697-710 

complément System 

alternative pathway 1302-1303 
classical pathway 1302-1303, 1318T 
early complément components 
1302-1303 

immunoglobulin activation 1317 
inactivation 1303 

and the inflammatory response 1241, 
1303 

late complément components 1303 
lectin pathway 1302-1303 
and phagocytosis 740, 1302-1303 
complementary DNA see cDNA 
complémentation tests 487, 490 
Complex I (NADH dehydrogenase 
complex) 767-770, 769F 
Complex II (succinate dehydrogenase) 
767-768F, 772-773, 775 
complex oligosaccharides 717-718 
complexins 745 

compression résistance 1036F, 1082, 
1084F 

computer analogy, cell signaling 831 
computer models see models 
(simulations) 
computer techniques 

need for quantitative analysis 38 
protein structure displays 117 
in proteomics 167 
in structured illumination microscopy 
550 

concentration gradients 

driving passive transport 599 
germ cell migration 1186F 
morphogens 1158, 1165-1166F, 1200 
signal molécules in development 
1151 

concentration thresholds, signal- 
processing 827-831 
condensation reactions 

as energetically unfavorable 66 
formation of macromolecules by 49, 
71F 

hydrolysis as the reverse 49F 
powered by ATP 65-66, 70-73 
condensins 209F, 215, 979-982 
conditional mutations 487, 489 
confluence 1098 

confocal microscopy 540-542, 544F, 546, 
724F, 800F, 875F 
conformational changes 

actin filament élongation 902 
in allosteric enzymes 151 
in AP2 adaptor protein 700 
in calmodulin 840 
complément System 1303F 
in dyneins 938F 
generating movement 160-163 
in GPCRs 833 

in integrin activation 1077-1078 

in ion channels 618, 630 

in kinases 835 

in myosin 916 

retinoic acid activation 877F 

in RTK activation 851 


in SecA ATPase 677 
in transcription élongation 312 
in transcription initiation 307 
in transporters 599, 601,607 
virus entry 1281 
conformations 
and energy 114 

of macromolecules generally 49, 50F 
of proteins 110 
see also protein structure 
conjoined twins 1167 
conjugation, and horizontal gene transfer 
1268 

connective tissues 

collagens in 1061,1063-1064 
dérivation from extracellular matrix 
1035, 1228 

fibroblasts in 1057F, 1228 
myoblast patterning 1185 
connexins 1051,1052F 
connexons (hemichannels) 1051-1052 
conoids 1282, 1283F 
consensus nucléotide sequences 
marking introns 319F 
for RNA splicing 319 
in transcription 308, 311 F, 325F 
in translation 348 

consensus récognition sequences 348 
conservative site-specific recombination 
292-295 
conserved DNA 
coding 1149 

multispecies conserved sequences 
225-226 

noncoding 224-225,1149 
X-chromosome sample 300F 
conserved genes 15-16,216-217 
biochemical importance 21-22 
common to ail domains 20, 21T 
eukaryotic cell cycle 32 
conserved proteins 
actin as 898 
in apoptosis 1025 
eukaryotic cell cycle 966 
histones as 190 
Sec61 complex 676 
structure 120 
conserved RNA motifs 363 
conserved Systems 

in cell polarization 956 
in cell signaling 814, 852, 855, 1150, 
1154 

cytochrome c oxidase 771F 
cytochrome c reductase 769 
in early development 1166-1167 
Hox complex serial gene expression 
1164 

A/-linked glycosylation 719 
signal-récognition particles 673-674, 
675F, 677-680 

constant région, Ig light and heavy chains 
1318-1319 

constitutive secretory pathway 741, 742F, 
746 

contact-dependent signaling 814, 815F, 
867 

contact inhibition 1098F 
contractile rings 

in cell division 890, 892F 
in cytokinesis 924,996-1001 
septins and 949 


in telophase 981 
contrast enhancement 534-535 
convergent extension 1188, 1189F 
Coolair RNA 1183 

cooperative allosteric transition 152-153 
cooperative binding 

by repressors and activators 
516-517, 519 

transcription regulators 378-380 
coordination 

of cell growth and cell division 1018 
of multiple cell signaling responses 
825 

COPI-coated vesicles 

budding from the Golgi apparatus 
698, 713F 
COPII and 697 
GTPase control 703 
interaction with ER retrieval signais 
713-714 

vesicle transport model 721 
COPII-coated vesicles 

budding from the ER 698,711 
COPIand 697 
formation 704F 
GTPase control 703, 705-706 
procollagen packaging 704, 705F 
protein packaging 711 
uncoating coupled to delivery 
706-707 

copper 

in ethylene receptors 881 
ions, in cytochrome c oxidase 771F 
corrals 593 

cortical cytoskeleton 591-593 
cortical rotation 1156, 1167 
cortisol 400, 835T, 875-876 
Costal2 scaffold protein 871-873 
coupled reactions 

energetically unfavorable reactions 
60-61,63, 76-78, 102 
favorable reactions with activated 
carriers 64-65 
mechanical model 64F 
coupled transporters 601-604 
coupling 

between binding sites 151-152 
heat production to increasing order 
53-54 

transcription and excision repair 271 
covalent bonds 43, 45T, 90 
disulfide bonds 127 
DNA-topoisomerase 252-253 
double bonds 90 
polar covalent bonds 56 
covalent modifications 

in the Golgi apparatus 716 
histone amino acid side chains 
196-197 

post-translational protein modification 
165-166 
tRNAs 336 

Coxiella burnetii 1284F 
CpG sequences see CG sequences 
CPSF (cleavage and polyadenylation 
specificity factor) 324-325 
CRE (cyclic AMP response element) 836 
Cre recombinase 496-497 
CREB (CRE-binding protein) 836, 841 
CreERT2 gene 1221-1222F 
crescentin 897, 898F 
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crime see forensic science 
CRISPR (clustered regularly interspersed 
short palindromie repeat) System 
434, 497-498 
crista junctions 757-758 
cristaspace 757, 762,767-770,772, 
777-779, 802 

cristae 658, 757-758, 759-761 F, 778-779 
critical concentration, Ce 900-903, 904F, 
906, 927-928 
Crml receptor 420 
Cro repressor protein 123 
Crohn’s disease 1300 
cross-beta filaments 130-131 
cross-linking 

of cellulose by glycans 1083 
with glutaraldehyde 555 
cross-presentation, dendritic cells 1329 
cross-species transmission 1279, 1291 
cross-strand exchanges (Holliday 
junctions) 283F, 284-285 
cross-talk 202, 391,821-822, 857 
crossover control 285 
crossover interférence 1010 
crossovers see chromosome cross-overs 
crRNAs (CRISPR RNAs) 434 
cryoelectron microscopy 559-561, 562F 
actin helices 910F 
clathrin coats 699F 
respiratory chain supercomplex 772, 
773F 

cryoelectron tomography 559F, 705F, 778 
cryptic binding sites 1043, 1068 
“cryptic” splice sites/signals 321,322F, 
323, 324F 
cryptochromes 885 

crypts, intestinal 1122, 1124, 1218-1219 
CSFs (colony-stimulating factors) 
1244-1246, 1257F 

CstF protein (cleavage stimulation factor) 
324-325, 417-418 

CT (computed x-ray tomography) 1092F 
CTCF protein 409F 
CTCs (circulating tumor cells) 1102 
CTD (C-terminal domain) 310F, 311-312, 
311T,316-317 

CTLA4 (cytotoxic T-lymphocyte- 
associated protein 4) 

1138-1139, 1337 
CTR1 protein 881-882 
cullins 160, 164 
“culture shock” 443 
cultured cells 

abnormal behavior of cancer cells 
1098, 1099-1100 
eukaryotic cell lines 442-444 
fibroblast prolifération 966F 
hematopoiesis regulating factors 
1244 

light microscopy 440F, 442F 
limitations of laboratory culture 14 
myoblast fusion 1234 
need for support 441 
organoids from stem cells 1223 
population homogeneity 440, 442 
curare 632 

cut-and-paste transposition 289-291 
CXCL12 ligand 1185-1186 
CXCR4 receptor 1279 
CXCR4 receptors 1185 
Cy3 and Cy5 dyes 537 


cyanides 772 
cyanobacteria 

advanced photosynthesis in 782, 790 
ATP synthases 778F 
circadian docks 878-879 
and the origins of aérobic life 796-797 
and the origins of chloroplasts 28F, 
798F 

cyclic AMP (cAMP) 

AraC/AraJ genes and 521 
binding site example 135F 
CAP protein and 382, 522 
chemotaxis by Dictyostelium 958 
in choiera 576, 834 
fish mélanocytes and 940 
G protein régulation 833-834 
heart muscle effects 835 
as a second messenger 819-820, 

827, 833 

structure 101,834 

cyclic-AMP-dependent protein kinase 
(PKA) 827,834-837,841,843, 
845T, 848 

cyclic-AMP-gated cation channels 844 
cyclic AMP phosphodiesterases 834 
cyclic GMP (cGMP) 844-848, 880 

cyclic-GMP-dependent kinase 155F 
cyclic GMP phosphodiesterases 
844-845, 846T, 847-848 
cyclic-nucleotide-gated ion channels 
843-844 

cyclin-Cdk complexes 968, 969F 
cyclin-dependent kinases (Cdk) 

phosphorylation of nucleoporins and 
lamins 656 

in protein kinase evolutionary tree 
155F 

Cyclin E and growth inhibition 1197 
cyclins 

as conditionally short-lived 359 
four classes 968-969 
cycloheximide 351,352T, 546F 
cyclopamine 873 
cyclosome see APC/C 
cyclosporin A 655F 
Cyk4 protein 1000F 
cysteine 

selenocysteine 350 
structure 113 
tetracysteine tags 1053F 
see a/so disulfide bonds 
cystic fibrosis 324, 611,712 
cystinuria 598-599 
cytochalasins 904 
cytochrome £>562 118F, 354F 
cytochrome bô-^complex 787,789, 
791-794, 797 
cytochrome c 

in intrinsic apoptosis 1025-1027 
structure of heme group 766 
cytochrome c oxidase 767, 768F, 

770-773, 774F 

cytochrome c reductase 767-769, 
770-771 F, 773, 797 
cytochrome oxidases 659F, 797 
cytochrome P450 family, in détoxification 
670, 1128 

cytokine receptors 863-864 
cytokines 

class switching 1323 
co-stimulatory signais from 1337 


helper T cell fate 1335 
IL1 and TNFa as 873 
in the inflammatory response 1301 
mammalian response to dsRNA 1304 
see a/so chemokines; interferon-y; 
interleu kins 
cytokinesis 981 

in cell cycle M phase 964, 965F, 
966-1004 

contractile ring and 949 
ESCRT mechanism 737 
membrane-enclosed organelles 1001 
in plants 1000-1001 
plasma membrane enlargement 748 
cytokinins 881-882 
cytoplasm 

defined 642 
macromolecules in 60F 
nuclear reprogramming 1252 
cytoplasmic dyneins 937-939, 943 
cytoplasmic inheritance 807 
cytoplasmic proteins in vesicle formation 
701-702 

cytoplasmic tyrosine kinases 852, 853F, 
858,862-863 

Janus kinases (JAKs) as 863 
cytosine 

3-methyl- 271 
5-methyl- 404, 406 
deamination of 5-methyl- 405 
deamination to uracil 267T, 268, 269F, 
1322-1323 

DNA base pairing with guanine 176, 
177F 

DNA méthylation at 404 
structure 100 
cytoskeleton 

accessory proteins 889, 894-896 
actin and actin-binding proteins 891, 
898-914 

in axon growth 1201 
bacterial and eukaryotic 24, 896-897 
bacterial and viral hijacking 913-914, 
1286-1288 

cell migration and 951-960 
coordination among éléments 
959-960 

dynamic behavior 890-892, 895F 
extracellular matrix connections 1035 
filament assembly 893 
function and origin 889-898 
integrin links to 1075 
intermediate filaments 891,944-950 
linker proteins 948-949 
and membrane protein diffusion 
591-593 

microtubules in 891,925-944 
mitochondria association with 755 
motor proteins 896 
myosin and actin 914-925 
receptor coupling by Rho family 858 
thermal stability 895F 
three éléments of 889 
see a/so microtubules 
cytosol 

pH régulation 604-605, 608 
RNA virus réplication 1278 
as site of protein synthesis 641 
transport between nucléus and 
649-658 
cytosolic surfaces 

coated vesicles and 697-698 
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RAB proteins on 705 
cytotoxic T lymphocytes 611 
CD8 expression 1331-1332 
class I MHC présentation 1326, 

1329F, 1330T 

dendritic cell activation 1329, 1333, 
1335 

foreign protein récognition 1327F, 

1328 

killing target cells 1333-1334 
NK cells resembling 1304-1305, 1333 
as a T cell class 1325 

D 

D gene segments 1320 
D-V (dorsoventral) axis 
animal body plan 1147 
cortical rotation, Xenopus 1167 
Dorsal transcription regulator and 
1164-1165, 1166F 
nervous System development 1199, 
1200F 

polarization of the embryo 1155-1158 
retinotopic map 1206 
vertebrate équivalent 1169 
Daily and Dally-like proteoglycans 1073 
Danio rerio see zebrafish 
DAPI dye 537-538F, 800F 
dark-field microscopes 533-534, 535F 
databases, protein 

mass spectrometry matching 
456-457 

and protein structure prédiction 462 
DDK kinase 975, 976F 
ddNTP (dideoxy nucleoside triphosphates) 
478 

de-differentiation 398, 1249 
de novo methyl transferases 404-405, 
406F 

de novo mutations 494 
deamination 

of 5-methlycytosine 405 
adenine to inosine 335, 336-337F, 

418 

arginine to NO 847 
cytosine to uracil 267T, 268, 269F, 
272F, 418, 1322-1323 
of DNA yielding unnatural bases 
271-273 

RNA editing 418-419 
death receptors 1024-1025 
decapping mRNA 426-428 
decorin 1058F, 1060 
default secretory pathway 741 
defensins 1298, 1302 
degenerative disease treatment 
1258-1259 

dégradation complex 870 
degrons 158 
degrowth 1248 

dehydrogenation and hydrogénation 56 
Deinococcus radiodurans 483 
delayed K + channels 634-635 
delayed négative feedback 516, 517F 
delayed response genes 1013F 
délétion mutations 487 
Delta protein 867, 868F, 1172, 1173F, 
1178-1179, 1224 
denaturing of DNA 472 
denaturing of proteins 114, 453, 584 


dendrites 

in neural development 1198, 1199F, 
1200-1201, 1206-1207, 1208F 
rôle in neurons 621,940 
self-avoidance 1206-1207 
dendritic cells 

antigen Processing 1330F 
co-stimulatory protein expression 
1306F, 1314, 1326, 1337, 1338F 
cytotoxic T cells and 1333, 1335 
derived from monocytes 1240, 1243F 
Langerhans cells as 1226F, 1240 
linking innate and adaptive immune 
Systems 1305, 1306F 
négative sélection in thymocytes 
1332 

as professional APCs 1307, 1328 
rôle as antigen peptide presenters 
1240, 1305 

surface proteins displayed 1306F, 
1326, 1327F, 1330 
T cell binding and activation 
1324-1326, 1331 
use of cross-presentation against 
viruses and tumors 1329 
dense-core secretory granules see 
secretory vesicles 
deoxyribose 

stability conferred by 366 
structure 100 

déphosphorylation, M-Cdk and 970-971, 
978, 993-995, 998 
depth of field 533F, 558-559 
depurination 267T, 268-269, 270F 
dermatan sulfate 1058 
desensitization see adaptation 
desmin 944T, 948-949, 1046 
desmocollins 1037T, 1038, 1039F, 1046F 
desmogleins 1037T, 1038, 1039F, 1046F 
desmoplakin 1037T, 1046F 
desmosomes 893F, 946, 1036, 1226F 
desmotubules 1053, 1054F 
detergent solubilization, membrane 
proteins 583-586, 766 
deterministic models 524 
détoxification 

carcinogen activation and 1128 
of oxygen 796 
by peroxisomes 667 
in smooth ER 670 
developing tissues see embryonic 
development 

developmental complexity and gene 
repression 390 

diabètes 

p-cell renewal and 1226 
insulin levels and cancer risk 1115 
as a multigenic condition 493 
Type I as an autoimmune diseases 
1315, 1332-1333 

diacylglycérol 819, 837, 838F, 859, 862F 

D/'ap gene 1197F 

Dicer enzyme 430F, 431-433 

Dictyostelium 559F, 958 

dideoxy sequencing 466F, 477-478 

Didinium 25F 

differential équations 

coupled differential équations 515 
and deterministic models 524 
for positive feedback 518 
protein concentrations and 513-514 
and transient behavior 512-513 


differential-interference-contrast 

microscopes 533-535, 560F, 
1002F 

différentiation 

from the blastomere 1148F 
embryoid bodies from iPSs 1257F 
epigenetic inheritance and 205 
four general statements 371 
genetic mechanisms maintaining 
392-404 

reprogramming cell types 396, 398 
rétention in culture 441-442 
stepwise commitment in 

hematopoiesis 1243-1244 
terminally différéntiated cells 400, 

816, 1012, 1121 

transcription regulator activation 
1170-1171 

variations in cancer cells 1121, 1122 
without changes in the genome 369 
diffraction effects, in microscopy 531, 
539F 

diffraction limit to resolution 532, 549, 
551, 554 

diffusion 

within lipid layers 569-570, 601 
passive diffusion into nuclei 650 
and reaction rates 59-60 
and signal molécule gradients 
1151-1153, 1158F, 1166 
of small molécules across membranes 
598 

diffusion coefficients 570, 589 
diffusion-limited reaction rates 143, 148 
digestion, enzymatic 74 
digital caméras in microscopy 555 
digital image enhancement 534 
digoxigenin 467 
dihydroxyacetone 96 
dihydroxyacetone phosphate 104 
dilated cardiomyopathy 923 
dilution effects 513-514 
dimerization/dimers 
a- and (3-tubulin 925 
ATP synthases 778, 779F 
caspases 1023-1025, 1026F 
fibronectin 1067 
helix-loop-helix proteins 377 
integrins 1075 
of MHC proteins 1327 
polypeptide 123 
of RTKs on ligand binding 851 
single-pass transmembrane proteins 
580 

T cell receptors 1325 
TGFP superfamily 865 
transcription regulators 375-378 
tyrosine-kinase-associated receptors 
862 

dimorphism, pathogenic fungi 1271-1272 
diplotene 207F, 1006, 1007F, 1010 
DISC (death-inducing signaling complex) 
1024-1025 

discontinuous responses 827, 829 
see a/so switches 
diseases 

analysis using stem cells 1258-1259 
linked to DNA repair 266T 
linked to integrin defects 1076-1077 
linked to mitochondrial mutations 
807-808 
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linked to mutations 479, 493-494, 

627, 668 

linked to X-chromosome changes 
300F 

lysosomal storage diseases 728-729 
membrane transport protein mutations 
598-599 

microbiotic imbalance and 1264 
potential of RNA interférence 433 
RNA splicing errors and 323-324 
spectrin deficiency 591 
Dishevelled protein 870-871 
disorder (thermodynamic) see entropy 
disordered régions (proteins) 125-126, 

149 

disulfide bonds 127, 452 
in fibronectin 1067-1068 
immunoglobulin (Ig) domains 1319 
in keratins 946 
in laminin 1070 

mitochondrial protein import 664 
proteins in lumen and cytosol 682 
single-pass transmembrane proteins 
582 

diurnal rhythms (circadian docks) 

876-879, 1183 
Dlg (discs-large protein) 165 
DNA 

analytical methods 463-466 
as carrier of genetic information 174, 
177-178 

as constituent of chromosomes 
173-174, 179-182 
content measurement 966, 967F 
distinctions from RNA 4-5, 302, 366 
fragmentation in apoptosis 1024F 
localization in eukaryotes 178-179 
manipulation 467-485 
in mitochondria and chloroplasts 753 
packaging within chromosomes 
179-193 
polarity 175 

structure and function 3, 175-179 

synthesis 240-241F 

see also double hélix; genes; 

mitochondrial DNA; recombinant 
DNA 

DNA catenation 977, 979 
DNA cleavage by restriction nucleases 
464-465 
DNA cloning 

cDNA cloning 470-471,475F 
DNA libraries from 469-470 
genomic DNA cloning 470-471,475F 
proteins in quantity from 483-484 
in recombinant DNA technology 464 
two meanings 467 
using PCR 473-474 
DNA damage 

apoptosis response 1022,1028 
cell cycle response 276, 1014-1015 
by Chemical carcinogens 1127-1128 
p53 regulatory pathway 1113, 
1115-1116 

response in cancer cells 1099, 1113, 
1115-1116, 1132 

susceptible sites and typical lésions 
267 

telomeres and response 264 
see also DNA repair 
DNA demethylases 404, 406 


DNA duplication, collagen a-chains 1062 
DNA fingerprinting 475-477 
DNA glycosylases 269, 270F, 271,273 
DNA gyrases 315 
DNA helicases 

nucléotide excision repair 270 
production 484F 
réplication rôle 246, 249, 255 
in S-phase 974-975, 976F 
structure 247F 
inTFIIH 311 

Xpd knockout mice 497F 
DNA injection by bactériophages 19F 
DNA inversion 294 
DNA labeling 466 
DNA libraries 469-471 
DNA ligases 

in disease 266T 

in recombinant DNA technology 464 
repair function 269-270, 280, 289F 
réplication function 245, 246F, 250 
use in DNA cloning 468, 469F 
DNA looping 383-384, 385F, 386, 391 
DNA-only transposons 218F, 288-290 
DNA polymerase a-primase 253 
DNA polymerases 

attachment 246-247 
discovery 240 
and PCR 473 
Polô 246, 261 
Pois 246 

RNA polymerases compared 304-305 
as self-correcting 243 
synthetic action 241 
translesion 273, 274F 
viral use of host 1289 
DNA primases 245, 249, 253, 255, 256F, 
263F 

DNA primers, in PCR 473 
DNA probes 472-473, 502 
DNA repair 266-276 

base excision and nucléotide excision 
269-271 

contrasted with RNA 271-273 
defects in cancers 1097, 1124-1125, 

1132- 1133 

diseases linked to 266T 
double strand breaks 273-275, 

1133- 1134 

in eukaryotes and bacteria 271 
by homologous recombination 
1133-1134 

see also DNA damage 
DNA réplication 4-5 

and the cell cycle 258-260, 974-977 
end-replication problem 262 
errors in cancer 1106F, 1107 
initiation and completion 254-266 
mechanisms 239-254 
proofreading 242-244, 250-251,257F 
as semiconservative 240, 242F, 447 
temperature-sensitive mutations and 
489 

topoisomerase and the winding 
problem 251-253 
two stages 974, 976F 
see also réplication forks; réplication 
origins 

DNA segment shuffling 16,17F 
DNA sequences 

Alu sequences 212F, 223F 


barcoding mutants 498, 499F 
barrier DNA sequences 195F, 202 
centromeric 203 

changes in cancer cells 1094, 1097, 
1106, 1109-1110, 1111-1112F 
deducing gene function 20,216-217 
human p-globin gene 179F 
inferring for ancient genomes 
223-224 

maintenance of 237-239 
rate of change 218 
see also genome sequencing; human 
genome 

DNA sequencing methods 

dideoxy sequencing 466F, 477-478 
Illumina® sequencing 480 
ion torrent™ sequencing 481 
second-generation sequencing 
methods 479-480 
shotgun sequencing 479 
third-generation sequencing methods 
481 

DNA supercoiling 

created by RNA polymerases 
314-315 

positive and négative 315 
the réplication winding problem 251, 
253 

DNA transfection for oncogene 
identification 1104 
DNA transferases 

de novo methyl transferases 404-405, 
406F 

maintenance methyl transferases 404 
docking sites 

PI(3,4,5)P 3 as 859 
plasma membrane 859, 860F 
protein tyrosine kinases 1080 
receptor tyrosine kinases 849-850, 
851F, 852, 853F 
specificity of 821-823 
STAT proteins and cytokines 863 
dogs 

olfactory bulb neurons 1199F 
size différences 1193,1196 
dolichol/dolichol phosphate 99, 684 
“Dolly the sheep” 1252 
domain shuffling 121-123 
domains (lipid bilayers) 572-573, 

590-591,593, 749 

domains (protein) see protein domains 
domains (taxonomie) 

bacteria, archaea, and eukaryotes 15 
common gene families 20, 21T 
dominant mutations 

cancer-critical genes 1005F, 1104 
gain-of-function as typically 489 
dominant négative mutations 487, 1116 
Dorsal transcription regulator 655F, 873, 
1164-1165, 1166F 
dorsoventral (D-V) axis 
animal body plan 1147 
cortical rotation, Xenopus 1167 
Dorsal transcription regulator and 
1164-1165, 1166F 
nervous System development 1199, 
1200F 

polarization of the embryo 1155-1158 
retinotopic map 1206 
vertebrate équivalent 1169 
dorsoventral inversion 1169 
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double bonds 
alternating 90 
c/s-double bonds 566, 571 
unsaturated fatty acids 98 
double hélix 

DNA 3F, 174, 176 
hydrogen bonding 175-176 
as key to DNA repair 268 
superhelical tension, DNA 314-315 
double-negative activation 820, 821F 
double-pass transmembrane proteins 
679, 681F 

double-reciprocal plots 143 
double-strand breaks (DNA) 
in cancers 1111, 1116 
homologous recombination 278-279, 
1116 

in meiosis 282-284, 1009-1010 
repair mechanisms 273-275, 497, 
1323 

segmentai duplications from 228 
topoisomerase II and 252-253 
double-stranded RNA (dsRNA) 

RNA interférence and 431,499-500 
TLR3 récognition 1299, 1304 
as viral characteristic 1304 
Down syndrome 1010 
doxycycline 495F 

Dpp gene (Decapentaplegic) 1165-1166, 
1169 

Dpp signaling protein 1073 
driver mutations 1104, 1112-1113, 1117, 
1119F 

Drosophila (D. melanogaster) 
airways 1191F 
alternative splicing in 415 
body segmentation 1159-1163 
Branchless gene 1191F 
cell cycle 966 

central nervous System 1179 
characteristic transposon types 292 
chromosome and genome sizes 
33-34 

circadian clock 877-878 
Dlg protein discovery 165 
Engrailed protein 120, 536F 
Eve (even-skipped) gene 393-394, 
395F 

Eyeless transcription regulator 
397-398F 

fungal infection 1299F 
gene expression by in situ 
hybridization 536F 
genetic switches 392-395 
Hedgehog proteins in 871 
homeodomain motif 376 
imaginai dises 1195 
insulator-binding protein 391F 
mitochondrial DNA 805 
as model organism 29T, 33-34, 417 
mRNA localization in 422 
mutant libraries 498 
neurogenesis 1173-1174 
Notch receptor in 867-868 
P element 416,486 
photosensing neurons 1208F 
polytene chromosomes 208-211, 
391F, 540F 

position effects in 194, 195F 
RNA interférence 499 
Sev, Sos and Grb2 Ras-GEFs 855 
shibire mutant 702F 


Drosophila embryos 

control of development 1157 
discovery of signaling pathways 1154 
germ-band extension 1045 
mitosis without cytokinesis 1002, 
1003F 

syncytium development 1157 
transcription regulators 392-395 
drug discovery 

computer-based 463 
stem cells and 1258-1259 
drug résistance 

cancer treatment 1135-1136,1139 
malaria 610 

multidrug résistant cancers 610 
pathogens 1291-1294 
see also antibiotic résistance 
drug targets 

antibiotics 1293F 
transmitter-gated ion channels 
631-632 

drug treatment, predicting individual 
responses 506 
Dscam gene 415F, 1208F 
DSCAM proteins 415, 1207 
duplication in tissue renewal 1226 
dwarfism, pituitary and achondroplastic 
1196 
dynactin 939 

dynamic instability, actin 903 
dynamic instability, microtubules 
927-929, 935 

dynamic range, intracellular signaling 
824-825 

dynamic Systems and differential 
équations 512 

dynamin and dynamin-related proteins 
701-702, 803, 804F, 806 
dyneins 936-943, 959 

axonemal dyneins 937-938, 942 
cytoplasmic dyneins 937-939, 943 
and the mitotic spindle 983-985 
viruses and 1288 
dysbiosis 1264 
dyskeratosis congenita 265 
dystroglycan 1070,1071F 
dystrophin 1234 

E 

El (ubiquitin-activating enzyme) 158, 
159F 

E2 (ubiquitin-conjugating enzyme) 
158-159, 160F 
E3 see ubiquitin ligases 
early endosomes 

delivery to 732-733, 738 
and late endosomes 696F, 707 
maturation 707, 730, 735-736 
retrieval from 734-735 
transcytosis and 738 
EB1 proteins 935 

EC (extracellular cadherin) domains 1039, 
1040F 
eedysone 875 
ECM see extracellular matrix 
EcoRI nuclease 465-466F, 468F 
ectoderm 1147-1148, 1167-1168 
neurogenic ectoderm 1166, 
1173-1174, 1179, 1186 
eczema 947 


editing pockets 339 
EF-G élongation factor 343, 344F 
E2F proteins 1012-1014 
EF-Tu élongation factor 160-161, 163, 
343-344 

effector B and T cells 
compared 1309 
effector B cells 1309-1310, 

1312-1313, 1316, 1322, 1324 
effector T cells 1309-1310, 1312- 
1313, 1324, 1328, 1333-1335, 
1337 

and immunological memory 
1310-1311 

in lymphocyte différentiation 1309 
effector Bcl2 family proteins 1027, 1028F 
effector proteins 

extracellular bacterial pathogens 
1269-1271, 1278, 1281-1282, 
1283F, 1285-1286 
in intracellular signaling 814 
efflux pumps 883, 884F, 1293 
EGF (epidermal growth factor) 

action via receptor tyrosine kinases 
850T 

broad specificity 1011 
production of embryoid bodies 1257F 
resemblances to matrix proteins 
1066F 

vertebrate embryo segmentation 1178 
EGF receptors 
activation 851 
lysosomal dégradation 735 
mutation in glioblastoma 1107 
targeting in lung cancers 1136 
egg-polarity genes 1157-1159, 

1160-1161 F, 1163, 1165 

eggs 

growth without cell division 1018 
similarity of egg cells 2F 
eggs, fertilized see zygotes 
eicosanoids 837 

elFs (eukaryotic initiation factors) 347, 
348F, 423-428, 1304 
elF4E transcription initiation factor 347, 
425-426, 1017 
EIN3 protein 882 

EJCs (exon junction complexes) 320F, 
321,352,353F 

elastase, compared with chymotrypsin 
119F 

elastic fibers 1065-1066 
elastin 

as disordered 125 
in the extracellular matrix 1058, 
1065-1066 

electrochemical gradients 

in chemiosmotic coupling 754 
composed of membrane potentials 
and concentration gradients 
599,662,762-763 
membrane transport and 599-600, 
602, 612, 662 

mitochondrial membranes 758 
powering ATP synthase 774, 776 
thylakoid membrane 784F 
electrogenic effects 608, 615 
électron carriers 

NADH and NADPH as 67-68, 762 
plastoquinone, plastocyanin, and 
ferredoxin as 793 
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électron crystallography 580, 586 
électron microscopy 

cryoelectron microscopy 559-561, 
562F 

EM tomography 558, 779, 988F 
freeze-etch 937F, 942F, 948F 
freeze-fracture électron microscopy 
1047, 1049F, 1051F 
immunogold électron microscopy 
556-557 

négative staining 559-561 
resolution 554, 559, 560F, 562 
single-particle reconstruction 
561-562 

staining 555-556 

three-dimensional imaging 557-558 
transmission électron microscopy 
305F, 554, 555-556F, 558, 560F 
see also S EM 
électron transfers 

chlorophyll 787-788, 792 
NADH dehydrogenase complex 768 
oxidation and réduction as 55-56 
photosynthetic reactions 783-784 
see also light reactions 
electron-transport chain 

in ATP synthesis 84-85, 658 
in chemiosmotic coupling 754 
in chloroplasts 789 
the citric acid cycle and 83 
in early living cells 795-797 
location in mitochondrial cristae 
757-758 

in mitochondria and chloroplasts 
755F 

mitochondrial protein imports 659, 
662, 663F, 664 
proton gradients and 86F 
proton pumps of 763-774 
électrons 

damage to proteins 561 
productivity in terms of ATP 775 
wavelength 554 

electrophoresis see gel electrophoresis 
electroporation 495 
electrospray ion source 456-457 
electrostatic attractions 

atomic force microscopy and 548 
binding site example 135F 
as noncovalent 44, 95 
éléments 

periodic table 43F 
required for living cells 12, 43 
élongation factors 
EF-G 343,344F 
EF-Tu 160-161, 163, 343-344 
loading onto RNA polymerase 388 
rôle 313-314, 343-344 
élongation phase, DNA réplication 974 
EM (électron microscope) tomography 
558 

embryogenesis, cell migration in 951 
embryoid bodies, from iPSs 1257F 
embryonic development 
apoptosis in 1022 
asymmetries 1151-1152 
blastomere to différentiation 1148F 
cadherin-dependent cell-cell adhesion 
in 1040-1041 

cell memory in 1148,1150,1162, 
1164 


control of timing 1176-1184 
epithelial-mesenchymal transitions 
1042 

fundamental processes in animais 
1145 

germ cells and somatic cells 1158 
intracellular control programs 1179 
morphogenesis 1184-1193 
neural development 1198-1213 
overview 1147-1155 
retina 1236 

spatial patterning in 1150-1154 
spécifie genes in animais 1149 
tissue morphogenesis 1059 
embryonic stem cells 
Nanog regulator 378F 
transgenic mice 496 
embryos 

cell-cycle control System 967, 971 
genome activation 1147 
parthenogenetic 987 
see also blastula 
embryos, Drosophila 

control of nuclear transport 655F 
discovery of signaling pathways 1154 
germ-band extension 1045 
mitosis without cytokinesis 1002, 
1003F 

syncytium development 1157 
transcription regulators 392-395 
embryos, frog 

blastula stage components 1147F 
epigenetic inheritance and 205-206 
evidence for différentiation without 
gene loss 369-370 
reprogramming by donor nuclei 
205-206 

embryos, vertebrate 

inductive signaling 1166, 1167F, 1177, 
1184, 1198 

nervous Systems 1041F 
spatial patterning 1167-1169 
EMT (epithelial-mesenchymal transitions) 
1042, 1101 

endocrine cells, in signaling 815 
endocytic-exocytic cycle 731 
endocytic vesicles 

endosome delivery 696F, 707 
endosome formation and 730-732, 
735 

phagosomes as 738 
synaptic vesicles from 746 
endocytosis 

of activated TGF(3 receptors 865 
defined 695 

influenza virus infection by 709, 1280 
as a lysosome delivery pathway 725 
pathways 730-741 
phagocytosis as 738 
and receptor down-regulation 830 
receptor-mediated 709, 727, 

732-735, 849, 1281 
endoderm 1147-1148,1156,1158, 
1167-1168 

endolysosomes 723, 724F, 730, 734F, 

736, 1299, 1300T 
endonucleases 

AP endonuclease 269 
mRNA destruction 427 


tRNAsplicing 336F 
endoplasmic réticulum (ER) 

among intracellular compartments 
642 

area compared with plasma 
membrane 643 

ER résident proteins 647, 682-683, 
685F, 711,714 
ER rétention signais 682 
ER retrieval signais 713 
ER tail-anchored proteins 682 
functions 669-691 

junction complexes with mitochondria 
691 

membrane asymmetry 681 
microtubules in organization 939 
mitochondrial contacts with 755, 

757F 

mRNA localization and 421 
nuclear envelope connection to 179, 
180F 

protein retrotranslocation 358, 686 
protein transport to the Golgi 
apparatus 710-722 
rough and smooth ER 642, 670, 671F 
rough ER in B cells 1309 
source of lipid droplets 573 
source of microsomes 445, 671-673 
source of single pass membrane 
proteins 577 

see also sarcoplasmic réticulum 
endoreplication 1194 
endosome maturation 707, 730, 735-736 
endosomes 

among intracellular compartments 
642 

early and late 696F, 707, 730, 735, 
1280-1281, 1285 
Rab5 domain formation 707 
recycling endosomes 696F, 706T, 730, 
737-738, 739F 
Toll-like receptors 1299F 
tubule formation 705 
endosymbiont hypothesis 800 
endothélial cells, in blood vessels 
1235-1238, 1311-1312 
endothelin-3 1186 

energetically favorable and unfavorable 
reactions 

carbon fixation as unfavorable 784 
condensation reactions as unfavorable 
66 

coupling 60-61,63-65,76-78,102 
defined 57 

DNA supercoiling as favorable 314 
électron transfer as favorable 764 
membrane transport of glucose as 
favorable 603F 
energetics 

of active transport 600 
of biological reactions 102 
bond strengths 44F 
catalysis and energy use 51-73 
extraction of energy from food 73-88 
geochemical energy 12 
of glycolysis and oxidative 
phosphorylation 756 
hydrogen burning and oxidative 
phosphorylation 761 
membrane transport proteins 163 
mitochondrial protein import 661F 


Page numbers with an F refer to a figure; page numbers with a T refer to a table. 



1:18 Index 


oxidation of organic molécules 54-55 
of protein folding 114-115, 549 
see also free energy 
energy conservation 52-54 
energy storage 
as fat 78-79 
as starch 80-81 

through coupled reactions 76-78 
Engrailed gene (i Drosophila) 1160-1161, 
1162F 

engrailed protein (J Drosophila) 120 
enhancers 312, 1175 

see also c/s-regulatory sequences 
enkephalins 744 
enolase 105 

enterocytes see absorptive cells 
enteroendocrine cells 1218, 1219F, 1221, 
1223-1224 
entrainment 877-878 
entropy 52-53, 60, 103 
enveloped viruses 1274, 1275F, 1280, 
1286, 1287F, 1288 
viral envelopes 1275F, 1280-1281, 
1287F 

environmental eues 

cell migration 1185-1186 
metamorphosis of amphibians 1182 
plant flowering times 1182-1184 
environmental factors 

identical twin studies 412 
stem cell fates 1222 
enzyme amplification in microscopy 540 
enzyme cascades 848, 873, 881 
enzyme-coupled receptors 

as cell-surface receptor class 818F, 
819 

insulin receptor as 824F 
receptor serine/threonine kinases as 
865 

RTKsas 837F 
enzyme kinetics 141-144 
enzyme-substrate complexes 
formation by collision 59-60 
lysozyme example 146F 
enzymes 

activation energy effects 57-58 
activation sequence for Src kinases 
156 

allostery 151-153 
cellular compartmentalization 

148- 149 

choice of reaction pathways 58, 59F 
classes and nomenclature of 140T 
concentrations and metabolic rate 
148 

DNA repair function 266 
electrostatic attractions in 95 
multienzyme complexes 148 
“perfect enzymes” 143 
positive and négative régulation 

149- 151, 152F 

as protein catalysts 6, 51 
as proteins 48 

specificity as catalysts 140-141 
speed of molecular motions and 
59-60 

zymogens as proenzymes 736 
eosinophils 1239, 1240F, 1241, 1245, 
1302F, 1317 

EPEC (enteropathogenic E. coli) 1278 
ephrin-Eph binding/EphrinB 1224 


ephrin proteins 850T, 858, 1188, 1206, 
1207F 

epidermal stem cells 1225-1226 
epidermolysis bullosa 947, 949, 1069 
epigenetic changes 

in cancer cells 1094, 1096-1097, 
1109-1111, 1125-1126 
nuclear reprogramming and 
1252-1253 

tumor suppressor gene inactivation 
1109 

epigenetic inheritance 

and chromatin structure 194, 

204-206, 409-411 
of gene expression 411-412 
mechanisms acting in c/s and trans 
412, 413F 
epilepsy 627,913F 
epinephrine see adrénaline 
epistasis analysis 490 
epithelia 

basal lamina and 1035, 1062, 
1068-1069 

cell-cell junctions 1044-1049 
cell-matrix attachments 1036F 
hemidesmosomes in 1076 
mesenchymal interactions 1190 
planar cell polarity 1189-1190 
protection by mucus 719-720, 749, 
1276-1277, 1298 

simple columnar epithelia 1036, 1047 
stem cells in 1217-1227 
épithélial barrierto infection 1265, 1276- 
1277, 1298 
épithélial cells 

adherens junctions 1044 
apical and basolateral domains 749, 
893, 1036, 1047 
aquaporins 612 
carcinomas as cancers of 1092 
keratin filaments in 946 
kératocytes as 953 
polarization 749, 1047 
renewal, in colon and rectum 1122 
transport of solutés 605 
épithélial tubes 1045 
epithelial-mesenchymal transitions 1042, 
1101 

epitope tagging 450-451 
Epulopiscium fishelsorti 13F 
equilibrium binding experiments 458, 

459F 

equilibrium centrifugation 672 
equilibrium constants 

actin polymerization 900, 902 
deriving from standard free energy 
changes AG° 62-63, 63T, 139F 
protein binding strength and 138-140, 
458 

protein-promoter complexes 511 
equilibrium potential, Nernst équation 616 
equilibrium reactions 

ATP synthesis and hydrolysis 
775-776 

energetics of 61-63,103 
enzymes and 58 
see also réversible reactions 
equilibrium sédimentation 447 
ER see endoplasmic réticulum 
eraser enzymes 201 
ERk (MAP kinases) 856-857, 861F 


ERM protein family (ezrin, radixin, and 
moesin) 905,913 
error correction 

in DNA synthesis 243-245, 250-251 
proofreading 339, 344-345 
by tRNA synthetases 338-339 
see also quality control 
error rates 

DNA réplication and RNA synthesis 
244, 244T 
meiosis 1010 
and viral évolution 1291 
errors in data 525 
érythrocytes see red blood cells 
erythropoietin 864T, 1011, 1244-1245 
ES cells see stem cells, embryonic 
Escherichia coli {E. coli) 13F, 16F 
ABC transporters 609 
arabinose metabolism 521-523 
DNA réplication 255, 256F 
enteropathogenic (EPEC) 1278 
F plasmid 469 
gene transcription 380-381 
genome 22, 23F, 1269 
as Gram-negative 1267F 
historical importance 22 
homologous recombination 279 
horizontal gene transfer in 19 
Lac operon 382-383 
mutation rates 237 
promoters 308F 
rRNAgenes 327 

strand-directed mismatch repair 250 
universal gene families and 21T 
ESCRT complexes (Endosome Sorting 

Complex Required for Transport) 
intraluminal vesicle formation 
736-737 

ubiquitin-binding proteins 735 
ESR (électron spin résonance) 
spectroscopy 570 
essential amino acids 86, 87F 
estradiol 876F 
ethidium bromide 466, 1024 
ethylene, plant growth regulator 881-882, 
1087 

eubacteria 15F 

euchromatin and heterochromatin 194 
eukaryotes 

ABC transporters in 163F, 609F, 610 
cell cycle 185,258,964-966 
cell features 24-25 
cell lines 442-444 

chromatin structure and chromosome 
function 206 

distinction from prokaryotes 12-15 
DNA localization and packaging 
178-182 

DNA repair in 271,281 
DNA réplication in 253-254 
genomes 23-39 
Homo sapiens as 16F 
last common ancestor 880 
mRNA compared to prokaryotic 316F 
mutation rates 234 
numbers of ribosomes 340 
organelles common to ail 641-643 
plasma membrane composition 568, 
572,575-576 
probable origins 24-26 
protein kinase numbers 154-155 
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régulation of protein synthesis 361F, 
405 

RNA interférence in 429F 
RNA polymerases in 390 
single-celled 24, 25F, 30 
transcription control 384-392 
translation initiation 422-425 
eukaryotic cell sizes 644 
eukaryotic cytoskeleton 896-897 
eukaryotic initiation factors (elFs) 347, 
348F 

eukaryotic parasites 1277, 1282-1284, 
1290 

eukaryotic pathogens 1266, 1271-1273, 
1282-1283, 1286F, 1290 
Eve (even-skipped) gene 392-394, 395F 
évolution 

cancer as a microevolutionary process 
1092, 1119 

of chemiosmotic processes 794-796 
convergent évolution 665 
critical stages in human évolution 226 
of energy-conversion 753 
of eukaryotic cell and its membranes 
643-645 

ofgenomes 216-234,804 
infection as a driver 1331 
of ion channels 626-627, 630 
molecular clock 220 
nonsense-mediated decay in 352 
of NPCs and vesicle coats 650 
of photosynthetic reaction centers 
793, 794F, 796 

in plants compared with animais 
880-881 

of protein kinases 154-155 
of protein synthesis 365-366 
and RNA splicing patterns 323 
sources of genetic variation 16-17, 
217-218, 221,227-232 
of vertebrates 227 
of viruses 1291, 1292F 
see also natural sélection 
evolutionary tracing 136-137 
exchangers see antiporters 
excitatory neurotransmitters 629-631 
excitatory postsynaptic potential 633 
executioner caspases 1022-1025, 1026F 
exocrine cells 

aquaporins 612 
pancreatic 671F 
exocytosis 
defined 695 

in endocytic-exocytic cycle 731 
of lysosomes and multivesicular 
bodies 729 
of residual bodies 739 
synaptic vesicles 744-746 
exon définition 322, 323F 
exonskipping 321-323, 324F 
exons (expressed sequences) 
évolution rate 220F 
in human genome 183F, 184, 318F 
recombination 230 
size range 322 
exonucleases 

exonucleolytic proofreading 243-244, 
250 

mRNA destruction 426 
exosomes 326, 729 
expiants 441 


expression vectors 483 
see also plasmids 
extracellular bacterial pathogens 
1269-1271 

extracellular cadherin (EC) domains 1039, 
1040F 

extracellular matrix 

in animais 1057-1074 
basal lamina as 1068-1069 
cell-matrix interactions 1064 
cell-matrix junctions 1035-1038, 
1044F, 1074-1080, 1081F 
dégradation 1072-1073 
fibroblast différentiation and 1229 
fibrous proteins 124 
general features 1035-1038 
isolating and culturing cells 440, 441 
migrating cells and 955 
modification by cancers 1101 
neuronal growth and 1202 
plant cell walls as 1053 
three classes of macromolecules 
1057-1058 

variety of forms 1057-1058 
extracellular signal molécules 

in intercellular communication 813 
responses and concentration 828 
transcription régulation response 
867-880 

extracellular signais 
range 814-815 

response to concentration changes 
830-831 

responses as programmed 816-817 
responses speeds 826F 
transcription regulator activation by 
395,399-400 
extremophiles 10 

extrinsic activation pathway, apoptosis 
1023-1025, 1028-1029 
Eyeless transcription regulator (Pax6) 
397-398F, 1146F, 1171 

eyes 

cornea 1057, 1063-1064, 1069F 
lens 1038, 1065-1066 
ségrégation of inputs 1212 
see also retina 

F 

F-box proteins 159-160, 167, 168F, 971, 
972F 

F-type ATPases see ATP synthases 
FACS (fluorescence-activated cell sorters) 
440, 441 F, 967F 

Factors V and VIII, lectin binding 712 
Factor VIII gene 301F, 318F 
FADD (Fas-associated death domain) 
1025F 

FADH 2 (reduced flavin adenine 
dinucleotide) 

as an activated carrier 69 
in the citric acid cycle 83 
in the respiratory chain 768F, 772, 

774, 775T 

FAK (focal adhesion kinase) 863, 1037T, 
1079-1080 

familial hypertrophie cardiomyopathy 923 
FAP (familial adenomatous polyposis coli) 
1123-1124 

Fas death receptors 1024, 1025F, 1031 
Fas ligand 1024, 1025F, 1029, 1031, 1334 


Fat cadherin 1039F 
fate maps 1159F, 1167 
fats 

energy storage as 78-79, 81,82F 
energy storage in plants 785-786 
oxidation to acetyl CoA 81-82, 83F 
fatty acids 

breakdown in peroxisomes 667 
as a mitochondrial fuel 758, 759F 
and other lipids 98-99 
in phospholipid tails 566 
product yields from oxidation 775T 
unsaturated fatty acids 98 
favorable and unfavorable reactions see 
energetically favorable 
Fbx15 gene 1255F 

Fc receptor, and phagocytosis 738F, 739, 
1317 

FDG (fluorodeoxyglucose) 1092F 
feed-forward loops/motifs 401-402, 403F 
incohérent and cohérent 522-523 
feedback loops 

and cell memory 401-402 
mitogen-activated transcription 
1012-1013 

need for quantitative analysis 38 
in signaling Systems 825, 828, 829F, 
830-831,856-857 
feedback régulation 
of cell size 1193 
cholestérol 655, 656F 
enzyme inhibition 150-151 
generating Ca 2+ waves 838-840 
in photosynthesis 784 
in stem-cell division 1121 
see also négative; positive feedback 
FepA protein 581 
fermentations 75-76, 780 
ferredoxin 790F, 792-793 
ferritin 427 
fertilization 

Ca 2+ changes 547F, 839F 
centrosome 987 
membrane fusion in 709 
Xenopus egg rotation 1156, 1167 
fêtai hemoglobin 229 
FG-repeats (phenylalanine glycine) 652, 
653F, 654 

FGF (fibroblast growth factor) 

action via receptor tyrosine kinases 
850T 

in branching morphogenesis 
1190-1191 

interaction with heparan sulfate 1073 
production of embryoid bodies 1257F 
FGF receptor and achondroplasia 1196 
fibril-associated collagens 1062-1064 
fibrillar collagens 1058F, 1062-1064 
fibrillin 1065-1066 
Fibrinogen 1076T, 1077 
fibroblasts 

clathrin coated vesicles 699F 
cytoskeleton in cell division 890, 892F 
endocytosis 731,732F 
extracellular matrix organization 1057, 
1062F, 1064, 1228 
Golgi apparatus 715F, 720 
inducing cell-cell adhesion 1041 
intermediate filaments 946 
irradiation damage to DNA 282F 
mitosis 966F 
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motion 951,957F 
nucléus of human 180F, 329-330F 
in rat cornea 1057F 
réplicative cell senescence 265 
reprogramming 396-397, 398F, 
1254-1256, 1258 

and their transformations 1228-1232 
tumor-associated 1101 
viewed in different microscope types 
535F 

fibronectin 

in basal lamina 1070 
fibronectin type 3 domains 122F, 1073 
integrin binding 1067-1068 
matrix organization 1066-1067 
RGD sequence in 1067,1075,1078 
sécrétion and assembly 1064, 1068 
sizeandshape 1058F 
syndecan interaction 1061 
fibronectin-coated substrata 1079-1080F 
fibronectin fibrils 1068 
fibronectin receptors 1076T 
fibronectin repeats 1066F, 1067-1068, 
1073 

fibrous proteins 124-125 

field émission guns 554, 559, 561F 

filaggrin 946-947 

filamin 905, 911F, 912, 913F, 957, 958F 
filopodia 

actin filaments in 890, 951 
axon growth cônes 952, 1201, 1202F 
Cdc42 and 956,957F 
endothélial tip cells 1236 
of fibroblasts 892F, 911F 
profilin and 906 
fimbrin 905,911-912 
first law of thermodynamics 53, 102-103 
fish 

medakafish 547F 
mélanocytes in 939, 940F 
sticklebacks 1174-1175 
see also zebrafish 

FISH (fluorescence in situ hybridization) 
472F 

5'end, DNA 175, 177F 
fixatives 

électron microscopy 555-556, 559 
light microscopy 535-536 
fixed anions 612,615 
FK506 655F 
flagellae 

bacterial 942, 1267, 1271 
built from microtubules 941-942 
flagellar axoneme 756F 
flagellar motors 778, 781 
flagellins 294, 1290 
Flamingo cadherin 1039F, 1190 
flavin adenine dinucleotide see FADH 2 
Fie (flowering locus C) gene 1183-1184 
fleas 1276F 

flip-flops between lipid layers 570, 588, 
689-690 
FLIP protein 1025 

flippases (phospholipid translocators) 

570, 574, 690 

flow cytometry 524, 966, 967F 
flower spike, wheat 559F 
flowering times, plants 1182-1184 
fluorescein 537, 544, 545F 
fluorescence-activated cell sorters (FACS) 
440, 441F, 967F, 1242 


fluorescence anisotropy, protein 
interactions 458, 459F 
fluorescence microscopy 503, 524, 547 
asymmetric cell division 1002F 
confocal microscopy 541 
contractile ring 997F 
endoplasmic réticulum 669F, 757F 
kinetochores 988F 
L/ster/a-infected cell 914F 
localization of spécifie molécules 
536-537 

migrating fibroblast 1068F 
mitochondrial réticulum 803F 
mitotic spindles and chromosomes 
980-981,986F, 991F, 994F 
myelinated axons 625F 
neurons 634F 

optic cup derived from ES cells 1258F 
pre-cellularization Drosophila embryo 
1003F 

superresolution techniques 549-551 
time-lapse 757F, 803F, 875F, 935F, 
991F, 1178 
Tl RF 547-548 
use of multiple probes 538F 
vesicle budding events 705 
zebrafish neural map 1204F 
fluorescence recovery after 

photobleaching (FRAP) 545-546, 
588-589 

fluorescence résonance energy transfer 
(FRET) 459, 543-544, 545F, 

855F 

fluorescent antibodies 

against axon terminais 634F 
against cofilin 954F 
DNA probes and 472F 
Golgi apparatus 715F 
against microtubules 756F 
against mitochondrial membranes 
800F 

against myosin 1234F 
against phosphotyrosine and actin 
1080F 

in structured illumination microscopy 
550F 

visualizing cell components 538-539F 
fluorescent dyes 

antibody blotting techniques 454 
DNA labeling 466, 467F, 478, 503-504 
examples 537 

excitation and émission wavelengths 
537F 

Illumina® sequencing 480 
labeling cancer cells 1120 
photoactivation 544 
protein arrays 458 
quantitative RT-PCR 502-503 
in single-particle tracking 589 
fluorescent indicators 546-547 
fluorescent labeling and microscope 
resolution 532 

fluorescent molécules, imaging using 
PALM 553F 
fluorescent proteins 

gene fusion and 495, 501 F, 502, 504F 
in microscopy 537F, 542-546 
see also GFP 

fluorescent reporters 1178 
fluorochromes, inorganic 538 
FNR (ferredoxin-NADP + reductase) 790F, 
792-793 


foam cells 1266 
focal adhesions 

actin network and 955, 956F 
fibronectin and 1068F, 1079 
integrins and 863, 952F, 957, 959, 
1075, 1079 

stress fibers and 891,924, 957 
vimentins and 959 
folding see histone; immunoglobulin; 

protein structure; RNA 
follicle cells 1158, 1165 
food 

authenticating, using PCR 475 
extraction of energy from 73-88 
storage as fats, starch and glycogen 
78-81 

forensic science 233, 473-477 
formins 905, 906-907, 909F, 911,957, 
996, 997 

/V-formylmethionine 347, 800, 958, 1031 
Fôrster résonance energy transfer see 
FRET 

fossil record 

phylogenetic trees and 219 
sequence information 223 
founder effects 231-232 
4E-BP protein 1017F 
FoxP3 transcription regulator 1336 
fractal globules 212 
FRAP (fluorescence recovery after 
photobleaching) 545-546, 
588-589 
free energy 

of accurate translation 345-346 
définition 103 

and living things 8, 102-103 
protein folding and 114-115, 354 
sources 11 

see also activation energies 
free energy changes AG 

actin filament élongation 901-902 
ATP formation and hydrolysis 65, 71, 
775 

concentration of reactants 61, 
775-776 

électron transport chain 763-764 
favorable and unfavorable reactions 
57, 60-61 

microtubule polymerization 927 
Nernst équation dérivation 616, 762 
oxidative phosphorylation 761 
redox potential différence and 
763-765, 767 

RNA polymerase binding 382 
transcriptional synergy 388-389 
see also standard free energy changes 
free radicals 1115 

freeze-etch électron microscopy 937F, 
942F, 948F, 1085F 

freeze-fracture électron microscopy 1047, 
1049F, 1051F 
freeze substitution 1061F 
frequency see oscillations 
FRET (fluorescence/Fôrster résonance 
energy transfer) 459, 543-544, 
545F, 855F 

Fringe family transferases 868 
Frizzled receptors 868-871 
frog eggs 996F, 1156F 
frog embryos 

blastula stage components 1147F 
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epigenetic inheritance and 205-206 
evidence for différentiation without 
gene loss 369-370 
reprogramming by donor nuclei 
205-206 

frogs 

myosin thick filaments 915,919 
neuromuscular junctions 630-631 
Rana pipiens 36 F 
tadpole skin collagen 1063F 
see also Xenopus 
frontotemporal dementia 324 
fructose 

1,6-biphosphate in glycolysis 75F, 

104 

6-phosphate in glycolysis 85F, 104 
structure 96 
fruit fly see Drosophila 
FT (flowering locus T) gene 1183 
FtsZ proteins 806, 896-897 
Fuga rubripes (puffer fish) 29, 223 
fumarase as a “perfect enzyme” 143 
fumarate 107 

functional reconstitution, membrane 
proteins 585, 586F 

fungi 

bacterial inhibitors from 351 
as eukaryotes 26-27 
genetic code variation 349 
Neurospora and defining the gene 
416-417 

pathogenic 1271-1272, 1299F 
see also yeasts 
fura-2 indicator 547, 839 
FUS protein 133 
Fused protein kinase 871-872 
fusion proteins (made by fusion) 451 
Cas9 with activators and repressors 
497 

in FRET 545 

with GFP 501F, 502, 504F, 543, 589, 
715F, 802-803, 884F 
with red fluorescent protein 875F 
fusion proteins (promoting membrane 
fusion) 571,708 

G 

Go phase 

initiation factors and 424 
as a resting State 965, 1012 
G r Cdk 972, 1012-1014 
Gi cyclins 969 
Gi phase 

Cdk inactivity in 1002-1004 
cell cycle position 964-965 
chromosomes 486 
G 2 phase, cell cycle 964 
G 2 /M transition 968-970, 973, 978, 1014 
G418 antibiotic 1255F 
G-protein coupled receptors (GPCRs) 
adrénaline effects 827, 832, 835T 
bacteriorhodopsin and 588 
CXCR4 as 1185 

desensitization by phosphorylation 
848-849 
effects 818, 832 

signaling overlaps with RTKs 861-862 
in smell and vision 843-846 
G proteins (trimeric GTP-binding proteins) 
588, 820, 832-834 


four major families 846T 
G 12 form 843, 846T, 959F 
G^form 846T, 959F 
Gj form 833F, 834, 836, 843, 846T, 
848, 862,959F, 1278 
G 0 form 846T 
Goif form 844, 846T 
GPCR activation of 833F 
G q form 836, 838F, 846T, 847F, 859 
G s form 833F, 834, 836, 843, 846T, 
848, 862 

G t (transducin) 845, 846T, 848 
régulation of cyclicAMP 833-834 
régulation of ion channels 843 
signaling via phospholipids 836-838 
subunits 832-834 

Gi/S-Cdk 971,984, 1004, 1012-1014 
G-|/S cyclins 969, 1013 
y-TuRC (y-tubulin ring complex) 929, 
930-931 F, 933 
GABA (y-aminobutyric acid) 

as an inhibitory neurotransmitter 629 
receptors as drug targets 632 
Gag gene, HIV 420F 
GAGs (glycosaminoglycans) 

as extracelluar matrix macromolecules 
1057-1058 

glycosaminoglycan chains 711,719, 
728 

pectins as resembling 1084 
in proteoglycans 1059-1060, 1061F, 
1070-1071 

gain-of-function mutations 
in cancer 1104, 1105F 
defined 487 
Uni4 gene 1181F 
as typically dominant 489 
Ubx genes 1162F 
galactocerebroside 575F 
galactose 96 

P-galactosidase (P-gal) 282F, 394F, 501F 
galactosyltransférase 546F 
galacturonic acid 1083-1084 
GalS repressor protein 522 
gametes 

abnormal 286, 1010 
defined 1004 

genetic recombination with formation 
231 

mitochondria in 807 
in plants 1183 

see also eggs; meiosis; spermatozoa 
ganglion mother cells 1174,1179 
gangliosides 575 
gap genes 1159-1160, 1162 
gap junctions 

distinguished from channels 611 
in épithélial cells 1036F, 1037 
structure and function 1050-1053 
gap phases, cell cycle 964 
GAPs (GTPase-activating proteins) 157, 
158F 

coat recruitment 703 
control of GTP-binding proteins 820, 
821F 

Ran-GAP 653-654, 656 
Ras-GAPs 855 
Rho-GAPs 858 

gastrula-stage embryos 36F, 205, 541, 
1045, 1147-1148 


gastrulation 

cell adhesion and migration 1185, 
1187-1188 

pluripotency loss 1148 
préservation of gene expression 
patterns 1161, 1187 
variability 1155 
zebrafish 1189F 
Gata4 transcription factor 1259 
GATC residues, méthylation 250, 251F, 
257F 

gated transport 646 
gating 

gap junctions 1052 
ion channels 604, 614, 618-620, 636F 
latéral gating of protein translocators 
678-679 
Gcn4 protein 425 

GDI (Rab-GDP dissociation inhibitor) 705, 
706F, 707 

GDIs (guanine nucléotide dissociation 
inhibitor) 858 

GDP (guanosine diphosphate) elF2 
binding 424 

GEFs (guanine nucléotide exchange 
factors) 157, 158F 
chromosome stabilization of 
microtubules 986 
coat recruitment 703 
control of GTP-binding proteins 820, 
821F 

Rab-GEFs 706 
Rac-GEFs 960 
Ran-GEF 653, 656 
Ras-GEFs 855, 860, 862F 
Rho-GEFs 858, 958, 997 
gel electrophoresis 
in DNA cloning 469 
DNA fragmentation in apoptosis 
1024F 

of DNA molécules 465-466 
pulsed-field gel electrophoresis 466 
SDS-PAGE 452, 453F, 454, 584 
two-dimensional 452-454 
gel-filtration chromatography 448-449, 
450F, 455 

gel-forming polysaccharides 1058 
gel-forming proteins 911,957 
gelsolin 905,909-910 
geminin 975 

gene amplification and cancer 1107, 1111 
gene control régions 384-385, 392-394 
gene conversion 

homologous recombination and 282, 
285-286 

tumor suppressor genes and 1109 
gene duplication 

diversification in signaling 1150 
in Drosophila melanogaster 34 
évolution of globins 229-230 
frequency in vertebrates 34-35 
as mechanism for innovation 16-18, 
227-228 

transporters and ion channels 
603-604, 622 

whole-genome duplications 35, 228 
gene expression 

chromatin loops and 211-212 
chromatin position and 212, 213F 
chromosome-wide changes 409-411 
circadian docks and 876-878 
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combinatorial Controls 394-398, 400F, 
520-521 

conservative site-specific 
recombination and 294 
control of 369-373, 405 
efficiency of 301F 
enzyme régulation through 150 
epigenetic inheritance of 411-412 
and gene function 485-509 
heterochromatin limitation of 194 
localization with in situ hybridization 
502,536 

localizing using reporter genes 501 
monitoring in Saccharomyces 
cerevisiae 32 
monoallelic 411 
oscillations 1177 
patterns in development 1149F 
post-transcriptional Controls 413-428 
quantitative measurements 502-503 
Ras-MAP kinases signaling and 856 
Ras proteins and 854 
régulation at six points 372, 373F 
régulation by ncRNAs 429-436 
representing a low response 825, 

826F 

response to external signais 372, 522, 
867-874 

riboswitches and 414-415 
serial organization 1163-1164 
time required for 1176 
transcription and translation in 7, 306 
variability between cells 523, 524F 
and vertebrate évolution 227 
visualizing 536 
gene families 

arising from gene duplication 17-18, 
228 

common to ail biological domains 20 
évolution of globins 229-230 
gene function 

classical genetics studies 485-488 
cluster analysis and 504 
deducing from DNA sequences 20, 
216-217 

deducing from mutations 21-22, 496, 
498-499 

testing with RNA interférence 499- 
501 

gene fusion 495 

gene inactivation by RNAi 433, 499 
gene knockouts 494-496, 1117 
gene products, epistasis analysis and 490 
gene régulation see transcription 
regulators 

gene segments, B cells 1319-1320, 

1321F, 1325, 1332 
gene silencing 

Polycomb proteins and 206 
position effects 194 
gene transcription 

Ca 2+ spikes and 840 
cyclic AMP effects 836F 
JAK-STAT signaling pathway and 863 
response to phytochromes 884 
gene transfer from organelles 801-802 
see also horizontal gene transfer 
general recombination see homologous 
recombination 

general transcription factors 309-312, 
384-387, 388F, 390F 


post-transcriptional control 405, 425 
TFII numbering 310 
TFIID and poliovirus 1288 
general translation factors 425 
genes 

coding for multiple proteins 318 
définition of 182,416-417 
distribution in the nucléus 211-212 
essential genes of unknown function 
499 

horizontal and vertical transfer 16 
identifying new genes by ribosome 
profiling 506, 507F 
isolation and over-expression 464 
mechanisms for innovation in 16-17 
mitochondrial, in different species 
801F 

multiple proteins from 415 
nature of hereditary information 
173-174 

number and density in different 
species 182F, 415-416 
numbers coding proteins 184T, 185 
numbers in bacteria and archaea 16 
proteins encoded by 7 
rapidly evolving and conserved 15-16 
spécifie délétion 294-295 
spécifie to animal development 1149 
see also cancer-critical genes 
genetic analysis, of tumor cells 1102 
genetic code 

amino acid équivalents of codons 334 
history 7, 178 
possible origins 365 
universality 2-3 
variants 349-351,805 
genetic engineering 

conservative site-specific 
recombination 294-295 
conservative site-specific 

recombination in 294-295 
embryonic stem cells and 1253 
epitope tagging 450-451 
of fluorescent proteins 545 
transgenic organisms 495-497 
transgenic plants 507-508 
using the CRISPR System 497-498 
genetic innovation 16-17, 217-218, 221, 
227-232 

genetic instability, cancer cells 1097, 

1103, 1111-1112, 1116, 1125, 
1133-1134, 1139 
genetic marker proteins 1221 
genetic recombination 175, 231,318, 

486, 1005-1006 

see also homologous recombination 
genetic screening 

for mutant phenotypes 488-490 
RNA interférence and 500 
timing of embryonic development 
1180 

genome annotation 477-483 
genome editing/genome engineering 494 
genome maintenance genes 1104, 1110F 
genome map distance 486 
genome sequencing 
bacteria 1268-1269 
cancer cells 1095, 1109-1111,1119, 
1137 

and conserved régions 217,224-225 
cost of 481 


and evolutionary tracing 136-137, 

292 

exome sequencing 1109 
progress in 439, 477 
resequencing 479 
speed of 464, 477 
tumor biopsies 1141 
viruses 1273 
genome sizes 

in amoebae 182 

in bacteria 1267 

chromosome numbers and 183 

compared 28-29. 33, 182, 223-224 

in Danio rerio 35 

Drosophila melanogaster 34 

in E. coli 22, 23F 

in Fuga rubripes 223 

in humans 178, 179 

in mammals 221-222 

minimum 9 

in vertebrates 222-223 
genome-wide association studies 
493-494 

genome-wide screening, by RNAi 500F 
genomes 

aggregated, human microbiome 1264 
ancient 223-224 
of chloroplasts 800-809 
diversity of 10-23 
eukaryotes 23-39 
évolution 216-234 
genes important for multicellularity 
1149 

hybrid 27-28 

mitochondrial 27, 800-809 
proportion encoding ABC transporters 
609 

see also chloroplast genome; human 
genome; mitochondrial DNA 
genomic analysis, and the tree of life 14, 
218-219 

genomic imprinting 407-409 
genomic libraries 469,471,479 
genomic plasticity, viruses 1291 
génotypes defined 485-486 
germ-band extension 1045 
germ cells 

embryonic development 1158 
migration in zebrafish 1185F 
germ layers 1148, 1167-1168, 1187 
see also ectoderm; endoderm; 
mesoderm 

germ line 

introducing altered genes 495-497 
mutation rates 238 
RNAi protection 433 
germinal centers 1313F, 1322-1323 
Get3 ATPase 682 F 
Getl-Get2 receptor complex 682F 
GFP (green fluorescent protein) 459F, 
501F, 502, 504F, 543, 546F 
fusion proteins 501F, 502, 504F, 543, 
589, 715F, 802-803 
structure 543F 
GGGTTA repeats 262 
gibberellins 881 

Gibbs free energy, G see free energy 
gigantism, pituitary 1196 
GK domains 1050 
Glanzmann’s disease 1076T, 1077 
GIcNAc see acetylglucosamine 
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Gleevec® 1135,1136F 
GUI, Gli2 and Gli3 regulator proteins 873 
glial cells 625, 1173-1174, 1179, 1186, 
1198-1202, 1210F 
from neural stem cells 1250 
radial glial cells 1200, 1201F 
glioblastoma 

EGF receptor mutation in 1107,1117 
Rb pathway disruption 1113 
p-globin 

DNA sequence of human gene 179F, 
223F, 318F 

DNA sequence of mouse gene 223F 
FIS4 barrier sequences and 202 
P-thalassemia and 324F 
globins, évolution 229-230 
globular proteins, filaments from 123-124 
glucagon 132F, 835T 
glucocorticoid receptor 400 
glucocorticoids, liver response 372 
glucose 

abnormal metabolism in cancer cells 
1098 

ATP yield from oxidation 775 
structure 96 

transcellular transport 605F 
see a/so glycolysis 

glucose 6-phosphate, in glycolysis 79, 
80F, 85F, 87F, 104 
glucose residues in cellulose 1083 
glucose transport, and Na + gradients 
603F 

glucose trimming 683F, 684-685 
glucosidases 685 
glucosyl transferases 685 
glucuronic acid 1058,1060F 
glutamate-gated ion channels 631 
glutamate neurotransmitter 629, 636 
glutamic acid structure 113 
glutamine 

structure 113 
synthesis 66, 70 
glutamine synthetase 66 
glutaraldehyde fixative 535, 555 
glutathione S-transferase (GST) 451 
glycan cross-linking 1083-1084 
glyceraldehyde 96 

glyceraldehyde 3-phosphate 75F, 76, 77F, 
78, 104-105, 760 
in photosynthesis 784-786, 792 
glyceraldehyde 3-phosphate 
dehydrogenase 105 

glycine 

as an inhibitory neurotransmitter 629 
in collagens 1062 
in elastin 1065 
structure 113 
glycobiology 718 
glycocalyx 582 
glycogen 

breakdown 827-828, 835T, 837T 
polysaccharides 71F, 79-81,87, 97 
glycogen synthase kinase 3 (GSK3) 870 
glycolipids 

as membrane constituents 568, 
575-576 
structure 99 
glycolysis 

ATP production by 74-78, 781 
in cancer cells 1098-1099, 1103, 

1115 


in plants 786 
stages of 104-105 
superiority of oxidative 
phosphorylation 756 
glycophorin 579F, 592F 
glycophosphatidylinositol (GPI) 573, 577F, 
582 

GPl-anchors 573F, 577, 582, 688- 
689, 749, 1039, 1060 
glycoproteins, noncollagen 

distinguishing proteoglycans 1059 
as extracelluar matrix macromolecules 
1057, 1073-1074 
laminin 1058F, 1069-1073, 

1075-1077 

matrix organization by 1066-1067 
nidogen 1058F, 1070, 1071F 
variant-specific glycoprotein (VSG) 
1290 

see also fibronectin 
glycosaminoglycans see GAGs 
glycosidases 716,718, 722 
glycosphingolipids 690, 749 
glycosyl transferases 716, 718-720, 868, 
1059 
glycosylation 

membrane proteins 582-583, 
683-684, 723 
of Notch protein 868 
in protein folding 685 
purposeof 719-720 
glyoxylate cycle 667 
glyoxysomes 667 
glypicans 1073 
Gmi ganglioside 575F, 576 
GM (granulocyte/macrophage) progenitor 
cells 1245 

GMCSF (granulocyte-macrophage- 

colony- stimulating factor) 864T 
goblet cells, intestinal 1218, 1219F, 1221, 
1223-1225 

gold, colloïdal 539, 652F 
gold particles 

anti-plectin antibodies 949T 
gold nanoparticles 589 
nuclear pore complex investigations 
651 

“golden rice” 508 
Golgi apparatus 

among intracellular compartments 
642 

c/s- and trans -faces 715F, 716 
localization 715F 
matrix proteins 721-722 
microscopy and 546F, 556F, 558, 

715F 

microtubules in organization 939 
models of transport 720-721 
oligosaccharide Processing 716-718 
protein transport from the ER to 
710-722 

proteoglycan assembly 718-719, 

1059 

transport to lysosomes 722-730 
see a/so TGN 
golgins 721-722 
gonorrhea 19, 288, 1303 
goût 1301 

GPCR kinases (GRKs) 848, 849F 
GPCRs see G-protein coupled receptors 


GPl-anchors (glycophosphatidylinositol) 
573F, 577, 582, 688-689, 749, 
1039, 1060 

Gram-positive and Gram-negative 
bacteria 610F, 1267 
grana, thylakoid membrane 783, 789 
granulocytes 864T, 1239, 1241 T, 1245 
granzymes 1334 
graphene 554F 

gravity and plant growth 881F, 883, 884F 
Grb2 adaptor protein 824F, 855, 862F 
great apes 218, 219F, 220 
green algae 588, 623 
green fluorescent protein (GFP) 459F, 
501F, 502, 504F 
Grim protein 1029 
GRKs (GPCR kinases) 848, 849F 
Groucho co-repressor 870F, 871 
growth 

of animais and organs 1193-1198 
and degrowth 1248 
growth cônes, axons 858, 943, 951, 
1201-1204, 1206, 1208-1211 
endothélial tip cells resembling 1237 
growth factors 

control of cell growth 1011,1017, 
1114F 

définition 1011 
mTOR and 861 
PI 3-kinase and 1017 
growth hormone 

gigantism and dwarfism 1196 
JAK-STAT signaling pathway and 
864T 

see a/so plant growth regulators 
growth inhibition by TGFp 1012 
GSK3 (glycogen synthase kinase 3) 
870-871, 872F, 881 
GST (glutathione S-transferase) 451 
GTP-binding proteins see GTPases 
GTP cap, microtubules 927-929 
GTP (guanosine triphosphate) 
in the citric acid cycle 83, 84F, 
106-107 
structure 85F 
GTP hydrolysis 

microtubules 928 
in nuclear import 654 
in protein synthesis 343-344, 
345-346 

in Rab5 domains 707 
tubulin catalysis of 894 
GTPases 

as cell regulators 156-157 
EF-Tu 160-161,163 
G protein a-subunit as 820, 832 
monomeric 703, 820, 854 
Rab and Ras families 278, 854 
Rho family as 997 
septins as 949 
guanine 

deamination to xanthine 272F, 273 
DNA base pairing with cytosine 176, 
177F 

0 6 -methyl- 271 
structure 100 

synthesis and riboswitches 414F 
guanosine 

7-methyl- 316-317F 
A/,A/-dimethyl- 337F 
guanyl transferases 316 
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guanylyl cyclase 844, 846-847 

guide R N As 328, 329F, 429, 435, 497-498 

gut lining see small intestine 

H 

H + see proton 

Haelll nuclease 465. 468F 

Haemanthus 992F 

hair cells, auditory 560F, 619, 890, 924, 
1171, 1189 

as non-renewable 1227 
hairpin helices, DNA 246, 247F 
hairpin helices, RNA 307, 326 
Hairy protein (Drosophila) 422F 
half-lives 

actin filaments 896-897, 906, 919 
connexins 1052, 1053F 
extracellular, of nitric oxide (NO) 847 
and time to steady State 826, 827F 
Halobacterium salinarum 587, 588F 
handedness of helices 124-125 
haploid-diploid cycle 4860 
haplotype blocks 492-494 
FIARs (human accelerated régions) 226 
HATs (histone acetyl transferases) 196 
FIDACs (histone deacetylase complexes) 
196, 201,390, 1257 
HDL (high-density lipoproteins) in 
nanodiscs 586 

head and tail polymerization 72-73, 340 
heart attacks 

apoptosis and 1031-1032 
muscle replacement by scar tissue 
1247 

muscle replacement by 

transdifferentiation 1258 
heart muscle 

actin and myosin isoforms 923 
actin and myosin mutations 948 
effects of acétylcholine 843 
effects of cyclic AMP 835 
localization of mitochondria 755, 756F 
myosin II in 916-918 
transdifferentiation of fibroblasts 1258 
heart muscle cells 1233F, 1247, 
1258-1259 

heart tissue morphogenesis 1059 
heat 

génération by brown fat 780 
release by biological reactions 61,65, 
76, 102 

heat shock proteins 
hsp60 355-357, 662 
hsp70 355-357, 659-662, 683, 702 
as molecular chaperones 355 
heavy chains 

dynein 938F, 942 
immunoglobulin 118F, 1316-1318, 
1320-1322 
kinesis 936 

myosin (MHC) 915,923,958F 
Hedgehog g en es 1160-1161F 
Hedgehog pathway 1150, 1154, 1160 
Hedgehog proteins 871-873, 1160 
see also Sonic hedgehog 
helicase loaders 255, 256F 
helices 

closed ring alternative 124 
collagen triple hélix 124-125, 1061, 
1070 


cytoskeletal filaments 893-894 
DNA double hélix 3F 
handedness 124-125 
microtubules 926 
reasons for abundance 124 
S4 hélix 622 

see also a-helices; double hélix 
Hélicobacter pylori 1129 F, 1132, 1263, 
1265, 1277 

helix-loop-helix motifs 377,1171 
helix-turn-helix motifs 376 
helperT cells (Th cells) 

B cell activation 1316-1317, 1320, 
1322-1323, 1335 

CD4 expression on 1331-1332, 1335 
co-stimulatory proteins/signals 1314 
cytotoxic T cells and 1333 
effector helper T cells 1326,1335, 
1338F 

follicular helper T cells 1336 
HIV invasion of 1332 
macrophage activation 1335 
possible différentiation 1335-1336 
présentation of peptide-MHC 

complexes 1130, 1327F, 1338F 
T cell classes 1325 
hematopoiesis 1241,1243F 
hematopoietic cell survival 1246-1247 
hematopoietic progenitor cells 1243, 
1245-1246 

hematopoietic stem cells 1239, 
1242-1243, 1308 
hematoxylin 535, 536F 
heme group 

biosynthesis 760 
in cytochrome c 766 
and iron, in hemoglobin 147F, 148 
hemichannels (connexons) 1051-1052 
hemidesmosomes 893F, 946, 947F, 1036, 
1076 

hemoglobin 

expression in blood cells 371 
fêtai 229 

heme groups in 147F, 148 
interspecies comparisons 37F 
as a multisubunit protein 123 
see also globins 
hemophilia 291,300F 
heparan sulfate 871, 1058, 1060F, 1073, 
1151,1279 

hepatitis-B and -C viruses 1129-1130, 
1132 

hepatitis delta virus 5F 
hépatocytes, l-cell disease 728-729 
Her2 protein 1137 
Herceptin® 1137 

hereditary cancers 282-282, 1107-1108 
hereditary nonpolyposis colorectal cancer 
(HNPCC) 250, 1124-1125 

heredity 

as characteristic of life 2 
DNA and the mechanism of 174, 
177-178 

epigenetic inheritance 194 
hermaphroditism, C. elegans 33, 1180F, 
1194 

herpes viruses/herpes simplex virus 
1273T, 1279, 1286, 1288 
Hes genes 1178 
heterochromatin 

around centromeres 432-433 


in cancer cells 1097,1109 
and euchromatin 194 
and gene silencing 206-207 
localization 211-212 
pericentric heterochromatin 204F 
RNA interférence and 432-433 
in X-inactivation 410 
see also H PI 

heteroduplex DNA 278, 280F, 284-286 
heterokaryons 444F, 588 
heterophilic binding 1039F, 1055, 1056F 
heterotypic membrane fusion 712 
heterozygosity, loss of 281 
hexokinase specificity 141 
HGF (hépatocyte growth factor) 1073 
HHV-8 human herpesvirus 1132 
Hld protein 1029-1030 
HIFs (hypoxia-inducible factors) 1191, 
1237 

high-mannose oligosaccharides 717, 

718F 

Hill coefficients 517-518 
Hindlll nuclease 465-466F 
Hippo pathway 1197 
hippocampus 

LTP (long-term potentiation) 636-637 
microtubule-associated proteins 932F 
neuron turnover rate 1250 
histamine 742, 1239, 1241T, 1317 
histidine 

chlorophyll-protein complex 787-788 
metal-ion binding 450 
structure 112 
histone acetylases 202 
“histone code” 198 
histone demethylases 196, 201, 1257F 
histone folds 188,189F 
histone H1 (linker histone) 192-193, 1252 
histone methyl transferases 196, 390, 
1257F 

histone modification factors 1257F 
histone-modifications, chromatin 
immunoprécipitation 505 
histone-modifying enzymes 

and transcription activators 386 
in transcription initiation 313 
in X-inactivation 411 
histone octamers 188, 189F, 261,976 
histone proteins, in nucléosome 187-188 
histone tail side chain modifications 
196-197 

histone tails, N-terminal 189F, 190 
in chromatin compaction 193F 
inheritance of spécifie modifications 
205F 

meaning of spécifie modifications 
200F 

in nucléosome stacking 192 
histone variants 

fibroblast reprogramming and 1257F 
histone variant H3.3 205-206 
incompatibility of méthylation and 
acétylation 207 
site-specific insertion 198 
histones 

and non-histone proteins 187 
nuclear reprogramming 1252-1253 
séparation from samples 440 
side-chain modifications 196-197 
synthesis and the cell cycle 261 
Histoplasma capsulatum 1272 
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HIV (human immunodeficiency virus) 
and cancer 1130T, 1132 
capsid structure 562F 
combination thérapies 1140 
detecting in samples 475F 
diversification from SIV 1291F 
genome 420F 

infection and membrane fusion 709, 
710F 

invasion of helperT cells 1332 
regulated nuclear transport and 
419-421 

réplication errors and évolution 1291 
transcription atténuation 414 
HIV-1 1273T,1286,1289,1291 
receptors required 1279-1280 
HLA proteins (human leukocyte 
associated) 1327 
HNPCC (hereditary nonpolyposis 
colorectal cancer) 250, 
1124-1125 

hnRNPs (heterogeneous nuclear 

ribonucleoproteins) 323F, 326 
Holliday junctions 283F, 284-285 
homeodomain family 120, 122, 378F 
homeodomain motif 376, 1163 
homeostatic mechanisms 1195,1217, 
1226, 1247 

homeotic selector see Hox 
Homo sapiens 

as a eukaryote 16F 
as a model organism 29T, 33, 
491-492 
homogenates 

cell-free Systems 451 
from subcellular fractionation 445 
homolog pairing 1005F, 1006-1007 
homologous chromosomes 
and imprinting 408F 
ségrégation in meiosis I 1008 
homologous genes 216 
homologous protein species 

interchangeability 1146 
homologous recombination 276-287 
B RC Al and BRCA2 defects 266T, 
267 

cellular régulation of 280-282 
double strand break repair 275, 
278-279, 1116, 1133-1134 
in meiosis 282-285 
in transgenic organisms 495, 497, 
498F 

use by cancer cells 1100 
homologs 

in human cells 180 
orthologs and paralogs as 17-18 
homophilic binding 
axon growth 1202 
cadherins and 1038-1042,1202 
contrasted with heterophilic 1039F 
DSCAM proteins 1207 
immunoglobulin superfamily and 
1055, 1056F, 1202 
sealing strands 1048 
homotypic membrane fusion 712, 721, 
748 

homunculi 1155,1205 
Hooke, Robert 439, 1081 
horizontal gene transfer 

antibiotic résistance 19, 289, 1269, 
1292-1293 


in prokaryotes 18-20 
sexual reproduction and 19-20 
as source of innovation 16, 17F 
three mechanisms 1268 
hormone-secreting pituitary cells 840 
hormones 

in extracellular signaling 815 
gene expression response 372, 400 
in hematopoiesis 1244 
melatonin 877 

moderated by cyclic AMP 835T 
production through DNA cloning 484 
somatostatin 835-836 
steroid hormones 875-876 
timing of developmental transitions 
1182 

horseradish peroxidase 735F 
horseshoe crab 223 
housekeeping genes 406, 407F 
Hox (homeotic selector) complex 
cell memory and 1164 
in humans 1169 

serial gene expression 1163-1164, 
1169 

Hox genes 1162-1164 

in pattern refinement 1161-1164, 

1169 

rhombomeres and 1188 
HP1 (heterochromatin-specific protein) 
197, 200F, 202F, 204F, 206, 
210-211 

HPLC (high-performance liquid 

chromatography) 449, 457 
HPV (human papillomaviruses) 1131, 
1265, 1273T 

HS4 barrier sequences 202 
hsp60 (heat shock protein) 355-357, 662 
hsp70 protein family 
BiP 683 

clathrin removal 702 
hsp60 and 355-357 
mitochondrial 659, 660-661 F, 662 
human body 

daily ATP turnover in 774 
as a microbiological ecosystem 
1263-1264 

mutation rate 1091,1094, 1095F 
number of cells and cell types 2, 

1091,1217, 1264 
number of lymphocytes 1308 
proteins unique to humans 122 
human brain, number of neurons 627, 
1198 

human cells 

limit on divisions 1016 
tRNA and anticodon numbers 336 
human chromosome 5 472F 
human chromosome 6 132F, 1327 
human chromosome 9 472F, 1093, 1094F, 
1135F 

human chromosome 13 1108 
human chromosome 22 

compaction in mitosis 187 
sample section 183F 
translocation with chromosome 9 
1093, 1094F, 1135F 
human female, errors in meiosis 1010 
human genome 

analysis, and medical treatment 506 
conserved régions as functional 217 
evidence of migrations 232 


gene density and gene size in 182, 
184 

as genetic model 36- 37 
haplotype blocks 492 
human mitochondrial genome 
801-802F, 804-805 
MHC polymorphisms 1331 
mobile genetic éléments 218F 
mobile genetic éléments in 218F, 287, 
291-292 

mouse genome compared with 220F, 
221-222 

mutation rates 238 
number of genes and proportion 
expressed 371 

number of genes coding for collagens 
1062 

number of genes coding for Golgi 
enzymes 718 

number of genes coding for 

intermediate filaments 944, 946 
number of genes coding for ion 
channels 635 

number of genes coding for orphan 
receptors 875 
number of genes coding for 

phosphatases and kinases 819 
number of genes coding for receptor 
proteins 814, 832 
number of mcRNAs produced 429, 
435 

number of nucléotides 28, 37, 175, 
183F, 184T 
other statistics 184T 
proportion of cancer-critical genes 
1112-1113 

protein-coding genes 120, 122-123 
réplication origins 260-261 
sample section of chromosome 22 
183F 

sample section of X-chromosome 
300F 

size 178, 179, 184T 
variability 38, 232-234 
human genome project 

genomic DNA cloning 471 
répétitive DNA problem 479 
human microbiota 1263-1275, 1277 
human population 

as a genetic model 491-492 
genome-wide association studies 493 
population growth and genetic 
variation 231 

Hunchback transcription activator 393F, 
394, 395F, 1161F, 1179 
huntingtin gene 224F 
hyaluronan 1057F, 1058-1060, 1061F 
hybridization 

nucléotide sequences from 472 
in recombinant DNA technology 464, 
473 

in situ hybridization 502, 536 
hybridization (DNA) 

and homologous recombination 
277-278 

in microarrays 257, 258F 
hybridomas 444 

hydride ions, NADH and NADPH 67-68 
hydrocarbon tails, fats 83, 98-99 
hydrocarbons 

as hydrophobie 44, 92 
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structures 90 

hydrodynamic measurements 455 
hydrogen bonding 

in the a-helix and p-sheet 116, 
579-580 
in p-barrels 582 
in cellulose 1083-1084 
codon-anticodon matches 345F 
in the DNA double hélix 175,255 
edge récognition of base pairs 374 
glycolipids 575 
in hybridization 472 
transmembrane proteins 579-580 
in tRNA 335 
hydrogen bonds 

binding site example 135F 
as noncovalent attractions 44, 94 
in the nucléosome 188 
polar amino acids 114 
structure of 45, 92 
in water 44, 94, 613 
hydrogen ion release 481 
hydrogen nuclei in NMR 461 
hydrogen peroxide in peroxisomes 
666-667 

hydrogénation and dehydrogenation 56 
hydrolases 

activity in endosomes 736 
in lysosomes 722-723, 727 
transport to endosomes 728-729F 
hydrolysis 

DNA damage from 267T 
as the reverse of condensation 
reactions 49F 

see also ATP hydrolysis; GTP 
hydrolysis 

hydrolytic editing 339F 
hydronium ions 46, 613, 773 
hydropathy plots 579-580, 681F 
hydrophilicity 

contrasted with hydrophobicity 44, 92 
soluble immunoglobulins 1317 
hydrophobie chromatography 448, 452 
hydrophobie forces 
in lipid bilayers 99 

as noncovalent attractions 44-45, 95 
and protein structure 111, 114, 355 
hydrophobicity 

contrasted with hydrophilicity 44, 92 
ligand-modulated transcription 
regulators 874 

as low in disordered régions 126 
membrane-bound immunoglobulins 
1317 

nonribosomal proteins 808 
and quality control mechanisms 357, 
359 

transmembrane proteins 577, 579F, 
593 

hydrothermal vents 11-12 
hydroxyl groups, a- and p-links 97 
hydroxyl ions 46, 93 

hydroxylapatite (calcium phosphate) 1229 
hydroxyproline 1062F, 1084 
hygromycin 351F 
hyperpolarization 844-845 
hypervariable immunoglobulin régions/ 
loops 1318-1319, 1321 
hypochlorite 1239 
hypothalamus 877 
hypoxanthine 271,272F 


hypoxia 1115, 1116F, 1191, 1238F 
hypoxia-inducible factors (HIFs) 1191, 
1237 

hystérésis, in positive feedback 518 

i 

l-cell disease (inclusion-cell disease) 
728-729 

lAPs (inhibitors of apoptosis) 1029-1030 
iCAD inhibitor (of CAD) 1024F 
IC AMI 1055F, 1076T, 1077 
ICAMs (intercellular cell adhesion 
molécules) 1055, 1056F 
identical twins 412, 477 
iduronic acid 1058,1060F 
IFN see interferon 
IFT (intraflagellar transport) 943 
Ig see immunoglobulins 
IGF family (insulin-like growth factors) 

860, 1114 

IGF1 (insulin-like growth factor-1) 850T, 
852, 1196 

Igf2 (insulin-like growth factor-2) 407, 409 
IGFBP (insulin-like growth factor-binding 
protein) 1066F 
iHog protein 872 
IkB kinase kinase (IKK) 875F 
IkB proteins 874 

IL1,5, 6, 10, 12, 13, 17 and 21 see 
interleukins 

ILK (integrin-linked kinase) 1079 
Illumina® sequencing 480 
image enhancement 534, 561 
imaginai dises ( Drosophila ) 1195 
immédiate early genes 856,1013F 
immortalized cell lines 442-443 
immune rejection, transdifferentiation and 
iPScells 1259 
immune response 

cancer treatment using 1137-1139 
phase variation in bacteria 294 
see also adaptive; innate immune 
System 
immune System 

nonsense-mediated decay in 352-353 
see also CRI S PR System 
immunization 

as the basis of vaccination 1307 
primary and secondary immune 
responses 1310, 1311F 
“self” molécules 1313 
tumor cells and 1324 
immunoblotting (Western blotting) 455 
immunodeficiency 957 
see also HIV 

immunofluorescence 539F 
immunofluorescence microscopy 
cytotoxic T cells 1334F 
dendritic cell 1326F 
embryonic nervous Systems 1041F 
épithélial cells 946F, 967F 
mitochondria 756F 
neurons 932F 

nuclear localization signais 652F 
Sordaria prophase I 1007F 
spermatozoa 590 
synaptonemal complexes 1009F 
immunoglobulin fold 121 
immunoglobulin heavy chains 118F, 
1316-1318, 1320-1322 


immunoglobulin (Ig) domains 1318-1319, 
1321F, 1338 

Ig-like domains 1325,1338 
immunoglobulin (Ig) superfamily 

in antigen récognition 1338-1339 
IgA 1316-1317, 1318T, 1320, 1322, 
1323F 

IgE 1316-1317, 1318T, 1319F, 1320, 
1322, 1323F, 1336 
IgG 1316-1317, 1318T, 1320, 1322, 
1323F 

IgM 1316-1317, 1318T, 1319-1320, 
1322-1323 

immunoglobulin light chains 1316, 1317F, 
1318, 1319F, 1320-1322 
V région 1320, 1321F, 1323F 
immunoglobulins (Ig) 

B and T cells distinguished 
1308-1309, 1324 
and B lymphocytes 1315-1324 
binding site versatility 138,1320 
as bivalent molécules 1316 
in cell-cell adhesion 1055-1056, 1077 
diversification mechanisms 1323F 
évolution 1338-1339 
exon recombination in 230 
five classes of mammalian 1316-1317 
Ig-like domains 1325,1338 
primary human répertoire 1319 
secondary classes 1322-1323 
secondary human répertoire 1320 
soluble and membrane-bound 
1315-1316, 1317 
V(J)D recombination in 290 
see also antibodies 
immunogold électron microscopy 
556-557 

immunological memory 1309-1311, 1326 
immunological self-tolerance 1307, 
1336-1337 
AIRE gene in 1333 
central and peripheral tolérance 1314 
FoxP3 gene in 1336 
mechanisms of 1313-1315 
MHC-peptide complexes in 
1328-1329 

immunological synapses 1333,1334F 
immunoprécipitation 449-450 

co-immunoprecipitation 457, 505, 
506F 

immunostaining 1202F 
immunosuppression 

immunosuppressive drugs 655F 
tumor microenvironments 1137, 

1138F 

importins see nuclear import 
imprinting, genomic 407-409 
in silico, meaning 472 
in vitro, meanings 440-441 
in vivo, meanings 440-441 
inactivation, and signal response speeds 
825-826, 848 

incontinentia pigmenti 300F 
indels 492 

independent-choice theory (stem cells) 
1222, 1223F 

indirect immunocytochemistry 539 
indole-3-acetic acid (auxin) 882, 883F 
induced-fit models 153,345 
induced proximity 822 
inductive signaling 1150-1151 



Index 


1:27 


morphogens in 1151, 1153 
sequential induction 1153-1154, 1160 
in vertebrate embryos 1167F, 1177, 
1184, 1198 

infants 

antibody transport in newborns 737 
birth defects 1154 
infection strategies 1276-1294 
infectious diseases 

immunity and the adaptive immune 
System 1297 
mortality 1263 
see also pathogens 
inflammasomes 1301 
inflammatory response 
and cancer 1132 
and pathogens 1300-1301 
and white blood cells 1240 
influenza virus 

as a disease-causing virus 1265, 
1273T 

effect on host-cell transcription 1288 
genome reassortment 1291,1292F 
infection by endocytosis 709, 
1280-1281 

influenza A 1273T, 1280 
négative strand genome 1289 
nomenclature 1292F 
pandémies 1291,1292F 
structures 1274 
inheritance 

chromosome-wide gene expression 
changes 409-411 
of organelles 648, 807 
see also heredity 

inhibitors of protein function 459-460 
inhibitory neurotransmitters 629 
inhibitory postsynaptic potential 633 
inhibitory signais, sequential 820, 821F 
initiation factors 

elFs (eukaryotic initiation factors) 347, 
348F, 423-428 
for IRES 425 
phosphorylation 423-424 
initiator caspases 1022-1025,1028 
initiator proteins 254-255, 256-257F, 259 
innate immune responses 
speed of 1298 

Toll and Toll-like receptors 1165 
innate immune System 

as first line of defense 1298-1306 
natural killer cells in 1304 
innate lymphoid cells 1326 
innexins 1051 
inosine, in tRNA 336, 337F 
inositol 1,4,5-trisphosphate (IP 3 ) 837, 847 
inositol phospholipid signaling pathways 
836, 848, 852, 853F, 861 
inositol phospholipids (phosphoinositides) 
572, 574, 836 

phosphatidylinositol (PI) 377F, 574 
profilin binding 906 
see also PIPs 
insect vectors 1276 
insertional mutagenesis 488, 491 
insulator DNA sequences 391,409 
insulin 

action via receptor tyrosine kinases 
850T 

assembly 130 


délétion of thymocytes recognizing 
1332 

PI 3-kinase-Akt signaling pathway and 
860,1114-1115 
recycling endosomes and 738 
sécrétion by pancreatic p-cells 1226 
insulin receptor 823, 850T, 851-852 
Intl gene 869 
integrases 291 

intégration, intracellular signaling Systems 
825 

integrin clusters and 1079 
integrins 

active and inactive conformations 
1077-1078 
binding sites 1075 

in cell adhesion and motility 955-956, 
959 

cytoplasmic tyrosine kinases and 
862-863 

fibronectin binding 1067-1068 
intracellular signaling proteins and 
1079-1080 

a L p 2 integrin (LFA1) 1055F, 1076T, 
1077 

as matrix receptors 1074 
médiation of cell-cell adhesion 1038, 
1054, 1077 

médiation of cell-matrix junctions 
1037F, 1038 

a-and p-subunits 1075 
as transmembrane adhesion proteins 
1037 

types of 1076T 

interacting proteins, identifying 457-458 
interaction domains 822-824, 852-853, 
860 

interchromatin granule clusters 213, 
331-333 

interférence effects, in microscopy 531, 
532F, 550 
interferon-a (IFNa) 
as a cytokine 1304 
JAK-STAT signaling pathway and 
864T, 1304 

interferon-p (IFNp) 1304 
interferon-y (IFNy) 

from helper T effector cells 
1335-1336 

JAK-STAT signaling pathway and 
864T 

as a Type II interferon 1304 
interleukins 

IL1 (interleukin-1) 873, 1257F, 1301 
IL3 1257F 
IL4 1336 
IL5 1335F 

IL 6 1301, 1335F, 1336 
IL10 1335F, 1336 
IL12 1301, 1335 
IL13 1335F 

IL17 1301, 1335F, 1336 
IL21 1335F, 1336 
intermediate filaments 
assembly 124,946 
in cell migration 959 
desmosomes and 1036,1045-1046 
flexibility and stretch 895F 
hemidesmosomes and 1076 
and the nuclear envelope 890 
and the nuclear lamina 891,944 


and septins 944-950 
intermembrane space, mitochondria 658, 
663-664 

internai membranes 
chloroplasts 27F 
eukaryotic cells 24 
mitochondria 26F 
interphase 

as cell cycle stage 185, 964 
chromosome State in 185-188, 198, 
207-208, 209F 
components 964, 965F 
interpolar microtubules 982 
intestinal crypts 1122, 1124, 1218-1219 
intracellular compartments see organelles 
intracellular development programs 1179, 
1200 

intracellular pathogens 1278-1279, 1282, 
1283F, 1284-1285 
intracellular signaling cascades see 
enzyme cascades 

intracellular signaling complexes 822, 
823F, 851F, 859 

intracellular signaling pathways 814 
combating noise 820-822 
interaction domains 822-824, 
852-853, 860 
in plants 880-885 
response behaviors 824-825 
response speeds and turnover rate 
825-826 

see also signaling pathways 
intracellular signaling proteins 578, 700, 
814, 1079-1080 
intraepithélial neoplasia 1131 
intragenic mutations 16, 17F 
intralumenal vesicles 724F, 729-730, 
735-737, 738F 

intrinsic activation pathway, apoptosis 
1023-1029 

anti-cancer drugs binding to Bcl2 
1031 

Bcl2 régulation 1025-1028 
mitochondrial involvement 1025 
intron sequence ambiguities 416 
introns (intervening sequences) 

consensus nucléotide sequences 
319F 

discovery 417 
évolution rate 220F 
in human genome 183F, 184, 224F, 
318F 

self-splicing 324 
size range 319, 321 
transcribed, removal by RNA splicing 
315-316,317-318, 320F, 336 
unique removal by IRE1 688 
invadopodia 952 

invariant membrane proteins 1316-1317, 
1326, 1336 

invariant polypeptide chains, BCR/TCR 

1327F, 1328-1329, 1330F, 1337F, 
1338 

invariant sections, MHC protein 1331 
invasin 1281 
inversion mutations 487 
inverted repeats 289, 603, 604F, 676F 
ion-channel-coupled receptors 817F, 818, 
843 

ion channels 

all-or-nothing opening 626 
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cyclic-nucleotide-gated 843-844 
depolarizing and hyperpolarizing 627 
G protein direct régulation 843 
gating 604, 614, 618-620, 636F 
ion selectivity 613-614, 617-618 
mechanosensitive 619-620 
paracellular pores as 1049 
photosensitive 623 
sequential activation in neuromuscular 
transmission 632-633 
ion-concentration gradients 601-604 
ion-exchange chromatography 448-449, 
452 

ion-gated channels 614 
ion pumps 164 

ion-sensitive indicators 546-547 
ion sources, mass spectrometry 456-457 
ion torrent™ sequencing 481 
ionotropic receptors see ion-channel- 

coupled receptors; transmitter- 
gated ion channels 

ions, inorganic, inside and outside cells 
598T 

IP 3 see inositol 1,4,5-trisphosphate 
IP 3 receptors (IP 3 -gated Ca 2+ -release 
channels) 837-840 
ipilimumab 1138 

iPS cells (induced pluripotent stem cells) 
398, 401,1254-1259 
IRE1 protein kinase 687-688 
IRES (internai ribosome entry sites) 
425-426, 1288 
iron 

and atmospheric oxygen levels 797 
in cytochrome c and hemoglobin 766 
iron-sulfur clusters 

chloroplasts 787, 792-794F 
mitochondria 760, 766-768, 

769-771 F, 773F 
irradiation see radiation 
IRS1 (insulin receptor substrate 1) 824F, 
852 

isocitrate 106-107 
isoelectric focusing 453-454 
isoforms 

ofactin 898 

of collagen 1070, 1071F, 1072 
ofconnexins 1051 
and defining the gene 417 
of DSCAM proteins 415, 1207 
of fibronectin 1067 
of histone H3 1253 
of laminin 1070,1072F 
oftubulin 925 
isoleucine, structure 113 
isomers/isomerizations 
amino acids 112 
citric acid cycle 106 
in glycolysis 104 
monosaccharides 96 
isopeptide bonds 157,159F 
isotopic labeling, equilibrium 
sédimentation 447 
radioisotope labeling 452, 454, 
466-467, 1219 
Ixodes scapularis 1264F 

j 

J gene segments 1320, 1321F, 1325 
JAK-STAT signaling pathway 863-864, 
1304 


Janus kinases (JAKs) 863 
jellyfish 543, 547 

“jumping genes” see mobile genetic 
éléments 

junction complexes/junctional complexes 
ER with mitochondria 691 
integrin clusters and 1079 
tight junctions 1049-1050 
junctional diversification 1321 
junctional epidermolysis bullosa 1069 

K 

K + channel types 634-636 
K + gradients 607, 615, 617 
K + leak channels 614-615, 617-618, 

619F 

K-Ras proto-oncogene 1123,1125 
Kai proteins 878 
Kartagener’s syndrome 942 
karyopherins (nuclear transport receptors) 
326, 653 
karyotypes 

abnormalities and cancer 1097, 1108, 
1111,1125F 

human chromosome set 181F, 182 
KASH proteins 949, 959 
katanin 933, 935-936 
Kcnql gene 409 

KDEL receptors/sequences 713-714 
keratan sulfate 1058, 1060-1061F 
keratins 944T, 946-947, 949, 953, 1046 
a-keratin 115,117,124 
kératocytes 951,953, 954-955F 
a-ketoglutarate 84, 106-107 
ketoses 98 

kidney cells 536F, 558F 
kidney disease 1071 
kidney glomeruli 1063T, 1069-1071 
killer (cytotoxic) lymphocytes 1024, 1025F 
see also cytotoxic T lymphocytes 
kinase cascades 820 
kindlin 1037T, 1075F, 1078F 
kinesin-1 936, 937F, 939 
kinesin-2 943 
kinesin-4 983,991,992F 
kinesin-5 983, 985, 987F, 994 
kinesin-10 983, 991,992F 
kinesin-13 933-934,936 
kinesin-14 983, 985, 987F 
kinesins 163, 459, 460F, 936-937, 939, 
1288 

kinesis-related proteins 983 
kinetic binding experiments 458 
kinetic energy 54F 
kinetic proofreading 345 
kinetic rate constants 458, 516F 
kinetics of enzyme 141-144 
kinetochore fibers 989F, 990 
kinetochore microtubules 982 
kinetochores 

bi-orientation 988-989 
in meiosis 1008 
microtubule attachment 990F 
in mitosis 980-981,988F 
as protein complexes 186 
sister-chromatid attachment 987 
Skpl protein in 167-168 
species différences 203 
Kit g en e/kinase 

mutation effects in human and mouse 
37F, 1187 


stem cell factor and 1244 
KKXX sequences 713 
Klf4 transcription regulators 398-399F, 
1254-1255 

see also OS KM factors 
Km (half-maximal reaction rate) 141, 143 
Krebs cycle see citric acid cycle 
kringle domains 122 
Krüppel gene 1159 
Krüppel protein 393F, 394, 395F, 1179 
Ku protein 275F 

L 

L1 element see LINEs 
L15 protein 347F 
labeling see isotopic labeling 
laboratory culture see cultured cells 
Lac operon 381 F, 382-383 
P-lactamases 1293 

lactate production in cancer cells 1098 
lactic dehydrogenase 118F 
LacZ gene/protein 382F, 384F, 1162F, 
1221-1222F 

lag phase, actin filament growth 899-900, 
902 

lagging strand 
defined 242 

RNA primer synthesis 245 
lakritz mutant 488 
lamellae 1083-1086 
lamellipodia 890, 892F, 906, 912, 

951-957, 958F, 959 
axon growth cônes 1201,1202F 
laminins 1058F, 1069-1073, 1075-1077 
cultured cells and 1223 
structure 1070F 
laminopathies 948 

lampbrush chromosomes 207-209,211 
Langerhans, islets/cells 1226, 1240 
“lariat” formation, RNAsplicing 318-319, 
320F, 321,324 

lasers 

confocal microscopy 541 
in FRAP 545, 546F, 589 
multiphoton imaging 542F 
photoactivation/photoswitching of 
fluorescent tags 544, 546F, 
551-552 

in TIRF microscopy 548 
last common ancestor 

cancer diversification from 1119F 
eukaryotes 880 

of humans and chimpanzees 218, 
219F 

of humans and mice 221 
species divergence and 16F, 17, 37F, 
226 

late endosomes 

delivery to lysosomes 696F, 723, 727, 
733 

early endosome maturation 730, 
735-736 

Rab proteins 706T, 707 
latent transcription regulators 
Hedgehog as 871 
NFkB proteins as 873 
Notch protein as 868 
plant photoproteins and 884 
in regulated proteolysis 867 
Smad family proteins as 865 
STAT proteins as 863, 865 
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latéral diffusion in bilayers 570, 588, 589F 
latéral inhibition 867, 868F, 1151-1152, 
1174F, 1178-1179 
Notch mediated 1171-1173, 
1224-1225 
latrunculin 904, 906 
Lck protein kinase 862, 1332, 1337F 
LDLs (low-density lipoproteins) 733-734 
LDV sequence (Leu-Asp-Val) 1075 
leading strand, defined 242 
“leaky scanning” 348, 424 
lectin pathway, complément System 
1302-1303 

lectins 

carbohydrate layer affinity 582 
in cell récognition 575-576 
ER chaperones as 685-686 
exportfromER 711-712,720 
mannose-binding lectin (MBL) 711, 
1301, 1302F 

receptors as PRRs 1300 
selectins as 1054,1312 
LEF1/TCF transcription regulator 870F, 
871 

Lefty protein 1168 

Légionella (L pneumophilà) 739, 1284F, 
1285-1286 

Légionnaire’s disease 1263,1285 
Leishmania (L tarentolae) 804, 1284F, 
1286F 

leptin genes 219-220F 

leptotene 1006, 1007F 

Let7 gene 1180-1181 

léthal mutations 487, 488, 491,496 

leucine-rich repeats 1299-1300 

leucine structure 113 

leukemias 

acute lymphocytic leukemias 1117 
Bcl2 overproduction and 1246-1247 
as cancers of white blood cells 1092 
chronic myelogenous leukemia (CML) 
1093-1095, 1135-1136 
leukocyte adhesion deficiency 1076T, 
1077 

leukocytes see white blood cells 
LeuT protein 604F 
lever arm, myosin 916, 917F, 925F 
LFA1 (a L p 2 integrin) 1055F, 1076T, 1077 
Lgr5 gene 1221-1223, 1224F 
LHC (light-harvesting complexes/antenna 
complexes) 788-789, 794F 
licensing, of réplication origins 974-975 
life, origins of 362-366 
ligand-gated channels 614,629, 636 
ligand-modulated transcription regulators 
874-876, 877F 

ligands 

binding and allostery 152-153 
binding and linkage 151 
binding and RTK dimerization 851 
defined 134 

in extracellular signaling 815-816 
see also binding sites 
light chains 
dyneins 937 

immunoglobulin 1316, 1317F, 1318, 
1319F, 1320-1322, 1323F 
kinesin 936, 939 

myosin (MLC) 915-916,922,957, 
958F, 997 

light-driven cation channels 623 


light-driven proton pumps 586-588, 602F, 
606 

light microscopy 

asymmetric cell division 1002F 
bright- and dark-field 533-534, 535F 
confocal microscopes 540-542, 544F 
continuing utility 529 
differential-interference-contrast 
microscopy 533-535, 560F, 
1002F, 1202F 

image deconvolution 540, 542 
image enhancement 534-535 
of living cells 533-534 
of mitochondria 756, 802 
multiphoton imaging 542 
plant cell in telophase 1000F 
resolution 530-532 
sister chromatid séparation 992F 
specimen préparation 535-536 
stains for 529, 535, 536F 
superresolution techniques 549-554 
three-dimensional imaging 540, 541F, 
542, 550F, 553F 
typical design 531F 
see also fluorescence microscopy 
light reactions 783-785, 787F, 793 
light-sensitive phytochromes 883-885 
lignin 1082-1083 
Lin4 gene 1180-1181 
Uni4 gene 1181F 
Lincoln, Abraham 1066 
LINEs (long interspersed nuclear 
éléments) 218F, 291 
linkage and ligand binding 151 
linkage tetrasaccharides 1059 
linker DNA in nucléosomes 187-188 
linker histones (histone H1) 192-193, 

1252 
linker proteins 

in cartilage 1061F 
cytoskeleton 948-949, 959 
lipid anchors 577-578, 593 
see also GPl-anchors 
lipid bilayers 99, 566-576 

assembly in the ER 689-691 
association of membrane proteins 
with 576-577 

asymmetry 573-574, 590, 681, 690 
composition and its effects 571T 
deformation by membrane-bending 
proteins 573-574 
domains 572-573 
electrochemical gradients 612 
flip-flops between monolayers 
570-588 

as fluids/solvents 569-573, 588 
fusion 708F 

latéral diffusion in 570, 588, 589F 
membrane transport proteins in 597 
osmium tetroxide binding 555 
overview 99, 566-576 
as self-sealing 568, 569F 
small molécule diffusion across 598 
spontaneous assembly 566 
spontaneous formation 9, 568-569 
synthetic 569-571,597-598 
lipid droplets 573 
lipid kinases 574 

lipid rafts/raft domains 572, 573F, 575, 
590, 689,749-750 
caveolae from 731, 750 
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as amphiphilic 566 
density in cell membranes 566 
dolichol as 684 
fatty acids and 98 

lipoprotein production in smooth ER 670 
liposomes 569-570, 572F, 585 
lissencephaly 939 
Listeria (L. monocytogenes) 423F, 

914-915, 953, 1281, 1284, 1287, 
1288-1289F 
listeriolysin O 1284 

lithotrophic organisms 11, 12F, 13, 14F 
liver 

carcinogen activation by 1128 
conversion to nerve cells 396, 397F 
glucocorticoid response 372 
hépatocyte renewal 1226-1227 
hépatocytes and l-cell disease 
728-729 

membrane enclosed organelles 643T, 
644F 

peroxisomes 667F 

protein abundances in brain and 372F 
size régulation 1022 
urea cycle 760 

vasopressin-induced Ca 2+ oscillations 
840F 

liver cancer and hepatitis viruses 1132 
liverworts 806F 
living cells 

bilayer formation and 569 
light microscopy of 533-534, 

538-539, 541F, 542-546 
monitoring ion concentrations 
546-547 

IncRNAs see long noncoding RNAs 
local mediators, cell signaling 815 
localization 

auxin transporters 883 
protein machines 164 
loci, defined 486 
logic functions 

AND logic 521-522, 523F 
AND NOT logic 521-522 
logic operations see switches 
long noncoding RNAs (IncRNAs) 

conserved sequences producing 225 
function 305T, 435-436 
in imprinting 409 
Xist as 411 
loop structures 

chromatin 207-208,211-212 
DNA looping 383-384, 385F, 386, 391 
ion channels 618 
see also feedback loops 
loss-of-function mutations 
defined 487 

disease prédisposition and 494 
following gene duplication 228-229 
Uni4 gene 1181F 
tumor suppressor genes and 1104 
as typically récessive 489 
Ubx genes 1162F 
loss of heterozygosity 281 
Lou Gehrig’s disease 947 
low complexity domains 132-133 
low-light photography 534 
lox sites 496-497 

LPSs (lipopolysaccharides) 1267, 1269 
LRP (LDL-receptor-related protein) 
870-871 
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LRP4 receptors 1210 
LRR (leucine-rich repeat) receptor kinases 
881 

LTD (long-term dépréssion) 637 
LTP (long-term potentiation) 636-637 
lumens 

defined 644 

endoplasmic réticulum 669 
small intestine 1037 
luminescent proteins 547 
luminescent reporters 1117F 
lung cancer 1095, 1110F, 1129F, 1136 
lungs, formation and structure 1190 
luteinizing hormone 835T 
Lyme disease 1264F, 1286F 
lymph nodes (lymph glands) 1313F 
lymphatic System 

antigen présentation 1305, 1306F 
endothélial cells in 1236 
lymphocyte recirculation 1311-1313 
métastasés use of 1101 -1102, 1123, 
1236 

see also lymphoid organs 
lymphocytes 

autoimmune diseases and 1031 
B and T lymphocytes 1240, 1297 
différentiation in immunological 
memory 1309 

élimination by apoptosis 1022 
killer (cytotoxic) lymphocytes 1024, 
1025F 

L-selectin and 1055 
maturation 1310 
migration 1185-1186 
number in the human body 1308 
tyrosine-kinase-associated receptors 
and 862 

see also B cells; T cells 
lymphoid follicles 1312, 1313F, 1322, 
1336 

lymphoid organs 

human lymphoid organs 1308F, 1309 
peripheral, and antigen display 1324, 
1326, 1332 

peripheral, and self-tolerance 1314 
peripheral, antigen binding in 1308- 
1312, 1313F, 1317, 1322, 1323F 
lymphomas 1092 
lysine 

acétylation during transcription 387, 
388F 

elastin cross-linking 1065 
histone modifications 196-197, 200F 
incompatibility of méthylation and 
acétylation 207 
KKXX sequences 713 
in nuclear localization signais 650 
in the nucléosome 188,196-197 
regulatory use of méthylation 165T 
structure 112 

lysosomal storage diseases 728-729 
lysosome-dependent/independent 
pathways 1283 
lysosomes 

among intracellular compartments 
642 

delivery pathways 725, 727-728 
exocytosis 729 

fusion with the autophagosome 726 
heterogeneity 723-724 
maturation 724F 


plant and fungal vacuoles 724-725 
structure and function 722-724 
transport from Golgi apparatus to 
722-730 
lysozyme 

catalysis illustrated 6F, 144-146 
disulfide bonds in 127 


M-Cdk 973, 978-979, 982, 985-986, 

1003 

M-cyclins 969, 971,978, 993, 1004 
M phase 

chromosome changes 215 
position in cell cycle 963, 964F 
macromolecules 

complex formation 50F 
conformations 49 
in the cytoplasm 60F 
extracellular matrix 1057-1058 
formation by condensation reactions 
49 

head and tail polymerization 72-73 
importance in biology 43, 47-49 
localization using immunogold 

électron microscopy 556-557 
precursors from the citric acid cycle 
85F 

relative sizes 1059-1060F 
self-assembly 128-129 
and small molécule monomers 47 
synthesis driven by ATP hydrolysis 
70-73 

visualization using AFM 548-549 
visualization using négative staining 
or cryoelectron microscopy 
559-561 

visualization using TIRF 547-548 
macrophages 

CSFs and production of 1245 
derived from monocytes 1239 
inhibitory receptors 1245 
Légionella pneumophila and 1286 
as professional phagocytes 739, 1301 
selectivity 1031 

macropinocytosis 725, 732, 733F, 738 
as a lysosome delivery pathway 725 
virus entry 1280 
Mad2 protein 994 
magnésium in chlorophyll 766, 787 
magnésium (Mg 2+ ) ions 
integrin binding 1075 
NMDA receptors and 636 
magnification and appearance of cells 
529, 530F 

maintenance methyl transferases 
404-405 

maize 

pachytene chromosomes 550F 
regulatory DNA influence 1175 
major groove, DNA double hélix 177F, 
373-374F 

location of methylated cytosines 405 
transcription regulator binding 
373-377, 405 

malaria 610, 804, 1272-1273, 1276, 

1282,1331 
see also Plasmodium 
malate 107 

MALDI (matrix-assisted laser desorption 
ionization) 456-457 


malignancy, defined 1092, 1093F 

maltoporin 581 

mammals 

conserved body plan 227 
hematopoietic stem cells 1244 
imprinting restricted to 409 
ion concentrations inside and outside 
cells 598T 

limited regenerative abilities 1249 
LTP in the hippocampus 636-637 
membrane lipids 567 
the mouse as a model organism 
35-36 

phylogenetic tree 221F 
manganèse clusters 790, 791F, 793-794F 
mannose 96 

mannose-6-phosphate (M6P) 727, 
728-729F, 742F 

MAP kinases (mitogen-activated protein 
kinases) 

activation as all-or-none 830F 
mammalian Raf, Mek and Erk kinases 
856 

MAP kinase module 856 
in protein kinase evolutionary tree 
155F 

Ras activation 855-856,1012 
see also Ras-MAP kinase 
MAPKK (MAP kinase kinase) 856-857, 
1271 

MAPKKK (MAP kinase kinase kinase) 

856- 857 

MAPs (microtubule-associated proteins) 
932-934, 986 
MAP2 932-934 
see also centrosomes 
Marchantia 801F, 805 F 
Marfan’s syndrome 1066 
mass spectrometry (MS) 455-457 
mast cells 1239, 1240, 1317 
master transcription regulators 398-399, 
1254-1256 

maternal-effect genes 1158-1159 
maternal-zygotic transition 1181 
mathematics in biology 509-525 
see also models 

mating factors/receptors/pheromones 
813-814, 832, 857 
matinib 1135-1136 
matrix métalloprotéases 1072-1073 
matrix receptors and co-receptors 1074 
matrix space, mitochondria 658, 660, 
661-662, 757-759 

MBL (mannose-binding lectin) 1301, 
1302F, 1303 
Mbl protein 896-897 
MCSF (macrophage-colony-stimulating 
factor) 850T 

Mdm2 protein 1014, 1015F, 1016, 1017F 
Mdrl gene 1139 

MDR (multidrug résistance) protein 610, 
1293 

mean lifetimes 513F, 514 
mechanical stability 

compression résistance 1036F, 1082, 
1084F 

intermediate filaments 944, 946-948 
tensile strength 1046,1057,1063, 
1070, 1082-1083, 1084-1085F 
mechanically gated channels 614, 619 
mechanotransduction 1043, 1044F, 1074, 
1080-1081 
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medaka fish 547F 
Mediator protein 

as a coactivator 386 
in transcription initiation 312, 313F, 
385F 

Mef2c transcription factor 1259 
megacolon 1186 

megakaryocytes 180, 1002, 1239, 1241T 
platelet dérivation from 1239 
polyploid nucléus 1242 
meiosis 

chromosome behavior in 1004-1010 
comparison with mitosis 1005F, 
1008F 

genetic recombination in 486 
Holliday junctions in 284 
homologous recombination and 277, 
282-285 

pachytene chromosomes 550F 
in Saccharomyces cerevisiae 31 
meiosis I 1004-1006,1008-1010 
meiosis II 1005, 1008F, 1009-1010 
Mek (MAP kinases kinase) 856 
mélanocytes 729, 939, 940F 
melanomas 1092, 1093F, 1136, 1138 
melatonin 877 

membrane-associated proteins 577 
membrane attack complexes 1303 
membrane-bending proteins 573-574, 
699, 701-702, 732, 737, 896 
membrane-bound antibodies 417 
membrane disruption, by viruses 1280F 
membrane fusion 

endolysosomes 723 
homotypic and heterotypic 712 
SNARE proteins in 708-709 
membrane markers 697 
membrane potentials 

changes mediating fast responses 
825, 826F 

contribution to electrochemical 
gradients 599, 662, 762-763 
ionic basis 617F 
protein import into mitochondria 
661-662 

resting membrane potentials 615, 
617, 629-630 

rôle of K + leak channels 614-615 
see also voltage-gated channels 
membrane proteins 576-594 
association with lipid bilayers 
576-577 

asymmetry 590, 681 
ATP synthases as 776-778 
attachment of GPI anchors 688 
BCRs and TCRs 1317,1325-1326, 
1337 

cellulose synthase 1085 
complément System 1302 
densities 758-759 

detergent solubilization 583-586, 766 
diffusion 588-593 
functional reconstitution 585, 586F 
glycosylation 582-583, 723 
in large complexes 588-589 
lysosomal 723 

peripheral membrane proteins 577 
phospholipase Cp as 836 
phospholipid translocators as 570 
proportion of ail proteins 576, 580 
régulation by transcytosis 738 
see also transmembrane proteins 


membrane-spanning a-helices 677, 679, 
680, 777 

membrane traffic disruption 1284-1286 
membrane transport proteins 

active and passive transport 599-600 
energetics 163 

for membrane-bound organelles 641 
as multipass transmembrane proteins 
599 

as proportion of membrane proteins 
597 

rôle 9, 597-600 

transporters and channels 597-599 
membranes, biological 

double membranes of mitochondria, 
chloroplasts and bacteria 753F 
lipid bilayers and membrane proteins 
in 565-566 

mitochondria in biosynthesis 760-761 
phospholipids in 98-99 
recycling 743 

types, in liver and pancréas 643T 
see also cell membranes 
memory cells 1311-1313, 1314F 
memory B cells 1312,1316,1322 
memory T cells 1312, 1322, 1326, 
1332 

and transmembrane polypeptides 
1336 

memory formation 836, 841 

see also cell memory; molecular 
memory 

Mendelian inheritance 408F, 493 
contrasted with cytoplasmic 
inheritance 807-808 
P-mercaptoethanol 452-453 
meristems 442, 443T, 1082, 1085, 1183 
mesenchymal cells 

epithelial-mesenchymal transitions 
1042, 1101 

mesenchymal stem cells 1229 
mesenchyme-epithelial interactions 1190 
mesoderm 1147-1148,1156,1158, 
1167-1169, 1187F, 1189 
connective tissue derived from 1228 
presomitic mesoderm 1177-1178 
Twist specificity 1165 
messenger RNAs (mRNAs) 

5'-end cap snatching 1288 
3'-end Processing 324 
accumulation delays 1176 
analysis with microarrays and 
RNA-seq 503-504 
bacterial as polycistronic 348 
capping and polyadenylation 315-316 
cloning 469 

dégradation and gene expression 
372, 373F 

eukaryotic and prokaryotic 316F 
export from the nucléus 325-327, 655 
“factories” 332 

in gene expression monitoring 32 
localization in eggs 1156 
localization in the cytosol 421-422 
localization with in situ hybridization 
502 

polyribosome formation 675 
quantifying with RT-PCR 503 
régulation of transport from the 
nucléus 419-421 
ribosome profiling and 505-506 


RNA editing 418-419 
signal initiated translation 506 
stability and gene expression 
426-428 

in translation 343-344. 351-352 
untranslated régions (UTRs) 184T, 
421-423, 430 
see also pre-mRNA; UTRs 
metabolic pathways 

enzyme catalysis in 51, 52F 
epistasis analysis and 490 
glycolysis and the citric acid cycle 
among 87F 
metabolic processes 

carcinogen activation by 1128 
catabolism and anabolism 51-52 
rate and ATP utilization 148 
metabotropic receptors 629-630 
metamorphosis 1182 
metaphase 

chromatid alignment 964 
in mitosis 980 

metaphase chromosomes 214, 486, 988F 
metaphase plates 990-991,992F 
metaphase-to-anaphase transition 965, 
968, 969F, 973T, 977 
proteolysis in 970 

sister-chromatid séparation 992-994 
spindle assembly checkpoints 
993-994 
métastasés 

cross-section 1100F 
defined 1092 

invasiveness resembling EMT 1101, 
1120 

micrometastases 1102-1103,1120 
monitoring in mice 1117-1118 
and natural sélection 1096 
progress of 1102F, 1119-1120 
uncertainties over 1119-1120 
Methanococcus jannaschii 16F, 676F 
méthionine 

as initiator of protein synthesis 347, 
800 

/V-formyl- 347, 800, 958, 1031 
SRP protein 673 
structure 113 

méthionine, S-adenosyl- see 
adenosylmethionine 
méthylation 

of CG sequences and différentiation 
404 

changes in nuclear reprogramming 
1252-1253 

DNA damage from nonenzymatic 
267T, 271 

of GATC residues in mismatch repair 
250, 255, 257F 

and genomic imprinting 407-409 
and induced pluripotency 1256 
inheritance of DNA méthylation 
patterns 404-413 
of ribose in mRNA 316F 
of ribose in rRNA 328 
of tumor suppressor genes 1109 
see also cytosine; lysine 
Mg 2+ ions 

integrin binding 1075 
NMDA receptors and 636 
MHC (major histocompatibility complex) 
122 


Page numbers with an F refer to a figure; page numbers with a T refer to a table. 



1:32 Index 


class I, récognition by NK cells 
1304-1305 

class I and class II 1326-1327, 1328F, 
1330T 

polymorphisms 1328, 1330-1331 
in T-cell-mediated immune responses 
1307, 1324-1339 
Mia40 protein 663F, 664 
micelles 99 

bilayers and 568, 569F 
critical micelle concentration 583, 
584F 

Michaelis-Menten équation 142-143 
microarrays, DNA 

mRNA analysis 503-504 
réplication studies 257, 258F 
microbiota 1263-1275,1277 
microelectrodes 546 
microfibrils 1065 
microfilaments see actin filaments 
p 2 -microglobulin 1327, 1328F, 1330T, 

1339 

microinjection of altered genes 495 
micrometastases 1102-1103,1120 
microorganisms, prépondérance 10 
“microprocessor” activity of proteins 
155-156 

micro R N As (miRNAs) 

regulatory rôle 429-431 
timing of embryonic development 
1180-1182 

microscopy see AFM; électron 

microscopy; light microscopy 
microsomes 445, 448, 671-673 
microtomes 535, 555 
microtubule-associated proteins (MAPs) 
932-934 

see also centrosomes 
microtubule depolymerization 990-991, 
994 

microtubule flux 991,992F, 994 
microtubule-organizing centers (MTOCs) 
891,929-930, 931-932F, 936 
microtubules 

astral microtubules 982-985, 987, 

992, 994, 999, 1000F, 1001-1002 
building cilia and flagella 941-942 
in cell migration 959-960 
in cell wall orientation 1086-1087 
Chemical inhibitors 459, 904T 
cortical arrays 1086 
in the cytoskeleton 891,925-944 
drug effects 929 

dynamic instability 927-929, 935, 986 
fluorescence microscopy 538F, 553F 
Golgi apparatus 713, 715 
mitochondria and 755, 756F 
mitotic spindle formation 889-890, 
940-941, 982-984 
motor proteins and 936-938 
nulceation 929, 930F 
plus and minus ends 927 
severing and tubulin sequestering 
proteins 935-936 
subunits as asymmetrical 894 
see also tubulins 
microvilli 890-892, 893F, 924 
midbodies 997, 998F 
minor groove, DNA double hélix 177F, 

190, 373-374F 

minus-end-binding proteins 931,932, 934 


minus-end-directed transport 937-938 
miRNAs (microRNAs), function 305T, 

1149 

misfolded proteins see protein misfolding 
mismatch proofreading 250-251,257F 
mismatch repair System 

colorectal cancer, defects in 1116, 
1124-1125 

gene conversion from 286F 
MutSin 549F 

strand-directed mismatch repair 244T, 
245,250-251 
mitochondria 

acetyl CoA production 81-82 
among intracellular compartments 
642 

ATP production in 774-783 
ATP production in plants 81 
biochemical fractionation 758 
chloroplasts compared with 782-783, 
797 

contacts with ER 755, 757F 
in cytokinesis 1001 
cytoskeletal microtubule association 
755,756F 

électron transport in chloroplasts and 
755F 

features and origins 25, 26-27F, 644, 
663 

heat-shock proteins 355 
human mitochondrial genome 
801-802 F, 804-805 
intrinsic activation pathway, apoptosis 
1025, 1026F 

junction complexes with ER 691 
lipid imports 691 
overview 755-763 
peroxisome function and 666 
protein transport into 658-664 
pyruvate conversion 75, 81-82 
rôles in cellular metabolism 759-761 
self-splicing RNAs 324 
séparation by centrifugation 445 
site of the citric acid cycle 83 
structure 658F, 757F 
tobacco 544F 
urea cycle 760 
yeasts 451F 

mitochondrial activation pathway, 

apoptosis see intrinsic activation 
mitochondrial carrier family 779-780 
mitochondrial DNA 

in different cells and tissues 802 
genetic code variants 334, 349-350 
location 759 

mitochondrial genomes 27, 800-809 
mutation effects 807-808 
mutation rate 220 

mitochondrial fission 755-756, 802-803 
mitochondrial fusion 802-803, 804F, 808 
mitochondrial membranes 

inner and outer membranes 658, 
663-664, 757-758 
inner membrane transport proteins 
779-780 

proton gradient across 103 
small molécule transport across 664 
mitochondrial precursor proteins 659-662 
mitochondrial réticulum 757F, 803, 808 
mitogens 

in cell division 1011-1012, 1013F 


effects on Gi-Cdk and Gi/S-Cdk 
1012-1014 

response to excessive stimulation 
1017F, 1114F 
mitophagy 727 
mitosis 

cell cortex in 913 
in the cell cycle 978-995 
chromosome compaction in 185, 

186F, 187, 215-215 
chromosome painting in 180-181 
comparison with meiosis 1005F 
five stages of 978 
M-Cdk and 978-979, 982, 985-986 
nuclear envelope in 656-657 
in Saccharomyces cerevisiae 31 
without cytokinesis 1002,1003F 
mitotic chromosomes 979F 
mitotic index 966 
mitotic recombination 1109 
mitotic ségrégation in mitochondrial 
inheritance 807-808 
mitotic spindles 

acentrosomal 987 

Apc protein and 1124 

assembly 984-985 

in asymmetric cell division 1001-1002 

bipolarity 986-987 

centrosome rôle 930 

chromosome attachment 988, 989F 

disassembly 995 

fluorescence microscopy 538F, 545F 
forces on chromosomes 990-992 
kinesins and 936 
metaphase 983F 

microtubules and 889-890, 940-941, 
982-094 

monastrol effects 460F 
positioning rôle in cytokinesis 
997-999, 1001 

MLCK (myosin light-chain kinase) 922, 
923F, 958F 

mobile genetic éléments 
conservative site-specific 
recombination 292-297 
human genome 218F, 287, 292 
transposition and 287-292 
viruses as 290 
model organisms 
cell cycle 966 

in embryonic development studies 
1148 

examples 29T, 33 
genome sizes 28F, 29T 
mutation libraries 498-499 
protein-coding genes 122-123 
models (protein structure) 115F, 460, 

461F 

models (simulations) 524,1178 
modulated répétition 1162 
moiré patterns 550 
mold-yeast transitions 1271 
molecular docks 220-221 
molecular dynamics calculations 584F 
molecular memory devices 840-841 
molecular motions, speed of 59-60 
molecular oxygen 

in chemiosmotic coupling 754, 
761-762 

end of électron transport chain 767, 
771 
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in évolution of large organisms 771 
from photosynthesis 782, 790, 791F 
production by cyanobacteria 782 
use by perixisomes 666-667 
molecular switches see switches 
molecular weights 

analytical ultracentrifugation 455 
gap junctions and 1051F 
plasmodesmata and 1054 
SDS-PAGE séparation 452 
see also macromolecules 
molécule localization with fluorescence 
microscopy 536-537 
molten globules 354 
monastrol 460F 

monoallelic gene expression 411 
monoclonal antibodies 

fractionating tumor cells 1122 
from hybridomas 444-445 
ipilimumab 1138 
in microscopy 539, 591F 
againstT cell inhibitory proteins 1337 
trastuzumab 1137 

monocytes 1231 F, 1239-1240, 1241 T, 
1243F 

monomeric GTPases 820, 821F 
Rho family 843, 956 

monosaccharides, as aldoses and ketoses 
98 

monoubiquitylation 735 
morphogenesis 

branching morphogenesis 1190, 
1191F 

defined 1146 
in embryonic development 
1184-1193 
morphogens 

Bicoid as 1159-1160 
Dorsal as 1165 
graded effects 866, 1151 
neural development 1200 
range 1153 
response timing 824 
Wnt and Hedgehog as 868, 871 
morphological diversity, sticklebacks 
1174-1175 
mosaicism 410-411 

motor neurons 630, 633, 634F, 858, 1199, 
1208-1209, 1211 
motor proteins 161-163 
cytoskeleton 896 
and microtubules 936-938, 1288 
and mitotic spindle 982, 983-984 
Rab effectors as 706 
Toxoplasma gondii 1282 
viruses and 1288 
visualizing with TIRF 548 
see also myosin 
mouse (; Mus musculus) 

brain 502F, 542F, 624, 1250F 
cancer-critical gene function 
1118-1118 

embryonic development of paw 1022 
epidermis 1226F 
épithélial cell lifetimes 1219 
évolution rate 220F 
genome comparison with human 
220F, 221-222, 292 
Hox gene effects 1171F 
imprinting in 408F 
iPS cell incorporation 1254 


liver 1197 
macrophage 739F 
as model organism 29T, 33, 35-36 
myosin mutation effects 923F 
transposons in 292 
X-inactivation in 411F 
mouse, transgenic 

“brainbow mice” 502F 
knockout mice and 495-496,1117 
oncogene collaboration 1117-1118 
RNA interférence in 500 
telomere studies 265 
mouse-human hybrid cells 589F 
movement, and protein conformation 
changes 160-163 
M6P receptor proteins 727, 728F 
Mrell complex 282-283F 
MreB protein 896-897 
mRNAs see messenger RNAs 
mRNPs (messenger ribonucleoproteins) 
655 

MRSA (methicillin-resistant 

Staphylococcus au reus) 1276 
MS/MS (tandem mass spectrometry) 457 
MscL and MscS channels 620 
MTOCs (microtubule-organizing centers) 
891,929-930, 931-932F, 936 
see also centrosomes 
mTOR (mammalian target of rapamycin) 
860F, 861,864T, 1114-1115 
mucins 718 

mucus 719-720, 749, 1276-1277, 1298 
multicellularity 

defined by the genome 29-30 
évolution in plants and animais 
880-881 

noncoding DNA in 182 
SH2 domains in 122 
multidrug résistant cancers 1139 
multidrug treatments, for cancer and AIDS 
1139-1140 

multienzyme complexes 148-149 
multinucleate cells 1249 
see also syncytia 

multipass transmembrane proteins 578, 
581-582 

combinations of start- and stop- 
transfer 679-681 

double-pass transmembrane proteins 
679, 681F 

ion-channel-coupled receptors as 818 
KDEL receptor as 714 
membrane transport proteins as 599, 
664 

Patched receptor as 872 
rhodopsin/bacteriorhodopsin as 
586-588, 681F 

seven-pass transmembrane proteins 
870, 872 

multiphoton imaging 542 
multiple sclerosis 625 
multispecies conserved sequences 
225-226 

multiubiquitylation 735 
multivesicular bodies 

ESCRT complexes and 736-737 
exocytosis 729 
fusion 736 
muntjac deer 183 
Mus musculus see mouse 
muscarinic acétylcholine receptors 843, 
847F 


muscle cells 

Ca 2+ pump in SR 606-607 
insect flight muscle 919F 
muscle types 917, 1232, 1233F 
myoblast migration 1185-1186 
myotendinous junctions 1075, 1076T 
neuromuscular junctions 630 
muscle contraction 

actin and myosin in 916-920 
calcium (Ca 2+ ) ions in 920-923 
smooth muscle 921-923 
speed of 919 

muscular dystrophy 948-949,1072, 
1077, 1234 

MuSK receptor tyrosine kinase 1210 
mutagenesis 

Ames test for 1128 
carcinogenesis link to 1094 
mutational landscapes 1112F 
mutations 

characteristic order in colorectal 
cancer 1125-1126 
classical genetics approaches 
485-488 

complémentation tests 487, 490 
deducing gene function from 21-22, 
496, 498-499 

disease risk and 479, 493-494 
effects in mitochondrial genomes 
807-808 

in germ cells and somatic cells 
238-239 

heterochronic mutants 1180, 1181F 
identification through DNA analysis 
491 

intragenic mutations 16,17F 
libraries of 498-499 
missense, in cancer-critical genes 
1110F 

mobile genetic éléments and 287, 
292 

neutral mutations and population size 
231-232 

rates of in eukaryotes 234, 237 
rates of in mitochondrial genomes 
803-804 

and RNA splicing patterns 323 
S. pombe mutant phenotype 21F 
somatic, in cancer cells 1094, 1104 
types of 487, 1104 
see also gain-of-function mutations; 
loss-of-function mutations 
mutator genes 250 
MutH, MutLand MutS proteins 251F 
MutL and MutS genes 1124 
MutS protein 251F, 549F 
mutualism, microbes with hosts 1264 
myasthenia gravis 1315 
Myc regulating protein 
in cancers 1107, 1118 
fibroblast reprogramming 1254-1255 
p53response 1115 
see also OSKM factors 
Mycobacterium tuberculosis 1265, 1281, 
1284F, 1285 

Mycoplasma genitalium 9, 10F, 168, 499 
myelination of neurons 625, 667 
myeloid cells 1240 

myoblasts 1185-1186, 1196, 1233-1234 
MyoD gene 206F 

MyoD transcription regulator 396, 399, 
1170-1171, 1254, 1258 
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myoepithelial cells 1232,1233F 

myofibrils 918-921 

myosin 

access to actin bundles 912 
actin and, in muscle contraction 
916-920 

in actin arrowheads 898 
ATP hydrolysis by 916 
in cell migration 954-955 
in the contractile ring 890 
fusion with GFP 548F 
as a motor protein 162 
in non-muscle cells 923-925 
myosin gene mutations 923 
myosin protein superfamily 924-925 
myosin I 924 

myosin II 911,915-918, 923-924, 936, 
948, 953, 955-957 
adherens junction links 1042, 1044 
in cell motility 955-957 
light and heavy chains 915, 922-923, 
923F, 957, 958F 
myosin XIV 924F, 1282 
myostatin 1196-1197 
myotendinous junctions 1075, 1076T 
myotonia 627 
myotonie dystrophy 324 
myristic acid 578 

N 

A/-formylated peptides 958 
A/-linked oligosaccharides 683-684, 
685-686, 716-720 
complex and high-mannose 717 
N-terminal histone tails see histone tails 
N-termini 

acétylation and dégradation 360-361 
anchoring Src kinases 155 
cadherin linking 1040F 
polypeptide backbone 110F 
pro-peptides, protein precursors 744 
signal sequences at 647, 663, 679 
Na + -Ca 2+ exchanger 607 
Na + -driven Cr-HCC> 3 -exchanger 604 
Na + gradient 602, 603F, 604, 605F, 608, 
616 

Na + -H + antiporters 781 
Na + -independent CI _ -HCC> 3 _ exchanger 
604-605 

Na + -K + pump proteins 585F, 607-608. 
615 

Na + -linked symporters 603, 604F, 605, 
632 

NAD + (nicotinamide adenine dinucleotide) 
as an électron carrier 754 
régénération 760 
NADH dehydrogenase complex 

(Complex I) 767-770, 769F 
NADH (reduced nicotinamide adenine 
dinucleotide) 

as an activated carrier 64 
as an électron donor 67-68, 764 
in catabolic reactions 68, 74 
production by glycolysis 74-78 
production in the citric acid cycle 
82-84, 759 

respiratory-chain complexes and 
766-768 

NADPH oxidase complex 1301 
NADPH (reduced nicotinamide adenine 
dinucleotide phosphate) 


as an activated carrier 64 
as an électron carrier 67-68, 760 
in anabolic reactions 68 
in photosynthesis 755, 784 
as a reducing agent 68F 
naïve B and T cells 

co-stimulatory signal requirement 
1314 

locating APCs 1311 
in lymphocyte différentiation 1310 
naïve B cells 1316-1317, 1322 
naïve T cells 1314,1326,1328, 

1332- 1333, 1335-1337, 1338F 
NANA (A/-acetylneuraminic acid) 575F, 

717F 

nanodiscs 586 
Nanog gene 1254F 
Nanog protein 378F, 506F 
nanomachines see protein machines 
nanoparticles 

immunogold électron microscopy 557 
quantum dots as 538 
Nanos protein 1158F 
natural killer (NK) cells 1240, 1241 T, 1304, 

1333- 1334 
natural sélection 

mutation and 15-16 
and pathogens 1289 
protein structure and 119-120 
purifying sélection 219-220, 221F, 
223, 225, 231 

small sélective advantages 226 
and tumor progression 1091-1092, 
1096, 1104, 1118, 1119F, 1125 
NCAMs (neural cell adhesion molécules) 
1055-1056, 1338F 

Ndc80 complex 987, 988F, 990F, 991 
Neanderthals 220, 223-224, 232F, 300F, 
479 

nebulin 919, 920F 
necroptosis 1021 
necrosis 

contrasted with programmed cell 
death 1021,1022F 
as typical of cancer cells 1099, 1110F, 
1116 

négative feedback 
Cdc20-APC/C 993 
in cell régulation 515-516 
in cell signaling 829-830, 853 
circadian docks and 876-878 
delayed négative feedback 516, 517F, 
1179F 

Hedgehog pathway 873 
Hes and Her g en es 1178,1179F 
JAK-STAT signaling pathway 863 
NFkB pathway 874, 875F 
rate constant fluctuations 516F 
Smad signaling pathway 866 
Wnt signaling pathway 871 
négative feedback loops 402 
common to ail cells 402 
rod cells 845 

négative régulation of enzymes 151,152F 
négative sélection 1332,1333F 
négative staining 559-561 
Neisseria gonorrhoeae 
antibiotic résistance 19 
protection against the complément 
System 1303 

Neisseria meningitidis 1291 


Neisseria spp. 

phase variation 1290 
nematodes see Ascaris ; Caenorhabditis 
eiegans 

neoblasts 1248-1249 
neoplasms, defined 1092 
see a/so cancers; tu mors 
Nernst équation 615-616 
nerve cells see neurons 
nerve impulses see action potentials 
nerve terminais 1209 
nesprins (KASH proteins) 949, 959 
netrins 1202-1204 
network motifs 402 
see a/so feedback 

neural crest cells 951,959, 1041-1042, 
1076T, 1186 

neural development 1198-1213 
four phases 1199F 
vertebrates 1199-1200 
neural map formation 1204-1206 
neural stem cells 1201F, 1250-1251 
neural tube 1040F, 1041, 1045F, 1186, 
1192, 1199 

neuraminic acid, A/-acetyl- (NANA) 575F 
neurites 940, 1202 

neuroblasts 1173-1174, 1179-1180, 1200 
neurodegenerative diseases 
amyloid fibrils in 130-131 
neurofilaments in 947 
plectin gene mutations and 949 
stem cell potential 1250-1251 
neurofilament proteins (NF-L, NF-M and 
NF-H) 944T, 947 

neurogenesis, in Drosophila 1173-1174 
neurogenic ectoderm 1166, 1173-1174, 
1179, 1186 

neurological diseases 939 
neuromuscular junctions 

acétylcholine receptors 630-631 
régénération 1071-1072 
synapse formation 1209, 1210F, 1211 
neuromuscular transmission and ion 
channels 632-633 
neuronal doctrine 441 
neuronal migration 913F 
neuronal specificity 1205 
neurons 

absence of energy storage 87 
activity-dependent synaptic change 
1211-1212 
basket cell 1198F 
commissural neurons 1202-1204 
computation by single neurons 
633-636 

as examples of long-distance 
signaling 815 
firing rate 623, 635 
firing rule 1212 

fluorescence microscopy 543F 
growth cône 858, 943, 951 
intermediate filaments 947, 948F 
latéral inhibition 867, 868F 
liver cell conversion to 396, 397F 
microtubule-associated proteins 932, 
940F 

migration 1200,1201F 
myelin sheathing 625 
neurotrophic factors and 1208-1209 
normal neuronal death 1208 
olfactory receptor 844F 
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optogenetic control in mice 624 
presynaptic and postsynaptic 
628-629 

programmed production 1201F 
RNA localization 421 
secretory vesicles 744-746 
stathmin rôle in amygdala 935 
structure and function 620-621,940 
survival factors 1030F 
sympathetic 1018 
turnover rate 1250 
types and firing properties 627 
types of ion channel 614, 631 
visualizing in mouse brain 502F 
see a/so axons; central nervous 
System; dendrites; glial cells; 
synapses 

“neurospheres” 1250, 1251F 
Neurospora 416-417 
neurotransmitter receptors 

ionotropic and metabotropic 630 
neurotransmitter transporters 603, 604F, 
628 

neurotransmitters 

binding to transmitter-gated channels 
629 

excitatory and inhibitory 629-630 
as extracellular signaling molécules 
815-818, 825 

in synaptic vesicles 745-746 
see also transmitter-gated ion 
channels 

neurotrophic factors 1208-1209 
neurotropic alpha herpesviruses 1288 
neutrophils 1239, 1240F, 1241, 

1244-1246 

chemotaxis 958, 959F 
CSFs and production of 1245 
cytoskeletal rearrangements 890, 892 
as professional phagocytes 739, 

740F, 1301 

newts 

limb régénération 1248-1249 
lung cells 991 

NFkB signaling pathway 873-874, 1301 
NGF (nerve growth factor) 

action via MAP kinases 856 
action via receptor tyrosine kinases 
850T, 853-854 

as a neurotrophic factor 1208-1209 
sympathetic neuron growth and 1018 
NHEJ see nonhomologous end joining 
NHL (non-Hodgkin’s lymphoma) 1092F 
nicotinamide dérivatives see NADH; 
NADPH 

nidogen 1058F, 1070, 1071F 
nitric oxide (NO) 846-847 
nitrogen cycle 85-86 
nitrogen fixation 12, 14F, 86 
nitrogen in C-N Chemical groups 91 
nitroglycérine 847 

nitrosated polyamines and peptides 267T 
nitroxide spin labels 569-570 
NLRs (NOD-like receptors) 1300-1301 
NMDA receptors (A/-methyl-D-aspartate) 
636-637 

NMR (nuclear magnetic résonance) 
studies 

of human brain 913F 
of protein structure 461-462 
of transcription regulators 375 


nocodazole 904T 
NOD-like receptors (NLRs) 1300 
Nodal protein 1168-1169 
nodes of Ranvier 625 
Noggin protein 1168-1169 
noise 

in data points 525 
in genetic control Systems 1246 
in intracellular signaling 820-822, 831 
in microscope images 532-533, 561 
non-crossovers 284-286 
non-genetic variability 524 
non-histone proteins 187, 209-210 
nonclassical cadherins 1037T, 1038, 
1039F, 1046F 
noncoding DNA 

conserved 224-225 
in humans and other species 28-29, 
182 

in the nucleolus 330 
regulatory DNA as 7, 29 
see also introns 
noncoding RNAs 305, 327 

localization with in situ hybridization 
502 

régulation of gene expression 
429-436 

RNA sequencing and 482 
small ncRNAs and RNA interférence 
429-431 

noncovalent interactions/noncovalent 
bonds 

in biological membranes 565 
cytoskeletal filaments 893 
four types 44-45, 45T, 94-95 
ligand binding 134, 139F, 140 
in macromolecules 49-50 
membrane-associated proteins 577 
in protein folding 110-111,114 
nondisjunction 1010,1109F 
nonenveloped viruses 1274, 1280-1281 
nonfibrillar collagens 1063T 
nonhomologous end joining (NHEJ) 
274-275, 282, 289 
B cell switch sequences 1323 
homologous recombination and 275F, 
278 

nonretroviral retrotransposons 291-292 
nonsense-mediated mRNA decay 327F, 
352-353 

normal flora 1264-1265, 1276, 1293, 

1298 

NOS (NO synthases) 847 
Notch protein 

activation by proteolysis 869F 
O-glycosylation 720, 868 
Notch receptor 867-868, 869F, 1117 
Notch signaling pathway 
diversity 1150 
Hes genes 1178 

latéral inhibition 1152, 1171-1173, 
1224-1225 

stem cell maintenance 1224-1225 
notochords 1147F, 1167, 1185F, 1189F, 
1200F 

Noxa protein 1028 

NPCs see nuclear pore complexes 

NPV (human papillomavirus) 1131-1132 

NSF protein 709,712F 

NtrC protein 383F 

nuclear envelope 

DNA localization and 178-179 


as feature of eukaryotes 24 
inner and outer membranes 649 
intermediate filaments and 890 
linker proteins 948-949 
in mitosis 656-657, 978, 980-981, 
985-986, 995 

repair factory tethering 213-214 
nuclear export signals/receptors 652-655 
nuclear import signals/receptors 650, 

652, 653F, 654-656 
nuclear lamina 

heterochromatin and 211-212 
intermediate filaments and 179, 180F, 
891,944 

in mitosis 656, 986, 995 
rôle 649, 650F 
nuclear lamins 

A-type lamins 948 
caspase cleavage of 1023 
nuclear localization signais 650-651, 
652-653F, 654-655, 657 
amino acid phosphorylation 655 
import receptor binding 652 
nuclear matrix or scaffold 214 
nuclear pore complexes (NPCs) 
arrangement 651F 
export of mRNA-protein complexes 
326-327 

gated transport 646 
import receptor binding 652 
phosphorylation in mitosis 985 
Ran-GTPase and 653-654 
virus entry 1281 

nuclear pores 179, 180F, 202F, 213, 327, 
559F, 561F 

nuclear receptor superfamily 875-876, 
877F 

nuclear reprogramming 1252-1253 
nuclear-to-cytoplasmic ratio 1182 
nuclear transplantation 1252 
nuclear transport receptors 326, 653 
nucléation, actin filament formation 
899-900, 902, 906-907, 

908-909F, 953, 954F, 1289F 

nuclei 

as characteristic of eukaryotes 13 
DNA virus réplication 1278 
human chromosome localization 
211-212, 213F 

as intracellular compartments 642 
loss from red blood cells 1245F 
noncoding RNAs 327-330 
protein import 649 
RNA export 325-327,419-421,649, 
655 

subcompartments and structures 
213-214, 331-333 
transplanted into enucleated eggs 
205, 206F, 369-370 
transport between cytosol and 
649-658 

nucleic acid structures 1101 
nucleic acid synthesis 71-72 
nucleoids 759, 800F 
nucleolus 213, 329-330, 331F, 340 
snoRNAs 305T, 328 
nucleoporins 126, 649-650, 651F, 656 
nucleoside triphosphate hydrolysis 894, 
896, 902-903 

see also ATP hydrolysis; GTP 
hydrolysis 
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nucléosome assembly factors 976 
nucléosome core particles 
DNA packaging in 188-190 
histone proteins in 187-188, 189F 
susceptible sites to DNA damage 
267F 

nucléosome mark récognition 199F 
nucléosome sliding 190,191F 
nucléosomes 

assembly behind réplication forks 
261-265 

as basic chromosome structures 
187-188 

chromatin remodeling and 190-193, 
380 

cooperative binding and 379-380 
DNA réplication and 254 
fibroblast reprogramming as iPS 1256 
histone H1 binding 193F 
nucléotide excision repair 266T, 270-271 
nucleotide-gated channels 614 
nucléotide sequences see DNA 
sequences 

nucleotide-sugar intermediates, in 
glycosylation 684 
nucléotides 

in activated carriers 69 
biosynthesis 86 
complementarity 3F 
functions 101 
in the nitrogen cycle 85-86 
nomenclature 101 
as nucleic acid monomers 3 
number in human genome 28 
number in the nucléosome 188 
as phosphorylated nucleosides 101 
structures 3-4, 100-101 
see also bases 
nucléus see nuclei 
nullcline analyses 518, 519F 
numerical aperture and resolution 532, 
533F 

numerical intégration 512 

o 

O-linked glycosylation 719 
O-linked oligosaccharides 684, 718, 868 
0 6 -methylguanine 271 
obesity 1115, 1128F, 1129, 1264 
occludin 1049-1050 
Oct4 transcription regulators 398-399F, 
506F, 1254-1255 
see also OS KM factors 
P-octylglucoside 583, 584F 
ocular dominance columns 1212 
Okazaki fragments 241-242, 243F, 

245-250, 253-254, 255F, 261 
oleic acid 98 

olfactory neurons 943,1227 
olfactory receptors 832, 843-844, 846T, 
1250F 

oligodendrocytes 625 
oligonucleotides, affinity chromatography 
449 

oligosaccharides 97 

glucose trimming 683F, 684-685 
mannose trimming 686 
A/-linked 683 

Processing in the ER and Golgi 
apparatus 718F 


Processing in the Golgi apparatus 
716-718 

oligosaccharyltransferases 677, 683F, 
684, 717F 
Omi protein 1029 
OMPLAprotein 581F 
oncogene dependence 1135 
oncogenes 

collaborative action 1118 
discovery 1105-1107 
DNA sequence changes 1110F 
gain-of-function mutations and 1104 
v-Ras 1106 

from viruses 1131-1132 
see also proto-oncogenes 
oocytes see eggs 

open reading frames (ORFs) 424, 457, 

482 

ribosome profiling technique 505-506 
opérons 380-385. 385 

Lac operon 381 F, 382-383 
opportunistic pathogens 1265, 1268, 

1276 

opsin 845 
opsonization 1301F 
optic cup 1258F 
optic tectum 1204-1206, 1207F 
optical isomers, amino acids 112 
optical microscopy see light microscopy 
optical sections 540 
optical techniques, protein interactions 
458 

optogenetics 624-625 
OR logic 521F 

ORC (origin récognition complex) 
259-261,974-975, 976F 

order 

in biological structures 51F 
disorder as entropy 52-53, 60, 103 
thermodynamics of 52-54 
organelles 

biparental and maternai inheritance 
807 

endosymbiont hypothesis 800 
energy-converting 753 
four families of 645 
gene transfer to the nucléus 801-802 
growth and prolifération 648, 658, 
800, 806 

lipid droplets 573 

membrane-enclosed 565, 641-649, 
889, 896, 938-939, 1001 
not constructed de novo 648, 658 
protein movements between 645-647 
Rab family GTPases in 706T 
secretory and endocytic pathways 
646F 

from subcellular fractionation 445 
volumes in a liver cell 643T 
organic chemistry 

biological importance 47 
Chemical bonds and groups 47, 

90-91 

organisms 

shared proteins 482 
size différences 1010 
see also species 

Organizer signaling center 1167-1168 
organoids 1223 
organotrophic organisms 11 


organs 

génération from stem cells 

1256-1257, 1258F, 1266-1267 
grafted/transplanted 1329, 1331 
growth of animais and 1193-1198 
régénération of 1247-1249 
size régulation 1226 
sizes of transplanted 1193 
transcription regulators and création of 
1170-1171 

see also lymphoid organs 
origins of réplication see réplication origins 
orphan receptors 832, 875 
orthologs 17-18,36 
oscillations 

Ca 2+ waves 838-840, 842, 843F 
circadian docks 876-878 
metaphase chromosomes 991 
négative feedback effects 516, 517F, 
829F, 830, 875F, 876-878 
NFkB activation 874, 875F 
and robustness 520 
vertebrate segmentation 1177 
OSKM factors (Oct4, Sox2, Klf4, and Myc) 
1254-1255, 1256F 
osmium tetroxide 555-556, 716F 
osmotic equilibrium 1083 
osmotic gradients 612 
osmotic pressure 612, 619-620, 724 
osmotic stress 857, 1115 
osteoblasts 1057, 1229-1232 
osteoclasts 1230-1232, 1239, 1243F 
ostéocytes 1229-1230 
osteoporosis 1232 
ovarian cancer 1113,1116 
OXA complex (cytochrome oxidase 

activity) 659, 660F, 663F, 664 
oxaloacetate 70F, 83-84, 85F, 87, 

106-107 

oxidation 

as an électron transfer 55-56 
DNA damage from 267T 
of organic molécules 54-55 
oxidative phosphorylation 84-85, 86F, 

753, 759F, 761-763 
in cancer cells 1098 
in plants 786 
oxidative stress 1115 
oxygen 

C-0 Chemical groups 91 
carbonyl oxygen atoms 612, 618, 

619F 

détoxification 796 

origins of atmospheric 11,26, 

796-797 

utilization in peroxisomes 666 
see also molecular oxygen 
oxygen-carrying molécules, évolution 
229-230 

P 

P-bodies (processing bodies) 427-428, 
430 

Peso chlorophyll 790, 791F, 794F 
P element, Drosophila 416,486 
P-glycoprotein (MDR) 610,1293 
P-granules 1002F 
32 P labeling 466-467 
p21 protein 1014, 1116 
p27 protein 971F, 973T, 1004 
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p53 protein 

apoptosis-promoting function 1015, 
1028, 1115-1116 
cell-cycle arrest 1016 
loss of function in cancers 1126, 
1132-1133 

post-translational modification 166 
target of Chkl and Chk2 kinases 
1014 

as transcription regulator 1116 
p53 regulatory pathway in cancers 
1113-1116, 1123 
P-type pumps 606-608, 690 
pachytene 550F, 1006, 1007F 
packaging proteins 704 
PAGE (polyacrylamide-gel electrophoresis) 
see polyacrylamide-gel; 
SDS-PAGE 

pair-rule genes 1159-1160, 1162 
pairing, homologs 1005F, 1006-1007 
PALM (photoactivated localization 
microscopy) 552, 553F 
palmitic acid 98 

PAMPs (pathogen-associated molecular 
patterns) 

dendritic cell activation 1306F, 1326 
peptidoglycans and LPSs as 1267 
récognition by PRRs 1298, 1300 
PAMs (protospacer adjacent motifs) 434F, 
497 

pancréas 

acinar cells 748 
p-cell renewal 1226-1227 
exocrine cells 671F 
membrane types in liver and 643T 
négative sélection to protect 1332- 
1333 

pancreatic cancers 1117 
pandoraviruses 1274 
Paneth cells 1218-1219,1221-1225 
PAP (poly-A polymerase) 325 
papillomaviruses 1129, 1130T, 

1131-1132, 1276 
PAPS (3'-phosphoadenosine- 

5'-phosphosulfate) 719 
paracellular pores 1049 
paracellular transport 1047 
paracrine action, Type I interferons 1304 
paracrine signaling 815 
paralogs 

distinguished from homologs and 
orthologs 17-18 
in vertebrate genomes 34, 120 
Paramecium 614,941 
parasites, eukaryotic 1277, 1282-1284, 
1290 

parasitism 1264 
parathormone 835T 
Parkin ubiquitin ligase 727 
Parkinson’s disease 130, 324, 727, 1251 
ParM protein 897 

PARP (PolyADP-ribose polymerase) 
inhibitors 1133-1135, 1139 
parthenogenicity 987 
parvin 1079 
parvoviruses 1274 

passenger mutations 1104, 1111-1112, 
1119F, 1137 

patch-clamp recording 626, 627F 
Patched receptors 871-873 
paternity testing 476 


pathogenicity islands 1268-1269, 1282F 
pathogens 

cross-species transmission 1279, 

1291 

drug résistance 1291-1294 
engulfment by phagocytic cells 
1301-1302 

épithélial barrierto infection 1265, 
1276-1277 

évolution by antigenic variation 
1289-1291 

extracellular 1269,1277-1278 
facultative and obligate 1268 
fungal and protozoan 1271-1273 
host interactions 1264-1265 
host specificity 1265 
infection strategies 1276-1294 
insect vectors 1276 
intracellular 1278-1279, 1282, 1283F, 
1284-1285 

opportunistic pathogens 1265, 1268, 
1276 

primary pathogens 1265,1276 
use of the host’s cytoskeleton 
913-914, 1286-1288 
viral, bacterial and eukaryotic 1266 
viruses causing human diseases 
1273T 

see a/so bacteria; parasites; viruses 
patterns, biological see order 
patterns, in development see spatial 
patterning 

Pax6 gene (Eye/ess) 397-398F, 1146F, 
1171 

paxillin 1037T, 1080 
PCNA 247, 262 

PCR (polymerase Chain reaction) 

chromosome structure analysis 209F 
DNA cloning using 473-477 
ion torrent™ sequencing 481 
quantitative RT-PCR 502-503 
PDI receptors 1138F, 1139, 1337 
PDGF (platelet-derived growth factor) 
action via receptor tyrosine kinases 
850T, 853F 

as a mitogen 1011-1012 
pericyte and smooth muscle 
recruitment 1238 

production of embryoid bodies 1257F 
PDI (protein disulfide isomerase) 682, 686 
PDK1 (phosphoinositide-dependent 
protein kinase 1) 860 
Pdm transcription regulator 1179 
PDZ domains 1050 
pectins 1083-1084 
pemphigus 1046 
penicillin 1267,1291,1293 
pénis, cyclic GMP in 847 
pentose phosphate pathway 760 
peptide-binding grooves, MHC proteins 
1327 

peptide bonds 

in the a-helix and p-sheet 116F 
as C-N bonds 91 
energetics of 61,346 
hydrogen bonding among 94, 

110-112 

in peptides and proteins 110-112 
in protein synthesis 339-344 
see a/so polypeptides 
peptide synthetase 365 


peptidoglycans 1267, 1292 
peptidyl-transferases 343, 348, 352T 
peptidyl-tRNAs 339, 340F, 342F, 348, 
351F 

Per protein 878 
“perfect enzymes” 143 
perforin 1334 

pericentric heterochromatin 204F 
pericentriolar material 930, 931F, 943F 
pericentriolar matrix 982, 983F, 984, 985F 
pericytes 1235, 1236F, 1238 
perinuclear space 649, 650F 
periodic table of the éléments 43F 
peripheral tolérance 1314 
periplasmic substrate-binding protein 
610F 

periplasms 1267F 

periventricular heterotopia 912, 913F 
PERK protein kinase 688 
perlecan 1058F, 1070, 1071F 
permeases see transporters 
peroxidases 735F 
peroxins 667 

peroxisome proliferation-activated 
receptors (PPARs) 875 
peroxisomes 

Abcdl gene and 301F 
among intracellular compartments 
642 

as organelles 666-669 
persistence length 898, 926, 945 
persistence of responses, intracellular 
signaling 825 

persistent inputs, and feed-forward motifs 
522-523 

pertussis (whooping cough) 834, 1277 
PET (positron émission tomography) 
1092F 

Pexl and Pex6 ATPase 668 
Pex5 import receptor 667-668 
Peyer’s patches 1308F, 1311 
pH 

acidity of lysosomes 722-723 
KDEL receptor affinity and 714 
pH scale 46, 93 
régulation by vacuoles 724 
régulation of cytosolic 604-605 
pH changes, ion torrent™ sequencing 
481 

pH gradients 

contribution to electrochemical 
gradients 662, 762-763 
isoelectric focusing 453 
see a/so proton gradients 
PH (pleckstrin homology) domains 822, 
824F, 859F, 860 
phagocytosis 

of bacteria by host cells 1281-1282 
coiling phagocytes 1286 
defined 730 

as feature of eukaryotes 24, 25F 
as a lysosome delivery pathway 725 
plasma membrane enlargement 748 
professional phagocytes 739 
see a/so macrophages; neutrophils 
phagolysosomes 1284,1301,1306F, 

1329 

phagosomes 725, 738-740 

autophagosomes 725F, 726-727 
Listeria monocytogenes and 1284F 
PRRsand 1298 
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Salmonella enterica and 1285F 
phalloidins 904, 916F, 953-954F, 957F 
pharmaceutical production through DNA 
cloning 484, 506 
see a/so drug discovery 
phase, light waves 531,532F 
phase-contrast microscopes 533-535, 
591F 

phase transitions, lipid bilayers 571, 572F 
phase variation, bacteria 294, 1290 
phencyclidine 636 
phénobarbital 1022 
phenotypes 

behavioral changes 488 
defined 485-486 
stochastic effects on 523-524 
synthetic 491 
phenylalanine 
structure 113 
tRNAfor 335F 
pheromones 844, 857 
Philadelphia chromosome 1093, 1094F, 
1095, 1135 
phosphate bonds 

bond energies 78, 79F 
phosphates and phosphoanhydrides 
65-66F, 78, 79F, 91, 101 
phosphates, position in nucléotides 100 
phosphatidylcholine 567, 570F, 571 T, 
572F, 573 
synthesis 689 

phosphatidylethanolamine 566-567, 

571 T, 574, 689-690 
phosphatidylinositol (PI) 574, 577F 
interconversion with PIPs 700, 701F, 
737, 859 

phosphatidylserine 566-567, 571 T, 574, 
689-690 

and phagocytosis 740, 1030-1031 
and PKC 837 
phosphodiester bonds 

reformation by DNA ligase 246F, 
269-270 

reformation by DNA topoisomerase 
251-252 
in RNA 302F 

phosphodiesterases see cyclic AMP; 
cyclic GMP 

phosphoenolpyruvate 79F, 85F, 105 
phosphofructokinase 104 
phosphoglucose isomerase 104 
3-phosphoglycerate 76, 77-78F, 85F, 105, 
785 

phosphoglycerate kinase 77F, 78, 105 
phosphoglycerate mutase 105 
phosphoglycerides, cell membrane 
566-567 

phosphoinositide phosphatases 859 
phosphoinositides 
AP2 binding to 699 
marking organelles and membrane 
domains 700 

membrane bending and 594 
in signaling complex formation 822, 
823F 

phospholipases 574 
phospholipase C 574 
phospholipase C-P (PLCP) 836, 837T, 
838F, 840F, 859, 862F 
phospholipase C-y (PLCy) 852, 853F, 

859, 862F 


phospholipase C-Ç (PLCQ 839F 
phospholipid exchange/transfer proteins 
691 

phospholipid translocators 570, 574, 690 
phospholipids 

G protein signaling through 836-838 
mobility 569-571 
in plasma membranes 9, 98 
sites of synthesis 760 
spontaneous bilayer formation 
568-569 

structures 566-567 
phosphorylation 

autophosphorylation 364F, 688, 841, 
842 F 

in the cell-cycle control System 968, 
970F 

initiation factors 423-425 
nuclear localization signais 655 
regulating protein dégradation 360 
regulating protein function 153-156, 
165T 

regulating signal responses 826 
self-phosphorylation of P-type pumps 
607 

of serine in RNA polymerase tails 312, 
316-317 

of serine in the nucléosome 196-197 
phosphorylation cycles 156 
phosphotyrosine docking/binding sites 
824F, 850, 852-853, 863 
photoactivation, fluorescent dyes 544, 
546F 

photochemical reaction centers 783-784, 
788-790, 793, 796 
photolyases 885 
photoproteins 884 

photoreceptors, rod and cône 844-846, 
848 

photorespiration, peroxisomes in 667, 
668F 

photoswitchable probes 551 
photosynthesis 

and atmospheric oxygen 796-797 
ATP production by 753 
categories of reaction 783-784 
charge-separation in 788-789, 
792-793 

as complementary to respiration 55 
electron-transfer process 788 
energetics of 54F 
green sulfur bacteria 796 
rôle of chloroplasts in 782-799 
thylakoid membrane as site of 
786-787 

see a/so chloroplasts 
photosynthetic reaction centers 588 
photosystems 

in photosynthetic organisms 754-755, 
789F 

of the thylakoid membrane 786 
photosystem I 755F, 789, 790F, 792-793, 
794F 

photosystem II 755F, 789-791,792F, 793, 
794F 

photosystem II complex 588, 791F 
phototrophic organisms 11,13,14F 
phototropin 885 
phragmoplasts 1000,1001F 
phylogenetic classification of bacteria 
1267 


phylogenetic trees (tree of life) 218-220, 
221F 

construction 10 
genomic analysis and 14 
primary branches 14-15, 220 
phytochromes 883-885 
PI 3-kinase/Akt/mTOR signaling pathway 
1114-1115 

PI 3-kinase-Akt signaling pathway 
860-861, 1030F 
TORand 861, 1017 
see a/so RTK/Ras/PI3K pathway 
PI 3-kinase (phosphoinositide 3-kinase) 
and cell survival 860-861 
in chemotaxis 958-959 
classes la and 1b 859 
as a growth factor 1017 
signaling protein recruitment 574, 852 
pili 1267,1277 
pilin gene/protein 1290 
pinch 1079 
pineal gland 877 
Pinkl protein kinase 727 
pinocytosis 730-732 

macropinocytosis 725, 732, 733F, 738 
PIPs (phosphatidylinositol phosphates) 
700, 701F, 703 
PI(3)P 701F, 707, 737 
PI(3,4)P 2 701F 

PI(3,4,5)P 3 701F, 739, 740F, 859-860, 
958, 1017F, 1115, 1117F 
PI(4)P 701F 

PI(4,5)P 2 701-702, 733, 739, 

836-837, 859-860, 959, 

1017F, 1078F 
PI(5)P 701F 

piRNAs (Piwi-interacting RNAs) 305T, 

429, 433 

pituitary glands 1196 
Pitxl gene expression 1174-1175 
PKA (protein kinase A) see cyclic-AMP- 
dependent protein kinase 
PKB (protein kinase B) see Akt 
PKC see protein kinase C 
plague 1276, 1281 
plakins 948-949, 959 
plakoglobin (y-catenin) 1037T, 1042, 
1046F 

plakophilin 1037T, 1046F 
planar cell polarity 1189-1190 
planar polarity pathway 869 
planarian worms 1247-1249 
plant growth regulators 881, 1087 
plants 

cell growth and cell wall orientation 
1085-1087 

cell size and ploidy 1194 
cell walls 26, 1000, 1053, 1081-1087 
cytokinesis in 1000-1001 
energy storage as starch 80-81 
flowering times 1182-1184 
intracellular signaling pathways 880- 
885 

model organism 32 
and nitrogen-fixing bacteria 12 
peroxisomes in 667, 668F 
receptor serine/threonine kinases in 
881 

régénération in culture 442 
RNAi and viruses 431 
transgenic plants 507-508 
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plasma, distinguished from sérum 1011 
plasma cells, effector B cells as 1309 
plasma membrane repair 748 
plasma membranes 

Ca 2+ gradient/pumps 607, 838 
carbohydrate layers protecting 582 
cell polarization and 748-750 
clathrin-coated pits 553F 
composition in eukaryotes 572 
depolarization by action potentials 
607, 621-624, 627-629, 632, 
634-636 

depolarization by neurotransmitters 
629, 631, 633 
endocytosis 730-741 
endosome recycling 737-738 
hyperpolarization 844-845 
pinocytic vesicle formation 731 
potential enlargement by secretory 
vesicles 746-748 
recruitment of intracellular signaling 
proteins 700 
red blood cell 565F 
soluble and membrane-bound 
immunoglobulins 1317 
synaptic vesicles from 746 
universality of 8-9 
virus fusion 1280 
plasmalogens 667 
plasmid vectors 468-469, 508F 
plasmids 
origins 18 
ségrégation 897 
tumorviruses 1130-1131 
virulence plasmids 1268 
plasmodesmata 1053-1054, 1082F 
Plasmodium (IP .! falciparum) 610, 801F, 
804, 805F, 1272-1273, 1276 
life cycle 1272F 
plastids see chloroplasts 
plastocyanin 789, 792-793, 794F 
plastoquinol 791 

plastoquinones 767F, 789, 791-792F, 793 
platelets 1002, 1011, 1054-1055, 1076T, 
1077-1078, 1241T 
dérivation from megakaryocytes 
1002,1239 

PLCs see phospholipase C 
plectin 933, 948-949, 1076 
Plk (Polo-like kinase) 978-979, 985 
ploidy, and cell size 1194 
pluripotency 

blastula stage 1148 
fertilized egg 1253 
neoblasts 1248 
see a/so stem cells 

plus-end-depolymerization, kinetochores 
991 

plus-end-directed transport 938 
plus-end-polymerization, kinetochores 
987-988 

+TIPs (plus-end tracking proteins) 933, 
935, 958, 960 
podosomes 952 
point mutations 218, 487 

cancer genomes 1111, 1116 
immunoglobulins 1322 
Ras oncogenes 1106 
RNA splicing errors and 324 
tumor suppressor genes 1109 
point spread functions 540, 551-552, 
553F 


polar amino acids 109-110 
polar covalent bonds 56 
polar éjection force 991-992 
polarity 

actin filaments 892F, 894, 896, 898, 
899F 

spindle bipolarity 986-987 
polarization of the embryo 1155-1157 
polarized light 458, 459F 
Pol6 and Pois 246 

poliomyelitis 1266F, 1273T, 1275, 1281 
poliovirus 1266F, 1273T, 1274, 1281, 
1288-1289 

polyacrylamide-gel electrophoresis 
for DNA 465, 466F 
protein phosphorylation 879F 
SDS-PAGE 452, 453F, 454 
polyacrylamide-gel electrophoresis (PAGE) 
372F, 454, 455F, 466F, 879F 
see a/so SDS-PAGE 
polyadenylation 

effects on résultant protein 417-418 
of mRNA 3' ends 315-316, 324-326, 
327F 

poly-A shortening 426-427 
polycistronic transcripts 348, 806-807 
polycomb chromosomes 208-211 
polycomb group proteins 1164, 1165F, 
1183 

polycomb protein group (PcG) 206, 211 
polyethylene glycol 444F 
polyisoprenoids 99 
polymers see macromolecules 
polymorphisms 

MHC g en es/proteins 1328, 

1330-1331 

SNPs 232-233, 492, 493F 
types 492 

polymorphonuclear leukocytes see 
neutrophils 
polypeptides 

amino acid addition 339-344 
disordered 125 

loops as binding sites 118F, 121,138 
mode of ER membrane transport 
675-677 

multiple chains in large proteins 123 
number of possible variations 
118-119 

proteins as 6, 109 
side chains of 109-110 
synthesis outside ribosomes 365 
polyploidy, megakaryocyte nucléus 1242 
polyprotein precursors 744 
polyps, adenomatous 1123 
polyribosomes 349, 675 
polysaccharides 97 
glycogen as 79 
Golgi apparatus synthesis 711 
lysozyme action on 144 
pectins 1083-1084 
synthesis by condensation reactions 
71F 

see a/so cellulose 

polytene chromosomes 208-211,391 F, 
540F 

polyubiquitin chains 157-158 
polyubiquitylation 
caspases 1029 

distinguished from monoubiquitylation 
735 


population size 

and neutral mutations 231 
and response smoothing 829 
porins 

aquaporins 580F, 599, 612-613 
in bacteria and mitochondria 662, 758 
barrel formation 580F, 581 
distinguished from channels 611 
porphyrin rings 766, 787 
Porphyromonas gingivalis 1266 
position effect variegation 195-197, 
201-202, 205 
position effects 

embryonic transcription regulators 
393 

and gene silencing 194 
positional labels 1161,1206 
positional values 1163,1169,1205 
positive feedback 

cell memory and 520 
CICR (Ca 2+ -induced calcium release) 
838 

cooperative binding and 517, 519F 
heterochromatin formation 195 
iPS conversion 1255 
in M-Cdk activation 979 
pattern génération 1152 
self-amplification of cell différences 
1152 

self-amplification of nerve impulses 
621 

vesicle recruitment to membranes 
707 

positive feedback loops 

cell memory and 401-402, 412, 413F, 
432 

helper T cells 1337 
M-Cdk in 1004 

switches and bistability 517, 518-520, 
827-829 

positive régulation of enzymes 151,152F 
positive sélection 1332 
post-transcriptional Controls 413-428 
post-translational and co-translational 
processes 670, 677 
post-translational import 

endoplasmic réticulum 677, 683, 685 
mitochondria and chloroplasts 659, 
664, 670 

post-translational modifications 
covalent modification 165-166 
ofGFP 543 

mapping using tandem mass 
spectrometry 457 
multisite 166 

post-translational translocation 677, 678F 
postcapillary venules 1311-1312 
potassium see K + 
poxviruses 1274, 1280, 1289 
PPARs (peroxisome proliferation-activated 
receptors) 875 

pre-mRNA modification and RNA splicing 
316F, 317-321,323F 
pre-synaptic terminais 747F 
precursor oligosaccharides 683-684 
prématuré aging 265 

RNA splicing errors and 324 
Xpd knockout mice 497F 
prenyl chains 572, 577F, 578 
preprophase 999, 1001F 
prereplicative complexes (preRCs) 

259-260, 974-975, 976F, 1002 
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presenilin 868 

presomitic mesoderm 1177-1178 
prickle cells 1226F 
primary axes, embryo polarization 
1155-1157 

primary cell walls 1082-1085 
primary cilia 824F, 845F, 873, 942-943, 
949, 950F 

primary ciliary dyskinesia 942 
primary cultures 441 
primary pathogens 1265,1276 
primary tumors 

derived from one abnormal cell 
1093-1094 

genomes of métastasés and 1119 
primer strand, DNA 240-241 F, 243, 255, 
262 

primers, RNA 245, 247, 249, 253 
prion diseases 130-131,132F 
probability 

statistical methods 524-525 
thermodynamics and 52, 60, 102-103 
procollagen 704, 705F, 720 
professional antigen-presenting cells 
(APCs) 1307, 1328 
professional phagocytic cells 

macrophages and neutrophils as 739, 
1298 

PRRson 1298 
profilins 905-907, 909F 
progenitor cell commitment 1243 
proinsulin 130 
prokaryotes 

as bacteria and archaea 14-15 
circular DNA in 23F 
distinction from eukaryotes 12-13 
diversity 12-14 
horizontal gene transfer 18-19 
mRNA compared to eukaryotic 316F 
prolactin 864T 
proline 

in collagens 1062 
in elastin 1065 
hydroxyproline 1062F, 1084 
structure 113 

prometaphase 978, 980, 986, 990-991, 
992F, 994F 

promoter complexes 510-512 
promoters 

in CG islands 406-407 
fraction bound by activators and 
repressors 511-512, 517F 
gene clusters from 380 
in gene control régions 384 
orientation and DNA inversion 294 
protein concentrations and 513 
repressors and activators 381-383, 
386 

in transcription 306, 308-309, 388 
proneural genes/cells 1172 
proofreading 

aminoacyl-tRNAs 339, 344-345 
kinetic proofreading 345 
proofreading, DNA réplication 242-244, 
250-251,257F 

mismatch proofreading 250-251, 
257F 

translesion polymerases 273 
see also error correction; quality 
control 

proopiomelanocortin 744F 


prophase 

formation of sister chromatids 964 
M-Cdk activity 978, 985-986 
in meiosis 1006-1007 
in mitosis 978-980 
preprophase 999, 1001F 
spindle assembly 985 
prostaglandins 837 
prostate cancer 1117, 1128-1129F 
proteases, serine, and matrix 

métalloprotéases 1072 
protéasomes 

function 357-359, 685 
ubiquitylation and 157,358F 
protein activity control 

gene expression 372, 373F 
hyperactivity in cancer 1106-1107 
protein arrays 458 
protein assemblies 

cooperative allosteric transition 
152-153 

as molecular machines 164 
tetranucleosomes 192 
for transcription initiation 313 
protein-coding genes 120, 122-123 
protein complexes 

aggregation to achieve rétention 714 
cellulose synthase rosettes 
1085-1086 

intracellular signaling complexes 822, 
823F, 851 F, 859 
membrane proteins 588-589 
see also Arp2/3 complex 
protein concentrations 513-514 
protein dégradation 

effects on mean lifetimes 514 
N-terminus acétylation and 360-361 
phosphorylation regulating 360 
protein domains 

ABC transporters 609F 
domain shuffling 121-122 
and genetic recombination 318 
in immunoglobulins 230 
as modules 117-118,121-122 
as secondary structure 117-119 
SFI2 example 115 
transmembrane proteins 579-580 
Type III fibronectin repeats 1066F, 
1067-1068, 1073 
protein families 

évolution of kinases 154-155 
examples 119 

ligand binding sites 136-137 
mitochondrial carrier family 779-780 
trimeric GTP-binding proteins 846T 
protein folding 

co-translational 353-354, 355F 
as conserved 460 
energetics of 114-115,354 
formation of binding sites 6, 135F 
molecular chaperones and 114 
multipass transmembrane proteins 
581F 

N MR spectroscopy and 462 
noncovalent bonds in 50, 110-114 
oligosaccharide tagging 685 
see also a-helices; p-sheets; protein 
misfolding; protein unfolding 
protein function 

binding ability and 134-169 


FRET monitoring of dynamics 
543-546 

inferring from interaction maps 168 
investigation using site-specific 
recombination 294-295 
phosphorylation in regulating 153-156 
proteins structure and 462-463, 483 
proteins with unknown function 123 
protein kinase A (PKA) see cyclic-AMP- 
dependent protein kinase 
protein kinase B (PKB) see Akt 
protein kinase C (PKC) 

as calcium dépendent 837, 852 
plasma membrane binding 574 
protein kinases 

cyclic-AMP-dependent (PKA) 827, 
834-837, 841, 843, 845T, 848 
as enzyme-coupled receptors 819 
evolutionary tree in eukaryotes 
154-155 

illustrating feedback mechanisms 
829F 

initiating autophagy 726 
régulation of molecular switches 819 
and réplication origins 260 
as serine/threonine or tyrosine kinases 
819 

see also cyclin-dependent kinases; 
MAP kinases; serine kinases; 
tyrosine kinases 
protein machines 

ATP synthase as 754, 776-778 
coordination in 164 
recombination complexes 1006 
réplication machine 249-250 
V-type pumps as 608 
protein misfolding 

BiP chaperones and 683, 712 
ER export and dégradation 685-686, 
712 

prion diseases 130-131 
retrotranslocation 358, 686 
protein-only inheritance 131 
protein phosphatases and molecular 
switches 819 

protein phosphorylation, enzymes 
154-156 

protein-protein interactions 
aquaporins 580F 
biochemical and optical methods 
457-459 

cytoskeletal filaments 894 
identifying interacting proteins 
457-458 

interaction mapping 166-168 
interface types 137 
involving transcription regulators 385 
mediated by interaction domains 852 
régulation by 157 
transmembrane proteins 580, 590 
use of FRET 459 
see also protein subunits 
protein sequences, interspecies 
comparisons 36, 37F 
protein sorting 

pathways in the TGN 742 
in polarized épithélial cells 749F 
see also signal sequences; sorting 
signais 
protein structure 

coiled coils 116-117 
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immunoglobulin fold 121 
models and représentations of 115F, 
460, 461F 

primary, secondary, tertiary and 
quaternary 117,841 
and protein function 462-463, 483 
size limits of analytical techniques 
462 

specified by amino acid sequence 
109-114 

three-dimensional 462-463, 579, 586, 
587F, 607 

using NMR spectroscopy 461-462 
using x-ray diffraction 460, 461F 
see a/so a-helices; p-sheets 
protein subunits 123,127-128 
actin and tubulin 893-894 
activating subunits in mitosis 971 
symmetrical assemblies 152-153 
trimeric GTP-binding proteins 
832-834 
protein synthesis 

by condensation reactions 71F 
cytosol as location of 641 
global régulation 423-424 
inhibitors 351,352T 
on polyribosomes 349 
possible évolution 365-366 
quality control and régulation 
351-353, 357, 361F 
speed of 341 

as transcription and translation 299 
transfer RNA rôle 334-340 
protein translocation 646 

in mitochondria and chloroplasts 
658-666 
roadmap 646 
three routes 678F 
protein translocators 580 

aqueous channels in 675-677 
in the endoplasmic réticulum 677F 
latéral gating 678-679 
in mitochondria 659, 800, 802F 
see a/so phospholipid translocators 
protein tyrosine kinases 1080 
see a/so Src protein kinases 
protein tyrosine phosphatases 864-865 
protein unfolding and AFM 549F 
proteins 

abundance 785 
accumulation delays 1176 
analytical methods 452-462 
availability in quantity 483-484 
conditionally short-lived 359 
conformation and chemistry 135-136 
crystallizing 561 
encoded by genes 7, 178 
fluorescent tagging in living cells 
542-546 
functions 5-6 

generating movement 160-163 

identifying 455-458 

multiplicity of functions 48-49, 109 

number in a eukaryotic cell 641 

numbers in human cells 168 

in plant primary cell walls 1084 

purifying 445-451 

rôle as catalysts 5-6 

transcription regulators as 373 

see a/so enzymes; membrane proteins 


a-proteobacteria 798F 
proteoglycans 582 

aggrecan 1058F, 1060, 1061F 
decorin 1058F, 1060 
in the extracellular matrix 1057-1061, 
1073-1074 

GAGs occurring as 1057, 1059-1061 
Golgi apparatus assembly 718-719, 
1059 

as matrix receptors 1074 
perlecan 1058F, 1070, 1071F 
proteolysis 

the caspase cascade 1022-1023 
in cell cycle control 970, 972F 
enzyme régulation through 150 
in insulin and collagen assembly 130 
isolating cell from tissues 440 
proteolytic cascades 1022-1023, 1025, 
1302 

see a/so caspase cascades 
proteomics, defined 167 
proto-oncogenes 

conversion into oncogenes 
1106-1107 

gain-of-function mutations in 1104 
protocadherins 1038, 1207 
protofilaments 
actin 899F 

cytoskeletal 894, 895F 
intermediate filaments 945 
microtubules 894, 926, 928-929, 

930F, 934-935, 937 
protofilament curvature 928-929, 934 
proton gradients (H + gradients) 

active transport in bacteria, yeasts 
and plants 586 
bacteriorhodopsin and 587 
electron-transport chain and 86F 
mitochondrial membrane 103,662, 
727 

thylakoid membrane 791,793-794 
use by ATP synthases 586, 606, 779, 
793 

proton-motive force 754, 762F, 767F, 768, 
803 

chemiosmosis predating 780-781 
in mitochondria and chloroplasts 794, 
795F 

proton pumps (H + pumps) 

ATP synthases reversibility as 778 
bacteriorhodopsin as 586-588 
cytochrome c oxidase 770-773 
cytochrome c reductase 768-770 
of the electron-transport chain 
763-774, 767, 774F 
endosomes 736 
in the first living cells 795 
lysosomes 723 
mitochondria 754, 762 
NADH dehydrogenase complex 768 
thylakoid membrane 783-784 
proton-translocating proteins 773 
“proton wires” 773 
protons 

behavior in water 45-46, 613, 773 
membrane transport 164 
protozoa 

as eukaryotes 24, 25F, 30 
pathogenic 1272-1273,1282 
variety of 31F 

protrusions see cell-surface protrusions 


PRRs (pattern récognition receptors) 
classes 1299-1300 
dendritic cell expression 1305 
inflammatory response 1300 
récognition of PAMPs 1298, 1326 
récognition of viruses 1303-1304 
see a/so Toll; Toll-like receptors 
pseudogenes 

globin family 230 
in the human genome 184T 
loss-of-function mutations 229 
and purifying sélection 221 
Pseudomonas aeruginosa 1268 
pseudopods 739,740F 
pseudosymmetry 603, 604F 
pseudouridine 328, 329F, 335F 
PSP (postsynaptic potential) 633-634, 
635F, 636 

PTB (phosphotyrosine-binding) domains 
822, 824F, 852-853 
PTEN phosphatases 859, 1115, 1117 
puffer fish {Fuga rubripes) 29, 223 
pulsed-field gel electrophoresis 466 
puises of gene activation 522 
Puma protein 1028 

pump proteins see membrane transport 
proteins 

“purifying” DNA 473 
purifying sélection 219-220, 221F, 223, 
225, 231 

purines 

in complementary base-pairing 176, 
177F 

structures of adenine and guanine 
100 

purines, hydrolysis see depurination 

Purkinje cells 547F, 650 

puromycin 351,352T 

purple bacteria 794F, 797, 798F 

“purple membrane” 586, 587F, 590, 591F 

pus 1301 

pyrene fluorescent probe 900F 
pyrimidines 

in complementary base-pairing 176, 
177F 

structures of thymine, cytosine and 
uracil 100 

pyrophosphate hydrolysis in biosynthesis 
71-72 

pyruvate 

anaérobie breakdown 76F 
conversion to acetyl CoA 75, 81-82 
oxidation in the citric acid cycle 
82-84, 758, 759F 
produced in glycolysis 75, 105 
as a substrate of several enzymes 87 
pyruvate carboxylase 70F 
pyruvate dehydrogenase complex 82, 
107, 149F 

pyruvate kinase 105 

Q 

Q cycle 770, 771F, 791,792F 
quality control 

aspect of apoptosis 1022 
protein exit from the ER 711-712 
RNA splicing 323-324, 336 
RNA transport from the nucléus 
419-421 

translation 351-353, 357 
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quantitative analysis 38 
quantitative approaches 

importance for biology 509 
transcription activators 510-514 
transcription repressors 514 
quantitative measurements 
of gene expression 502-503 
quantitative PT-PCR 502-503 
quantum dots 538 
quantum mechanics 532 
quaternary structures 117, 841,842F 
quinones 

as électron carriers 764-766, 767F, 
768, 788 

plastoquinones 767F, 789, 791-792F, 
793 

ubiquinone 765-766, 767F, 768-770, 
771F, 772-773 
quorum sensing 813 

R 

R-spondin protein 1222 
Rab cascades 707, 708F, 721 
Rab effectors 706-708 
Rab family GTPases 278 

as Ras superfamily members 854T 
subcellular location 706T 
in vesicular transport 705-709 
Rab proteins, Légionella modification 739 
rabies 1273T, 1274F, 1279 
Rac protein 

adherens junction assembly and 
1043F 

as Rho family members 854T, 858, 
956-957 

Rad51 protein 279, 281F, 282 
Rad52 protein 281 
radial glial cells 1200, 1201F 
radiation 

and cancer treatment 1132 
and carcinogenesis 1094, 1128F, 
1132-1133 

Schmidtea mediterranea response 
1248 

ultraviolet light 267-268, 269F, 1094 
x-rays and hematopoietic stem cells 
1242 

radioisotope labeling 452, 454, 466-467, 
1219 

radiorésistant bacteria 483 
Raf (MAP kinase kinase kinase) 856-857, 
882 

Ramachandran plots 111F 
Ran-GAP protein 653-654, 656 
Ran-GEF protein 653, 656 
Ran-GTPases 

as chromatin positional marker 656, 
986 

and nuclear pore complexes 653-654 
as Ras superfamily members 854T 
Rana pipiens 36F 
“random coils” 1065,1066F 
random mutagenesis 485 
random walks 59, 60F, 231,652 
Rapl GTPase 1078F 
rapid freezing 556, 561F 
rapidly inactivating K + channels 635 
Ras family GTPases 278, 854T 

Ras oncogenes 1106,1118,1123, 
1125 


Ras-MAP kinase signaling pathway 
anticancer drugs targeting 1137F 
integrins and 1079 
Myc transcription and 1012 
Raf, Mek and Erk 856 
see also RTK/Ras/PI3K pathway 
Ras-PI3K pathway 1114 
Ras proteins 

activation mechanisms 855F 
as GTPases 157, 158F, 161 
signaling via MAP kinases 855-857 
three human types 854 
rate constants 

actin polymerization 900, 902 
association and dissociation 510F, 

511 

transcription rate constants 513 
Rb gene mutations 1108-1109 
Rb regulatory pathway 1113-1114 
Rb (retinoblastoma) protein family 
1012-1014 

papillomavirus inactivation 1132 
reaction centers, photochemical 783-784, 
788-790, 796 
évolution 793, 794F 
reaction-diffusion Systems 1152, 1153F 
reaction rates 

diffusion-limited rates 143 
enzymatic accélération 144 
K m (half-maximal rate) 141,143 
and speed of molecular motions 
59-60, 143 

reader-writer complexes 

chromatin domains and 202 
nucléosome modifications 199-201, 
406F 
reading frames 

and genome translation in silico 
477, 482 

Ig diversification and 1321 
translation and 334, 342, 347 
see also open reading frames 
Reaper protein 1029 
RecA protein 279-281 
receptor-activated Smads (R-Smads) 
865-866 

receptor down-regulation 830, 848, 853 
receptor editing 1313-1314, 1321 
receptor inactivation 848 
receptor-mediated endocytosis 709, 727, 
732-735, 849 

receptor séquestration 848 
receptor serine/threonine kinases 
865-866, 881 

receptor tyrosine kinases (RTKs) 

dimerization on ligand binding 851 
extracellular signaling and 850T 
IGF1 action 850T, 852 
insulin receptor as 824F 
signaling overlaps with GPCRs 
861-862 

subfamilies 851F 
see also RTK/Ras/PI3K pathway 
receptors 

cell-surface and intracellular 816F 
in extracellular signaling 815-816, 
831F 

matrix receptors and co-receptors 
1074 

récessive mutations 

cancer-critical genes 1005F, 1104 


complémentation tests 490 
loss-of-function, as typically 489 
Reclinomonas {R. americana) 801 F, 805 
recombinant DNA technology 
éléments of 464, 484F 
and reverse genetics 494 
recombination events 1291,1292F 
recombination mechanisms 287 
conservative site-specific 
recombination 292-295 
transposons 288-292 
recycling endosomes 696F, 706T, 730, 
737-738, 739F 
red blood cells 

asymmetry of lipid bilayers 573, 574F 
band 3 protein 605 
cytoskeleton 591,592F, 912 
life-span 1244 
plasma membrane 565F 
see also erythropoietin 
redox-driven pumps 601 
redox pairs 764-765 
redox potentials 

along the respiratory chain 767, 768F, 
879 

chlorophyll Ao 792 
defined 764 

as measure of électron affinity 
763-764 

measurement and calculation of ÀG° 
765 

in photosynthesis 790F 
reducing agents, in photosynthesis 796 
réduction, as an électron transfer 55-56 
refractive indices 532 
refractory period, ion channels 622-623 
régions of attraction 519F, 520 
regulated exocytosis 748 
regulated nuclear transport 419 
regulated proteolysis 396F, 867, 970 
regulated secretory pathway 

distinguished from constitutive 

secretory pathway 741,742F 
signaling in 744 
regulatory DNA 

as conserved 217 
différences between animal species 
1149, 1174-1175 
in the human genome 185 
as noncoding 7, 29 
regulatory genes 

multicellularity and 29-30 
in vertebrate évolution 227F 
regulatory networks, mathematics of 
509-512 

regulatory RNA, miRNAs as 1180 
regulatory sites, allosteric enzymes 151 
regulatory T cells 1314,1325-1326, 

1327F, 1328, 1331-1332, 1333F, 
1335F 

in immunological self-tolerance 1336 
induced regulatory T cells 1336 
release factors 344, 348, 562F 
renaturation see hybridization 
“repair factories” 213-214 
repeated sequences 289 
see also tandem repeats 
réplication bubbles 254F, 257-258 
“réplication factories” 1287 
réplication forks 

asymmetry 240-244, 246 
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bacterial 249F 

eukaryotic 253-255, 257-259, 261 
failed, cell cycle response 1014 
nucléosome assembly behind 
261-265 

repair of stalled or broken 277, 280, 
281F, 1134 
in S-phase 974-977 
réplication machines 249-250, 259 
réplication origins 

in bacteria 254-255 
in eukaryotes 186,254-255,258-260 
in human cell division 257 
méthylation 257 
S-phase 974-975, 976F 
réplicative cell senescence 
avoidance by ES cells 1254 
cultured cells 442 
macrophages and 739 
and mitochondrial DNA 808 
p53 pathway and 1116 
stem cells and 1243 
telomere function and 264-265, 1016, 
1100 

reporter genes 393, 394F, 501-502, 884F 
reporter proteins 543 
repressor proteins see transcription 
repressors 

répulsive interactions, cell-cell junctions 
1188, 1206 

rescue, microtubules 927-928, 932-935, 
986 

residual bodies 739 
resolution 

distinguished from détection 532 
électron microscopy 554, 559, 560F, 
562 

light microscopy 530-532 
superresolution techniques 549-551 
resolution limits 

light microscopy 532, 549 
and wavelength 529, 533F 
résonance energy transfer 787-788, 789F 
see a/so FRET 
respiratory burst 1301-1302 
respiratory chain 

and électron donation 764 
malfunctions 808 
and oxidative phosphorylation 761 
protein biogenesis 802F, 804 
respiratory chain complexes 758, 762, 
764, 766, 779 

redox potentials 767, 768F 
supercomplex 772-773 
the three complexes 767 
three constituents 767 
response timing, intracellular signaling 
824 

resting membrane potentials 615, 617, 
629-630 

restriction factors 1297 
restriction nucleases 

in DNA cloning 468-469 
interphase chromosome structures 
209F 

in recombinant DNA technology 
464-466 

restriction points see Start 
Ret cadherin 1039F 
retina 

embryonic, tip cells and 1236 


ocular dominance columns 1212 
photoreceptive epithelium 1227 
RGCs (retinal ganglion cells) 
1204-1206 
retinal 147 

as (bacterio)rhodopsin chromophore 
587, 845 

in channelrhodopsin 623 
retinitis pigmentosa 324 
retinoblastomas 1013, 1107-1108, 1111 
see a/so Rb proteins 
retinoids/retinoic acid 876, 877F, 1257F 
retinotopic maps 1205, 1206, 1207F, 

1211 

retrieval pathways 695-696, 713-714, 
726, 743 

retromers 727, 728F 
retrotranslocation, damaged proteins 
358, 686 

retroviral-like éléments 218F 
retroviral-like transposons 291 
retrovi ruses 

and cancers 1130T 
defined 290 

as oncogene vectors 1105-1106 
RNA editing and 419 
as transdifferentiation vectors 1258 
see a/so HIV 
Rev protein 420-421 
reverse genetics 494-495, 500 
reverse transcriptases 
AZT inhibition 1292 
in cDNA cloning 470 
quantitative RT-PCR 502-503 
in retrovirus infection 290, 1105, 1289 
telomerases as resembling 262 
réversible amyloids 132-133 
réversible reactions 61 

see a/so equilibrium reactions 
RGCs (retinal ganglion cells) 1204-1206 
RGD sequence (Arg-Gly-Asp) 1067-1068, 
1075, 1078, 1281 

RGSs (regulators of G protein signaling) 
832,845 

Rheb GTPase 854T, 861 
Rho family GTPases 

activation by extracellular signais 958 
adherens junction assembly and 
1043F 

and bacterial entry to hosts 1281 
in cell polarization 955-959 
cytoskeleton and 858, 861 
neuron growth cônes and 858 
pseudopod shaping 739 
as Ras superfamily members 854T, 
858 

RhoA 

activation in cytokinesis 997, 998F, 
999 

localization in cytokinesis 1000F 
rhodamine 537, 545F, 756F, 936F 
rhodopsin 

as a GPCR 844-845 
retinal and 147,845 
signal amplification 848-849 
rhodopsins 

ER membrane insertion 681 
as GPCRs 588, 832, 845 
rhombomeres 1169,1188 
RIAM protein 1078F 
ribbon models, introduced 115 


ribonucleases, ribosome profiling 
505-506, 507F 

ribose 

méthylation in mRNA 316F 
méthylation in rRNA 328 
structure 96, 100, 302 
synthesis 86, 366 
ribosomal RNA (rRNA) 
abundance 327 
assembly 331F, 347 
codon-anticodon matches 345F 
évolution of genes for 220 
four types 328 

interspecies comparisons 15-16 
“S” values 309T, 328 
transcription process 305F 
ribosome profiling technique 505-506, 
507F 

ribosomes 

among intracellular compartments 
642 

antibiotic binding sites 351F 
bacterial and eukaryotic compared 
341 

characteristic of rough ER 670 
free and membrane-bound 674-675 
large subunit 331 F, 341-343, 
346-347F, 349, 423, 673 
as macromolecular complexes 50F 
négative staining 560 
with a release factor 562F 
response to antibiotics 800 
as ribozymes 346-347 
RNA binding sites 341-342,347 
rôle in translation 7, 340-343 
self-assembly 128,649 
séparation by centrifugation 445 
small subunit 331F, 341-344, 345F, 
346, 347F, 348, 423 
structure 346F 
subunit assembly 329, 340 
TOR and S6K effects 1017 
riboswitches 414-415, 423F 
ribozymes 

as catalysts 51,69 
ribosomes as 346-347 
structure 363F 
in vitro selected 363-364 
ribulose, structure 96 
ribulose 1,5-bisphosphate 785, 786F 
ribulose bisphosphate carboxylase 
(Rubisco) 

in carbon fixation 785 
as example enzyme 48 
x-ray crystallography 461F 
Rickettsia rickettsii 1287, 1288-1289F 
Rickettsia spp. 798F, 805F, 1287-1288, 
1289F 

Rieske protein 770F 
Riftia pa chyp tila 12 F 
RISC (RNA-induced silencing complex) 
429-431,432F 

RITS (RNA-induced transcriptional 
silencing) complex 432 
RK (rhodopsin kinase) 845, 848 
RLRs (RIG-like receptors) 1300 
RNA catalysis 

and the RNA world hypothesis 
362-363 

in the spliceosome 321 
see a/so ribozymes 
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RNAcleavage 417-418 
RNA editing 418-419,806 
RNA folding 302, 303F, 335F, 363 
RNA genes 

in the human genome 185 
for rRNA 327, 330 
in S. cerevisiae 305 
RNA helicases 1300 
RNA interférence (RNAi) 

as an experimental tool 433 
as a defense mechanism 431-432 
and heterochromatin formation 
432-433 
limitations 501 
small ncRNAs and 429-431 
testing gene function 499-501 
RNA ligases 688 
RNA localization 421-422 
RNA polymerase holoenzyme 306 
RNA polymerase I 327 
RNA polymerase II 

as an “RNA factory” 317F 
gene control régions and 384-386 
general transcription factors and 
309-312, 384-385 
modifying proteins 312-313 
similarity with bacterial polymerase 
309,310F 

snoRNAs and 328-329 
RNA polymerase III 328 
RNA polymerases 303-305 

DNA polymerases compared 304-305 
in eukaryotes 309T 
paused polymerases 388 
RNA-dependent 431 
RNA primers 245, 247, 249, 253 
RNA processing 

in chloroplasts 806 
control of gene expression 372, 373F 
RNA-req method 371F 
RNA sequencing (RNA-seq) 
alternative splicing and 482 
deep RNA sequencing 477 
mRNA analysis 503-504 
RNA splicing 

and chromatin structure 323 
consensus nucléotide sequence for 
319 

coordination with transcription 322 
“cryptic” splice sites/signals 321, 
322F, 323, 324F 
errors and disease 323-325 
as a function of snRNAs 305T 
intron sequence removal by 315-316, 
317-318, 320F, 336 
régulation 416 
spliceosome rôle 319-320 
see also alternative splicing 
RNA transport and localization 372, 373F 
RNA tumor viruses 1105 
“RNA world” 69, 362-366, 415 
R N As 

antisense 423F, 435 
categories and proportions of 305T, 
306 

conformations 5F 

distinctions from DNA 4-5, 302, 366 
double-stranded, as viral 
characteristic 1304 
rearrangements in the spliceosome 
321 


régulation of transport from the 
nucléus 419-421 

self-replicating potential 364, 365F 
as single-stranded 302, 363 
size of molécule 303 
in telomerase 263T, 429, 435 
in transcription and translation 4-5 
see also long noncoding; messenger; 
ribosomal; small noncoding; 
snRNAs; transfer 

RNPs (ribonucleoproteins) see hnRNPs; 
mRNPs; snRNPs 

Robol, Robo2 and Robo3 receptors 
1203F, 1204 

robustness, in biological networks 520, 
822 

Rock kinase 958F, 997, 998F 
rod photoreceptors 844-846, 848, 943 
ROS (reactive oxygen species) 808 
see also superoxides 
rosettes, cellulose synthase 1085-1086 
rotary catalysis, ATP synthases 776-778 
rough ER, protein glycosylation 683-684 
roundworms see Caenorhabditis elegans 
Rous sarcoma virus 1105,1265 
RRE (Rev responsive element) 420 
RTK/Ras/PI3K pathway 1113 
RTKs see receptor tyrosine kinases 
Rubisco see ribulose bisphosphate 
carboxylase 

ruffles, in macropinocytosis 732, 733F, 
1280-1281, 1282F 
ruthénium red dye 582, 583F 
ryanodine receptors 838-840 

s 

S. cerevisiae see Saccharomyces 
S-Cdks 974, 979, 1014 
S-cyclins 969, 971,993, 1013-1014 
S4 hélix 622 
S phase, cell cycle 

centrosome duplication 984-985 
DNA réplication in 258-260, 963, 
964F, 974-977 
visualization 966 

“S” values (sédimentation coefficients) 
rRNA 309T, 328 
ultracentrifugation 446, 455 
Saccharomyces cerevisiae 21 T, 31F 
cell cycle in 966 
centromeres 203 
DNA réplication in 253-254, 259 
gene density 182F 
genes essential to growth 499 
genes for voltage-gated ion channels 
627 

intercellular communication 813 
kinesins 936 

mitochondrial inheritance 807 
as model organism 29T, 966 
mutant libraries 498 
myosin V in 925 
RNA coding genes 305 
RNA rearrangements at the 
spliceosome 321F 
separated chromosomes 466F 
see also yeasts, budding 
salamanders 1194-1195 
Salmonella enterica 1268, 1281, 1282F, 
1284F, 1285, 1290 


Salmonella ssp. 

as Gram-negative 1267F 
use of phase variation 294 
SAM complex (sorting and assembly 
machinery) 659, 660F, 662 
Sanger sequencing see dideoxy 
sequencing 

Sari proteins 703, 704-705F 
sarcomas 1092 
sarcomeres 918-920,948 
sarcoplasmic réticulum (SR) 

Ca 2+ pump 606-607, 608F, 632F, 633, 
920 

in muscle cells 671,920, 921F 
satellite cells, skeletal muscle 1234 
scaffold proteins 126 
cullins as 160, 164 
Dishevelled as 870 
in intracellular signaling complexes 
822, 823F, 857 
membrane bending 594 
in protein machines and biochemical 
factories 164,332F 
septins as 949, 999 
in tight junctions 1049-1050 
scaffold RNA molécules 165, 435 
scanning électron microscopy see SEM 
SCAP (SREBP cleavage activation protein 
656F 

SCF (stem cell factor)/Steel protein 1187, 
1244 

SCF ubiquitin ligase 159-160, 164, 

167-168, 971, 972F, 973, 1004 
schistosomes 1302F 
schizophrenia 494 
Schizosaccharomyces (S. pombe) 
microtubule +TIP proteins 935F 
mitochondrial genome 801F, 805F 
as a model organism 966, 1271 
mutant phenotype 21F 
Schmidtea mediterranea 1247-1249 
Schwann cells 625 
Sciara 987F 

SCN cells (suprachiasmatic nucléus) 877 
“scramblases” 574, 690 
SDS-PAGE (sodium dodecyl 

sulfate-polyacrylamide-gel 
electrophoresis) 452, 453F, 454F, 
584 

SDS (sodium dodecyl sulfate) 583, 584F 
sea urchin embryo 403F, 524 
sealing strands 1047-1048, 1049F, 1050 
Sec61 complex 676-678 
Sec pathway 665F 
Sec23/Sec24 and Sec 13/31 proteins 
705F 

SecA ATPase 677, 678F 
second-generation sequencing methods 
479-480 

second law of thermodynamics 52, 53F, 
60, 102-103 

second messengers 819-820, 824, 827, 
833 

in intracellular signal amplification 848 
IP 3 and diacylglycérol as 837, 838F 
secondary cell walls 1082-1083, 
1085-1086 

sécrétion Systems, bacterial 1271, 1278, 
1281, 1282F, 1285-1286 
secretory and endocytic pathways 
constitutive and regulated secretory 
pathways 741, 742F 
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introduced 695, 696F 
secretory granules 132 
secretory proteins 

active precursors 743-744 
aggregation in TGN 742-743 
secretory vesicles 

budding from TGN 716,742 
endocytosis 744F 
exocytosis 741-744 
localization 744 

maturation and acidification 743 
membrane removal by endocytosis 
746-748 

phosphoinositides in 701F 
Rab3a in 706T 
synaptic vesicles as 745 
securin 971,973, 978, 992-994, 1009 
sédimentation velocity and equilibrium 
446-447 

segment-polarity genes 1159-1162 
segmentai duplications, double strand 
breaks 228 

segmentation clock 1178-1179 
segmentation genes 

insect bodies 1159,1160F 
regulatory hierarchy 1161F 
in vertebrates 1177 
selectins 720 

cell-cell adhesion 1054-1055 
E-, L- and P-selectin 1054-1055 
homing receptors 1312 
structure and function 1055F 
sélective advantage see natural sélection 
selectivity filters, ion channels 613, 618, 
619F, 622F 
selenocysteine 350 
self-amplification 

caspase cascade in apoptosis 1023 
complément cascade 1303 
of embryonic asymmetry 1152 
of nerve impulses 621, 623 
of Rab domains 707 
self-assembly 

in cells 9, 128-129 
glycolipids 575 
lipid bilayers 566 

self-association, cytoskeletal filaments 
893, 897 

self-organization, embryonic development 
1145 

self-phosphorylation, P-type pumps 607 
self-renewing tissues 1120,1217 
self-sealing, lipid bilayers 568, 569F 
self-tolerance see immunological 
self-tolerance 

“selfish DNA” see mobile genetic éléments 
SEM (scanning électron microscopy) 
chick embryo neural tube 1192F 
frog neuromuscular junction 630F 
fungal infection on Drosophila 1299F 
NK cell attacking a cancer cell 1304F 
olfactory neuron cilia 844F 
osteoclast on bone matrix 1231F 
pericytes 1236F 
principles 558-561 
semiconservative réplication 
of centrosomes 984 
of DNA 240, 242F, 447 
Semiiki fores t virus 1275F 
senescence see réplicative cell 
senescence 


sensitive period, Visual System 1212 
sensitivity of target cells 824 
sensory mother cells 1172, 1173F 
sensory neurons 1172,1208-1209 
separases 993, 1009 
septins 949-950, 959, 999 
septum, Z-ring and 896 
sequence logos 308, 375, 378F 
sequential induction 1153-1154, 1160 
serine 

phosphatidyl- 566-567, 571 T, 574 
phosphorylation during transcription 
388F 

phosphorylation in RNA polymerase 
tails 312,316-317 
phosphorylation in the nucléosome 
196-197 
structure 113 

serine kinases in protein kinase évolution 
155F 

serine proteases 1072 

Asp-His-Ser catalytic triad in 136F 
complément System 1302 
domain shuffling in 121F, 123 
elastase and chymotrypsin compared 
119F 

as a protein family 119,121F 
serine/threonine kinases 
AKT as 860 

distinguished from tyrosine kinases 
819-820 

PKA as 834-835 
plant photoproteins 884 
serotonin 

as an excitatory neurotransmitter 629 
cyclic AMP response 834F 
GPCRs activated by 832 
serotonin sécrétion 1218, 1219F, 1239, 
1241T 

sérum, distinguished from plasma 1011 
seven-pass transmembrane proteins 
Frizzled as 870 
Smoothened as 872 
see a/so G-protein coupled receptors 
Sevenless (Sev) RTK 855 
sex chromosomes 

in different organisms 486 
dosage compensation and 410 
as nonhomologous 180 
X-chromosome sample sequence 
300F 

X-inactivation 410-411,412F, 1252 
sex hormones 99, 875-876 
sexual reproduction 

haploid-diploid cycle 486 
homologous recombination and 277, 
282 

and horizontal gene transfer 19-20 
see a/so meiosis 

shadowing (électron microscopy) 560 
SH2 (Src homology 2) domains 

as an interaction domain 822, 824F 
in cytoplasmic tyrosine kinases 862 
evolutionary tracing 136-137 
multicellularity and 122 
phosphorylation and 154,156 
phosphotyrosine binding via 852-855, 
859, 863, 864F 
positioning rôle 135 
structure 115-116, 118F 
versatility 121-122 


SH3 (Src homology 3) domains 

as an interaction domain 822, 824F 
binding to proline-rich domains 853 
inhibitory action 118F, 122, 156, 157F 
tyrosine kinases and 854-855, 863, 
864F 

in ZO scaffold proteins 1050F 
shelterins 263 

shibire mutant, Drosophila 702F 
Shigella flexneri 1268, 1284F, 1287, 
1288-1289F 

Shine-Dalgarno sequence 348, 349F, 

422,423F 

shotgun sequencing 479 

shugoshin 1009 

shuttle System, NADH 760 

shuttling proteins 654 

sialic acids 575, 717, 1056, 1279, 1303 

Sicl protein 973T, 1003-1004 

sigma (o) factors 306, 307-308F, 

309-310, 384 

sigmoidal relationships 517 
sigmoidal responses 827-828, 841 
signal hypothesis 672, 673F 
signal patches 647, 728, 729F 
signal peptidases 647, 664, 673, 677 
signal Processing, intracellular 825 
signal-processing proteins 

Src kinase example 155-156, 157F 
types of response to signal 
concentrations 827-831 
signal-récognition particles see SRPs 
signal sequences 

amino acid sequences 647, 648T 
discovery 672 

internai 659, 663F, 664, 677-679, 
680-681F 

within sorting signais 647 
for successful gene transfer 801 
translocation into chloroplasts and 
664-666 

translocation into endoplasmic 
réticulum 672-675, 677 
translocation into mitochondria and 
659, 661 F, 663 

translocation into peroxisomes and 
667-668 

signal transduction, cell-surface receptors 
818 

signal variability 

among cell populations 829 
as noise 822 

signaling centers 1153, 1160, 1167-1168, 
1178 

signaling hubs, Ras and Rho as 854 
signaling mechanisms 
compared 158F 
in nerve cells 621 
see a/so cell signaling 
signaling molécules 

response and turnover 825-826 
see a/so hormones 
signaling pathways 

discovery in Drosophila embryos 
1154 

Hedgehog pathway 1150, 1154, 1160 
JAK-STAT signaling pathway 
863-864, 1304 

NFkB signaling pathway 873-874, 
1301 

PI 3-kinase-Akt signaling pathway 
860-861, 1017, 1030F 
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TGF(3 signaling pathway 1117 
Wnt signaling pathway 1124, 1157, 
1160, 1179, 1190, 1220-1221 
see also intracellular signaling; Notch; 
Ras-MAP kinase; RTK/Ras/PI3K 
pathway 
signaling proteins 

inducing différentiation in culture 1169 
lysosomal dégradation 735 
membrane localization 577-579 
vertebrate embryo patterning 1168 
see also intracellular signaling proteins 
“signatures” of transposition 289 
silent mutations 238 
SIM (structured illumination microscopy) 
550 

simulations 524, 1178 
SINEs (short interspersed nuclear 
éléments) 218F, 291 
see also Alu sequences 
single-molecule localization methods 
551-553 

single-particle reconstruction/tracking 
559F, 561-562, 586, 589, 593, 
773F 

single-pass transmembrane proteins 
CD4 and CD8 co-receptors 1331 
enzyme-coupled receptors as 819 
Golgi apparatus 716 
Notch and Delta as 867-868 
structure 579, 582F, 677, 679F, 716 
single-strand binding (SSB) proteins 246, 
247-248F, 253, 256F 
single-strand breaks, mismatch repair 
250 

single-stranded DNAs as probes 472 
siRNAs (small interfering RNAs) 305T, 

429, 431 

sister-chromatid cohésion 964, 977, 992, 
994, 1009 

sister-chromatid resolution 979, 982 
sister chromatids 

bi-orientation 988-989 
formation in interphase 185 
formation in prophase 964 
in homologous recombination 275 
kinetochore attachment 987 
at metaphase 214-215 
nonhomologous end joining and 275F, 
278 

séparation 992, 993F, 994 
site-directed mutagenesis 543 
site-specific recombination 
conservative site-specific 
recombination 292-295 
TCR assembly 1325 
transgenic mice 496 
SIV (simian immunodeficiency virus) 

1291F 

S6K (S6 kinase) 1017 
skeletal muscle 

cells as syncytia 1233 
genesis and régénération of 
1232-1235 
myosin II in 915-916 
organization of 917-919 
from reprogrammed fibroblasts 
396-397, 398F 
see also muscle contraction 
skin see epidermal 

Skpl protein (adaptor protein 2) 167-168 


sliding clamps 247, 248-249F, 262, 274F 

Slit protein 1203 

Slug protein 1042 

Smac/Diablo protein 1029 

Smad family proteins 865-866, 1126F 

Smad pathway 1197 

small intestine 

fastest self-renewal 1217,1218F 
stem cell location 1219 
tight junctions 1037,1049F 
small molécules 

in cancer treatment 1135-1137 
coenzymes 146-148 
diffusion 59, 598 

inhibitors of protein function 459-460 
intracellular signaling by 874-876 
major families 47, 48F 
passive diffusion into nuclei 650 
second messengers as 819 
shedding by stressed bacteria 620 
transport by ABC transporters 609F 
see also neurotransmitters 
small noncoding RNAs 429-431 
bacterial defense against viruses 
433-434 

miRNAs (microRNAs) 305T, 429-431, 
1149 

piRNAs (Piwi-interacting RNAs) 305T, 
429,433 

regulatory rôle of three types 429 
and RNA interférence 429-431 
siRNAs (small interfering RNAs) 305T, 
429, 431 

smallpox (variola) 1265-1266, 1273T, 
1275, 1278, 1287F 

SMC proteins (Structural Maintenance of 
Chromosomes) 977, 982 
smell (olfactory receptors) 824, 832, 
843-846, 1250F 
smoking see tobacco 
smooth muscle 916-918, 921-923, 948 
smooth muscle cells 1232, 1233F, 
1235-1236, 1238 
Smoothened protein 871-873 
Smurf ubiquitin ligase 866 
Snail protein 1042 
snakevenom 1068 
SNARE complexes 709, 745 
SNARE proteins 705, 707 
in autophagy 726 

in membrane fusion 708-709, 1280 
subunit anchoring to ER 682 
t-SNAREs and v-SNAREs 708-709, 
712, 714, 745F, 747F, 751 
snoRNAs (small nucleolar RNAs) 305T, 
328 

SNPs (single-nucleotide polymorphisms) 
disease prédisposition and 493F 
in population studies 232-233, 492 
snRNAs (small nuclear RNAs) 
function 305T 

snoRNAs (small nucleolar RNAs) 

305T, 328 

as spliceosome components 319-320 
snRNPs (small nuclear ribonucleoproteins) 
and Cajal bodies 331-332 
and the nucleolus 330 
as spliceosome components 

319-322, 323F, 325-326, 329F 
sodium see Na + 

sodium dodecyl sulfate (SDS) 583, 584F 


see also SDS-PAGE 

Sog gene (Short gastrulation) 1165-1166, 
1169 
somatic cells 

embryonic development 1158 
mutation frequency 238-239 
régénération from 1247-1248, 1249F 
self-sacrifice of 1091 
somatic hypermutations 1321-1323, 
1325, 1335-1336, 1338F 
somatic mutations 

in cancer cells 1094,1104,1112 
see also driver; passenger mutations 
somatosensory System 1204-1205 
somatostatin 835-836 
somites 1177-1179, 1185, 1187, 1189F, 
1233 

Sonic hedgehog (Shh) protein 1191, 

1199, 1200F, 1202 
Sordaria 1007 F 
sorting receptors 647-648 
sorting signais 

basolateral proteins 750 
nuclear localization signais 651 
and protein movements between 
compartments 645-647 
retrieval pathway to ER 713-714 
secretory proteins 743 
see also signal sequences 
Sos protein (Son-of-sevenless) 824F, 855, 
860 

Sox2 transcription regulators 398-399F, 
506F, 1254-1255 
see also OSKM factors 
spatial patterning 

in embryonic development 
1150-1154 

latéral inhibition and 1171-1173 
mechanisms 1155-1175 
polarization as first step 1155 
transient patterns 1160-1162 
vertebrate embryos 1167-1169, 
1177-1178 

spécial pairs, chlorophyll 788-790, 

791-792F, 793, 794F 
spéciation 

conserved DNA sequences and 226 
transposed DNA sequences and 292 
species 

antiviral defenses 432 
characteristic transposon types 292 
cross-species transmission 1279, 
1291 

different sex chromosomes in 486 
exporting the CRISPR System to 
497-498 

extinct, genomes 479 
gene number and complexity 
415-416 

interchangeability of homologous 
proteins 1146 

mitochondrial genomes compared 
801 F, 805F 

number living today 2 
protein sequences compared 36, 37F 
regulatory DNA as distinguishing 
1149, 1174-1175 
size différences 1010,1193-1197 
tissue regenerative abilities 1247 
variation of exon and intron lengths 
322F 
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specimen préparation 

électron microscopy 555-556, 559 
light microscopy 535-536 
speckles, nuclear (interchromatin granule 
clusters) 213,331-333 
speckles, tubulin 991F 
spectinomycin 351F 
spectral karyotyping 181F 
spectrin 591,592F, 905, 911 F, 912-913 
spermatozoa 

as centrosome source 987 
élimination of mitochondria from 807 
localization of mitochondria 755, 756F 
membrane domains 590,591F 
see also fertilization 
sphingolipids 567-568, 573, 575 
sphingomyelin 567, 571T, 572F, 574-575, 
690 

sphingosine 690 
spin labels 569-570 
spinal cord 

development 1170F, 1186, 

1199-1200, 1202, 1203F, 1209F 
injury 1251 

spinal muscular atrophy 324 
spindle see central spindle; mitotic 
spindles 

spindle assembly checkpoints 993-994 
spindle pôle body 557F, 930 
Spirulina platensis 778F 
spleen 

antigen removal 1311 
cells, transplanted in mice 1193 
érythrocyte removal 1244 
hematopoietic stem cells in 1244 
lymphocyte accumulation 1031,1055 
lymphocyte activation 1308-1309, 
1311 

spliceosomes 

catalysis 321,324 
GFP-tagged 544F 
intron sequence ambiguities 416 
rôle 319-320 
splicing enhancers 322 
Spoll protein 282-285 
SR proteins 322, 323F, 326 
Src protein kinases 117, 118F, 155-156, 
157F, 578 

binding via SH2 domains 852 
as cytoplasmic tyrosine kinases 862, 
1080, 1336 

Lck protein as 1331-1332 
régulation 155-156 
viral cancers and 1265 
SREBP (sterol response element binding 
protein) 656F 
SRP-like pathway 665F 
SRP receptor 673-674, 678F 
SRPs (signal-récognition particles) 
673-674, 675F, 677-680 
and the nucleolus 330 
SSB (single-strand binding) proteins 246, 
247-248F, 253, 256F 

staining 

cell cycle stages 966 
électron microscopy 555-556 
Gram-positive and Gram-negative 
610F, 1267F 

light microscopy 529, 535, 536F 
white blood cells 1240F 
see also fluorescent dyes 


standard free energy changes, ÀG° 
comparing reactions 61-62, 765 
deriving equilibrium constants 62-63, 
63T, 139F 

phosphate bond breakage 78, 79F 
viability of reactions 71-72 
standard redox potentials 765 
Staphylococcus (S. aureus) 

antibiotic-resistant (MRSA) 1276 
as opportunists 1276 
Staphylococcus spp. as Gram-positive 
1267F 

starch 80-81,785-786 
starfish eggs 839F 
start codons 347-348 
start-transfer signais 678-680, 681F 
Start transition 965, 966F, 968-970, 972, 
1014 

STAT proteins (signal transducers and 

activators of transcription) 863 
stathmin (Op18) 933, 935, 959 
statins 733 

statistical methods in biology 524-525 
steady-state concentrations 513F, 514 
steady-state kinetics 142 
steady States 

actin filament growth 901,902, 904 
bistabililty 518-520 
négative feedback effects 516 
protein-promoter complexes 511 
stable and unstable 519 
timetoattain 512,514,516 
stearic acid 98 

STED (stimulated émission déplétion 
microscopy) 551-552 
Steel/SCF protein 1187,1244 
stem-cell niche 1222 
stem cells 

in artificial replacement 1249-1250 
asymmetric cell division idea 1222, 
1223F 

blood cell formation from 1239-1247 
cancer stem-cells 1120-1122,1124 
defining properties 1219, 1220F 
epidermal stem cells 1225-1226 
in épithélial tissues 1217-1227 
guided génération of cells and organs 
1266-1267 

hematopoietic 1239,1242-1243, 

1308 

independent-choice theory 1222, 
1223F 

in intestinal crypts 1122, 1124, 
1219-1220, 1224 
mesenchymal 1229 
multipotent 1219F, 1220-1221, 1222F, 
1229, 1239, 1242-1244 
mutation risk 1243 
myoblasts persisting as 1234 
neoblasts resembling 1248-1249 
neural stem cells 1201F, 1250, 1251F 
organoids from 1223 
restricted possibilities 1251 
therapeutic use 1249-1251 
types 1220 
stem cells, embryonic 

in drug discovery 1258-1259 
guided génération of cells and organs 
1266-1267 

production and pluripotency 1253 
transcription network 399F, 1254 


stem cells, pluripotent 

induced pluripotency 398,401, 
1254-1259 

through reprogramming 398 
stem-loop structures, RNA 414, 423F, 
427F, 435 

stereocilia 890, 892, 924, 1189-1190 
“steric zipper” 131F 
steroids 99, 875-876 
brassinosteroids 881 
sterols, cholestérol as 568 
sticklebacks 1174-1175 
“sticky ends” 465F 
stochastic effects 523-524 
stomach cancer 1093, 1110F, 1120, 
1129F, 1132, 1265, 1277 
stop codons 348-349, 350 

nonsense-mediated mRNA decay 
327F, 352-353 
prématuré 418 
relative scarcity in ORFs 482 
tumor suppressor gene mutations 
1110F 

stop-transfer sequences 663, 679-680, 
681F 

STORM (stochastic optical reconstruction 
microscopy) 552, 553F 
strand-directed mismatch repair 244T, 
245, 250-251 

strand exchange (strand invasion), in 
homologous recombination 
264F, 278-282, 283F, 284 
streptavidin 538F, 549F 
Streptococcus (S. pneumoniae) 
as an opportunist 1276 
as Gram-positive 1267F 
transformation with alien DNA 174F 
streptogramin 351F 
Streptomyces spp. 1293 
stress 

MAP kinase response 857 
NFkB signaling and 873-874 
in plants 881 

survival by cancer cells 1099, 1103, 
1115-1116, 1133 

stress fibers 891,911,923, 955-957, 
958F, 996 

stress granules 427-428 
striated muscle 891,918 
stripes, Drosophila embryo 393-395 
strokes, apoptosis and 1031-1032 
stroma, basal lamina 1069, 1100-1101, 
1228 

stroma, chloroplasts 658, 665, 782 
stromal cell contact, stem cells 1244 
STRs (short tandem repeats) 476 
strychnine 629 
subclones 

in adenomatous polyps 1123 
in cancer 1096, 1097F, 1118-1119, 
1123 

substrate affinity, K m as measuring 143 
substrates 

direction into reaction pathways 58, 
59F 

in enzymes catalysis 51,57, 140-141 
see also enzyme-substrate complexes 
succinate dehydrogenase (Complex II) 
767-768F, 772-773, 775 
succinyl-CoA 107 
sucrose 784, 786 
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sugars 

in GAGs 1058 
glycolysis 74-78 

product yields from oxidation 775T 
ring and complex formation 96-97 
ring numbering 100 
storage as starch and fat 785-786 
see also glucose; monosaccharides; 
oligosaccharides; 
polysaccharides 

sulfur 

sulfate assimilation pathway 86 
sulfhydryl groups 91 
see also disulfide bonds 
SUMO (small ubiquitin-related modifier) 
158 

SUN proteins 948-949 
superhelical tension, DNA 314-315 
superhelices, collagen 1061, 1070 
superoxides 771,808, 1239, 1301 
superresolution techniques (microscopy) 
549-553 

suppressor mutations 487 
surface imaging, by SEM 558 
survival factors 

apoptosis suppression by 1011, 
1029-1030 

and cell migration 1186-1187 
SV40 simian virus 128F, 732, 1279 
switch helices 161 
switches 

allosteric proteins as 152 
cell-cycle control System 967, 972- 
973 

cooperative binding and 517 
GTP-binding proteins as 157F, 703, 
820, 854 

GTPases (Ran, Ras, Rho) as 653, 854, 
956 

kinases and phosphatases and 819 
Lac opérons as 382-383 
M-Cdk activation 979 
microtubule catastrophes and rescues 
934 

operation in Drosophila 392-395 
positive feedback loops and 517, 
518-520 

protein kinases as 829F, 841,843F, 
846, 852, 857 

riboswitches 414-415, 423F 
Smads in 866 

transcription activation and repression 
871,873, 878 

transcription control in eukaryotes 
384-392 

transcription regulators as 384-392, 
402-403 

transcription repressors, to activators 
868 , 871 

switching times, gene expression 
1176-1177 
symbiosis 

endosymbiont hypothesis 800 
in eukaryotes 25 

plants and nitrogen-fixing bacteria 12 
intubeworms 12F 
sympathetic neurons 
NGF and 1208 

withdrawal from the cell cycle 1018 
symporters 602-603, 604F, 605, 628, 632 


synapses 

adjustment depending on activity 
1198 

Chemical synapses 627-630 
described 627 
electrical and Chemical 628 
élimination 1211 
formation 1209-1212 
number per neuron 633 
psychoactive drugs at 631-632 
scaffold proteins 1050F 
synapsis and desynapsis 1006-1007 
synaptic clefts 628, 632-633, 1209 
synaptic plasticity 636-637, 1212-1213 
synaptic signaling 815 
synaptic vesicles 628, 744-746, 747F 
synaptobrevin 745F, 747F 
synaptonemal complexes 1006-1007, 
1008-1009F, 1010 
synaptotagmin 745 
syncytia 393F, 655F, 748, 1002, 1003F, 
1157 

skeletal muscle cells as 1233 
syncytial blastoderms 1157-1158, 
1157-1161, 1165 
syndecans 1060-1061 
Synechococcus elongatus 778F, 878 
synonymous codons 219-220 
syntaxin 745 F 
synteny 221-222, 224-225 
“synthetic-lethal” treatments 491, 1133, 
1140 

synthetic ribozymes 363-364 

T 

T-antigen, SV40 virus 652F 
T cell activation 

control of nuclear import 655F 
dendritic cells and 1324-1326,1331 
extracellular signais and 1336-1337 
T-cell-mediated immune responses 
1307-1308 
T-cell receptors 

accessory receptors and co-receptors 
1304, 1325, 1330T, 1331-1333, 
1335-1337 

inhibitory receptors 1138,1337 
misfolding 712 
T cells 

antigen présentation by dendritic cells 
1305 

co-stimulatory proteins 1305, 1306F, 
1326, 1327F, 1335, 1337-1338 
effector T cells 1309-1311,1313, 
1324, 1328, 1333-1335, 1337 
y/5 T cells 1326 
main classes of 1325 
MHC proteins and 1324-1339 
origin in the thymus 1308 
récognition of foreign peptides 1327F 
thymocytes as developing T cells 
1308F, 1332-1333, 1336 
t-loops 263 

T tubules (transverse tubules) 

in muscle contraction 920, 921F 
voltage-gated Ca 2+ channels in 633 
T4 bactériophage 19F, 324 
T7 bactériophage 243 
tail polymerization 72-73 
talin 1037T, 1075, 1078-1081 


tamoxifen 1221-1222F 
tandem duplication 

and genetic variation 228 
linking protein domains 122 
tandem repeats 

RNA polymerase tails 317F 
STRs (short tandem repeats) 476 
telomeres 262 

TAP-tagging (tandem affinity purification 
tagging) 451 

TAP (transporter associated with antigen 
Processing) 611 
TAR RNA 414 

TAT pathway (twin arginine translocation) 
665F 

TAT protein 414,420-421 
TATA boxes 310-311,312F, 385F 
tau protein 932-933, 934F 
Taxol® (paclitaxel) 904T, 929, 936F 
TBP (TATA-binding protein) 310, 

311 —312 F, 311T 

TBSV (tobacco bushy stunt virus) 129F 
Tbx5 transcription factor 1259 
TCRs (T-cell receptors) 1304, 1325-1326, 
1328, 1331-1333, 1336, 1337F, 
1338 

telomerases 

cancer cells 1100 
expression by ES cells 1254 
human cells not producing 1016, 

1100 

IcRNA in 429, 435 
and the nucleolus 330 
resembling reverse transcriptases 
262,263F 
telomeres 

as chromosomal structures 186, 263 
end-replication problem 262 
length and its régulation 264-265 
réplicative cell senescence 442, 1016, 
1100 

telophase 964, 981,995 
plant cell 1000F 

TEM (transmission électron microscopy) 
305F, 554, 555-556F, 558, 560F 
température effects, lipid bilayers 571 
temperature-sensitive mutations 489 
templated polymerization 3-4 
templating 

active sites 145 
assembly factors 130 
in DNA repair 275 

in DNA réplication 177-178, 239-240 
in PCR 474 
RNA 364 

tensile strength 1046, 1057, 1063, 1070, 
1082-1083, 1084-1085F 
tension sensing 1043, 1044F, 1068F, 

1080, 1081F, 1229 
teratomas 1257 

terminally différéntiated cells 400, 816, 
1012, 1021,1148 
skeletal muscle 1233 
small intestinal villi 1218-1219 
termination (of transcription) 306 
testes 

Leydig cells 671F 
piRNAin 433 

testosterone, structure 99, 876F 
Tet repressor 495F 
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tethering 

by disordered régions 126 
by dynamin 701 
Rab effectors in 706 
repair factories 213-214 
by RNA polymerase II CTDs 316 
TetR efflux pump 1293F 
tetracysteine tags 1053F 
Tetrahymena 263T, 324, 805F 
tetranucleosomes 192 
TFII see general transcription factors 
TGF(3 (transforming growth factor-P) 

BMPs as family members 1168 
growth inhibition by 1012 
induced regulatoryT cells 1336 
morphogen gradients 1166 
myostatin as family member 1196 
TGFp/activin family 865 
TGFp signaling pathway 
in cancer 1117, 1123F 
in development 1150 
TGFp superfamily 865-866,1168 
TGN (fra/is-GoIgi network) 716, 722, 
727-729, 730F, 731,736, 
749-750 

exocytosis and 741-744 
P-thalassemia 323-324 
Thaïe cress see Arabidopsis 
thermodynamics 
laws of 52-54 

of living cells 53-54, 102-103 
thermophilic bacteria 473F, 483, 572 
thick filaments 915-916, 918-920, 923F, 
936F 

thin filaments 916, 918-921,922F 
third-generation sequencing methods 481 
thoracic duct, lymphatic System 1236, 
1312 

3C (chromosome conformation capture) 
method 209F, 212 
three-dimensional imaging 

électron microscopy 557-558, 560F 
light microscopy 540, 541F, 542, 

550F, 553F 

3' end, DNA 175, 177F, 480 
threonine, structure 113 
thrombin 141,837T, 1078 
thylakoid membranes 606, 658, 664-665, 
783 

reaction centers 783-784, 788-790, 
793, 796 

site of photosynthesis and ATP 
génération 786-787 
site of photosystems I and II 789-790 
thylakoid space 783, 784F, 791-792, 

793F, 794, 795F 
thymidine, radioactive 1219 
thymine 

from deamination of 5-methlycytosine 
405 

DNA base pairing with adenine 176, 
177F 

structure 100 
thymine dimers 268, 269F 
thymocytes 1308F, 1332-1333, 1336 
thymosin 905-907 

thymus, originsofT cells 1308, 1325, 
1336 

thyroid hormones 835T, 874, 876, 1182 
thyroxine 876F 


tight junctions 

in épithélial cells 1036F, 1037, 

1048-1049F 
permeability 1047 
polarized cells 749 
preventing lipid diffusion at 590 
structure and function 1047-1049 
tiling 1207 

TIM complex (translocator of the inner 
mitochondrial membrane) 659, 
660F, 661-662 
TIM22 659, 660F, 663F, 664 
TIM23 659, 660F, 661-662, 663F, 664 
Tim protein (timeless) 878 
time delays, kinetic proofreading 345 
time-lapse imaging 1249 

fluorescence microscopy 757F, 803F, 
875F, 935F, 991F, 1178 
timing in embryonic development 
1176-1184 

Timothy syndrome 1258 
tip cells, endothélial 1236-1237 
Tl RF (total internai reflection fluorescence 
microscopy) 547-548, 553F 
tissue maintenance and repair 1154 

independent of stem cells 1226-1227 
tissue morphogenesis 1059 
tissue remodeling/regeneration 
1043-1045, 1071 
régénération of organs 1247 
self-renewal and cancer 1120-1121 
see also wound healing 
tissue ségrégation and homophilic binding 
1041 

tissue specialization 

duplicated genes 229 
E. coli adhesins 1277 
tissues 

derived from ectoderm, endoderm and 
mesoderm 1147,1167 
isolating cells from 440 
neurotrophic factor release 
1208-1209 

non-renewable 1227, 1247 
protein abundances in brain and liver 
372F 

RNAi résistant 501 

self-renewing, and cancer 1120-1121 
specimen préparation for microscopy 
535-536 

titin protein 549F, 920 
TLRs (Toll-like receptors) 873, 1165, 
1299-1300, 1304 

TMV (tobacco mosaic virus) 128-129 
TNF (tumor necrosis factor) receptor 
family 1024 

TNFa (tumor necrosis factor a) 873, 1301 
tobacco cells 

GFP-tagged organelles 544F 
vacuoles 724F 
Western blotting 455F 
tobacco smoking 1095,1127-1128, 

1129F 

TOF (time of flight) analyzers 456F, 457 

Toll gene 1158F, 1165 

Toll-like receptors (TLRs) 873, 1165, 

1299, 1300T, 1304 

Toll receptors 873, 1158F, 1165, 1299 
TOM complex (translocator of the outer 
membrane) 659-662, 663F, 664 
tomograms 558, 559F 


topoisomerases 

recombinases resembling 293 
superhelical tension removal 315 
topoisomerase I 251-253 
topoisomerase II 195F, 252-253, 977 
topological relationships/equivalents 
644-647 

TOR (target of rapamycin) 861,1017 
mTOR 860F, 861,864T, 1114-1115 
torque, ATP synthase 777F 
Torso receptors 1158F 
total internai reflection fluorescence 
microscopy (TIRF) 547-548 
totipotency 

fertilized egg 1253 
totipotent cells 507, 1148, 1248 
toxins 

antibody response 1307 
of bacterial pathogens 1270, 1278 
entry points 576 
G protein targets 834 
and neurotransmitters 629 
Toxoplasma gondii 1282,1283F 
trabecular bone 1230 
trace éléments 43F 
tracheae and tracheoles 1191 
traction, in cell migration 951,952F, 955, 
956F 

trajectories of points 519-520 
trans- acting epigenetic mechanisms 412, 
413F 

trans- acting IncRNAs 435F, 436 
trans-Go\g\ network (TGN) 716, 727-729, 
730F, 731, 736, 741-744, 
749-750 

trans- SNARE complexes 708 
transcellular transport 605, 1047, 1048F 
transcription 301-317 

in bacteria and eukaryotes 306 
coordination with splicing 322 
direction of 309 
“factories” 332 
histone acétylation 202 
nucléotide excision repair coupled to 
271 

process described 302-305, 315F, 
510F 

speed of 304 

as templated polymerization 4 
transcription activators (activator proteins) 
chromatin modification 386-388 
coactivators 385-386, 388-389, 392, 
394, 395F 

combinatorial Controls 394-395 
fractional promoter occupation 
511-512 

fusion with bacterial Cas9 497 
in positive feedback 518 
repressors and 381-383, 394 
RNA polymerase II and 312-313 
switch from repressors 868, 871 
transcription atténuation 414 
transcription bubbles 306 
transcription circuits, as switches 
402-403 

transcription control 

in the cell-cycle control System 971 
in eukaryotes 384-392, 405 
of gene expression 372, 373F 
transcription élongation 

accessory proteins 313-314 


Page numbers with an F refer to a figure; page numbers with a T refer to a table. 



1:50 Index 


and riboswitches 415 
and RNA processing 315-316 
transcription factors see transcription 
regulators 

transcription initiation 

chromatin modification using 387-388 
in eukaryotes 310-313,387 
and termination 306-309 
transcription initiation complexes 311 
transcription rate constants 513 
transcription regulators (transcription 
factors) 

ability to read DNA sequences 
374-375 

activating différentiation 1170-1171 
in bacteria 380-383 
chromatin immunoprécipitation and 
505 

in chromatin modification 196-197, 
199-201,206 

combinatorial gene control 396, 397F, 
399, 520-521 

controlled activation 395, 396F 
cooperative binding 378-380, 517F 
dimerization 375-378 
in eukaryotes 384-392 
extracellular signaling 867-880 
Hox modulated 1163 
ligand-modulated 874-876, 877F 
maintaining embryonic stem cells 
1254-1255 

master transcription regulators 
398-399, 1171 
neurons expressing 1200F 
p53 protein as 1116 
as proteins 30 

rôle and mode of operation 373-380, 
389-390 

sequential expression 1179 
in specialized cells 399-400 
as switches 384-392, 402-403 
in the syncytial blastoderm 1157 
see also latent transcription regulators; 
transcription activators; 
transcription repressors 
transcription repressors (repressor 
proteins) 

activators and 381-383, 394 
binding site occupancy 514 
co-repressors 385, 386F, 392, 394, 
395F 

combinatorial Controls 394-395 
DNA inversion and 294F 
DNA méthylation and 405 
fusion with bacterial Cas9 497 
négative feedback 402,515 
switch to activators 868, 871 
tryptophan repressor 376, 380-383, 
392 

transcription units 306 
transcriptions/ activators, regulators, 
repressors see transcription 
activators etc. 

transcriptional synergy 388-389 
transcytosis 732, 737, 738F, 749F 
transdifferentiation 1258-1259 
transducin (G t ) 845, 846T, 848 
transduction and horizontal gene transfer 
1268 

transestérifications in RNA splicing 318, 
321 


transfer RNAs (tRNAs) 
function 7, 305, 334 
initiator tRNA 347-348 
and the nucleolus 330 
ribosomal binding sites 347 
structure 335F 
unusual bases 335F, 337F 
transferrin 427, 553F 
transferrin receptor 734-735 
transformation, horizontal gene transfer 
by 1268 

transformed phenotypes 1098, 1106 
transforming growth factor-p see TGFp 
transgenic organisms 495-497, 500, 506, 
543, 1221F 

transgenic plants 507-508 
transit amplifying cells 1121-1122, 1219, 
1220F, 1223, 1225, 1226F 
transition-métal ions 764-766 
transition States, enzyme stabilization of 
141-146 
translation 

accuracy and free energy 345-346 
co-translational protein folding 
353-354, 355F 

conserved gene families 20, 21T 
control with 3' and 5' UTRs 422-423 
control with IRES 425-426 
converting RNA into proteins 6-7, 
315F, 333-362 

initiation and termination 347-349 
messenger RNA in 343-344 
quality control 351-353 
ribosome profiling technique 505-506 
speed of 346 
stepwise 343-344F 
translation initiation mechanisms 425F, 
1288 

translation recoding 350 
translational control, gene expression 
372,373F 

translational repressors 623 
translesion polymerases 273, 274F 
translocation steps, ribosomal 342 
translocations 487 
translocator proteins 1270-1271F 
the translocon 677 
transmembrane proteins 
adhesion proteins 1037 
association with the lipid bilayer 577F 
ATG9 as 726 

p-barrelsin 579-581,659,662-663 
cadherins and integrins as 1037 
death receptors as 1024 
enzyme-coupled receptors as 849 
ER capture 672 
fates of endocytosed 738F 
hydrophobicity 577, 579F, 593 
intégration process 678 
IRE1 as a transmembrane protein 
kinase 687 

most receptors as 816 
Notch as 720 

PERK as a transmembrane protein 
kinase 688 

single-pass and multipass 578-579 
SRP receptor as 674 
see also multipass; single-pass 
transmission électron microscopy (TEM) 
305F, 554, 555-556F, 558, 1210F 
transmitter-gated ion channels 614, 
627-632, 636 


A-to-l editing 418 
acétylcholine receptors as 630-631 
allostery in 618,825 
ion-channel-coupled receptors 817F, 
818, 843 

types in neurons 631,636 
transport ATPases (ATP-driven pumps) 
601-602, 606-607 

transport proteins, inner mitochondrial 
membrane 779-780 
transport vesicles 644-648, 650, 656F, 
670, 691 

coated vesicles 697 
formation 701 
scope 695 

see also vesicular transport 
transporter (protein) 599 
in épithélial cells 605 
in plants 883 

régulation of cytosolic pH 604-605 
transposable éléments (transposons) 
conservative site-specific 
recombination and 293 
DNA-only transposons 288-290 
double-stranded RNA in 421 
eukaryotic genomes 28 
and genome changes 217-218, 222 
nonretroviral retrotransposons 291 
retroviral-like transposons 291 
RNA interférence and 429, 432-433 
three classes 288T, 292 
virulence genes 1268 
transposition 287-292 
trastuzumab 1137 
treadmilling, actin filaments 901, 
903-904, 953-954 
tree of life see phylogenetic trees 
triacylglycerols 78, 81,83F, 98-99 
tricarboxylic acid cycle see citric acid 
cycle 
tricellulin 1049 
trichothiodystrophy 497F 
trigger mechanism of invasion 1281, 
1282F, 1285F 

triskelions, clathrin 698, 699F 
Trithorax group proteins 1164 
Triton X-100 583, 584F 
TrkA kinase 853-854, 1209 
tropoelastin 1065 
tropomodulin 905,909,919,920F 
tropomyosin 416, 592F, 905, 907-910, 
920F, 921,922F, 923 
a-tropomyosin gene 319F 
troponins 921-923 
trypanosomes 689, 805, 1290 

Trypanosoma brucei 1266,1290 
Trypanosoma cruzi 1283-1284,1290 
trypsin 119, 132T, 440, 456-457, 942F 
tryptophan, structure 113 
tryptophan repressor 376, 380-383, 392 
TSH (thyroid-stimulating hormone) 835T 
tube and vesicle formation 1192 
tube worms 12F 
tubules 

branching tubular structures 1190 
vesicle budding events 705 
see also microtubules 
tubulin subunits 893-894, 904T, 927, 
928-929, 934-935 
a-tubulin 925, 926 
P-tubulin 926F, 927 
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y-tubulin 929, 930F, 982 
tubulins 459 

conséquences of mutations 925 
fluorescence-tagging 544, 545F 
FtsZ as homolog 896 
immunofluorescence 539F 
in microtubules 891,925 
séparation from samples 440 
séquestration 935 
tumor biopsies, genome sequencing 
1141 

tumor cells see cancer cells 
tumor microenvironments 1100-1101, 
1137, 1138F 
tumor progression 

colorectal cancers 1122-1126 
corrélation with mutation sequence 
1125-1126 

immune System and 1137,1138F 
loss of p53 function as essential 1126 
metastatic and primary genomes 
1119 

natural sélection in 1091-1092, 1096, 
1104, 1118, 1119F, 1125 
process 1095-1097, 1109, 1114F, 
1115-1116, 1119 
tumor suppressor genes 
discovery 1107-1109 
DNA sequence changes 1110F 
examples 1110F, 1115 
genetic and epigenetic inactivation 
1108-1109, 1110F 
loss-of-function mutation and 1104 
tumor viruses 1105-1106, 1129-1132 
tumors 

benign and malignant 1092, 1093F 
cell types in 1093,1101F 
defined 1092 
growth rate 1094F 

TUNEL technique (TdT-mediated dUTP 
nick end labeling) 1024F 
turbine analogy, ATP synthase 776-778 
y-TuRC (y-tubulin ring complex) 929, 
930-931 F, 933 
turgor pressure 

extracellular matrix and 1058 
plant cell walls and 1083, 1085 
vacuoles and 724-725 
turnover numbers 141,142-143 
turnover rates 

connexins 1052, 1053F 
Myc regulating protein 1107 
and signal response speeds 825-826, 
827F 

Twist genes 1165, 1166F 
Twist protein 1042 
two-dimensional gel electrophoresis 
452-454 

two-photon effect 542 
Tyk2 as a J AK kinase 863, 864T 
Type III fibronectin repeats 1066F, 1067- 
1068, 1073 

Type IV collagen 1058F, 1062, 1069-1073 
Type XVII collagen 1037T, 1062, 1070, 
1076F 

tyrosine 

structure 113 
sulfation 719 

thyroid hormones from 876 
see also phosphotyrosine 
tyrosine aminotransferases 371-372, 400 


tyrosine-kinase-associated receptors 862 
tyrosine kinase domains 849 
tyrosine kinases 

distinguished from serine/threonine 
kinases 819-820 
in protein kinase evolutionary tree 
155F 

régulation of Src kinase 155-156 
see also cytoplasmic; receptor 
tyrosine kinases 


u 

111, 1)2, U4, U5 and U6 see snRNAs 
ubiquinone 765-766, 767F, 768-770, 

771 F, 772-773 

ubiquitin 

APC/C ubiquitylation in mitosis 992 
histone side Chain modifications 197F 
monoubiquitylation, multiubiquitylation 
and polyubiquitylation 735 
and the protéasome 358F, 359 
regulatory function 157-159, 165T 
structure 159F 

tagging proteins for dégradation 736, 
882,992 

see also El; E2; SUMO 
ubiquitin ligases (E3) 158, 359-360 
APC/C as 360, 970 
Parkin 727 

SCF ubiquitin ligase 159-160, 164, 
167-168, 971,972F, 973, 1004 
Smurf 866 

Ubx (Ultrabithorax ) gene/complex 1162F, 
1164F 

UIMs (ubiquitin-interaction motifs) 853 
ultracentrifuges 
analytical 455 
préparative 445-447, 455 
ultraviolet radiation 267-268, 269F, 1094 
uncoupling protein 780 
“unfoldases” 358, 359F 
unfolded protein response 686-687 
unidirectional rachets 162 
uniporters 602, 605 
units of length 531 
universal gene families 20, 21T 
unsaturated fatty acids 98 
unsaturated lipids 571F 
unstructured régions (proteins) see 
disordered régions 

untranslated régions, mRNA see UTRs 
uORFs (upstream open reading frames) 
424 

uracil 

from cytosine deamination 267T, 268, 
269F, 1322-1323 
in RNA 302, 303F 
structure 100 

urate oxidase in peroxisomes 666 
urea as denaturing agent 453 
urea cycle 760 
uric acid 1301 
uridine 

dihydro- 335F, 337F 
isomerization to pseudouridine 328 
4-thio- 337F 

uridine diphosphate glucose 69T 
uronic acids see galacturonic; glucuronic; 
iduronic 

uterine cervix, cancers of 1093F, 1096F, 
1129, 1131, 1265 


UTRs (untranslated régions, mRNA) 184T 
3' and 5/ control of translation 
422-423 

3'-UTRs (untranslated régions, mRNA) 
421-422, 430 

v 

V gene segments 1320-1321 
v-Ras oncogene 1106 

V région, Ig light chains 1320, 1321F, 

1323F 

v-Src gene 1105 

V-type ATPases 723, 736, 743, 747F, 778 

V-type pumps 606 

vaccination 

and the adaptive immune System 
1297 

immunization as the basis 1307 
against Neisseria 1291 
vaccine production through DNA 
cloning 484 

against virus diseases 1131,1274, 
1275F, 1291 

Vaccinia virus 939, 1278, 1287F 
vacuolar H + ATPases 723 
vacuoles 

as lysosomes 724-725 
parasite containing 1283F, 1285, 
1286F 

Products from 725 
valine, structure 113 
van der Waals attractions 

atomic force microscopy and 548 
lipid bilayers 572, 575 
as noncovalent 44, 94 
van der Waals radii 45F, 94, 110 
van Leeuwenhoek, Antonie 440 
vancomycin 1292-1293 
variable régions, Ig light and heavy chains 
1318-1319 
variegation 807 

position effect 195-197,199-202, 

205 

variola (smallpox) virus 1265, 1273T 
vasculogenesis and angiogenesis 1236 
vasopressin 835T, 837T, 840T 
VCAMs (vascular cell adhesion molécules) 
1055 

V(D)J recombination 275F, 290, 

1320-1321, 1323, 1325, 1332 
VEGF (vascular endothélial growth factor) 
action via receptor tyrosine kinases 
850T 

angiogenesis and 1120,1237 
interaction with matrix proteins 1073 
VegT transcription regulator 1156, 1173 
“Velcro principle” 1039, 1079 
velocity sédimentation 446, 447F 
vernalization 1183-1184 
vertebrate embryos 

inductive signaling 1166, 1167F, 1177, 
1184, 1198 

nervous Systems 1041F 
spatial patterning 1167-1169 
vertebrates 

adaptive immune System 1307-1315 
body plan inversion 1169 
cyclins and Cdks 969T 
desmosomes in 1046 
Drosophila melanogaster as model 33 
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évolution of 227 

extracellular matrix in 1057-1074 
gene duplication frequency 34-35 
innate and adaptive immune response 
1297 

intermediate filament types 944T 
loss of CG sequences 406-407 
mitochondrial genomes 803-804 
neural development 1199-1200 
organ régénération in 1248-1249 
times of divergence 37F 
vertical gene transfer 1268 
vesicle coat proteins 650 
vesicle transport model 720-721 
vesicles 9, 585, 594 

budding and fusion 647, 648 
caveolae as pinocytic vesicles 
731-732 

formation of tubes and 1192 
intralumenal vesicles 724F, 729-730, 
735-737, 738F 

synaptic vesicles 628, 744-746, 747F 
vesicles, endocytic see endocytic vesicles 
vesicles, secretory see secretory vesicles 
vesicular traffic, disruption by pathogens 
1284-1286 
vesicular transport 

bidirectionality 730-731 
Rab family GTPases in 705 
retrieval pathways 695-696, 713-714, 
726 

sorting signais and 646 
steps in 696F, 698F 
see a/so transport vesicles 
vesicular tubular clusters 712-713 
Viagra® 847 

Vibrio cholerae 13F, 1265, 1266F, 
1269-1270 

vimentin-like filaments 944T, 946, 948, 

959 

Vinblastine 993 

vinculin 1037T, 1042 F, 1043, 1044F, 

1075F, 1078F, 1079-1081 
viral capsids 128, 129F, 290, 562F, 1266F, 
1274, 1280F 

nucleocapsids 1274, 1275F, 1286, 
1287F, 1288 

viral carcinogens 1130-1132 
viral envelopes 1275F, 1280-1281, 1287F 
virions 1274, 1281, 1287F, 1288 
viroids 363F 

virulence genes/virulence factors 609, 
610F, 1268-1270 
virulence plasmids 1268 
virus receptors 1279 
virus types and strains 

adenovirus 1273T, 1274F, 1280F, 

1281, 1289 

baculoviruses 1029, 1287-1288 
DNA viruses 1130, 1273, 1274F, 1278, 
1289 

enveloped viruses 1274, 1275F, 1280, 
1286, 1287F, 1288 

hepatitis-B and -C 1129-1130,1132 
herpes simplex virus 1273T, 1279 
measles 1273T, 1274, 1289 
nonenveloped viruses 1274, 
1280-1281 

not using double-stranded DNA 268 
pandoraviruses 1274 
papillomaviruses 732, 1129, 1130T, 
1131-1132 


parvoviruses 1274 
poxviruses 1274, 1280, 1289 
rabies 1273T, 1274F, 1279 
RNA viruses 1273, 1274F, 1278, 1289 
SV40 simian virus 128F, 732, 1279 
see a/so bactériophages; HIV; 
retroviruses; tumor viruses 
viruses (generally) 

bacterial defenses 433-434 
budding 738F, 1274, 1275F, 1286, 
1287F 

causing human diseases 1273T 
class I MHC suppression 1304-1305 
cross-presentation and 1329 
detecting in samples 475F 
entry points and mechanisms 576, 
732,1280-1281 
and gene transfer 18 
host-cell responses 1303-1305 
infection and membrane fusion 709 
introducing altered genes 495, 
1105-1106 
latent phases 1131 
and microtubular transport 939 
morphology 1274F 
négative staining 560 
receptor-mediated endocytosis 1281 
réplication errors and évolution 1291 
RNA interférence and 431 
self-assembly 128-129 
transdifferentiation vectors 1258 
use of conservative site-specific 
recombination 293 
use of host cell machinery 
1273-1275, 1288-1289 
use of host cytoskeleton 1286-1288 
use of IRES 425F, 426 
viewed as mobile genetic éléments 
290 

vision 825, 843-846, 1204-1206 
Visual System electrical activity 
1211-1212 

vitamins 

as coenzymes 146-148 
vitamin A 876 
vitamin D 874-876 
vitamin D 3 876F 
see a/so biotin 
vitreous ice 556, 560 
voltage-gated channels 614, 621-622, 
623F, 625F, 626-630, 632-635 
voltage-gated Ca 2+ channels 626, 
628, 632-633, 635, 745 
voltage-gated cation channels 621, 
626-627, 629-630 

voltage-gated K + channels 622, 625F, 
627, 634 

voltage-gated Na + channels 621-622, 
623F, 626-627, 633-634 
voltage sensors 622 
VSG (variant-specific glycoprotein) 1290 

w 

Warburg effect 1098-1099, 1115 
WASp proteins 957, 958F, 1278F, 1281, 
1288, 1289F 

water 

as an électron donor 762, 788, 790 
cage structures 568, 569F 
content of cells 535 


excluding from binding sites 136 
formation of acids and bases in 45-46 
hydrogen bonding in 44, 92, 94, 136 
hydrophilic and hydrophobie 
molécules 569F 

proton behavior in 45-46, 613, 773 
as a solvent 93 
structure 92 
vitreous ice 556, 560 
see a/so hydrophilicity; hydrophobicity 
water channels see aquaporins 
water-soluble proteins, ER capture 672 
wavelength 

of électrons 554 

excitation and émission, of fluorescent 
dyes 537F 

fluorescent protein excitation 545 
multiphoton microscopes 542 
resolution limits and 529-530, 533F, 
554 

Weel kinase 970, 973T, 979 
Western blotting 455 
wheat 559F, 783F 
white blood cells 

chemokine attraction of 1241 
inflammatory response and 1240 
leukemias as cancers of 1092 
major categories of 1239 
matrix dégradation 1072 
selectin and integrin action 1055, 

1077 

use of lymphatic System 1236 
White gene, Drosophila 195F 
whole-genome analysis see genome 
sequencing 

whole-genome duplications 35, 228 
whooping cough (pertussis) 834, 1277 
wing hairs 1189 
Wingless gene 536F, 869, 1161 
Wingless protein 1160 
Wiskott-Aldrich Syndrome 967 
Wnt proteins 868-871, 1199, 1200F 
Wntll 1156-1157 
see a/so Wingless 

Wnt signaling pathways 1124, 1157, 

1160, 1178, 1190 
canonical Wnt pathway 869-870 
planar-cell-polarity signaling pathway 
869, 1189F 

stem cell maintenance 1220 - 1221 , 
1224 

Wnt/p-catenin pathway 869-870, 

1117 

wobble base-pairing 335, 336-337F, 342 
in mitochondrial genomes 804 
worms see Ascaris; Caenorhabditis 
wound healing 504, 953, 1011, 1059 
VEGF and 1237 
see a/so tissue remodeling/ 
régénération 
wound repair 

fibroblasts in 1228 
plasma membrane enlargement 748 
wounds, and infection 1276 
writer proteins 199-200, 406F 

X 

X-chromosome sample sequence 300F 
X-inactivation 410-411,412F, 1252 
x-ray crystallography/x-ray diffraction 
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of ADP/ATP carrier protein 780F 
amyloid fibrils 131F 
of an SH2 domain 853F 
of ATP synthase 777F 
of bacterial K + channel 617 
of bacteriorhodopsin 587 
of Bcr-Abl-imatinib 1136F 
of Ca 2+ /calmodulin/CaM-kinase II 
841-842F 
coiled coils 117 
ofaconnexin 1052F 
of a cyclin/Cdk complex 970F 
of DNA and RNA polymerases 304 
électron microscopy compared 562 
of Ig domains 1319F 
of integrins 1077 
of lysozyme 144 
of nucléosomes 189F, 192 
of protein conformations 120F, 121, 
144 

protein structure from 460, 461F 
of respiratory chain complexes 768 
of ribosomes 346F, 460 
ribulose bisphosphate carboxylase 
461F 

of transcription regulators 375 
of transfer RNA 335F 
of transmembrane proteins 579-581 
viral capsids 128-129F 
x-ray exposure see radiation 
Xenopus microtubule-associated protein 
(XMAP215) 933-935 
Xenopus ruwenzoriensis 35 
Xenopus tropicalis 35 
Xenopus {X. laevis) 

cell cycle model 966 
embryo cortical rotation 1156, 1167 
embryo patterning 1167 
embryo reprogramming experiments 
205, 1252 


gene duplication among 35 
globin gene family in 230 
as laboratory species 35 
nuclear organelles 213F 
oocyte asymmetries 1156 
oocyte extracts 448, 545 
oocyte nuclear lamina 656F 
rRNAgenes 328 

xeroderma pigmentosum (XP) 266T, 267, 
1094 

XIAP protein 1029 
XIC (X-inactivation center) 411,412F 
Xist IncRNA 411,412F, 435 
XMAP215 (Xenopus microtubule- 

associated protein) 933-935 

Y 

Yap/Yorkie gene 1197F 
yeasts 

barcoded mutants 499F 
characteristic transposon types 292 
control of cell cycle in 186, 1018F 
électron microscope image 556F 
endoplasmic réticulum 671F 
MAP kinases 857 
mitochondrion 451F 
mold transitions 1271 
and prion diseases 131 
protein interaction mapping 166-167, 
168F 

protein purification 451 
as single-celled eukaryotes 30 
yeasts, budding 925, 949, 950F, 966F 
cell cycle proteins 969T, 971 
cyclins and Cdks 969T 
see a/so Saccharomyces 
yeasts, fission see Schizosaccharomyces 
Yersinia pestis 1276, 1281 
Yersinia pseudotuberculosis 1281 


Z 

Z dises 918-920, 948 
Z-ring 896 

Z schemes 789-790, 792-793 
zebrafish (Danio rerio) 

convergent extension 1189F 
genome size 35 
germ cell migration 1185F 
Her gene expression 1179F 
lakritz mutant 488 
maternal-zygotic transition 1181 
as model organism 29T, 33, 35, 36F 
neural map 1204F 
whole-genome duplications 228 
Zellweger syndrome 668 
zigzag model, chromatin 192 
zinc finger proteins 377 
Zinnia elegans 783F 
zipper mechanism of invasion 1281, 
1282F, 1284 

ZO proteins (zonula occludens) 1049 
zonula adherens (adhesion belt) 1036F, 
1044, 1045F 

zonula occludens/ZO proteins 1049 

zygotene 1006, 1007F 

zygotes 

as beginning of development 1145, 
1147 

as pluripotent or totipotent 1253 
zygotic-effect genes 1159, 1165, 1181 F, 
2282 

zymogens 736 
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The Genetic Code 

Ist Position 2nd Position 3rd Position 

(5' end) (3' end) 


i 

U 

C 

A 

G 

; 

U 

Phe 

Ser 

Tyr 

Cys 

U 

Phe 

Ser 

Tyr 

Cys 

C 

Leu 

Ser 

STOP 

STOP 

A 

Leu 

Ser 

STOP 

Trp 

G 

c 

Leu 

Pro 

His 

Arg 

U 

Leu 

Pro 

His 

Arg 

C 

Leu 

Pro 

Gin 

Arg 

A 

Leu 

Pro 

Gin 

Arg 

G 

A 

Ile 

Thr 

Asn 

Ser 

U 

Ile 

Thr 

Asn 

Ser 

C 

Ile 

Thr 

Lys 

Arg 

A 

Met 

Thr 

Lys 

Arg 

G 

G 

Val 

Ala 

Asp 

Gly 

U 

Val 

Ala 

Asp 

Gly 

C 

Val 

Ala 

Glu 

Gly 

A 

Val 

Ala 

Glu 

Gly 

G 


Amino Acids and Their Symbols 

Codons 

A 

Ala 

Alanine 

GCA GCC GCG GCU 

C 

Cys 

Cysteine 

UGC UGU 

D 

Asp 

Aspartic acid 

GAC GAU 

E 

Glu 

Glutamic acid 

GAA GAG 

F 

Phe 

Phenylalanine 

UUC UUU 

G 

Gly 

Glycine 

GGA GGC GGG GGU 

H 

His 

Histidine 

CAC CAU 

1 

Ile 

Isoleucine 

AUA AUC AUU 

K 

Lys 

Lysine 

AAA AAG 

L 

Leu 

Leucine 

UUA UUG CUA CUC CUG CUU 

M 

Met 

Méthionine 

AUG 

N 

Asn 

Asparagine 

AAC AAU 

P 

Pro 

Proline 

CCA CCC CCG CCU 

Q 

Gin 

Glutamine 

CAA CAG 

R 

Arg 

Arginine 

AGA AGG CGA CGC CGG CGU 

S 

Ser 

Serine 

AGC AGU UCA UCC UCG UCU 

T 

Thr 

Threonine 

ACA ACC ACG ACU 

V 

Val 

Val i ne 

GUA GUC GUG GUU 

W 

Trp 

Tryptophan 

UGG 

Y 

Tyr 

Tyrosine 

UAC UAU 


















































